What is COSMIC?

COSMIC: The world's leading curated knowledgebase empowering your cancer genomics investigations

COSMIC is the world’s most comprehensive and respected knowledgebase for somatic mutations in human cancers. Established for more than 20 years, COSMIC offers standardised, curated, and streamlined data to academics, drug discovery researchers, clinical scientists, diagnostic companies and beyond. Whether you are exploring the impact of somatic mutations or driving discoveries, COSMIC is your trusted partner.

Meticulously curated by our expert Scientists Curators, our knowledgebase collates high-quality published data and transforms it into highly standardised, continuously enriched datasets, saving our users hours and days of work while assisting in making confident decisions. Our data sources include over 29,000 peer-reviewed publications, enriched by major datasets like The Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium (ICGC). COSMIC integrates genome-wide screens and targeted analyses, enabling robust, reliable insights into cancer genomics.

COSMIC in numbers

Total Samples

1.6 Million

COSMIC contains samples from a wide range of primary sites and cancer types

Breakdown per tumour source

The depth of COSMIC's cancer genomics data, highlighted by the number of samples in COSMIC, categorised by tumour source

Tumour Source Knowledgebase Samples
Solid Cancers 1,150,000
Blood & Lymphatic Cancers 444,000
Circulating Tumour DNA 6,000

Breakdown per primary site

The diversity of cancer types covered in COSMIC demonstrated by the proportion of samples by primary site

Sample count for 5 most prevalent cancers worldwide

The vast number of samples in COSMIC for each of the world's most prevalent cancers according to The WHO

Cancer type Knowledgebase Samples
Trachea, bronchus and lung 217,049
Breast 62,902
Colorectum 216,352
Prostate 26,103
Stomach 29,858

Total Variants

29 Million

COSMIC includes mutations curated from peer-reviewed scientific publications and other public datasets

Breakdown per variant type

The breadth of mutation data available in COSMIC demonstrated by the count of variants, categorised by variant type

 

Variant type Knowledgebase variants
SNV 23,000,000
Insertions & Deletions 2,000,000
Structural & Copy Number Variants 4,300,000
Fusions 20,000