posted on 2020-10-09, 08:42authored byYanping Xie, Brittny C Davis Lynn, Nicholas Moir, David A. Cameron, Jonine D. Figueroa, Andrew H. Sims
Summary
This metadata record provides details of the data supporting the related manuscript: “Breast cancer gene expression datasets do not reflect the disease at the population level”.
The related study aimed to determine how representative publicly available tumor gene expression datasets are of clinical populations.
As the data are all publicly available in appropriate community repositories, no primary data is included with this metadata record. Instead, the attached spreadsheet lists the 70 publicly available datasets, along with their respective details, including the repositories in which they are stored and their accession numbers. The 70 datasets represent 16,130 breast carcinomas.
Data access
All of the gene expression datasets analysed in the study are already publicly available, and their accession numbers and original publication references are listed in the Supplementary Table included with this metadata record.
The 70 publicly available datasets were identified in the public domain when restricting the search to those studies representing a minimum of 50 breast cancer patients with primary tumours.
Funding
Breast Cancer Now
Cancer Research UK
U.S. Department of Health & Human Services | NIH | NCI | Division of Cancer Epidemiology and Genetics, National Cancer Institute (National Cancer Institute Division of Cancer Epidemiology and Genetics)
Temporal trends in incidence and mortality of molecular subtypes of breast cancer to inform public health, policy and prevention