Dataset containing genotype data for single nucleotide polymorphism markers in sporadic breast cancer related genes in a Sri Lankan case-control cohort of postmenopausal women

This dataset consists of three Excel spreadsheets in .xlsx format, containing the raw genotype data of a study which investigated the association of single nucleotide polymorphisms (SNPs) in breast cancer related genes with the risk and clinicopathological profiles of sporadic breast cancer among Sri Lankan postmenopausal women.

A case-control study design was adopted to conduct SNP marker disease association testing using single variant and haplotype association testing strategies in an existing blood resource obtained from a cohort of Sri Lankan postmenopausal women with clinically phenotyped sporadic breast cancer and healthy controls. The list of haplotype-tagging SNP markers for genotyping was selected based on the information available in the published literature and the use of bioinformatics tools and databases.

During the study, 350 Sri Lankan postmenopausal women with histopathologically confirmed invasive breast cancer (cases), and 350 healthy postmenopausal women (controls) were genotyped for 57 haplotype-tagging single nucleotide polymorphisms (SNPs) in 36 candidate genes associated with sporadic breast cancer using the iPLEX Sequenom Mass-Array platform.

The dataset contains the following files:

Data file 1_Genotypes_cases.xlsx consists of a single spreadsheet, containing the raw genotype data for the 57 SNPs in the 350 postmenopausal women with sporadic breast cancer (cases).

Data file 2_Genotypes_controls.xlsx consists of a single spreadsheet, containing raw genotype data for the 57 SNPs for 350 healthy postmenopausal women (controls).

Datafile 3_Primer Sequences consists of a single spreadsheet, containing the primer sequences used to amplify each of the 57 SNPs.

The .xlsx files can be opened using Microsoft Excel or any open spreadsheet software.