Description

The Australian Medical Genome Reference Bank (MGRB) collected whole-genome sequencing data of 4,011 healthy elderly individuals who lived ≥70 years, to make sure that the dataset is depleted of damaging genetic variants. Age and sex summary graphs are available from the MGRB website.

Data Access

Due to license restrictions, the data for this track cannot be downloaded from the UCSC Genome Browser. The Table Browser, Data Integrator, and download server are not available for this track.

VCF access can be requested via a form from Sydney Genomics.

Methods

The 4,011 MGRB samples underwent whole-genome sequencing on Illumina HiSeq X instruments at KCCG under ISO 15189 accreditation, using paired-end TruSeq DNA Nano libraries sequenced one lane per sample. Alignment of sequence reads to the hg38 reference genome assembly was with bwa 0.7.15-r1140. Variants were called following the Genome Analysis Toolkit (GATK) best practices procedure using GATK 4.1.4.0. A sites-only VCF with only passing variants (FILTER=PASS) was made with bcftools 1.20.

We received VCF files from m.hobbs@garvan.org.au via a transfer link and imported these VCFs. We provide documentation that indicates how all source files of the varFreqs track were converted in the makeDoc file of the track. For some tracks, python scripts were necessary and are also available from GitHub.

References

Lacaze P, Pinese M, Kaplan W, Stone A, Brion MJ, Woods RL, McNamara M, McNeil JJ, Dinger ME, Thomas DM. The Medical Genome Reference Bank: a whole-genome data resource of 4000 healthy elderly individuals. Rationale and cohort design. Eur J Hum Genet. 2019 Feb;27(2):308-316. PMID: 30353151; PMC: PMC6336775