Description

This track shows high-confidence structural variants (SVs) identified by Oxford Nanopore long-read sequencing of 3,622 Icelanders recruited through the deCODE genetics population cohort. The release contains 133,886 SVs (55,649 deletions, 75,050 insertions and 3,187 combined insertion/deletion events). Variants are site-level (no per-sample genotypes) and have been filtered to a high-confidence subset validated in the accompanying population-scale analysis.

Note that this release does not include allele counts or allele frequencies: each row represents a site that was called with high confidence in the cohort, but the number of carrier samples is not provided, so the track cannot be filtered by AF/AC.

Display Conventions and Configuration

Items are colored by SV type:

Insertions are placed at the insertion site with a width of 1 bp; deletions span the deleted interval; INSDEL events span the affected reference region and have SVLEN=0 because the reference and alternate alleles differ in both sequence and length. Filters are available for SV type and SV length.

Where a variant falls inside an annotated tandem-repeat region, the detail page also shows the coordinates of that region (TRRBEGIN / TRREND from the source VCF), which can be useful context for repeat-mediated insertions and deletions.

Methods

Oxford Nanopore whole-genome sequencing was performed on 3,622 Icelandic participants enrolled through deCODE genetics. Reads were aligned to GRCh38 and structural variants were called and merged across the cohort following the pipeline described in Beyter et al. (2021), which combined multiple callers and a joint reassessment of candidate variants against the long reads. The high-confidence set released here corresponds to the filtered callset with strong read support and consistent representation across samples.

Data Access

The data can be explored interactively in table format with the Table Browser or the Data Integrator and exported from there to spreadsheet or tab-sep tables. From scripts, the data can be accessed through our API, track=decodeSv.

The annotation is stored as a bigBed file that can be downloaded from our download server as decodeSv.bb. Individual regions or the whole annotation can be obtained with the bigBedToBed utility, available from our utilities page. Example: bigBedToBed http://hgdownload.soe.ucsc.edu/gbdb/hg38/lrSv/decodeSv.bb -chrom=chr21 -start=0 -end=100000000 stdout.

The original VCF is available from the deCODE genetics LRS_SV_sets GitHub repository.

Credits

Thanks to the deCODE genetics team and the Icelandic study participants for making this dataset publicly available.

References

Beyter D, Ingimundardottir H, Oddsson A, Eggertsson HP, Bjornsson E, Jonsson H, Atlason BA, Kristmundsdottir S, Mehringer S, Hardarson MT et al. Long-read sequencing of 3,622 Icelanders provides insight into the role of structural variants in human diseases and other traits. Nat Genet. 2021 Jun;53(6):779-786. PMID: 33972781