This track shows structural variants (SVs) identified by long-read sequencing of 945 Han Chinese individuals. The dataset contains 111,288 SVs merged across samples using SURVIVOR, including 49,518 deletions, 42,300 insertions, 13,503 duplications, 5,595 inversions, and 372 translocations.
Items are colored by SV type:
Filters are available for SV type, SV length, allele frequency, and number of supporting samples. For insertions, the item is placed at the insertion site with a width of 1 bp. For translocations, only the first breakpoint is shown; the second breakpoint chromosome and position are listed in the item details.
Long-read sequencing was performed on 945 Han Chinese individuals. Structural variants were called per sample and then merged across all samples using SURVIVOR (v1.0.6). The merged VCF was converted to bigBed format for display. Allele frequencies and per-sample support information were extracted from the INFO fields of the merged VCF. The study identified two notable variants: an ancestral deletion in GSDMD associated with bone density and kidney injury risk, and a modern human-specific variant in WWP2 influencing height, body composition, and facial features.
The raw VCF data was obtained from the OMIX repository (accession OED00945268) at the National Genomics Data Center (NGDC), China National Center for Bioinformation.
Thanks to Gong et al. for making their structural variant calls publicly available.
Gong J, Sun H, Wang K, Zhao Y, Huang Y, Chen Q, Qiao H, Gao Y, Zhao J, Ling Y et al. Long-read sequencing of 945 Han individuals identifies structural variants associated with phenotypic diversity and disease susceptibility. Nat Commun. 2025 Feb 10;16(1):1494. PMID: 39929826; PMC: PMC11811171