Description

The Simons Foundation Autism Research Initiative (SFARI) recruited a large cohort of families with autistic children who provided DNA samples and phenotypes. 54,558 families, parents and their children were sequenced, a total of 142,357 individuals with whole-exome (WES) and 12,519 with whole-genome sequencing (WGS). The data contains 32,559 trios and 8,895 quads (one sibling without autism), and 824 twins.

The same frequencies shown here are also available publicly on the SFARI Genome Browser. See (SPARK et al, Neuron 2018) for details.

Data Access

The data can be explored interactively with the Table Browser or the Data Integrator. For programmatic access, our REST API can be used; the track name is sfariSparkExomes. For bulk download, the VCF file can be obtained from our download server.

Allele frequencies can also be displayed on the SFARI Genome Browser. Full CRAMs and VCFs with genotypes are available from SFARI Base. They require a data access request, which is usually reviewed quickly. More information is available in the SPARK Welcome Packet.

Methods

The genome browser track project was approved by the Simons Foundation under request number 14584.1. WES and WGS data were downloaded from SFARI Base. pVCFs were downloaded, anonymized with a script using bcftools and its "fill-tags" plugin and normalized. There was no minimum allele frequency cutoff.

The methods are documented as follows by SFARI:

We provide documentation that indicates how all source files of the varFreqs track were converted in the makeDoc file of the track. For some tracks, python scripts were necessary and are also available from GitHub.

References

SPARK Consortium. Electronic address: pfeliciano@simonsfoundation.org, SPARK Consortium. SPARK: A US Cohort of 50,000 Families to Accelerate Autism Research. Neuron. 2018 Feb 7;97(3):488-493. PMID: 29420931; PMC: PMC7444276