The October 2005 Drosophila persimilis genome assembly, which was produced by the Broad Institute at MIT and Harvard, can be downloaded here. To review a list of the assembly's largest 1000 supercontigs ranked by size, click here.

Sample position queries

A genome position can be specified by a supercontig coordinate range, the accession number of an mRNA or RefSeq entry, a gene name, or keywords from the GenBank description of an mRNA. The following list shows examples of valid position queries for the D. persimilis genome. Note that some position queries (e.g. "huntingtin") may return matches to the mRNA records of other species. In these cases, the mRNAs are mapped to their homologs in D. persimilis. See the User's Guide for more information.

Request:
   Genome Browser Response:
super_0 Displays all of super_0
super_0:1-50,000 Displays first 50,000 bases of super_0

CG3727-RA Displays region of genome aligning to FlyBase accession CG3727-RA
dock Displays list of sequences aligning to genome region associated with gene dock (a D. melanogaster gene)
NM_057677 Displays region of genome aligning to RefSeq accession with identifier NM_057677 (a D. melanogaster RefSeq entry)

huntingtin Lists RefSeq and protein alignments associated with huntingtin protein (other species)


Assembly details

The Broad Institute produced the 28 October, 2005 assembly using the Arachne assembler. The assembly contains 12,838 supercontigs ranging in size from 204 bases to 11,822,988 bases, with a mean size of 14673.2 and median of 1671.

Bulk downloads of the sequence and annotation data are available via the Genome Browser FTP server or the Downloads page. The droPer1 annotation tracks were generated by UCSC and collaborators worldwide. See the Credits page for a detailed list of the organizations and individuals who contributed to this release.


GenBank Pipeline Details

For the purposes of the GenBank alignment pipeline, this assembly is considered to be: low-coverage.