Long-read sequence assembly of the gorilla genome

David Gordon,* John Huddleston,* Mark J. P. Chaisson,* Christopher M. Hill,* Zev N. Kronenberg,* Katherine M. Munson, Maika Malig, Archana Raja, Ian Fiddes, LaDeana W. Hillier, Christopher Dunn, Carl Baker, Joel Armstrong, Mark Diekhans, Benedict Paten, Jay Shendure, Richard K. Wilson, David Haussler, Chen-Shan Chin, Evan E. Eichler†

† Corresponding author. E-mail: eee@gs.washington.edu
* These authors contributed equally to this work.

Assembly data

The initial Susie genome assembly are available through the European Nucleotide Archive (ENA). While additional versions of the assembly are processing at the ENA, sequence data are hosted here.

Assembly Sequence Annotations
Initial gorilla assembly (Susie3a) GCA_900006655.1 Genes (BED)
Gorilla assembly with misassembly correction (Susie3b) FASTA (gz, fai, gzi)
Gorilla assembly with Illumina error correction (Susie3.2a) FASTA (gz, fai, gzi) Genes (BED)
Gorilla assembly with misassembly and Illumina error correction (Susie3.2b) FASTA (gz, fai, gzi)
Gorilla assembly with Illumina error corrected contigs assigned to chromosomes FASTA (gz, fai, gzi)

Sequence data

All sequence data including whole-genome sequence (WGS) and clones are available through the European Nucleotide Archive (ENA) and GenBank.

Data set ENA accession
PacBio whole-genome sequence (78.4-fold) PRJEB10880
Gorilla clone sequences PRJEB10880

Additional annotations

Software