Danecek, Petr Auton, Adam Abecasis, Goncalo Albers, Cornelis A Banks, Eric DePristo, Mark A Handsaker, Robert E Lunter, Gerton Marth, Gabor T Sherry, Stephen T
...
Published in
Bioinformatics (Oxford, England)
The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was develop...
R, Karchin K, Karplus David Haussler
Published in
Bioinformatics
Motivation: The enormous amount of protein sequence data uncovered by genome research has increased the demand for computer software that can automate the recognition of new proteins. We discuss the relative merits of various automated methods for recognizing G-Protein Coupled Receptors (GPCRs), a superfamily of cell membrane proteins. GPCRs are fo...
Hickey, Glenn Paten, Benedict Earl, Dent Zerbino, Daniel Haussler, David
Published in
Bioinformatics (Oxford, England)
Large multiple genome alignments and inferred ancestral genomes are ideal resources for comparative studies of molecular evolution, and advances in sequencing and computing technology are making them increasingly obtainable. These structures can provide a rich understanding of the genetic relationships between all subsets of species they contain. C...