Completeness in structural genomics.

Printer-friendly versionPrinter-friendly versionPDF versionPDF version
TitleCompleteness in structural genomics.
Publication TypeJournal Article
Year of Publication2001
AuthorsVitkup, D, Melamud, E, Moult, J, Sander, C
JournalNat Struct Biol
Date Published2001 Jun
KeywordsAlgorithms, Animals, Chromosomes, Computational Biology, Databases as Topic, Genome, Genomics, Humans, Models, Molecular, Protein Structure, Tertiary, Proteome, Research Design, Sensitivity and Specificity, Sequence Alignment, Sequence Homology, Amino Acid

Structural genomics has the goal of obtaining useful, three-dimensional models of all proteins by a combination of experimental structure determination and comparative model building. We evaluate different strategies for optimizing information return on effort. The strategy that maximizes structural coverage requires about seven times fewer structure determinations compared with the strategy in which targets are selected at random. With a choice of reasonable model quality and the goal of 90% coverage, we extrapolate the estimate of the total effort of structural genomics. It would take approximately 16,000 carefully selected structure determinations to construct useful atomic models for the vast majority of all proteins. In practice, unless there is global coordination of target selection, the total effort will likely increase by a factor of three. The task can be accomplished within a decade provided that selection of targets is highly coordinated and significant funding is available.

Alternate JournalNat. Struct. Biol.
PubMed ID11373627