Explanatory Notes for Progress Table



ProtID   Each protein selected for analysis is given a unique ID that is derived in the following way. The first protein or domain selected from a family of similar amino-acid sequences is identified by P followed by an integer, e.g. P6 (P006 in the table). If only a portion of a protein is selected for analysis, the integer is followed by a lower-case letter that uniquely identifies the protein fragment, e.g. P6a (or P006a) for amino acids 20-108 of P6. Any different fragment from the same protein receives its own letter suffix. Additional proteins selected from a family are given the same P number followed by a dash and successive integers as unique identifiers of different proteins, e.g. P111-1, P111-2, etc. for different family members. Letter suffixes to identify protein fragments are placed at the end, e.g., P111-1a for a fragment of P111-1. In distinguishing among different proteins in the same family, it may sometimes be useful to designate the initial family member with the suffix -0, as in P111-0 (which is equivalent to P111).

aa   When an entire protein is analyzed, a single number gives the number of amino acids in the protein. When a protein fragment is analyzed, a pair of numbers gives the first and last amino acids in the fragment, relative to the whole protein.

Org   A three-letter code identifies the species from which the protein derives.
  Sce Saccharomyces cerevisiae
  Mja Methanococcus jannaschii

Gene   The gene name is linked to a relevant genomic database.

Access#   The accession number is for SwissProt, if available; otherwise, another source

Colors green indicates a successful outcome and the likelihood of further work
  yellow indicates a problem such as a mutant, poor yield, or difficult data
  red indicates a likely block to further work, such as poor expression or insolubility

Clone verified DNA sequencing has verified a correct coding sequence
  cloned a clone has been obtained but not verified by DNA sequencing
  fragment a fragment of the total coding sequence has been cloned and verified
  mutant DNA sequencing showed the clone to be a missense mutant
  indel DNA sequencing found an insertion or deletion in the coding sequence
  failure attempted cloning was unsuccessful
  no entry cloning has not yet been attempted

Express +++ the protein is the dominant band in total cell proteins
  ++ the protein band is at least as intense as the most prominent cell proteins
  + the protein is apparent but not particularly intense relative to cell proteins
  ? the protein may be expressed but has not been confirmed
  0 the protein appears not to be expressed

Soluble soluble expressed protein is largely or entirely soluble under some condition
  partial a fraction of expressed protein appears to be soluble under some condition
  not expressed protein appears to be insoluble under all conditions tested

Purified purified expressed protein has been purified
  low yld expressed protein was purified but the yield was low

Crystal good crystals have been obtained that diffract to better than 3 Å
  likely crystals look promising, by size or initial diffraction test
  micro indications of crystal formation have been seen

Structure PDB ID green indicates structures deposited by NYSGRC;
no color indicates structures deposited by others; the extent of progress indicates where work stopped when we became aware of these structures
Related structure   Entries show significant BLAST hits to a protein in the structure PDB or a protein known to have been selected as a target for structure determination by one or more structural genomics group
  PDB PDB www.rcsb.org/pdb
  A LANL/UCLA Consortium www.doe-mbi.ucla.edu/TB/
  B LBNL Consortium www.strgen.org/
  C CARB/TIGR Consortium http://s2f.umbi.umd.edu/
  E Northeast Structural Genomics Consortium http://www.nesg.org/
  J New Jersey Consortium www.cabm.umdnj.edu/
  P APS Consortium www.mcsg.anl.gov/
  Y New York Structural Genomics Research Consortium http://www.nysgrc.org

Summary of Progress (at head of table):
  Cloned sum of all entries except indels and failures
  Express sum of all entries designated +, ++, or +++
  Soluble sum of all entries designated soluble or partial
  Purified sum of all entries, both purified and low yld
  Good crystals sum of all entries designated good
  Structure defined NYSGRC structures with a PDB ID (green) or non-NYSGRC structures with a PDB ID (white)

Return to Progress Table.