| ProtID | Each protein selected for analysis is given a unique ID that is derived in the following way. The first protein or domain selected from a family of similar amino-acid sequences is identified by P followed by an integer, e.g. P6 (P006 in the table). If only a portion of a protein is selected for analysis, the integer is followed by a lower-case letter that uniquely identifies the protein fragment, e.g. P6a (or P006a) for amino acids 20-108 of P6. Any different fragment from the same protein receives its own letter suffix. Additional proteins selected from a family are given the same P number followed by a dash and successive integers as unique identifiers of different proteins, e.g. P111-1, P111-2, etc. for different family members. Letter suffixes to identify protein fragments are placed at the end, e.g., P111-1a for a fragment of P111-1. In distinguishing among different proteins in the same family, it may sometimes be useful to designate the initial family member with the suffix -0, as in P111-0 (which is equivalent to P111). |
|
| aa | When an entire protein is analyzed, a single number gives the number of amino acids in the protein. When a protein fragment is analyzed, a pair of numbers gives the first and last amino acids in the fragment, relative to the whole protein. |
|
| Org | A three-letter code identifies the species from which the protein derives. |
|
| Sce | Saccharomyces cerevisiae |
|
| Mja | Methanococcus jannaschii |
|
| Gene | The gene name is linked to a relevant genomic database. |
|
| Access# | The accession number is for SwissProt, if available; otherwise, another source |
|
| Colors | green | indicates a successful outcome and the likelihood of further work |
| yellow | indicates a problem such as a mutant, poor yield, or difficult data |
|
| red | indicates a likely block to further work, such as poor expression or insolubility |
|
| Clone | verified | DNA sequencing has verified a correct coding sequence |
| cloned | a clone has been obtained but not verified by DNA sequencing |
|
| fragment | a fragment of the total coding sequence has been cloned and verified |
|
| mutant | DNA sequencing showed the clone to be a missense mutant |
|
| indel | DNA sequencing found an insertion or deletion in the coding sequence |
|
| failure | attempted cloning was unsuccessful |
|
| no entry | cloning has not yet been attempted |
|
| Express | +++ | the protein is the dominant band in total cell proteins |
| ++ | the protein band is at least as intense as the most prominent cell proteins |
|
| + | the protein is apparent but not particularly intense relative to cell proteins |
|
| ? | the protein may be expressed but has not been confirmed |
|
| 0 | the protein appears not to be expressed |
|
| Soluble | soluble | expressed protein is largely or entirely soluble under some condition |
| partial | a fraction of expressed protein appears to be soluble under some condition |
|
| not | expressed protein appears to be insoluble under all conditions tested |
|
| Purified | purified | expressed protein has been purified |
| low yld | expressed protein was purified but the yield was low |
|
| Crystal | good | crystals have been obtained that diffract to better than 3 Å |
| likely | crystals look promising, by size or initial diffraction test |
|
| micro | indications of crystal formation have been seen |
|
| Structure | PDB ID | green indicates structures deposited by NYSGRC; no color indicates structures deposited by others; the extent of progress indicates where work stopped when we became aware of these structures |
| Related structure | Entries show significant BLAST hits to a protein in the structure PDB or a protein known to have been selected as a target for structure determination by one or more structural genomics group |
|
| PDB | PDB www.rcsb.org/pdb |
|
| A | LANL/UCLA Consortium www.doe-mbi.ucla.edu/TB/ |
|
| B | LBNL Consortium www.strgen.org/ |
|
| C | CARB/TIGR Consortium http://s2f.umbi.umd.edu/ |
|
| E | Northeast Structural Genomics Consortium http://www.nesg.org/ |
|
| J | New Jersey Consortium www.cabm.umdnj.edu/ |
|
| P | APS Consortium www.mcsg.anl.gov/ |
|
| Y | New York Structural Genomics Research Consortium http://www.nysgrc.org |
|
Summary of Progress (at head of table): |
||
| Cloned | sum of all entries except indels and failures |
|
| Express | sum of all entries designated +, ++, or +++ |
|
| Soluble | sum of all entries designated soluble or partial |
|
| Purified | sum of all entries, both purified and low yld |
|
| Good crystals | sum of all entries designated good |
|
| Structure defined | NYSGRC structures with a PDB ID (green) or non-NYSGRC structures with a PDB ID (white) |
|