This section contains records used to describe the experiment and the biological macromolecules present in the entry: HEADER, OBSLTE, TITLE, SPLIT, CAVEAT, COMPND, SOURCE, KEYWDS, EXPDTA, AUTHOR, REVDAT, SPRSDE, JRNL, and REMARK records.
Overview
The HEADER record uniquely identifies a PDB entry through the idCode field. This record also provides a classification for the entry. Finally, it contains the date when the coordinates were deposited to the PDB archive.
Record Format
COLUMNS       DATA  TYPE     FIELD             DEFINITION
------------------------------------------------------------------------------------
 1 -  6       Record name    "HEADER"
11 - 50       String(40)     classification    Classifies the molecule(s).
51 - 59       Date           depDate           Deposition date. This is the date the
                                               coordinates  were received at the PDB.
63 - 66       IDcode         idCode            This identifier is unique within the PDB.
Details
Verification/Validation/Value Authority Control
The verification program checks that the deposition date is a legitimate date and that the ID code is well-formed.
PDB coordinate entry ID codes do not begin with 0. “No coordinates”, or NOC files, given as 0xxx codes, contained no structural information and were bibliographic only. These entries were subsequently removed from PDB archive.
Relationships to Other Record Types
The classification found in HEADER also appears in KEYWDS, unabbreviated and in no strict order.
Example
         1         2         3         4         5         6         7         8
12345678901234567890123456789012345678901234567890123456789012345678901234567890
HEADER    PHOTOSYNTHESIS                          28-MAR-07   2UXK              
HEADER    TRANSFERASE/TRANSFERASE INHIBITOR       17-SEP-04   1XH6              
HEADER    MEMBRANE PROTEIN, TRANSPORT PROTEIN     20-JUL-06   2HRT              
Overview
OBSLTE appears in entries that have been removed from public distribution.
This record acts as a flag in an entry that has been removed (“obsoleted”) from the PDB's full release. It indicates which, if any, new entries have replaced the entry that was obsoleted. The format allows for the case of multiple new entries replacing one existing entry.
Record Format
COLUMNS DATA TYPE FIELD DEFINITION --------------------------------------------------------------------------------------- 1 - 6 Record name "OBSLTE" 9 - 10 Continuation continuation Allows concatenation of multiple records 12 - 20 Date repDate Date that this entry was replaced. 22 - 25 IDcode idCode ID code of this entry. 32 - 35 IDcode rIdCode ID code of entry that replaced this one. 37 - 40 IDcode rIdCode ID code of entry that replaced this one. 42 - 45 IDcode rIdCode ID code of entry that replaced this one. 47 - 50 IDcode rIdCode ID code of entry that replaced this one. 52 - 55 IDcode rIdCode ID code of entry that replaced this one. 57 - 60 IDcode rIdCode ID code of entry that replaced this one. 62 - 65 IDcode rIdCode ID code of entry that replaced this one. 67 - 70 IDcode rIdCode ID code of entry that replaced this one. 72 - 75 IDcode rIdCode ID code of entry that replaced this one.
Details
Verification/Validation/Value Authority Control
wwPDB staff adds this record at the time an entry is removed from release.
Relationships to Other Record Types
None.
Example
1 2 3 4 5 6 7 8 12345678901234567890123456789012345678901234567890123456789012345678901234567890 OBSLTE 31-JAN-94 1MBP 2MBP
Overview
The TITLE record contains a  title for the experiment or analysis that is represented in the entry. 
  It should identify an entry in  the same way that a citation title identifies a publication. 
Record Format
COLUMNS DATA TYPE FIELD DEFINITION ---------------------------------------------------------------------------------- 1 - 6 Record name "TITLE " 9 - 10 Continuation continuation Allows concatenation of multiple records. 11 - 80 String title Title of the experiment.
Details
- Experiment type. - Description of the mutation. - The fact that only alpha carbon coordinates have been provided in the entry.
Verification/Validation/Value Authority Control
This record is free text so no  verification of format is required. The title is supplied by the depositor, but  staff may exercise editorial judgment in consultation with depositors in 
  assigning the title. 
Relationships to Other Record Types
COMPND, SOURCE, EXPDTA, and REMARKs provide information that may also be found in TITLE. You may think of the title as describing the experiment, and the compound record as describing the molecule(s).
Examples
1 2 3 4 5 6 7 8 12345678901234567890123456789012345678901234567890123456789012345678901234567890 TITLE RHIZOPUSPEPSIN COMPLEXED WITH REDUCED PEPTIDE INHIBITOR TITLE STRUCTURE OF THE TRANSFORMED MONOCLINIC LYSOZYME BY TITLE 2 CONTROLLED DEHYDRATION TITLE NMR STUDY OF OXIDIZED THIOREDOXIN MUTANT (C62A,C69A,C73A) TITLE 2 MINIMIZED AVERAGE STRUCTURE
Overview
The SPLIT record is used in instances where a specific entry composes part of a large macromolecular complex. It will identify the PDB entries that are required to reconstitute a complete complex.
Record Format
COLUMNS DATA TYPE FIELD DEFINITION ---------------------------------------------------------------------------------- 1 - 6 Record name "SPLIT " 9 - 10 Continuation continuation Allows concatenation of multiple records. 12 - 15 IDcode idCode ID code of related entry. 17 - 20 IDcode idCode ID code of related entry. 22 - 25 IDcode idCode ID code of related entry. 27 – 30 IDcode idCode ID code of related entry. 32 - 35 IDcode idCode ID code of related entry. 37 - 40 IDcode idCode ID code of related entry. 42 - 45 IDcode idCode ID code of related entry. 47 - 50 IDcode idCode ID code of related entry. 52 - 55 IDcode idCode ID code of related entry. 57 - 60 IDcode idCode ID code of related entry. 62 - 65 IDcode idCode ID code of related entry. 67 - 70 IDcode idCode ID code of related entry. 72 - 75 IDcode idCode ID code of related entry. 77 - 80 IDcode idCode ID code of related entry.
Details
Verification/Validation/Value  Authority Control 
  This record will be generated at the  time of processing the component PDB files of the large macromolecular complex  when all complex constituents are deposited.
Relationships to Other Record Types
REMARK 350 will contain an amended statement to reflect the entire complex.
Examples
         1         2         3         4         5         6         7         8
12345678901234567890123456789012345678901234567890123456789012345678901234567890
SPLIT      1VOQ 1VOR 1VOS 1VOU 1VOV 1VOW 1VOX 1VOY 1VP0 1VOZ
Overview
CAVEAT warns of errors and unresolved issues in the entry. Use caution when using an entry containing this record.
Record Format
COLUMNS DATA TYPE FIELD DEFINITION --------------------------------------------------------------------------------------- 1 - 6 Record name "CAVEAT" 9 - 10 Continuation continuation Allows concatenation of multiple records. 12 - 15 IDcode idCode PDB ID code of this entry. 20 - 79 String comment Free text giving the reason for the CAVEAT.
Details
Verification/Validation/Value Authority Control
CAVEAT will be added to entries known to be incorrect.
Overview
The COMPND record describes the macromolecular contents of an entry. Some cases where the entry contains a standalone drug or inhibitor, the name of the non-polymeric molecule will appear in this record. Each macromolecule found in the entry is described by a set of token: value pairs, and is referred to as a COMPND record component. Since the concept of a molecule is difficult to specify exactly, staff may exercise editorial judgment in consultation with depositors in assigning these names.
Record Format
COLUMNS       DATA TYPE       FIELD         DEFINITION
----------------------------------------------------------------------------------
 1 -  6       Record name     "COMPND"  
 8 - 10       Continuation    continuation  Allows concatenation of multiple records.
11 - 80       Specification   compound      Description of the molecular components.
              list  
Details
TOKEN                  VALUE DEFINITION
-------------------------------------------------------------------------
MOL_ID                 Numbers each component; also used in  SOURCE to associate
                       the information.
MOLECULE               Name of the macromolecule.
CHAIN                  Comma-separated list of chain  identifier(s).
FRAGMENT               Specifies a domain or region of the  molecule.
SYNONYM                Comma-separated list of synonyms for  the MOLECULE.
EC                     The Enzyme Commission number associated  with the molecule.
                       If there is more than one EC number,  they are presented
                       as a comma-separated list.
ENGINEERED             Indicates that the molecule was  produced using
                       recombinant technology or by purely  chemical synthesis.
MUTATION               Indicates if there is a mutation.
OTHER_DETAILS          Additional comments.
Verification/Validation/Value Authority Control
CHAIN must match the chain identifiers(s) of the molecule(s). EC numbers are also checked.
Relationships to Other Record Types
In the case of mutations, the SEQADV records will present differences from the reference molecule. REMARK records may further describe the contents of the entry. Also see verification above.
Examples
1 2 3 4 5 6 7 8 12345678901234567890123456789012345678901234567890123456789012345678901234567890 COMPND MOL_ID: 1; COMPND 2 MOLECULE: HEMOGLOBIN ALPHA CHAIN; COMPND 3 CHAIN: A, C; COMPND 4 SYNONYM: DEOXYHEMOGLOBIN ALPHA CHAIN; COMPND 5 ENGINEERED: YES; COMPND 6 MUTATION: YES; COMPND 7 MOL_ID: 2; COMPND 8 MOLECULE: HEMOGLOBIN BETA CHAIN; COMPND 9 CHAIN: B, D; COMPND 10 SYNONYM: DEOXYHEMOGLOBIN BETA CHAIN; COMPND 11 ENGINEERED: YES; COMPND 12 MUTATION: YES COMPND MOL_ID: 1; COMPND 2 MOLECULE: COWPEA CHLOROTIC MOTTLE VIRUS; COMPND 3 CHAIN: A, B, C; COMPND 4 SYNONYM: CCMV; COMPND 5 MOL_ID: 2; COMPND 6 MOLECULE: RNA (5'-(*AP*UP*AP*U)-3'); COMPND 7 CHAIN: D, F; COMPND 8 ENGINEERED: YES; COMPND 9 MOL_ID: 3; COMPND 10 MOLECULE: RNA (5'-(*AP*U)-3'); COMPND 11 CHAIN: E; COMPND 12 ENGINEERED: YES COMPND MOL_ID: 1; COMPND 2 MOLECULE: HEVAMINE A; COMPND 3 CHAIN: A; COMPND 4 EC: 3.2.1.14, 3.2.1.17; COMPND 5 OTHER_DETAILS: PLANT ENDOCHITINASE/LYSOZYME
Overview
The SOURCE record specifies the biological and/or chemical source of each biological molecule in the entry. Some cases where the entry contains a standalone drug or inhibitor, the source information of this molecule will appear in this record. Sources are described by both the common name and the scientific name, e.g., genus and species. Strain and/or cell-line for immortalized cells are given when they help to uniquely identify the biological entity studied.
Record Format
COLUMNS      DATA  TYPE     FIELD          DEFINITION                       
--------------------------------------------------------------------------------------
 1 -  6      Record name    "SOURCE"      
 8 - 10      Continuation   continuation   Allows concatenation of multiple records.
11 - 79      Specification  srcName        Identifies the source of the
             List                          macromolecule in a  token: value format.
Details
TOKEN                                VALUE  DEFINITION                       
--------------------------------------------------------------------------------------
MOL_ID                               Numbers each  molecule. Same as appears in COMPND.
SYNTHETIC                            Indicates a  chemically-synthesized source. 
FRAGMENT                             A domain or  fragment of the molecule may be
                                     specified.                                 
ORGANISM_SCIENTIFIC                  Scientific name of the  organism.           
ORGANISM_COMMON                      Common name of the  organism.
ORGANISM_TAXID                       NCBI Taxonomy ID number  of the organism.   
STRAIN                               Identifies the  strain.                     
VARIANT                              Identifies the  variant.                    
CELL_LINE                            The specific line of  cells used in the experiment.
ATCC                                 American Type  Culture Collection tissue    
                                     culture  number.                            
ORGAN                                Organized group of  tissues that carries on 
                                     a specialized function.                    
TISSUE                               Organized group  of cells with a common    
                                     function and  structure.                    
CELL                                 Identifies the  particular cell type.       
ORGANELLE                            Organized structure  within a cell.         
SECRETION                            Identifies the secretion, such as  saliva, urine,
                                     or venom,  from which the molecule was isolated.
CELLULAR_LOCATION                    Identifies the location  inside/outside the cell.
PLASMID                              Identifies the plasmid  containing the gene.
GENE                                 Identifies the  gene.                       
EXPRESSION_SYSTEM                    Scientific name of the organism in  which the
                                     molecule was expressed.
EXPRESSION_SYSTEM_COMMON             Common name of the organism in  which the molecule
                                     was  expressed.
EXPRESSION_SYSTEM_TAXID              NCBI Taxonomy ID of the organism  used as the
                                     expression  system.
EXPRESSION_SYSTEM_STRAIN             Strain of the organism in which  the molecule
                                     was  expressed.                             
EXPRESSION_SYSTEM_VARIANT            Variant of the organism used as the
                                     expression  system.
EXPRESSION_SYSTEM_CELL_LINE          The specific line of cells used as  the
                                     expression  system.
EXPRESSION_SYSTEM_ATCC_NUMBER        Identifies the ATCC number of the  expression system.
EXPRESSION_SYSTEM_ORGAN              Specific organ which expressed  the molecule.
EXPRESSION_SYSTEM_TISSUE             Specific tissue which expressed  the molecule.
EXPRESSION_SYSTEM_CELL               Specific cell type which  expressed the molecule.
EXPRESSION_SYSTEM_ORGANELLE          Specific organelle which expressed  the molecule.
EXPRESSION_SYSTEM_CELLULAR_LOCATION  Identifies the location inside or outside
                                     the cell  which expressed the molecule.
EXPRESSION_SYSTEM_VECTOR_TYPE        Identifies the type of vector used,  i.e.,
                                     plasmid,  virus, or cosmid.
EXPRESSION_SYSTEM_VECTOR             Identifies the vector used.
EXPRESSION_SYSTEM_PLASMID            Plasmid used in the recombinant experiment.
EXPRESSION_SYSTEM_GENE               Name of the gene used in  recombinant experiment.
OTHER_DETAILS                        Used to present  information on the source which
                                     is not  given elsewhere.             
- When necessary to fully describe hybrid molecules, tokens may appear more than once for a given MOL_ID. - All relevant token: value pairs that taken together fully describe each fragment are grouped following the appropriate FRAGMENT. - Descriptors relative to the full system appear before the FRAGMENT (see third example below).
- The expression system is described. - The organism and cell location given are for the source of the gene used in the cloning experiment. - Transgenic organisms, such as mouse producing human proteins, are treated as expression systems.
Verification/Validation/Value Authority Control
The biological source is compared to that found in the sequence databases. The Tax ID is identified and the corresponding scientific and common names for the organism is matched to a standard taxonomy database (such as NCBI).
Relationships to Other Record Types
Each macromolecule listed in COMPND must have a corresponding source.
Examples
1 2 3 4 5 6 7 8 12345678901234567890123456789012345678901234567890123456789012345678901234567890 SOURCE MOL_ID: 1; SOURCE 2 ORGANISM_SCIENTIFIC: AVIAN SARCOMA VIRUS; SOURCE 3 ORGANISM_TAXID: 11876 SOURCE 4 STRAIN: SCHMIDT-RUPPIN B; SOURCE 5 EXPRESSION_SYSTEM: ESCHERICHIA COLI SOURCE 6 EXPRESSION_SYSTEM_TAXID: 562 SOURCE 7 EXPRESSION_SYSTEM_PLASMID: PRC23IN SOURCE MOL_ID: 1; SOURCE 2 ORGANISM_SCIENTIFIC: GALLUS GALLUS; SOURCE 3 ORGANISM_COMMON: CHICKEN; SOURCE 3 ORGANISM_TAXID: 9031 SOURCE 4 ORGAN: HEART; SOURCE 5 TISSUE: MUSCLE
For a Chimera protein:
SOURCE MOL_ID: 1; SOURCE 2 ORGANISM_SCIENTIFIC: MUS MUSCULUS, HOMO SAPIENS; SOURCE 3 ORGANISM_COMMON: MOUSE, HUMAN; SOURCE 3 ORGANISM_TAXID: 10090, 9606 SOURCE 5 EXPRESSION_SYSTEM: ESCHERICHIA COLI; SOURCE 6 EXPRESSION_SYSTEM_TAXID: 344601 SOURCE 6 EXPRESSION_SYSTEM_STRAIN: B171; SOURCE 7 EXPRESSION_SYSTEM_VECTOR_TYPE: PLASMID; SOURCE 8 EXPRESSION_SYSTEM_PLASMID: P4XH-M13;
Overview
The KEYWDS record contains a set of terms relevant to the entry. Terms in the KEYWDS record provide a simple means of categorizing entries and may be used to generate index files. This record addresses some of the limitations found in the classification field of the HEADER record. It provides the opportunity to add further annotation to the entry in a concise and computer-searchable fashion.
Record Format
COLUMNS       DATA  TYPE     FIELD         DEFINITION 
---------------------------------------------------------------------------------
 1 -  6       Record name    "KEYWDS" 
 9 - 10       Continuation   continuation  Allows concatenation of records if necessary.
11 - 79       List           keywds        Comma-separated list of keywords relevant
                                           to the entry.      
Details
- Functional classification. - Metabolic role. - Known biological or chemical activity. - Structural classification.
Verification/Validation/Value Authority Control
Terms used in the KEYWDS record are subject to scientific and editorial review. A list of terms, definitions, and synonyms will be maintained by the wwPDB. Every attempt will be made to provide some level of consistency with keywords used in other biological databases.
Relationships to Other Record Types
HEADER records contain a classification term which must also appear in KEYWDS. Scientific judgment will dictate when terms used in one entry to describe a molecule should be included in other entries with the same or similar molecules.
Example
1 2 3 4 5 6 7 8 12345678901234567890123456789012345678901234567890123456789012345678901234567890 KEYWDS LYASE, TRICARBOXYLIC ACID CYCLE, MITOCHONDRION, OXIDATIVE KEYWDS 2 METABOLISM
Overview
The EXPDTA record presents information about the experiment.
The EXPDTA record identifies the experimental technique used. This may refer to the type of radiation and sample, or include the spectroscopic or modeling technique. Permitted values include:
    X-RAY  DIFFRACTION
    FIBER  DIFFRACTION
    NEUTRON  DIFFRACTION
    ELECTRON  CRYSTALLOGRAPHY
    ELECTRON  MICROSCOPY
    SOLID-STATE  NMR
    SOLUTION  NMR
    SOLUTION  SCATTERING
*Note:Since October 15, 2006, theoretical models are no longer accepted  for deposition.  Any theoretical models  deposited prior to this date are archived at https://files.wwpdb.org/pub/pdb/data/structures/models.  
    Please see the documentation from previous versions for the  related file format description. 
Record Format
COLUMNS       DATA TYPE      FIELD         DEFINITION   
------------------------------------------------------------------------------------
 1 -  6       Record name    "EXPDTA"  
 9 - 10       Continuation   continuation  Allows concatenation of multiple records.
11 - 79       SList          technique     The experimental technique(s) with 
                                           optional comment describing the
                                           sample or experiment.
Details
Verification/Validation/Value Authority Control
The verification program checks that the EXPDTA record appears in the entry and that the technique matches one of the allowed values. It also checks that the relevant standard REMARK is added, as in the cases of NMR or electron microscopy studies, that the appropriate CRYST1 and SCALE values are used.
Relationships to Other Record Types
If the experiment is an NMR or electron microscopy study, this may be stated in the TITLE, and the appropriate EXPDTA and REMARK records should appear. Specific details of the data collection and experiment appear in the REMARKs.
In the case of a polycrystalline fiber diffraction study, CRYST1 and SCALE contain the normal unit cell data.
Examples
1 2 3 4 5 6 7 8 12345678901234567890123456789012345678901234567890123456789012345678901234567890 EXPDTA X-RAY DIFFRACTION EXPDTA NEUTRON DIFFRACTION; X-RAY DIFFRACTION EXPDTA SOLUTION NMR EXPDTA ELECTRON MICROSCOPY
Overview
The NUMMDL record indicates total number of models in a PDB entry.
Record Format
COLUMNS DATA TYPE FIELD DEFINITION ------------------------------------------------------------------------------------ 1 - 6 Record name "NUMMDL" 11 - 14 Integer modelNumber Number of models.
Details
Verification/Validation/Value Authority Control
The verification program checks that the modelNumber field is correctly formatted.
Example
1 2 3 4 5 6 7 8 12345678901234567890123456789012345678901234567890123456789012345678901234567890 NUMMDL 20
Overview
The MDLTYP record contains additional annotation pertinent to the coordinates presented in the entry.
Record Format
COLUMNS      DATA TYPE      FIELD         DEFINITION                          
------------------------------------------------------------------------------------
 1 -  6      Record name    "MDLTYP"                                            
 9 - 10      Continuation   continuation  Allows concatenation of multiple records.
11 - 80      SList          comment       Free Text providing  additional structural
                                          annotation.   
Details
Verification/Validation/Value Authority Control
The chain_identifiers described in this record must be present in the COMPND, SEQRES and the coordinate section of the entry.
Example
1 2 3 4 5 6 7 8 12345678901234567890123456789012345678901234567890123456789012345678901234567890 MDLTYP MINIMIZED AVERAGE MDLTYP CA ATOMS ONLY, CHAIN A, B, C, D, E, F, G, H, I, J, K ; P ATOMS ONLY, MDLTYP 2 CHAIN X, Y, Z MDLTYP MINIMIZED AVERAGE ; CA ATOMS ONLY, CHAIN A, B
Overview
The AUTHOR record contains the names of the people responsible for the contents of the entry.
Record Format
COLUMNS      DATA  TYPE      FIELD         DEFINITION                          
------------------------------------------------------------------------------------
 1 -  6      Record name     "AUTHOR"                                            
 9 - 10      Continuation    continuation  Allows concatenation of multiple records.
11 - 79      List            authorList    List of the author names, separated   
                                           by commas.
Details
- First and middle names are indicated by initials, each followed by a period, and precede the surname. - Only the surname (family or last name) of the author is given in full. - Hyphens can be used if they are part of the author's name. - Apostrophes are allowed in surnames. - Umlauts and other character modifiers are not given.
- There is no space after any initial and its following period. - Blank spaces are used in a name only if properly part of the surname (e.g., J.VAN DORN), or between surname and Jr., II, or III - Abbreviations that are part of a surname, such as Jr., St. or Ste., are followed by a period and a space before the next part of the surname.
- Group names used for one or all of the authors should be spelled out in full. - The name of the larger group comes before the name of a subdivision, e.g., University of Somewhere, Department of Chemistry.
- Line breaks between multiple lines in the authorList occur only after a comma. - Personal names are not split across two lines.
- Names are given in English if there is an accepted English version; otherwise in the native language, transliterated if necessary.
Verification/Validation/Value Authority Control
The verification program checks that the authorList field is correctly formatted. It does not perform any spelling checks or name verification.
Relationships to Other Record Types
The format of the names in the AUTHOR record is the same as in JRNL and REMARK 1 references.
Example
1 2 3 4 5 6 7 8 12345678901234567890123456789012345678901234567890123456789012345678901234567890 AUTHOR M.B.BERRY,B.MEADOR,T.BILDERBACK,P.LIANG,M.GLASER, AUTHOR 2 G.N.PHILLIPS JR.,T.L.ST. STEVENS
Overview
REVDAT records contain a history of the modifications made to an entry since its release.
Record Format
COLUMNS       DATA  TYPE     FIELD         DEFINITION                            
-------------------------------------------------------------------------------------
 1 -  6       Record name    "REVDAT"                                            
 8 - 10       Integer        modNum        Modification number.                  
11 - 12       Continuation   continuation  Allows concatenation of multiple records.
14 - 22       Date           modDate       Date of modification (or release  for  
                                           new entries)  in DD-MMM-YY format. This is
                                           not repeated on continued lines.
24 - 27       IDCode         modId         ID code of this entry. This is not repeated on
                                           continuation lines.   
32            Integer        modType       An integer identifying the type of   
                                           modification. For all  revisions, the
                                           modification type is listed as 1
40 - 45       LString(6)     record        Modification detail.
47 - 52       LString(6)     record        Modification detail.
54 - 59       LString(6)     record        Modification detail.
61 - 66       LString(6)     record        Modification detail.
Details
0 Initial released entry. 1 Other modification.
Verification/Validation/Value Authority Control
The modType must be one of the defined types, and the given record type must be valid. If modType is 0, the modId must match the entry's ID code in the HEADER record.
Relationships to Other Record Types
In the case of a version revision, the current will be specified in REMARK 4.
Template
1 2 3 4 5 6 7 8 12345678901234567890123456789012345678901234567890123456789012345678901234567890 REVDAT 2 15-OCT-99 1ABC 1 REMARK REVDAT 1 09-JAN-89 1ABC 0
1 2 3 4 5 6 7 8 12345678901234567890123456789012345678901234567890123456789012345678901234567890 REVDAT 2 11-MAR-08 2ABC 1 JRNL VERSN REVDAT 1 09-DEC-03 2ABC 0
Overview
The SPRSDE records contain a list of the ID codes of entries that were made obsolete by the given coordinate entry and removed from the PDB release set. One entry may replace many.
It is wwPDB policy that only the  principal investigator of a structure has the authority to obsolete it.
  Record Format 
COLUMNS        DATA TYPE     FIELD         DEFINITION                          
-----------------------------------------------------------------------------------
 1 -  6        Record name   "SPRSDE"                                            
 9 - 10        Continuation  continuation  Allows for multiple ID codes.         
12 - 20        Date          sprsdeDate    Date this entry superseded the listed 
                                           entries. This field is not copied on  
                                           continuations.              
22 - 25        IDcode        idCode        ID code of this entry. This field is  not
                                           copied on continuations.       
32 - 35        IDcode        sIdCode       ID code of a superseded entry.        
37 - 40        IDcode        sIdCode       ID code of a superseded entry.        
42 - 45        IDcode        sIdCode       ID code of a superseded entry.        
47 - 50        IDcode        sIdCode       ID code of a superseded entry.        
52 - 55        IDcode        sIdCode       ID code of a superseded entry.        
57 - 60        IDcode        sIdCode       ID code of a superseded entry.        
62 - 65        IDcode        sIdCode       ID code of a superseded entry.        
67 - 70        IDcode        sIdCode       ID code of a superseded entry.
72 - 75        IDcode        sIdCode       ID code of a superseded entry.        
Details
Verification/Validation/Value Authority Control
wwPDB checks that the superseded entries have actually been removed from release.
Relationships to Other Record Types
The sprsdeDate is usually the date the entry is released, and therefore matches the date in the REVDAT 1 record. The ID code found in the idCode field must be the same as one found in the idCode field of the HEADER record.
Example
1 2 3 4 5 6 7 8 12345678901234567890123456789012345678901234567890123456789012345678901234567890 SPRSDE 17-JUL-84 4HHB 1HHB SPRSDE 27-FEB-95 1GDJ 1LH4 2LH4
Overview
The JRNL record contains the primary literature citation that describes the experiment which resulted in the deposited coordinate set. There is at most one JRNL reference per entry. If there is no primary reference, then there is no JRNL reference. Other references are given in REMARK 1.
Record Format
COLUMNS DATA TYPE FIELD DEFINITION ----------------------------------------------------------------------- 1 - 6 Record name "JRNL" 13 - 79 LString text See Details below.
Details
COLUMNS DATA TYPE FIELD DEFINITION ------------------------------------------------------------------------------- 1 - 6 Record name "REMARK" 10 LString(1) "1" 13 - 16 LString(4) "AUTH" Appears on all continuation records. 17 - 18 Continuation continuation Allows a long list of authors. 20 - 79 List authorList List of the authors.
COLUMNS DATA TYPE FIELD DEFINITION ----------------------------------------------------------------------------------- 1 - 6 Record name "REMARK" 10 LString(1) "1" 13 - 16 LString(4) "TITL" Appears on all continuation records. 17 - 18 Continuation continuation Permits long titles. 20 - 79 LString title Title of the article.
COLUMNS DATA TYPE FIELD DEFINITION ----------------------------------------------------------------------------------- 1 - 6 Record name "REMARK" 10 LString(1) "1" 13 - 16 LString(4) "TITL" Appears on all continuation records. 17 - 18 Continuation continuation Permits long titles. 20 - 79 LString title Title of the article.
4a. If the reference has not been published yet, the sub-record type group has the form:
COLUMNS DATA TYPE FIELD DEFINITION -------------------------------------------------------------------------------- 1 - 6 Record name "JRNL " 13 - 16 LString(3) "REF" 20 - 34 LString(15) "TO BE PUBLISHED"
If the publication is a serial (i.e., a journal, an annual, or other non-book or non-monographic item issued in parts and intended to be continued indefinitely), use the abbreviated name of the publication as listed in PubMed with periods.
If the publication is a book, monograph, or other non-serial item, use its full name according to the Anglo-American Cataloguing Rules, 2nd Revised Edition; (AACR2R). (Non-serial items include theses, videos, computer programs, and anything that is complete in one or a finite number of parts.) If there is a sub-title, verifiable in an online catalog, it will be included using the same punctuation as in the source of verification. Preference will be given to verification using cataloging of the Library of Congress, the National Library of Medicine, and the British Library, in that order.
If a book is part of a monographic series: the full name of the book (according to the AACR2R) is listed first, followed by the name of the series in which it was published. The series information is given within parentheses and the series name is preceded by "IN:" and a space. The series name should be listed in full unless the series has an accepted ISO abbreviation. If applicable, the series name should be followed, after a comma and a space, by a volume (V.) and/or number (NO.) and/or part (PT.) indicator and its number and/or letter in the series.
If a reference is in a supplement to the volume listed, or if information about a "part" is needed to distinguish multiple parts with the same page numbering, such information should be put in the REF sub-record.
A supplement indication should follow the name of the publication and should be preceded by a comma and a space. Supplement should be abbreviated as "SUPPL." If there is a supplement number or letter, it should follow "SUPPL." without an intervening space. A part indication should also follow the name of the publication and be preceded by a comma and a space. A part should be abbreviated as "PT.", and the number or letter should follow without an intervening space.
If there is both a supplement and a part, their order should reflect the order printed on the work itself.
If a book has a report designation, the report information should follow the title and precede series information. The name and number of the report is given in parentheses, and the name is preceded by "REPORT:" and a space.
The name of the publication is reconstructed by removing any trailing blanks in the pubName field, and concatenating all of the pubName fields from the continuation lines with an intervening space. There are two conditions where no intervening space is added between lines: when the pubName field on a line ends with a hyphen or a period, or when the line ends with a hyphen (-). When the line ends with a period (.), add a space if this is the only period in the entire pubName field; do not add a space if there are two or more periods throughout the pubName field, excluding any periods after the designations "SUPPL", "V", "NO", or "PT".
The REF sub-record type group also contains information about volume, page, and year when applicable.
In the case of a monograph with multiple volumes which is also in a numbered series, the number in the volume field represents the number of the book, not the series. (The volume number of the series is in parentheses with the name of the series, as described above under publication name.)
COLUMNS       DATA  TYPE     FIELD              DEFINITION
---------------------------------------------------------------------------------------
 1 -  6       Record name    "JRNL  "
13 - 16       LString(3)     "REF "
17 - 18       Continuation   continuation       Allows long publication names.
20 - 47       LString        pubName            Name  of the publication including section
                                                or series designation. This is the only
                                                field of this sub-record which may be
                                                continued on  successive sub-records.
50 - 51       LString(2)     "V."               Appears in the first sub-record only,
                                                and  only if column 55 is non-blank.
52 - 55       String         volume             Right-justified blank-filled volume
                                                information; appears in the first
                                                sub-record only.
57 - 61       String         page               First page of the article; appears in
                                                the first sub-record only.
63 - 66       Integer        year               Year of publication; first sub-record only.
Give the place of publication. If the name of the country, state, province, etc. is considered necessary to distinguish the place of publication from others of the same name, or for identification, then follow the city with a comma, a space, and the name of the larger geographic area.
If there is more than one place of publication, only the first listed will be used. If an online catalog record is used to verify the item, the first place listed there will be used, omitting any brackets. Preference will be given to the cataloging done by the Library of Congress, the National Library of Medicine, and the British Library, in that order.
Give the name of the publisher in the shortest form in which it can be understood and identified internationally, according to AACR2R rule 1.4D.
If there is more than one publisher listed in the publication, only the first will be used in the PDB file. If an online catalog record is used to verify the item, the first place listed there will be used for the name of the publisher. Preference will be given to the cataloging of the Library of Congress, the National Library of Medicine, and the British Library, in that order.
Theses are presented in the PUBL record if the degree has been granted and the thesis made available for public consultation by the degree-granting institution. The name of the degree-granting institution (the issuing agency) is followed by a space and "(THESIS)".
The PUBL sub-record type can be reconstructed by removing all trailing blanks in the pub field and concatenating all of the pub fields from the continuation lines with an intervening space. Continued lines do not begin with a space.
COLUMNS       DATA  TYPE     FIELD              DEFINITION
--------------------------------------------------------------------------------------
 1 -  6       Record name    "JRNL  "
13 - 16       LString(4)     "PUBL"
17 - 18       Continuation   continuation       Allows long publisher and place names.
20 - 70       LString        pub                City  of publication and name of the
                                                publisher/institution.
6a. This form of the REFN sub-record type group is used if the citation has not been published.
COLUMNS DATA TYPE FIELD DEFINITION -------------------------------------------------------------------------------- 1 - 6 Record name "JRNL " 13 - 16 LString(4) "REFN"
6b. This form of the REFN sub-record type group is used if the citation has been published.
COLUMNS       DATA TYPE      FIELD              DEFINITION
-------------------------------------------------------------------------------
 1 -  6       Record name    "JRNL  "
13 - 16       LString(4)     "REFN"
36 - 39       LString(4)     "ISSN" or          International Standard Serial Number or
                             "ESSN"             Electronic Standard Serial Number.
41 - 65       LString        issn               ISSN number (final digit may be a
                                                letter and may contain one or
                                                more dashes).
COLUMNS       DATA  TYPE     FIELD              DEFINITION
--------------------------------------------------------------------------------
 1 -  6       Record  name   "JRNL  "
13 - 16       LString(4)     "PMID"
20 – 79       Integer        continuation       unique PubMed identifier number assigned to
                                                the publication  describing the experiment.
                                                Allows  for a long PubMed ID number.
COLUMNS       DATA  TYPE     FIELD              DEFINITION
--------------------------------------------------------------------------------
 1 -  6       Record  name   "JRNL  "
13 - 16       LString(4)     "DOI "
20 – 79       LString        continuation       Unique DOI assigned to the publication
                                                describing the experiment.
                                                Allows for a long DOI string.
Verification/Validation/Value Authority Control
wwPDB verifies that this record is correctly formatted.
Citations appearing in JRNL may not also appear in REMARK 1.
Relationships to Other Record Types
The publication cited as the JRNL record may not be repeated in REMARK 1.
Example
1 2 3 4 5 6 7 8 12345678901234567890123456789012345678901234567890123456789012345678901234567890 JRNL AUTH G.FERMI,M.F.PERUTZ,B.SHAANAN,R.FOURME JRNL TITL THE CRYSTAL STRUCTURE OF HUMAN DEOXYHAEMOGLOBIN AT JRNL TITL 2 1.74 A RESOLUTION JRNL REF J.MOL.BIOL. V. 175 159 1984 JRNL REFN ISSN 0022-2836 JRNL PMID 6726807 JRNL DOI 10.1016/0022-2836(84)90472-8
Known Problems