[BioPython] residue sequence number length (no more than 4 digits)
    Wolfgang Meyer 
    wolfgang.meyer at gmail.com
       
    Tue Jan  1 12:33:41 EST 2008
    
    
  
Hi,
According to PDB format (old), residue sequence number length should be no
longer than 4 digits.
...
23 - 26    Integer     resSeq    Residue sequence number.
...
However, Bio.PDB.Residue.__init__(...) does not check the length of this
parameter, neither does Bio.PDB.PDBIO. Though Bio.PDB.PDBIO tries to
restrict the length of residue sequence number to 4 in the format string:
_ATOM_FORMAT_STRING="%s%5i %-4s%c%3s %c%4i%c
%8.3f%8.3f%8.3f%6.2f%6.2f      %4s%2s%2s\n"
This does not prevent a residue sequence number longer than 4 digits to be
written into a PDB file by PDBIO. Such a PDB file would be considered false
by many PDB file parsers.
Of course users should be responsible to feed residue sequence number of
valid length to a residue. However, wouldn't it be better to handle some
careless input of wrong residue sequence number in BioPython?
Thanks!
-- 
Wolfgang Meyer
    
    
More information about the BioPython
mailing list