[BioPython] Extracting residue list from PDB

Boris Steipe boris.steipe at utoronto.ca
Sat Oct 29 00:45:38 EDT 2005


SEQRES and ATOM records have different semantics: the SEQRES is what  
the crystallographer puts into the experiment, the ATOM records is  
what she sees. Presumably you have covalent chain-breaks between  
residues, or parts of the polypeptide chain were not traceable in  
electron density and were omitted.

So even though they numbers are inconsistent, they are both right.

A related issue may be what the natural protein sequence is, as  
opposed to the perhaps truncated molecule in the crystal (presumably  
you are not interested in the propensity of fragments to crystallize,  
but in some biological property), or what the translated sequence is,  
that may have been posttranslationally processed, or even what the  
gene sequence is, that may have been translated with e.g.  
selenocysteine, etc. etc.

So, (as usual) "the way to go" is determined by where you want to go to.

HTH

Boris


On 28 Oct 2005, at 22:14, Gad Abraham wrote:



> On Fri, Oct 28, 2005 at 09:07:53AM -0700, Iddo Friedberg wrote:
>
>
>
>> Hi Gad,
>>
>> The reason the chains seem to be of the wrong length, is that they  
>> are
>> generated from the ATOM records, rather than the SEQRES records.  
>> Those
>> disagree often enough in PDB files.
>> This problem does not exists in mmCIF, I believe.
>>
>> Using your code, I got only six chains, so I cannot comment on the
>> second problem.
>>
>>
>>
>>
>
> I see. I'm consistently getting 10 chains (lengths 161, 804, 97, 176,
> 11, 71, 87, 662, 133, 286) for 1N62.
>
> It seems that parsing the SEQRES is the way go.
>
> Thanks,
> Gad
>
> -- 
> +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> gabraham at cs.rmit.edu.au
> http://yallara.cs.rmit.edu.au/~gabraham
> +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> _______________________________________________
> BioPython mailing list  -  BioPython at biopython.org
> http://biopython.org/mailman/listinfo/biopython
>
>
>





More information about the BioPython mailing list