[Biojava-l] PDBFileParser and Atom element symbol
Andy Thomas-Cramer
thomascramera at dnastar.com
Thu Apr 29 18:14:27 UTC 2010
Yes, I would like to have direct access to the element symbol data
that's in the file. Otherwise, anyone that needs the element type has to
create rules for interpreting it from the "atom name" field. It feels
wrong to attempt to deduce data when it is provided explicitly.
These PDB remediation project notes suggest using the element symbol
specified in 77-78
http://nar.oxfordjournals.org/cgi/content/full/36/suppl_1/D426#SEC3
"Atom types are provided for every atom (i.e. ATOM record columns
77-78), so prior atom name justification conventions should no longer be
assumed in reading atom names."
JMOL uses the PDB element symbol if present, else interprets from the
"atom name" field.
http://wiki.jmol.org/index.php/AtomSets
"On PDB format, Jmol will identify the element from columns 77-78
(element symbol, right-justified). If this is absent, then it will
interpret the "atom name" field (columns 13-14) to deduce the element
identity."
JMOL is LGPL. If it interpretation is desirable, could start with its
current approach. Personally, I would be happy just with access to the
data in the file.
________________________________
From: andreas.prlic at gmail.com [mailto:andreas.prlic at gmail.com] On Behalf
Of Andreas Prlic
Sent: Monday, April 26, 2010 8:08 PM
To: Andy Thomas-Cramer
Cc: biojava-l at lists.open-bio.org
Subject: Re: [Biojava-l] PDBFileParser and Atom element symbol
Hi Andy
Questions:
* Is this pattern documented in the PDB specification?
see here:
http://www.wwpdb.org/documentation/format23/sect9.html#ATOM
* If this pattern can be relied on, why are columns 77-78 also
dedicated to the element symbol?
That is the atom's element symbol (as given in the periodic table), in
contrast to the first name, which contains numbering information.
* Should reliance on the pattern be hidden behind a BioJava
method?
If you think that is important we could probably provide an enum for all
atom types. There are two categories though: the periodic table symbol
and the one that is related to the position in an amino acid....
Andreas
________________________________
From: andreas.prlic at gmail.com [mailto:andreas.prlic at gmail.com]
On Behalf Of Andreas Prlic
Sent: Friday, April 23, 2010 6:52 PM
To: Andy Thomas-Cramer
Cc: biojava-l at lists.open-bio.org
Subject: Re: [Biojava-l] PDBFileParser and Atom element symbol
Hi Andy,
you could check with Atom.getFullname(), which contains the
space characters from the PDB file:
e.g Calpha: " CA ", Calcium "CA "
in addition the parent group of a Calpha atom is usually an
AminoAcid and for Calciums it is a Hetatom group...
Andreas
On Fri, Apr 23, 2010 at 3:58 PM, Andy Thomas-Cramer
<thomascramera at dnastar.com> wrote:
Is there an easy way to identify the type of atom referenced by
an Atom
object?
For example, if Atom.getName() is "CA", is the element calcium
or the
atom carbon alpha?
If not, would it be feasible to add a method providing this in
Atom,
AtomImpl, and parsing it in PDBFileParser, using the columns
defined at
http://www.wwpdb.org/documentation/format32/sect9.html#ATOM?
_______________________________________________
Biojava-l mailing list - Biojava-l at lists.open-bio.org
http://lists.open-bio.org/mailman/listinfo/biojava-l
--
-----------------------------------------------------------------------
Dr. Andreas Prlic
Senior Scientist, RCSB PDB Protein Data Bank
University of California, San Diego
(+1) 858.246.0526
-----------------------------------------------------------------------
--
-----------------------------------------------------------------------
Dr. Andreas Prlic
Senior Scientist, RCSB PDB Protein Data Bank
University of California, San Diego
(+1) 858.246.0526
-----------------------------------------------------------------------
More information about the Biojava-l
mailing list