[Biopython] Mining the PDB header

Lenna Peterson lenna.peterson at gmail.com
Thu May 11 18:28:02 UTC 2017


Well, your original question did ask how to mine the PDB header with Python.

"protein classification" is not a specific term (do you mean organism?
function? fold? etc.) - is this something that appears in the PDB header?
If so, what PDB header field is it in?

Lenna


On Thu, May 11, 2017 at 1:49 PM, Ahmad Abdelzaher <underoath006 at gmail.com>
wrote:

> I'm not trying to mine the actual header. I would definitely be
> interested in an option that retrieves the protein classification
> without having to write any additional code. Does such option exist?
>
> Regards.
>
> On Thu, May 11, 2017 at 7:06 AM, João Rodrigues
> <j.p.g.l.m.rodrigues at gmail.com> wrote:
> > You can do *some* mining. Look at parse_pdb_header.
> >
> > 2017-05-10 18:58 GMT-07:00 Ahmad Abdelzaher <underoath006 at gmail.com>:
> >>
> >> Hey guys,
> >>
> >> Unfortunately  I read this in the FAQ page:
> >>
> >> " If you are interested in data mining the PDB header, you might want
> >> to look elsewhere because there is only limited support for this."
> >>
> >> So if I can't do it with biopython, what other alternatives do I have?
> >> I'm doing some PDB mining and I'm interested to retrieve the
> >> classification of the structure, to do some clustering analysis later.
> >>
> >> Cheers.
> >> _______________________________________________
> >> Biopython mailing list  -  Biopython at mailman.open-bio.org
> >> http://mailman.open-bio.org/mailman/listinfo/biopython
> >
> >
>
> _______________________________________________
> Biopython mailing list  -  Biopython at mailman.open-bio.org
> http://mailman.open-bio.org/mailman/listinfo/biopython
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.open-bio.org/pipermail/biopython/attachments/20170511/813d8328/attachment.html>


More information about the Biopython mailing list