[Biojava-l] PDBFileReader vs MMCIFFileReader - Parsing chain data into BioJava data structure
Andreas Prlic
andreas at sdsc.edu
Tue Jul 2 16:55:46 UTC 2013
Hi Phele,
The mmcif parsing framework currently does not read some of the header
information, yet. I'll file it as a feature request. In the meanwhile, you
could use the PDB file parser to get that data loaded ...
Andreas
On Tue, Jul 2, 2013 at 1:13 AM, Phelelani Mpangase <pmpangase at gmail.com>wrote:
> Hi all
>
> I am using the MMCIFFileParser class to parse the mmCIF files, which I
> understand should parse the mmCIF files into the same BioJava data
> structure as the PDB files. However, it seems as if some of the data in the
> mmCIF files is not loaded into the BioJava data structure.
>
> I was looking to obtain some information for each the chains in mmCIF files
> (organism, expression system etc) but I noticed that when I use the class
> getChains() on the structre parsed using the MMCIFFileParser, I always get
> "null" results. Here is code below demonstrating the results of the same
> PDB ID parsed with the PDBFileReader as well as the MMCIFFileReader:
>
> String pdbid = "4hhb";
> String dataDownload = "/tmp/";
>
> //pdb parser
> PDBFileReader pdbreader = new PDBFileReader();
> pdbreader.setAutoFetch(true);
> pdbreader.setPath(dataDownload);
>
> //mmCIF parser
> StructureIOFile cifreader = new MMCIFFileReader();
> cifreader.setAutoFetch(true);
> cifreader.setPath(dataDownload);
>
> //Parse pdb and cif structures into BioJava data structure
> Structure pdbstruc = pdbreader.getStructureById(pdbid);
> Structure cifstruc = cifreader.getStructureById(pdbid);
>
> System.out.println("Chain information for PDB structure " + pdbid +
> ":");
>
> //Get the chain information
> List<Chain> pdbChains = pdbstruc.getChains();
> for (Chain chain : pdbChains){
> System.out.println(chain.getHeader());
> }
>
>
> System.out.println("----------------------------------------------------");
> System.out.println("\nChain information for mmCIF Structure " +
> pdbid + ":");
>
> //Get the chain information
> List<Chain> cifChains = cifstruc.getChains();
> for (Chain chain : cifChains){
> System.out.println(chain.getHeader());
> }
>
> System.out.println("----------------------------------------------------");
>
>
> Are there additional classes I need to be using in order for the chain
> information in mmCIF files to be parsed into the BioJava data structure at
> the PDB files?
>
> Regards,
> Phele
> _______________________________________________
> Biojava-l mailing list - Biojava-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/biojava-l
>
More information about the Biojava-l
mailing list