[Bioperl-l] GenBank ASN.1 SeqIO parser

Chris Mungall cjm at fruitfly.org
Fri Feb 8 02:14:20 UTC 2008


Just to add to Barry's suggestions:

You can query the database directly:
http://www.berkeleybop.org/goose

You can also use the go-db-perl API and point it at the mysql port of  
one of the mirrors; see
http://www.geneontology.org/GO.database.shtml

On Feb 7, 2008, at 4:31 PM, Barry Moore wrote:

> Ryan,
>
> I you have a list of NCBI Gene IDs then you can grab the flatfile  
> gene2go from NCBIs ftp site ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/ 
> gene2go.gz.  That will give you tax_id, gene_id, go_id, evidence  
> code, qualifier, category etc.  From there  you can get the  
> description from the GO OBO file http://www.geneontology.org/ 
> ontology/gene_ontology_edit.obo.  If all you need is the  
> description then the file is pretty easy to parse on the fly, but  
> if you need to traverse the graphs or if you want an already  
> written parser then add go-perl http://search.cpan.org/~cmungall/go- 
> perl/go-perl.pod
>
> Barry
>
>
>
> On Feb 7, 2008, at 4:04 PM, Ryan Golhar wrote:
>
>> Let me re-phrase then - I want to parse an entry such as this:
>>
>> http://www.ncbi.nlm.nih.gov/sites/entrez? 
>> db=gene&cmd=Retrieve&dopt=full_report&list_uids=11258
>>
>> to retrieve the text of the Gene Ontology entries and the  
>> associated GO
>> IDs for those entries.  Is this possible with BioPerl?  If so, how  
>> can I
>> do this with BioPerl?
>>
>> Ryan
>>
>>
>>
>> Jason Stajich wrote:
>>> ugh - why parse ASN.1? NCBI provides converter application in the  
>>> ncbi
>>> toolkit to many formats : genbank, XML, etc.
>>> On Feb 7, 2008, at 1:48 PM, Chris Fields wrote:
>>>
>>>> No.  The only ASN.1 parser is entrezgene.  You could probably try
>>>> building one using the same ASN.1 parser that SeqIO::entrezgene  
>>>> uses
>>>> (Bio::ASN1::EntrezGene); it includes a parser for sequences:
>>>>
>>>> http://search.cpan.org/~mingyiliu/Bio-ASN1-EntrezGene-1.091/lib/ 
>>>> Bio/ASN1/Sequence.pm
>>>>
>>>>
>>>> chris
>>>>
>>>> On Feb 7, 2008, at 3:24 PM, Ryan Golhar wrote:
>>>>
>>>>> Is there a SeqIO parser module for GenBank ASN.1 format?  I  
>>>>> thought
>>>>> it would have been genbank or entrezgene, but neither of them  
>>>>> work.
>>>>>
>>>>> _______________________________________________
>>>>> Bioperl-l mailing list
>>>>> Bioperl-l at lists.open-bio.org
>>>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>>>>
>>>> Christopher Fields
>>>> Postdoctoral Researcher
>>>> Lab of Dr. Robert Switzer
>>>> Dept of Biochemistry
>>>> University of Illinois Urbana-Champaign
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> Bioperl-l mailing list
>>>> Bioperl-l at lists.open-bio.org
>>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>>>
>>> _______________________________________________
>>> Bioperl-l mailing list
>>> Bioperl-l at lists.open-bio.org
>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>>>
>>>
>>
>> _______________________________________________
>> Bioperl-l mailing list
>> Bioperl-l at lists.open-bio.org
>> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>




More information about the Bioperl-l mailing list