[Biojava-dev] retrieving species (common name)
mark.schreiber at novartis.com
mark.schreiber at novartis.com
Mon Jun 19 02:38:38 UTC 2006
You could try NCBI's e-utils webservice. This might enable you to get the
common name using a GI number.
- Mark
Hubert Prielinger <hubert.prielinger at gmx.at>
Sent by: biojava-dev-bounces at lists.open-bio.org
06/13/2006 12:36 AM
To: Richard Holland <richard.holland at ebi.ac.uk>,
biojava-dev at lists.open-bio.org
cc: (bcc: Mark Schreiber/GP/Novartis)
Subject: Re: [Biojava-dev] retrieving species (common name)
hi,
No, I'm not using BioSQL, it is an usual mySQL database and I have only
the genbank accession id available. I want to get the taxonomy with the
accession id, if that is possible.
regards
Hubert
Richard Holland wrote:
> I'm assuming your sequences and taxonomy data are stored in BioSQL. In
> which case, it's fairly straightforward to get this information out
> without having to drag all the features and annotations out as well, by
> using BioEntry instead of RichSequence to query the database. Code like
> this should work (hasn't been checked or anything, but it gives you an
> idea as to how things should go):
>
> // connect to BioSQL and establish a Hibernate Session
> Session sess = ...;
>
> // set up BioJavaX to use the session
> RichObjectFactory.connectToBioSQL(sess);
>
> // instantiate the class that gets BioEntries from
BioSQL.
> // use BioSQLRichSequenceDB instead if you want features
and
> // annotations included.
> BioEntryDB db = new BioSQLBioEntryDB(sess);
>
> // get BioEntry for accession (accession must be the
> // primary accession of the sequence, as found in the
> // 'name' column in the 'bioentry' table in the
database).
> BioEntry be = db.getBioEntry("YPOL_IBDVS");
>
> // get BioEntry's taxon object
> NCBITaxon tax = be.getTaxon();
>
> // print the names. Each name belongs to a name class.
> for (Iterator i = tax.getNameClasses().iterator();
> i.hasNext();
> ) {
> String nameClass = (String)i.next();
> for (Iterator k =
tax.getNames(nameClass).iterator();
> k.hasNext();
> ) {
> String name =
(String)k.next();
> System.out.println(nameClass+" : "+name);
> }
> }
>
>
> If your sequences and taxonomy data are not stored in BioSQL, then the
> only way to do this is to parse the taxonomy data on startup, parse the
> sequences on startup into a simple in-memory system such as
> HashRichSequenceDB, then use the methods on the RichSequenceDB interface
> to obtain sequences by accession before continuing as per the example
> above.
>
> cheers,
> Richard
>
>
> On Fri, 2006-06-09 at 14:51 -0600, Hubert Prielinger wrote:
>
>> hi,
>> sorry for replying that late,
>> I have XML blast outputs, which you can retrieve information like
>> accession id, protein name, length of sequnence aso....
>> but there is no possibility to retrieve the taxonomy (especially the
>> scientific name or common name)
>> I need the common and scientific name from each blast hit. I have found
>> in biojava-live/src/org/biojava/bibliography/taxa a few code examples
>> that could suit my
>> task (e.g: simpleTaxon.java)
>>
>> eg: I have the accession id: YPOL_IBDVS
>> and I want to get the taxonomy of that protein, not neccessarily the
>> entire taxonomy but mentioned above scientific and common name.
>> and I don't know exactly how to get the taxonomy, it seems that there
is
>> no directly way from the accession id, but over the taxon id, but I
>> don't know how to get that either.....
>> it must be possible to map the accession id to the taxon id and then
>> request with the taxon id the taxonomy, if I get it right.....
>>
>> thanks in advance
>> regards
>> Hubert
>>
>>
>> Richard Holland wrote:
>>
>>> I'm not sure what you're asking for here. Could you explain in a
little
>>> more detail? Maybe write some example program code that assumes
BioJava
>>> works the way you'd like it to work in this situation, making up the
>>> names of classes/methods that you might call in BioJava but don't yet
>>> exist, then we can help you fill in the gaps.
>>>
>>> cheers,
>>> Richard
>>>
>>> On Mon, 2006-06-05 at 16:49 -0600, Hubert Prielinger wrote:
>>>
>>>
>>>> hi,
>>>> Is it possible with biojava to retrieve the species not the entire
>>>> taxonomy, only the common name if I only have the accession id or the
>>>> name of the protein and if yes
>>>> how to start.....
>>>> In my case:
>>>> I would retrieve the accession id from my local database then assign
as
>>>> parameter to the program, retrieve common name and write the common
name
>>>> back into the database....
>>>> the thing I want to know is the retrieving possible with biojava?
>>>>
>>>> thanks for help
>>>>
>>>> Hubert
>>>> _______________________________________________
>>>> biojava-dev mailing list
>>>> biojava-dev at lists.open-bio.org
>>>> http://lists.open-bio.org/mailman/listinfo/biojava-dev
>>>>
>>>>
_______________________________________________
biojava-dev mailing list
biojava-dev at lists.open-bio.org
http://lists.open-bio.org/mailman/listinfo/biojava-dev
More information about the biojava-dev
mailing list