[Biojava-l] DAS client: how to retrieve features for a sequence region
Bernd Jagla
bernd.jagla at pasteur.fr
Thu Apr 29 06:30:03 UTC 2010
Hi Jonathan,
Just to clarify, I need to write my own das client? I was hoping to be able
to use most of the functionality especially for the parsing of the XML and
creating the URLs by means of functions/methods that are already around.
I am now going into debug mode for the DAS package in biojava to look for
the XML parsing, if you any further pointers on specific methods I should be
looking at it would mean a lot to me.
In short, I think I can create the URLs from scratch with not much effort. I
don't currently know how to put the XML into a data structure and how this
data structure should look like.
Thanks for your kind help,
Bernd
_____
From: Jonathan Warren [mailto:jw12 at sanger.ac.uk]
Sent: Wednesday, April 28, 2010 10:21 PM
To: Bernd Jagla
Cc: biojava-l at lists.open-bio.org
Subject: Re: [Biojava-l] DAS client: how to retrieve features for a sequence
region
Hi Bernd
For the UCSC you need to filter on types. see
http://genome.ucsc.edu/FAQ/FAQdownloads.html#downloads there is a section
called "Downloading data from the UCSC DAS server"
for DAS libraries you can see a tutorial here
http://www.biodas.org/wiki/DASWorkshop2010#Day_2
the one you would be most interested in is the Dasobert tutorial
(http://www.ebi.ac.uk/~rafael/dokuwiki/doku.php?id=das:courses:dasobert) for
DAS client creation, but there is a also a good javascript library as well
called JSDas.
Any more info then don't hesitate to ask.
Jonathan.
On 28 Apr 2010, at 08:25, Bernd Jagla wrote:
Hi there,
I am trying to retrieve information (features) from the UCSC genome browser
using the DAS interface.
I am looking at the org.biojava.bio.program.das sources. I can retrieve all
top level entry points with
DASSequenceDB(dbURL)
(Apperently the last entry from the return XML object gives a
[Fatal Error] :1:1: Content is not allowed in prolog.
Which I am ignoring...)
and also the DSN entries using:
DAS das = new DAS();
das.addDasURL(new URL(dbURLString));
for(Iterator i = das.getReferenceServers().iterator(); i.hasNext(); )
{....
When I try to access features for a top level entry point, i.e. a reference
sequence I have the impression that first all features for a given reference
sequence are being downloaded.
My questions:
How can I access only the features of a specific region? I guess in DAS
terms I want to specify the segment part of the URL
(http://genome.ucsc.edu/cgi-bin/das/hg17/features?segment=22:15000000,160000
00).
I would also like to get the list of available features. How can I achieve
this? From a wireshark output I can see that this is being retrieved somehow
behind the scene. How can I access this information?
I am looking at TestDAS*.java; are there any other examples around that I
can use to learn from?
Thanks a lot for your kind support,
Best,
Bernd
_______________________________________________
Biojava-l mailing list - Biojava-l at lists.open-bio.org
http://lists.open-bio.org/mailman/listinfo/biojava-l
Jonathan Warren
Senior Developer and DAS coordinator
jw12 at sanger.ac.uk
Ext: 2314
Telephone: 01223 492314
-- The Wellcome Trust Sanger Institute is operated by Genome Research
Limited, a charity registered in England with number 1021457 and a company
registered in England with number 2742969, whose registered office is 215
Euston Road, London, NW1 2BE.
More information about the Biojava-l
mailing list