[DAS2] transitioning from DAS/1 to DAS/2
Andrew Dalke
dalke at dalkescientific.com
Tue Dec 6 23:43:10 UTC 2005
> Andreas Prlic wrote:
>> I would therefore like to discuss to rename the "genome" domain to
>> "sequence".
>>
>> The information of which type of sequence, "genomic" or "protein
>> sequence" should be provided via the
>> source description.
Steve:
> I like this idea. The only nucleotide specific stuff in the DAS/2
> retrieval
> spec is the region request. Strand designation in a location specifier
> is
> already optional.
>
> We'd may then want to change the 'sequence' request to something else,
> perhaps 'residues'?
I've been thinking the same thing.
Going one step further - what about dropping the name entirely?
Consider this, with some of the xml: attributes removed for clarity.
I've added a 'source_type' field in the <SOURCE> element.
This is what you get from a SOURCES request
HTTP GET http://www.example.com/das2/
(note lack of 'genome' in that URL)
<SOURCES>
<SOURCE id="volvox" description="Volvox Example Database"
taxon="http://www.ncbi.nlm.nih.gov/taxon-browser?id=29118"
source_type="genome">
<VERSION id="volvox/1" description="Build 1, October 2002">
<ASSEMBLY id="http://www.ensembl.org/das/genome/vv116" />
</VERSION>
</SOURCE>
<SOURCE id="yeast" description="S. cerevisae Genome via SGD"
taxon="http://www.ncbi.nlm.nih.gov/taxon-browser?id=2186"
source_type="genome">
<VERSION id="yeast/1Feb2001" description="February 2001 build" />
<VERSION id="yeast/1Mar2001" description="March 2001 build (minor
changes)" />
</SOURCE>
<SOURCE id="swissprot" description="SWISS-PROT sequence"
source_type="protein">
<VERSION id="swissprot/45" description="Some SProt version" />
</SOURCE>
<SOURCE id="pdb" description="PDB"
source_type="structure">
<VERSION id="pdb/2005q3" description="Fall 2005 release of all PDB
structures" />
</SOURCE>
</SOURCES>
That is, the SOURCES request returns information about genomic,
protein sequence and structure databases.
If this occurs then there will need to be a few changes to the spec.
For example, 'taxon' is probably only properly part of the genomic
sources and not in the others, so perhaps move the taxon information
into a subelement of those SOURCE elements with 'source_type' ==
'genome'.
<SOURCE id="volvox" description="Volvox Example Database"
source_type="genome">
<TAXON>http://www.ncbi.nlm.nih.gov/taxon-browser?id=29118</TAXON>
<VERSION id="volvox/1" description="Build 1, October 2002">
<ASSEMBLY id="http://www.ensembl.org/das/genome/vv116" />
</VERSION>
</SOURCE>
...
Andrew
dalke at dalkescientific.com
More information about the DAS2
mailing list