From rmb32 at cornell.edu Sun Aug 1 15:17:14 2010 From: rmb32 at cornell.edu (Robert Buels) Date: Sun, 01 Aug 2010 12:17:14 -0700 Subject: [DAS] GMOD Evo Hackathon Open Call for Participation Message-ID: <4C55C83A.3060700@cornell.edu> We are seeking participants for the GMOD Tools for Evolutionary Biology Hackathon, held November 8-12, 2010 at the US National Evolutionary Synthesis Center (NESCent) in Durham, NC. This hackathon targets three critical gaps in the capabilities of the GMOD toolbox that currently limit its utility for evolutionary research: 1. Visualization of comparative genomics data 2. Visualization of phylogenetic data and trees 3. Support for population diversity and phenotype data If you are interested in these areas and have relevant expertise, you are strongly encouraged to apply. Relevant areas of expertise include more than just software development: if you are a GMOD power user, visualization guru, domain expert (comparative, phylogenetics, population, ...), or documentation wizard, then your skills are needed! How To Apply: Fill out the online application form at http://bit.ly/gmodevohack. Applications are due August 25. About GMOD: GMOD is an intercompatible suite of open-source software components for storing, managing, analyzing, and visualizing genome-scale data. GMOD includes many widely-used software components: GBrowse and JBrowse, both genome viewers; GBrowse_syn, a comparative genomics viewer; Chado, a generic and modular database schema; CMap, a comparative map viewer; as well as many other components including Apollo, MAKER, BioMart, InterMine, and Galaxy. We hope to extend the functionality of existing GMOD components, and integrate new components as well. About Hackathons: A hackathon is an intense event at which a group of programmers with different backgrounds and skills collaborate hands-on and face-to-face to develop working code that is of utility to the community as a whole. The mix of people will include domain experts and computer-savvy end-users. More details about the event, its motivation, organization, procedures, and attendees, as well as URLs to the hackathon and related websites are included below. Sincerely, The GMOD EvoHack Organizing Committee (and project affiliations as relevant): Nicole Washington, Chair (LBNL, modENCODE, Phenote) Robert Buels (SGN, Chado NatDiv) Scott Cain (OICR, GMOD) Dave Clements (NESCent, GMOD) Hilmar Lapp (NESCent, Phenoscape, Chado NatDiv) Sheldon McKay (University of Arizona, iPlant, GBrowse_syn) ----------------------------- About the GMOD Evo Hackathon Overview We are organizing a hackathon to fill critical gaps in the capabilities of the Generic Model Organism Database (GMOD) toolbox that currently limit its utility for evolutionary research. Specifically, we will focus on tools for 1) viewing comparative genomics data; 2) visualizing phylogenomic data; and 3) supporting population diversity data and phenotype annotation. The event will be hosted at NESCent and bring together a group of about 20+ software developers, end-user representatives, and documentation experts who would otherwise not meet. The participants will include key developers of GMOD components that currently lack features critical for emerging evolutionary biology research, developers of informatics tools in evolutionary research that lack GMOD integration, and informatics-savvy biologists who can represent end-user requirements. The event will provide a unique opportunity to infuse the GMOD developer community with a heightened awareness of unmet needs in evolutionary biology that GMOD components have the potential to fill, and for tool developers in evolutionary biology to better understand how best to extend or integrate with already existing GMOD components. Before the Event Discussion of ideas and sometimes even design actually starts well before the hackathon, on mailing lists, wiki pages, and conference calls set up among accepted attendees. This advance work lays the foundation for participants to be productive from the very first day. This also means that participants should be willing to contribute some time in advance of the hackathon itself to participate in this preparatory discussion. During the Event Typically, hackathon participants use the morning of the first day of the event to organize themselves into working groups of between 3 and 6 people, each with a focused implementation objective. Ideas and objectives are discussed, and attendees coalesce around the projects in which they have the most experience or interest. Deliverables / Event Results The meeting's attendance, working groups, and outcomes will be fully logged and documented on the GMOD wiki (http://gmod.org). Each working group during the event will typically have its own wiki page, linked from the main EvoHack page, where it documents its minutes and design notes, and provides links to the code and documentation it produces. Also, since GMOD and NESCent are both committed to open source principles, all code and documentation produced by participants during the hackathon must be published under an OSI-approved open source license. As contributions to existing GMOD tools, all hackathon products will most likely satisfy this requirement automatically. NESCent This event is sponsored by the US National Evolutionary Synthesis Center (NESCent, http://www.nescent.org) through its Informatics Whitepapers program (http://www.nescent.org/informatics/whitepapers.php). NESCent promotes the synthesis of information, concepts and knowledge to address significant, emerging, or novel questions in evolutionary science and its applications. NESCent achieves this by supporting research and education across disciplinary, institutional, geographic, and demographic boundaries (see http://www.nescent.org/science/proposals.php). Links Main GMOD EvoHack page, and full proposal: http://gmod.org/wiki/GMOD_Evo_Hackathon NESCent: http://www.nescent.org/ GMOD: http://gmod.org Similar past NESCent events, see: http://hackathon.nescent.org/ GMOD hackathon application: http://bit.ly/gmodevohack -- http://gmod.org/wiki/GMOD_News http://gmod.org/wiki/GMOD_Europe_2010 http://gmod.org/wiki/Help_Desk_Feedback From jprocter at compbio.dundee.ac.uk Mon Aug 2 04:18:57 2010 From: jprocter at compbio.dundee.ac.uk (Jim Procter) Date: Mon, 02 Aug 2010 09:18:57 +0100 Subject: [DAS] querying for nonpositional annotations In-Reply-To: References: <9FD87196-C80D-4C59-BE54-374A6CD71D68@ebi.ac.uk> Message-ID: <4C567F71.7050203@compbio.dundee.ac.uk> Interesting thread this - it's something that hasn't been properly discussed at previous DAS developer meetings.. On 30/07/2010 20:33, Dave Messina wrote: > I too agree with Eugene. > > No magic numbers. You're too late here. 0 *is already* a magic number in the normalised protein sequence world, since it indicates the transcription start site for a coding sequence (i.e. the initial M). This is the cause for some ambiguity in the bio* bindings, and confusion on the part of more simple minded programmers like myself :) > Types can be used for filtering, and actually you get more fine-grained control than simply positional or non-positional. (I use this technique now in DASher.) * > > In my opinion, the current spec as written is correct. That is, non-positional features don't just apply to the whole sequence, they apply to any part of the sequence. Agreed. But read on... > As an example, consider a journal reference ? a particular protein was isolated by a lab, they wrote a paper about it, and deposited the protein sequence in a database. If you look at a subsequence of the protein sequence, that subsequence still derives from the paper, right? So therefore the feature containing that journal reference should still be attached to the subsequence. > > On that basis, I think the uniprot server is technically doing it wrong and should be changed, although I have to say that in practice it hasn't been an issue for me. It's a difficult call. The uniprot server's behaviour is almost certainly due to the ambiguity arising from non-positional annotation which have start/end attributes (where start==end && start==0), and those which do not (the annotation is then usually derived from some other table, viz. the BioSQL schema). Other DAS servers do similar things, and kludges are needed to fix them. My only worry with the expectation of 'proper behaviour' - is that currently, I frequently see IDs with more non-positional annotation than positional (notwithstanding histogram like continuous quantitative annotation such as running averages of predicted or observed local sequence properties). Enforcing compliance with the spec as written means that the average DAS metaserver (i.e. uniprot, or some server that aggregates sequence database info with other data) will send a huge non-positional header in response to every range qualified feature request, which is pretty inefficient. It may not scale well, either, since the amount of database cross references is (still) increasing. > * It might be nice, though, to add 'positional' and 'non-positional' types, which would be a way to grab all of the existing positional or non-positional types in one go. (currently it's necessary to specify multiple types to get the same functionality.) This is essential, I think. However, the only way you are going to be able to do this in a DAS type constraint currently is to ensure the feature annotation source is ontology aware (and said ontology includes a distinct positional/non-positional hierarchy)**. One route would be to introduce a DAS-specific type term that the server maps to its source's ontology, another simpler approach would be to introduce a new boolean constraint 'positional', which if specified, limits the response to positional annotation only. Jim. ** but this immediatly brings to mind a nasty potential gotcha: e.g. 'expression' in the context of a genome is positional, but is a non-positional feature in the context of a proteome. So terms will have to be fully qualified in the type constraint on a feature request. -- ------------------------------------------------------------------- J. B. Procter (JALVIEW/ENFIN) Barton Bioinformatics Research Group Phone/Fax:+44(0)1382 388734/345764 http://www.compbio.dundee.ac.uk The University of Dundee is a Scottish Registered Charity, No. SC015096. From ljgarcia at ebi.ac.uk Mon Aug 2 04:50:51 2010 From: ljgarcia at ebi.ac.uk (Leyla Garcia) Date: Mon, 02 Aug 2010 09:50:51 +0100 Subject: [DAS] querying for nonpositional annotations Message-ID: <4C5686EB.1040800@ebi.ac.uk> *Hi list, *>No magic numbers. *According to discussions on this matter, I will change MyDas behaviour so 0 will be no "magic number" any more, *>Types can be used for filtering, and actually you get more fine-grained control than simply positional or non-positional. (I use this technique now in DASher.) * >In my opinion, the current spec as written is correct. That is, non-positional features don't just apply to the whole sequence, they apply to any part of the sequence. >As an example, consider a journal reference --- a particular protein was isolated by a lab, they wrote a paper about it, and deposited the protein sequence in a database. If you look at a subsequence of the protein sequence, that subsequence still derives from the paper, right? So therefore the feature containing that journal reference should still be attached to the subsequence. >On that basis, I think the uniprot server is technically doing it wrong and should be changed, although I have to say that in practice it hasn't been an issue for me. *and non-positional features will be always returned. Since UniProt is built upon MyDas, its behaviour will change as well. Thanks, Leyla * > >Dave From andy.jenkinson at ebi.ac.uk Mon Aug 2 05:36:07 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Mon, 2 Aug 2010 10:36:07 +0100 Subject: [DAS] querying for nonpositional annotations In-Reply-To: <4C5686EB.1040800@ebi.ac.uk> References: <4C5686EB.1040800@ebi.ac.uk> Message-ID: <4965F4EF-E3F3-4F47-A37F-5C1D6A68E37A@ebi.ac.uk> I guess if being unable to filter out nonpositional features proves to be a problem, we can introduce a specific boolean or type-based filter in the next version of the specification. On 2 Aug 2010, at 09:50, Leyla Garcia wrote: > *Hi list, > > *>No magic numbers. > > *According to discussions on this matter, I will change MyDas behaviour so 0 will be no > "magic number" any more, > > *>Types can be used for filtering, and actually you get more fine-grained control than simply positional or non-positional. (I use this technique now in DASher.) * >> In my opinion, the current spec as written is correct. That is, non-positional features don't just apply to the whole sequence, they apply to any part of the sequence. >> As an example, consider a journal reference --- a particular protein was isolated by a lab, they wrote a paper about it, and deposited the protein sequence in a database. If you look at a subsequence of the protein sequence, that subsequence still derives from the paper, right? So therefore the feature containing that journal reference should still be attached to the subsequence. >> On that basis, I think the uniprot server is technically doing it wrong and should be changed, although I have to say that in practice it hasn't been an issue for me. > > *and non-positional features will be always returned. > Since UniProt is built upon MyDas, its behaviour will change as well. > > Thanks, > > Leyla > * > >> >> Dave > > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das From jw12 at sanger.ac.uk Mon Aug 2 05:45:50 2010 From: jw12 at sanger.ac.uk (Jonathan Warren) Date: Mon, 2 Aug 2010 10:45:50 +0100 Subject: [DAS] querying for nonpositional annotations In-Reply-To: <4C5686EB.1040800@ebi.ac.uk> References: <4C5686EB.1040800@ebi.ac.uk> Message-ID: <4291A93C-6F52-4C84-A724-25AF59A70A1C@sanger.ac.uk> I'm not sure I agree with returning all annotations for every tiny part of a sequence requested. If you consider DAS to be used for visual display mainly - then it seems a bit ridiculous to return all publications related to a segment (take a case where you have many publications related to a protein sequence). If publications aren't asked for i.e. non-positional annotations then I don't think they should be given. So I guess I'm agreeing with Jim here. Given the history of DAS I would propose a non-positional parameter as apposed to "positional". I think we have to remember that the 1.6 spec is supposed to mainly be a consolidation of the way DAS is being used and DAS is supposed to be simple (or not overly technical and difficult to pick up anyway). However- obviously we don't want to propagate practices that really don't make sense. The 0,0 thing is a hack like we had hacks for ontologies which now for 1.6 we have cvIds (I think are a big improvement). So we need something that is simple and obvious for a big all encompassing thing like positional vs non-positional. On 2 Aug 2010, at 09:50, Leyla Garcia wrote: > *Hi list, > > *>No magic numbers. > > *According to discussions on this matter, I will change MyDas > behaviour so 0 will be no > "magic number" any more, > > *>Types can be used for filtering, and actually you get more fine- > grained control than simply positional or non-positional. (I use > this technique now in DASher.) * >> In my opinion, the current spec as written is correct. That is, non- >> positional features don't just apply to the whole sequence, they >> apply to any part of the sequence. >> As an example, consider a journal reference --- a particular >> protein was isolated by a lab, they wrote a paper about it, and >> deposited the protein sequence in a database. If you look at a >> subsequence of the protein sequence, that subsequence still derives >> from the paper, right? So therefore the feature containing that >> journal reference should still be attached to the subsequence. >> On that basis, I think the uniprot server is technically doing it >> wrong and should be changed, although I have to say that in >> practice it hasn't been an issue for me. > > *and non-positional features will be always returned. > Since UniProt is built upon MyDas, its behaviour will change as well. > > Thanks, > > Leyla > * > >> >> Dave > > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das Jonathan Warren Senior Developer and DAS coordinator blog: http://biodasman.wordpress.com/ jw12 at sanger.ac.uk Ext: 2314 Telephone: 01223 492314 -- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. From ljgarcia at ebi.ac.uk Mon Aug 2 06:29:53 2010 From: ljgarcia at ebi.ac.uk (Leyla Garcia) Date: Mon, 02 Aug 2010 11:29:53 +0100 Subject: [DAS] querying for nonpositional annotations In-Reply-To: <4291A93C-6F52-4C84-A724-25AF59A70A1C@sanger.ac.uk> References: <4C5686EB.1040800@ebi.ac.uk> <4291A93C-6F52-4C84-A724-25AF59A70A1C@sanger.ac.uk> Message-ID: <4C569E21.1060302@ebi.ac.uk> On 02/08/2010 10:45, Jonathan Warren wrote: > I'm not sure I agree with returning all annotations for every tiny > part of a sequence requested. > If you consider DAS to be used for visual display mainly - then it > seems a bit ridiculous to return all publications related to a segment > (take a case where you have many publications related to a protein > sequence). If publications aren't asked for i.e. non-positional > annotations then I don't think they should be given. So I guess I'm > agreeing with Jim here. I think a specific boolean (positional or non-positional) is a better idea than a type-based filter > Given the history of DAS I would propose a non-positional parameter as > apposed to "positional". > > I think we have to remember that the 1.6 spec is supposed to mainly be > a consolidation of the way DAS is being used and DAS is supposed to be > simple (or not overly technical and difficult to pick up anyway). > However- obviously we don't want to propagate practices that really > don't make sense. The 0,0 thing is a hack like we had hacks for > ontologies which now for 1.6 we have cvIds (I think are a big > improvement). So we need something that is simple and obvious for a > big all encompassing thing like positional vs non-positional. > > > > > On 2 Aug 2010, at 09:50, Leyla Garcia wrote: > >> *Hi list, >> >> *>No magic numbers. >> >> *According to discussions on this matter, I will change MyDas >> behaviour so 0 will be no >> "magic number" any more, >> >> *>Types can be used for filtering, and actually you get more >> fine-grained control than simply positional or non-positional. (I use >> this technique now in DASher.) * >>> In my opinion, the current spec as written is correct. That is, >>> non-positional features don't just apply to the whole sequence, they >>> apply to any part of the sequence. >>> As an example, consider a journal reference --- a particular protein >>> was isolated by a lab, they wrote a paper about it, and deposited >>> the protein sequence in a database. If you look at a subsequence of >>> the protein sequence, that subsequence still derives from the paper, >>> right? So therefore the feature containing that journal reference >>> should still be attached to the subsequence. >>> On that basis, I think the uniprot server is technically doing it >>> wrong and should be changed, although I have to say that in practice >>> it hasn't been an issue for me. >> >> *and non-positional features will be always returned. >> Since UniProt is built upon MyDas, its behaviour will change as well. >> >> Thanks, >> >> Leyla >> * >> >>> >>> Dave >> >> _______________________________________________ >> DAS mailing list >> DAS at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/das > > Jonathan Warren > Senior Developer and DAS coordinator > blog: http://biodasman.wordpress.com/ > jw12 at sanger.ac.uk > Ext: 2314 > Telephone: 01223 492314 > > > > > > > > > From andy.jenkinson at ebi.ac.uk Mon Aug 2 08:33:15 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Mon, 2 Aug 2010 13:33:15 +0100 Subject: [DAS] querying for nonpositional annotations In-Reply-To: <4291A93C-6F52-4C84-A724-25AF59A70A1C@sanger.ac.uk> References: <4C5686EB.1040800@ebi.ac.uk> <4291A93C-6F52-4C84-A724-25AF59A70A1C@sanger.ac.uk> Message-ID: <3A234F8D-DB4C-4183-9994-6694E56C8F6A@ebi.ac.uk> On 2 Aug 2010, at 10:45, Jonathan Warren wrote: > I'm not sure I agree with returning all annotations for every tiny part of a sequence requested. > If you consider DAS to be used for visual display mainly - then it seems a bit ridiculous to return all publications related to a segment (take a case where you have many publications related to a protein sequence). If publications aren't asked for i.e. non-positional annotations then I don't think they should be given. So I guess I'm agreeing with Jim here. > > Given the history of DAS I would propose a non-positional parameter as apposed to "positional". > > I think we have to remember that the 1.6 spec is supposed to mainly be a consolidation of the way DAS is being used and DAS is supposed to be simple (or not overly technical and difficult to pick up anyway). However- obviously we don't want to propagate practices that really don't make sense. The 0,0 thing is a hack like we had hacks for ontologies which now for 1.6 we have cvIds (I think are a big improvement). So we need something that is simple and obvious for a big all encompassing thing like positional vs non-positional. I'm not quite sure what you're suggesting we do in 1.6? > > > > On 2 Aug 2010, at 09:50, Leyla Garcia wrote: > >> *Hi list, >> >> *>No magic numbers. >> >> *According to discussions on this matter, I will change MyDas behaviour so 0 will be no >> "magic number" any more, >> >> *>Types can be used for filtering, and actually you get more fine-grained control than simply positional or non-positional. (I use this technique now in DASher.) * >>> In my opinion, the current spec as written is correct. That is, non-positional features don't just apply to the whole sequence, they apply to any part of the sequence. >>> As an example, consider a journal reference --- a particular protein was isolated by a lab, they wrote a paper about it, and deposited the protein sequence in a database. If you look at a subsequence of the protein sequence, that subsequence still derives from the paper, right? So therefore the feature containing that journal reference should still be attached to the subsequence. >>> On that basis, I think the uniprot server is technically doing it wrong and should be changed, although I have to say that in practice it hasn't been an issue for me. >> >> *and non-positional features will be always returned. >> Since UniProt is built upon MyDas, its behaviour will change as well. >> >> Thanks, >> >> Leyla >> * >> >>> >>> Dave >> >> _______________________________________________ >> DAS mailing list >> DAS at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/das > > Jonathan Warren > Senior Developer and DAS coordinator > blog: http://biodasman.wordpress.com/ > jw12 at sanger.ac.uk > Ext: 2314 > Telephone: 01223 492314 > > > > > > > > > > -- > The Wellcome Trust Sanger Institute is operated by Genome ResearchLimited, a charity registered in England with number 1021457 and acompany registered in England with number 2742969, whose registeredoffice is 215 Euston Road, London, NW1 2BE._______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das From jw12 at sanger.ac.uk Mon Aug 2 11:17:30 2010 From: jw12 at sanger.ac.uk (Jonathan Warren) Date: Mon, 2 Aug 2010 16:17:30 +0100 Subject: [DAS] querying for nonpositional annotations In-Reply-To: <3A234F8D-DB4C-4183-9994-6694E56C8F6A@ebi.ac.uk> References: <4C5686EB.1040800@ebi.ac.uk> <4291A93C-6F52-4C84-A724-25AF59A70A1C@sanger.ac.uk> <3A234F8D-DB4C-4183-9994-6694E56C8F6A@ebi.ac.uk> Message-ID: <97C6B89F-F001-4AB6-88E0-1B742FE39DF7@sanger.ac.uk> Currently having read all points and thought about it properly I agree with Eugenes summary of what the behavior should be. I was thinking that types filtering is not implemented and understood well by many casual users of DAS. But as the types command and filtering is something we want to encourage using the non-positional type should encourage this and as long as it's documented well and splattered everywhere I think it should be fine. But using the logic below as far as I can see this means you have to do 2 requests if you want features and non-positional annotations for a range /das/source/ features?segment=P99999:1,150 and /das/source/features? segment=P99999;type=annotation But presumably doing /das/source/features? segment=P99999:1,150;type=gene;type=annotation would actually return genes if in region but no non-positional results which isn't really logical on its own? On 2 Aug 2010, at 13:33, Andy Jenkinson wrote: > On 2 Aug 2010, at 10:45, Jonathan Warren wrote: > >> I'm not sure I agree with returning all annotations for every tiny >> part of a sequence requested. >> If you consider DAS to be used for visual display mainly - then it >> seems a bit ridiculous to return all publications related to a >> segment (take a case where you have many publications related to a >> protein sequence). If publications aren't asked for i.e. non- >> positional annotations then I don't think they should be given. So >> I guess I'm agreeing with Jim here. >> >> Given the history of DAS I would propose a non-positional parameter >> as apposed to "positional". >> >> I think we have to remember that the 1.6 spec is supposed to mainly >> be a consolidation of the way DAS is being used and DAS is supposed >> to be simple (or not overly technical and difficult to pick up >> anyway). However- obviously we don't want to propagate practices >> that really don't make sense. The 0,0 thing is a hack like we had >> hacks for ontologies which now for 1.6 we have cvIds (I think are a >> big improvement). So we need something that is simple and obvious >> for a big all encompassing thing like positional vs non-positional. > > I'm not quite sure what you're suggesting we do in 1.6? > >> >> >> >> On 2 Aug 2010, at 09:50, Leyla Garcia wrote: >> >>> *Hi list, >>> >>> *>No magic numbers. >>> >>> *According to discussions on this matter, I will change MyDas >>> behaviour so 0 will be no >>> "magic number" any more, >>> >>> *>Types can be used for filtering, and actually you get more fine- >>> grained control than simply positional or non-positional. (I use >>> this technique now in DASher.) * >>>> In my opinion, the current spec as written is correct. That is, >>>> non-positional features don't just apply to the whole sequence, >>>> they apply to any part of the sequence. >>>> As an example, consider a journal reference --- a particular >>>> protein was isolated by a lab, they wrote a paper about it, and >>>> deposited the protein sequence in a database. If you look at a >>>> subsequence of the protein sequence, that subsequence still >>>> derives from the paper, right? So therefore the feature >>>> containing that journal reference should still be attached to the >>>> subsequence. >>>> On that basis, I think the uniprot server is technically doing it >>>> wrong and should be changed, although I have to say that in >>>> practice it hasn't been an issue for me. >>> >>> *and non-positional features will be always returned. >>> Since UniProt is built upon MyDas, its behaviour will change as >>> well. >>> >>> Thanks, >>> >>> Leyla >>> * >>> >>>> >>>> Dave >>> >>> _______________________________________________ >>> DAS mailing list >>> DAS at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/das >> >> Jonathan Warren >> Senior Developer and DAS coordinator >> blog: http://biodasman.wordpress.com/ >> jw12 at sanger.ac.uk >> Ext: 2314 >> Telephone: 01223 492314 >> >> >> >> >> >> >> >> >> >> -- >> The Wellcome Trust Sanger Institute is operated by Genome >> ResearchLimited, a charity registered in England with number >> 1021457 and acompany registered in England with number 2742969, >> whose registeredoffice is 215 Euston Road, London, NW1 >> 2BE._______________________________________________ >> DAS mailing list >> DAS at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/das > Jonathan Warren Senior Developer and DAS coordinator blog: http://biodasman.wordpress.com/ jw12 at sanger.ac.uk Ext: 2314 Telephone: 01223 492314 -- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. From ljgarcia at ebi.ac.uk Mon Aug 2 12:15:42 2010 From: ljgarcia at ebi.ac.uk (Leyla Garcia) Date: Mon, 02 Aug 2010 17:15:42 +0100 Subject: [DAS] querying for nonpositional annotations In-Reply-To: <97C6B89F-F001-4AB6-88E0-1B742FE39DF7@sanger.ac.uk> References: <4C5686EB.1040800@ebi.ac.uk> <4291A93C-6F52-4C84-A724-25AF59A70A1C@sanger.ac.uk> <3A234F8D-DB4C-4183-9994-6694E56C8F6A@ebi.ac.uk> <97C6B89F-F001-4AB6-88E0-1B742FE39DF7@sanger.ac.uk> Message-ID: <4C56EF2E.7080800@ebi.ac.uk> On 02/08/2010 16:17, Jonathan Warren wrote: > Currently having read all points and thought about it properly I > agree with Eugenes summary of what the behavior should be. > I was thinking that types filtering is not implemented and understood > well by many casual users of DAS. But as the types command and > filtering is something we want to encourage using the non-positional > type should encourage this and as long as it's documented well and > splattered everywhere I think it should be fine. But using the logic > below as far as I can see this means you have to do 2 requests if you > want features and non-positional annotations for a range > /das/source/features?segment=P99999:1,150 and > /das/source/features?segment=P99999;type=annotation I think I am confused about the annotation type filter. Why do you need type=annotation to retrieve non-positional features when no range is specified? --> /das/source/features?segment=P99999;type=annotation If I am not mistaken, no range will retrieve both positional and non-positional features. > > But presumably doing > /das/source/features?segment=P99999:1,150;type=gene;type=annotation > would actually return genes if in region but no non-positional results > which isn't really logical on its own? So far, non-positional features will be always retrieve, with or without range. The idea behind using a type filter (type=annotation) is to specify whether or not non-positional features should be retrieve (or any other type). So the last query should actually retrieve the "annotation" type features, or not? Leyla > > > On 2 Aug 2010, at 13:33, Andy Jenkinson wrote: > >> On 2 Aug 2010, at 10:45, Jonathan Warren wrote: >> >>> I'm not sure I agree with returning all annotations for every tiny >>> part of a sequence requested. >>> If you consider DAS to be used for visual display mainly - then it >>> seems a bit ridiculous to return all publications related to a >>> segment (take a case where you have many publications related to a >>> protein sequence). If publications aren't asked for i.e. >>> non-positional annotations then I don't think they should be given. >>> So I guess I'm agreeing with Jim here. >>> >>> Given the history of DAS I would propose a non-positional parameter >>> as apposed to "positional". >>> >>> I think we have to remember that the 1.6 spec is supposed to mainly >>> be a consolidation of the way DAS is being used and DAS is supposed >>> to be simple (or not overly technical and difficult to pick up >>> anyway). However- obviously we don't want to propagate practices >>> that really don't make sense. The 0,0 thing is a hack like we had >>> hacks for ontologies which now for 1.6 we have cvIds (I think are a >>> big improvement). So we need something that is simple and obvious >>> for a big all encompassing thing like positional vs non-positional. >> >> I'm not quite sure what you're suggesting we do in 1.6? >> >>> >>> >>> >>> On 2 Aug 2010, at 09:50, Leyla Garcia wrote: >>> >>>> *Hi list, >>>> >>>> *>No magic numbers. >>>> >>>> *According to discussions on this matter, I will change MyDas >>>> behaviour so 0 will be no >>>> "magic number" any more, >>>> >>>> *>Types can be used for filtering, and actually you get more >>>> fine-grained control than simply positional or non-positional. (I >>>> use this technique now in DASher.) * >>>>> In my opinion, the current spec as written is correct. That is, >>>>> non-positional features don't just apply to the whole sequence, >>>>> they apply to any part of the sequence. >>>>> As an example, consider a journal reference --- a particular >>>>> protein was isolated by a lab, they wrote a paper about it, and >>>>> deposited the protein sequence in a database. If you look at a >>>>> subsequence of the protein sequence, that subsequence still >>>>> derives from the paper, right? So therefore the feature containing >>>>> that journal reference should still be attached to the subsequence. >>>>> On that basis, I think the uniprot server is technically doing it >>>>> wrong and should be changed, although I have to say that in >>>>> practice it hasn't been an issue for me. >>>> >>>> *and non-positional features will be always returned. >>>> Since UniProt is built upon MyDas, its behaviour will change as well. >>>> >>>> Thanks, >>>> >>>> Leyla >>>> * >>>> >>>>> >>>>> Dave >>>> >>>> _______________________________________________ >>>> DAS mailing list >>>> DAS at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/das >>> >>> Jonathan Warren >>> Senior Developer and DAS coordinator >>> blog: http://biodasman.wordpress.com/ >>> jw12 at sanger.ac.uk >>> Ext: 2314 >>> Telephone: 01223 492314 >>> >>> >>> >>> >>> >>> >>> >>> >>> >>> -- >>> The Wellcome Trust Sanger Institute is operated by Genome >>> ResearchLimited, a charity registered in England with number 1021457 >>> and acompany registered in England with number 2742969, whose >>> registeredoffice is 215 Euston Road, London, NW1 >>> 2BE._______________________________________________ >>> DAS mailing list >>> DAS at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/das >> > > Jonathan Warren > Senior Developer and DAS coordinator > blog: http://biodasman.wordpress.com/ > jw12 at sanger.ac.uk > Ext: 2314 > Telephone: 01223 492314 > > > > > > > > > From andy.jenkinson at ebi.ac.uk Mon Aug 2 12:17:46 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Mon, 2 Aug 2010 17:17:46 +0100 Subject: [DAS] querying for nonpositional annotations In-Reply-To: <97C6B89F-F001-4AB6-88E0-1B742FE39DF7@sanger.ac.uk> References: <4C5686EB.1040800@ebi.ac.uk> <4291A93C-6F52-4C84-A724-25AF59A70A1C@sanger.ac.uk> <3A234F8D-DB4C-4183-9994-6694E56C8F6A@ebi.ac.uk> <97C6B89F-F001-4AB6-88E0-1B742FE39DF7@sanger.ac.uk> Message-ID: So just to clarify, is this what you mean? /das/source/features?segment=P99999 [returns all positional and all nonpositional] /das/source/features?segment=P99999:1,150 [returns only positional in the query range] /das/source/features?segment=P99999;type=somenonpositionaltype [returns any feature of the given type, which might be nonpositional] This is different to what the spec says, but is close to what UniProt does. It does the above, except it also allows this: /das/source/features?segment=P99999:0,0 [returns only nonpositional] I think you're saying we should remove the 0,0 capability from MyDas/uniprot, and change the spec wording to reflect that nonpositional features will not be returned if the client specifies a start/end position. Right? :) Sorry for being dumb, am just a bit confused about what you're saying is illogical, etc. On 2 Aug 2010, at 16:17, Jonathan Warren wrote: > Currently having read all points and thought about it properly I agree with Eugenes summary of what the behavior should be. > I was thinking that types filtering is not implemented and understood well by many casual users of DAS. But as the types command and filtering is something we want to encourage using the non-positional type should encourage this and as long as it's documented well and splattered everywhere I think it should be fine. But using the logic below as far as I can see this means you have to do 2 requests if you want features and non-positional annotations for a range /das/source/features?segment=P99999:1,150 and /das/source/features?segment=P99999;type=annotation > > But presumably doing /das/source/features?segment=P99999:1,150;type=gene;type=annotation would actually return genes if in region but no non-positional results which isn't really logical on its own? > > > On 2 Aug 2010, at 13:33, Andy Jenkinson wrote: > >> On 2 Aug 2010, at 10:45, Jonathan Warren wrote: >> >>> I'm not sure I agree with returning all annotations for every tiny part of a sequence requested. >>> If you consider DAS to be used for visual display mainly - then it seems a bit ridiculous to return all publications related to a segment (take a case where you have many publications related to a protein sequence). If publications aren't asked for i.e. non-positional annotations then I don't think they should be given. So I guess I'm agreeing with Jim here. >>> >>> Given the history of DAS I would propose a non-positional parameter as apposed to "positional". >>> >>> I think we have to remember that the 1.6 spec is supposed to mainly be a consolidation of the way DAS is being used and DAS is supposed to be simple (or not overly technical and difficult to pick up anyway). However- obviously we don't want to propagate practices that really don't make sense. The 0,0 thing is a hack like we had hacks for ontologies which now for 1.6 we have cvIds (I think are a big improvement). So we need something that is simple and obvious for a big all encompassing thing like positional vs non-positional. >> >> I'm not quite sure what you're suggesting we do in 1.6? >> >>> >>> >>> >>> On 2 Aug 2010, at 09:50, Leyla Garcia wrote: >>> >>>> *Hi list, >>>> >>>> *>No magic numbers. >>>> >>>> *According to discussions on this matter, I will change MyDas behaviour so 0 will be no >>>> "magic number" any more, >>>> >>>> *>Types can be used for filtering, and actually you get more fine-grained control than simply positional or non-positional. (I use this technique now in DASher.) * >>>>> In my opinion, the current spec as written is correct. That is, non-positional features don't just apply to the whole sequence, they apply to any part of the sequence. >>>>> As an example, consider a journal reference --- a particular protein was isolated by a lab, they wrote a paper about it, and deposited the protein sequence in a database. If you look at a subsequence of the protein sequence, that subsequence still derives from the paper, right? So therefore the feature containing that journal reference should still be attached to the subsequence. >>>>> On that basis, I think the uniprot server is technically doing it wrong and should be changed, although I have to say that in practice it hasn't been an issue for me. >>>> >>>> *and non-positional features will be always returned. >>>> Since UniProt is built upon MyDas, its behaviour will change as well. >>>> >>>> Thanks, >>>> >>>> Leyla >>>> * >>>> >>>>> >>>>> Dave >>>> >>>> _______________________________________________ >>>> DAS mailing list >>>> DAS at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/das >>> >>> Jonathan Warren >>> Senior Developer and DAS coordinator >>> blog: http://biodasman.wordpress.com/ >>> jw12 at sanger.ac.uk >>> Ext: 2314 >>> Telephone: 01223 492314 >>> >>> >>> >>> >>> >>> >>> >>> >>> >>> -- >>> The Wellcome Trust Sanger Institute is operated by Genome ResearchLimited, a charity registered in England with number 1021457 and acompany registered in England with number 2742969, whose registeredoffice is 215 Euston Road, London, NW1 2BE._______________________________________________ >>> DAS mailing list >>> DAS at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/das >> > > Jonathan Warren > Senior Developer and DAS coordinator > blog: http://biodasman.wordpress.com/ > jw12 at sanger.ac.uk > Ext: 2314 > Telephone: 01223 492314 > > > > > > > > > > -- > The Wellcome Trust Sanger Institute is operated by Genome ResearchLimited, a charity registered in England with number 1021457 and acompany registered in England with number 2742969, whose registeredoffice is 215 Euston Road, London, NW1 2BE. From jw12 at sanger.ac.uk Mon Aug 2 12:47:37 2010 From: jw12 at sanger.ac.uk (Jonathan Warren) Date: Mon, 2 Aug 2010 17:47:37 +0100 Subject: [DAS] querying for nonpositional annotations In-Reply-To: References: <4C5686EB.1040800@ebi.ac.uk> <4291A93C-6F52-4C84-A724-25AF59A70A1C@sanger.ac.uk> <3A234F8D-DB4C-4183-9994-6694E56C8F6A@ebi.ac.uk> <97C6B89F-F001-4AB6-88E0-1B742FE39DF7@sanger.ac.uk> Message-ID: <0392DE2F-2809-4C5C-AD1C-F83D13678626@sanger.ac.uk> yes thats what my current thinking was and what I interpreted Eugenes email as suggesting. But what I would like to work is this : /das/source/features? segment=P99999:1,150;type=gene;type=annotation but this won't work if you have already specified a range - or at least it's not that clear as to whether it should work. I guess i'd like it to be a "I want non-positional annotations" rather than "give me everything and I'll filter out non-positional annotations by choosing all the other types except positional annotations". But this is only my current opinion - we are debating this right? On 2 Aug 2010, at 17:17, Andy Jenkinson wrote: > So just to clarify, is this what you mean? > > /das/source/features?segment=P99999 [returns all positional and > all nonpositional] > /das/source/features?segment=P99999:1,150 [returns only > positional in the query range] > /das/source/features?segment=P99999;type=somenonpositionaltype > [returns any feature of the given type, which might be nonpositional] > > This is different to what the spec says, but is close to what > UniProt does. It does the above, except it also allows this: > > /das/source/features?segment=P99999:0,0 [returns only > nonpositional] > > I think you're saying we should remove the 0,0 capability from MyDas/ > uniprot, and change the spec wording to reflect that nonpositional > features will not be returned if the client specifies a start/end > position. Right? :) Sorry for being dumb, am just a bit confused > about what you're saying is illogical, etc. > > On 2 Aug 2010, at 16:17, Jonathan Warren wrote: > >> Currently having read all points and thought about it properly I >> agree with Eugenes summary of what the behavior should be. >> I was thinking that types filtering is not implemented and >> understood well by many casual users of DAS. But as the types >> command and filtering is something we want to encourage using the >> non-positional type should encourage this and as long as it's >> documented well and splattered everywhere I think it should be >> fine. But using the logic below as far as I can see this means you >> have to do 2 requests if you want features and non-positional >> annotations for a range /das/source/features?segment=P99999:1,150 >> and /das/source/features?segment=P99999;type=annotation >> >> But presumably doing /das/source/features? >> segment=P99999:1,150;type=gene;type=annotation would actually >> return genes if in region but no non-positional results which isn't >> really logical on its own? >> >> >> On 2 Aug 2010, at 13:33, Andy Jenkinson wrote: >> >>> On 2 Aug 2010, at 10:45, Jonathan Warren wrote: >>> >>>> I'm not sure I agree with returning all annotations for every >>>> tiny part of a sequence requested. >>>> If you consider DAS to be used for visual display mainly - then >>>> it seems a bit ridiculous to return all publications related to a >>>> segment (take a case where you have many publications related to >>>> a protein sequence). If publications aren't asked for i.e. non- >>>> positional annotations then I don't think they should be given. >>>> So I guess I'm agreeing with Jim here. >>>> >>>> Given the history of DAS I would propose a non-positional >>>> parameter as apposed to "positional". >>>> >>>> I think we have to remember that the 1.6 spec is supposed to >>>> mainly be a consolidation of the way DAS is being used and DAS is >>>> supposed to be simple (or not overly technical and difficult to >>>> pick up anyway). However- obviously we don't want to propagate >>>> practices that really don't make sense. The 0,0 thing is a hack >>>> like we had hacks for ontologies which now for 1.6 we have cvIds >>>> (I think are a big improvement). So we need something that is >>>> simple and obvious for a big all encompassing thing like >>>> positional vs non-positional. >>> >>> I'm not quite sure what you're suggesting we do in 1.6? >>> >>>> >>>> >>>> >>>> On 2 Aug 2010, at 09:50, Leyla Garcia wrote: >>>> >>>>> *Hi list, >>>>> >>>>> *>No magic numbers. >>>>> >>>>> *According to discussions on this matter, I will change MyDas >>>>> behaviour so 0 will be no >>>>> "magic number" any more, >>>>> >>>>> *>Types can be used for filtering, and actually you get more >>>>> fine-grained control than simply positional or non-positional. >>>>> (I use this technique now in DASher.) * >>>>>> In my opinion, the current spec as written is correct. That is, >>>>>> non-positional features don't just apply to the whole sequence, >>>>>> they apply to any part of the sequence. >>>>>> As an example, consider a journal reference --- a particular >>>>>> protein was isolated by a lab, they wrote a paper about it, and >>>>>> deposited the protein sequence in a database. If you look at a >>>>>> subsequence of the protein sequence, that subsequence still >>>>>> derives from the paper, right? So therefore the feature >>>>>> containing that journal reference should still be attached to >>>>>> the subsequence. >>>>>> On that basis, I think the uniprot server is technically doing >>>>>> it wrong and should be changed, although I have to say that in >>>>>> practice it hasn't been an issue for me. >>>>> >>>>> *and non-positional features will be always returned. >>>>> Since UniProt is built upon MyDas, its behaviour will change as >>>>> well. >>>>> >>>>> Thanks, >>>>> >>>>> Leyla >>>>> * >>>>> >>>>>> >>>>>> Dave >>>>> >>>>> _______________________________________________ >>>>> DAS mailing list >>>>> DAS at lists.open-bio.org >>>>> http://lists.open-bio.org/mailman/listinfo/das >>>> >>>> Jonathan Warren >>>> Senior Developer and DAS coordinator >>>> blog: http://biodasman.wordpress.com/ >>>> jw12 at sanger.ac.uk >>>> Ext: 2314 >>>> Telephone: 01223 492314 >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> -- >>>> The Wellcome Trust Sanger Institute is operated by Genome >>>> ResearchLimited, a charity registered in England with number >>>> 1021457 and acompany registered in England with number 2742969, >>>> whose registeredoffice is 215 Euston Road, London, NW1 >>>> 2BE._______________________________________________ >>>> DAS mailing list >>>> DAS at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/das >>> >> >> Jonathan Warren >> Senior Developer and DAS coordinator >> blog: http://biodasman.wordpress.com/ >> jw12 at sanger.ac.uk >> Ext: 2314 >> Telephone: 01223 492314 >> >> >> >> >> >> >> >> >> >> -- >> The Wellcome Trust Sanger Institute is operated by Genome >> ResearchLimited, a charity registered in England with number >> 1021457 and acompany registered in England with number 2742969, >> whose registeredoffice is 215 Euston Road, London, NW1 2BE. > Jonathan Warren Senior Developer and DAS coordinator blog: http://biodasman.wordpress.com/ jw12 at sanger.ac.uk Ext: 2314 Telephone: 01223 492314 -- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. From andy.jenkinson at ebi.ac.uk Mon Aug 2 12:52:28 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Mon, 2 Aug 2010 17:52:28 +0100 Subject: [DAS] querying for nonpositional annotations In-Reply-To: <0392DE2F-2809-4C5C-AD1C-F83D13678626@sanger.ac.uk> References: <4C5686EB.1040800@ebi.ac.uk> <4291A93C-6F52-4C84-A724-25AF59A70A1C@sanger.ac.uk> <3A234F8D-DB4C-4183-9994-6694E56C8F6A@ebi.ac.uk> <97C6B89F-F001-4AB6-88E0-1B742FE39DF7@sanger.ac.uk> <0392DE2F-2809-4C5C-AD1C-F83D13678626@sanger.ac.uk> Message-ID: What exactly are the 'annotation' and 'gene' feature types? On 2 Aug 2010, at 17:47, Jonathan Warren wrote: > yes thats what my current thinking was and what I interpreted Eugenes email as suggesting. > > But what I would like to work is this : /das/source/features?segment=P99999:1,150;type=gene;type=annotation but this won't work if you have already specified a range - or at least it's not that clear as to whether it should work. > > I guess i'd like it to be a "I want non-positional annotations" rather than "give me everything and I'll filter out non-positional annotations by choosing all the other types except positional annotations". > > But this is only my current opinion - we are debating this right? > > > > On 2 Aug 2010, at 17:17, Andy Jenkinson wrote: > >> So just to clarify, is this what you mean? >> >> /das/source/features?segment=P99999 [returns all positional and all nonpositional] >> /das/source/features?segment=P99999:1,150 [returns only positional in the query range] >> /das/source/features?segment=P99999;type=somenonpositionaltype [returns any feature of the given type, which might be nonpositional] >> >> This is different to what the spec says, but is close to what UniProt does. It does the above, except it also allows this: >> >> /das/source/features?segment=P99999:0,0 [returns only nonpositional] >> >> I think you're saying we should remove the 0,0 capability from MyDas/uniprot, and change the spec wording to reflect that nonpositional features will not be returned if the client specifies a start/end position. Right? :) Sorry for being dumb, am just a bit confused about what you're saying is illogical, etc. >> >> On 2 Aug 2010, at 16:17, Jonathan Warren wrote: >> >>> Currently having read all points and thought about it properly I agree with Eugenes summary of what the behavior should be. >>> I was thinking that types filtering is not implemented and understood well by many casual users of DAS. But as the types command and filtering is something we want to encourage using the non-positional type should encourage this and as long as it's documented well and splattered everywhere I think it should be fine. But using the logic below as far as I can see this means you have to do 2 requests if you want features and non-positional annotations for a range /das/source/features?segment=P99999:1,150 and /das/source/features?segment=P99999;type=annotation >>> >>> But presumably doing /das/source/features?segment=P99999:1,150;type=gene;type=annotation would actually return genes if in region but no non-positional results which isn't really logical on its own? >>> >>> >>> On 2 Aug 2010, at 13:33, Andy Jenkinson wrote: >>> >>>> On 2 Aug 2010, at 10:45, Jonathan Warren wrote: >>>> >>>>> I'm not sure I agree with returning all annotations for every tiny part of a sequence requested. >>>>> If you consider DAS to be used for visual display mainly - then it seems a bit ridiculous to return all publications related to a segment (take a case where you have many publications related to a protein sequence). If publications aren't asked for i.e. non-positional annotations then I don't think they should be given. So I guess I'm agreeing with Jim here. >>>>> >>>>> Given the history of DAS I would propose a non-positional parameter as apposed to "positional". >>>>> >>>>> I think we have to remember that the 1.6 spec is supposed to mainly be a consolidation of the way DAS is being used and DAS is supposed to be simple (or not overly technical and difficult to pick up anyway). However- obviously we don't want to propagate practices that really don't make sense. The 0,0 thing is a hack like we had hacks for ontologies which now for 1.6 we have cvIds (I think are a big improvement). So we need something that is simple and obvious for a big all encompassing thing like positional vs non-positional. >>>> >>>> I'm not quite sure what you're suggesting we do in 1.6? >>>> >>>>> >>>>> >>>>> >>>>> On 2 Aug 2010, at 09:50, Leyla Garcia wrote: >>>>> >>>>>> *Hi list, >>>>>> >>>>>> *>No magic numbers. >>>>>> >>>>>> *According to discussions on this matter, I will change MyDas behaviour so 0 will be no >>>>>> "magic number" any more, >>>>>> >>>>>> *>Types can be used for filtering, and actually you get more fine-grained control than simply positional or non-positional. (I use this technique now in DASher.) * >>>>>>> In my opinion, the current spec as written is correct. That is, non-positional features don't just apply to the whole sequence, they apply to any part of the sequence. >>>>>>> As an example, consider a journal reference --- a particular protein was isolated by a lab, they wrote a paper about it, and deposited the protein sequence in a database. If you look at a subsequence of the protein sequence, that subsequence still derives from the paper, right? So therefore the feature containing that journal reference should still be attached to the subsequence. >>>>>>> On that basis, I think the uniprot server is technically doing it wrong and should be changed, although I have to say that in practice it hasn't been an issue for me. >>>>>> >>>>>> *and non-positional features will be always returned. >>>>>> Since UniProt is built upon MyDas, its behaviour will change as well. >>>>>> >>>>>> Thanks, >>>>>> >>>>>> Leyla >>>>>> * >>>>>> >>>>>>> >>>>>>> Dave >>>>>> >>>>>> _______________________________________________ >>>>>> DAS mailing list >>>>>> DAS at lists.open-bio.org >>>>>> http://lists.open-bio.org/mailman/listinfo/das >>>>> >>>>> Jonathan Warren >>>>> Senior Developer and DAS coordinator >>>>> blog: http://biodasman.wordpress.com/ >>>>> jw12 at sanger.ac.uk >>>>> Ext: 2314 >>>>> Telephone: 01223 492314 >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> -- >>>>> The Wellcome Trust Sanger Institute is operated by Genome ResearchLimited, a charity registered in England with number 1021457 and acompany registered in England with number 2742969, whose registeredoffice is 215 Euston Road, London, NW1 2BE._______________________________________________ >>>>> DAS mailing list >>>>> DAS at lists.open-bio.org >>>>> http://lists.open-bio.org/mailman/listinfo/das >>>> >>> >>> Jonathan Warren >>> Senior Developer and DAS coordinator >>> blog: http://biodasman.wordpress.com/ >>> jw12 at sanger.ac.uk >>> Ext: 2314 >>> Telephone: 01223 492314 >>> >>> >>> >>> >>> >>> >>> >>> >>> >>> -- >>> The Wellcome Trust Sanger Institute is operated by Genome ResearchLimited, a charity registered in England with number 1021457 and acompany registered in England with number 2742969, whose registeredoffice is 215 Euston Road, London, NW1 2BE. >> > > Jonathan Warren > Senior Developer and DAS coordinator > blog: http://biodasman.wordpress.com/ > jw12 at sanger.ac.uk > Ext: 2314 > Telephone: 01223 492314 > > > > > > > > > > -- > The Wellcome Trust Sanger Institute is operated by Genome ResearchLimited, a charity registered in England with number 1021457 and acompany registered in England with number 2742969, whose registeredoffice is 215 Euston Road, London, NW1 2BE. From thomas.a.down at gmail.com Tue Aug 3 15:41:40 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Tue, 3 Aug 2010 20:41:40 +0100 Subject: [DAS] Dalliance: a new genome DAS client Message-ID: As some of you already know, I've been experimenting recently with a web-based DAS client for genomic data. It's still in a unashamedly prototypical state (in particular, some of the popups and configuration stuff is outright clunky, and we know it!), but we're starting to find it quite useful, and would be interested to receive more feedback. So if you're curious, you can try it here: http://www.biodalliance.org/human/ncbi36/ It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and hopefully rather more coming soon), but has one major caveat: since it's pure Javascript code running in your web browser, there are limitations to which servers it can connect to. Specifically, it will only work with DAS servers that implement the W3C cross-origin resource sharing model (which has been discussed on this list before, but drop me a line if you've got any questions). What does this mean in practice? If you're adding datasources from the registry, things are simple because Dalliance will only allow you to add CORS-enabled sources (a huge thanks to Jonathan Warren for adding some support for this in the registry). If you run your own DAS servers and don't list them in the registry, you'll need to check for CORS compatibility yourself. The latest versions of Proserver and Dazzle should both be okay. All comments, suggestions, and bug reports are welcome! Thomas Down. From lincoln.stein at gmail.com Tue Aug 3 15:51:56 2010 From: lincoln.stein at gmail.com (Lincoln Stein) Date: Tue, 3 Aug 2010 15:51:56 -0400 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: References: Message-ID: That's really really nice! Lincoln On Tue, Aug 3, 2010 at 3:41 PM, Thomas Down wrote: > As some of you already know, I've been experimenting recently with a > web-based DAS client for genomic data. It's still in > a unashamedly prototypical state (in particular, some of the popups and > configuration stuff is outright clunky, and we know it!), but we're > starting > to find it quite useful, and would be interested to receive more feedback. > So if you're curious, you can try it here: > > http://www.biodalliance.org/human/ncbi36/ > > It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and > hopefully rather more coming soon), but has one major caveat: since it's > pure Javascript code running in your web browser, there are limitations to > which servers it can connect to. Specifically, it will only work with DAS > servers that implement the W3C cross-origin resource sharing model (which > has been discussed on this list before, but drop me a line if you've got > any > questions). What does this mean in practice? If you're adding datasources > from the registry, things are simple because Dalliance will only allow you > to add CORS-enabled sources (a huge thanks to Jonathan Warren for adding > some support for this in the registry). If you run your own DAS servers > and > don't list them in the registry, you'll need to check for CORS > compatibility > yourself. The latest versions of Proserver and Dazzle should both be okay. > > All comments, suggestions, and bug reports are welcome! > > Thomas Down. > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das > -- Lincoln D. Stein Director, Informatics and Biocomputing Platform Ontario Institute for Cancer Research 101 College St., Suite 800 Toronto, ON, Canada M5G0A3 416 673-8514 Assistant: Renata Musa From thomas.a.down at gmail.com Tue Aug 3 17:01:24 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Tue, 3 Aug 2010 22:01:24 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> Message-ID: On Tue, Aug 3, 2010 at 9:52 PM, Jonathan Warren wrote: > This is very cool - I had a look the other day. Was wondering why some > sources could be attached and some can't.... That's a good point, I should add an explanation of why some sources aren't available actually inside the registry browser. Do you think I can get away with an "e-mail the server administrator on xxx at yyy.com and demand CORS today" message? :-). Thomas. From jw12 at sanger.ac.uk Tue Aug 3 16:52:07 2010 From: jw12 at sanger.ac.uk (Jonathan Warren) Date: Tue, 3 Aug 2010 21:52:07 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: References: Message-ID: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> This is very cool - I had a look the other day. Was wondering why some sources could be attached and some can't.... Best browser experience yet by far I'd say. No problems about adding CORS support - for the record I'm very happy to implement new capabilities testing and other suggestions to the registry from anyone who cares to drop me a line. Especially if it's going to enhance and promote the use of the registry :) On 3 Aug 2010, at 20:41, Thomas Down wrote: > As some of you already know, I've been experimenting recently with a > web-based DAS client for genomic data. It's still in > a unashamedly prototypical state (in particular, some of the popups > and > configuration stuff is outright clunky, and we know it!), but we're > starting > to find it quite useful, and would be interested to receive more > feedback. > So if you're curious, you can try it here: > > http://www.biodalliance.org/human/ncbi36/ > > It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and > hopefully rather more coming soon), but has one major caveat: since > it's > pure Javascript code running in your web browser, there are > limitations to > which servers it can connect to. Specifically, it will only work > with DAS > servers that implement the W3C cross-origin resource sharing model > (which > has been discussed on this list before, but drop me a line if you've > got any > questions). What does this mean in practice? If you're adding > datasources > from the registry, things are simple because Dalliance will only > allow you > to add CORS-enabled sources (a huge thanks to Jonathan Warren for > adding > some support for this in the registry). If you run your own DAS > servers and > don't list them in the registry, you'll need to check for CORS > compatibility > yourself. The latest versions of Proserver and Dazzle should both > be okay. > > All comments, suggestions, and bug reports are welcome! > > Thomas Down. > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das Jonathan Warren Senior Developer and DAS coordinator blog: http://biodasman.wordpress.com/ jw12 at sanger.ac.uk Ext: 2314 Telephone: 01223 492314 -- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. From lincoln.stein at gmail.com Tue Aug 3 17:21:12 2010 From: lincoln.stein at gmail.com (Lincoln Stein) Date: Tue, 3 Aug 2010 17:21:12 -0400 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> Message-ID: Someone give me a quick summary of CORS support. I want to make sure that GBrowse exports DAS 1.53 with CORS (is it just the registry metadata, or something new?) Lincoln On Tue, Aug 3, 2010 at 4:52 PM, Jonathan Warren wrote: > This is very cool - I had a look the other day. Was wondering why some > sources could be attached and some can't.... > Best browser experience yet by far I'd say. > > No problems about adding CORS support - for the record I'm very happy to > implement new capabilities testing and other suggestions to the registry > from anyone who cares to drop me a line. Especially if it's going to enhance > and promote the use of the registry :) > > > > On 3 Aug 2010, at 20:41, Thomas Down wrote: > > As some of you already know, I've been experimenting recently with a >> web-based DAS client for genomic data. It's still in >> a unashamedly prototypical state (in particular, some of the popups and >> configuration stuff is outright clunky, and we know it!), but we're >> starting >> to find it quite useful, and would be interested to receive more feedback. >> So if you're curious, you can try it here: >> >> http://www.biodalliance.org/human/ncbi36/ >> >> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and >> hopefully rather more coming soon), but has one major caveat: since it's >> pure Javascript code running in your web browser, there are limitations to >> which servers it can connect to. Specifically, it will only work with DAS >> servers that implement the W3C cross-origin resource sharing model (which >> has been discussed on this list before, but drop me a line if you've got >> any >> questions). What does this mean in practice? If you're adding >> datasources >> from the registry, things are simple because Dalliance will only allow you >> to add CORS-enabled sources (a huge thanks to Jonathan Warren for adding >> some support for this in the registry). If you run your own DAS servers >> and >> don't list them in the registry, you'll need to check for CORS >> compatibility >> yourself. The latest versions of Proserver and Dazzle should both be >> okay. >> >> All comments, suggestions, and bug reports are welcome! >> >> Thomas Down. >> _______________________________________________ >> DAS mailing list >> DAS at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/das >> > > Jonathan Warren > Senior Developer and DAS coordinator > blog: http://biodasman.wordpress.com/ > jw12 at sanger.ac.uk > Ext: 2314 > Telephone: 01223 492314 > > > > > > > > > > -- > The Wellcome Trust Sanger Institute is operated by Genome ResearchLimited, > a charity registered in England with number 1021457 and acompany registered > in England with number 2742969, whose registeredoffice is 215 Euston Road, > London, NW1 2BE._______________________________________________ > > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das > -- Lincoln D. Stein Director, Informatics and Biocomputing Platform Ontario Institute for Cancer Research 101 College St., Suite 800 Toronto, ON, Canada M5G0A3 416 673-8514 Assistant: Renata Musa From thomas.a.down at gmail.com Tue Aug 3 17:28:40 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Tue, 3 Aug 2010 22:28:40 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> Message-ID: Jonathan's written a nice summary here: http://biodasman.wordpress.com/2010/07/20/cors/ But briefly... it's the "official" way to work around the same-origin policy (by default, browsers only allow unsigned javascript to trigger HTTP requests to the server from which it was originally downloaded). The specification is here: http://www.w3.org/TR/cors/ (Please don't be too alarmed by the datestamp! The core parts have been stable for > a year now, and it's well supported by Mozilla, WebKit, and -- via a slightly different API -- Internet Explorer). If you're running a public server and want it to be CORS accessible, all that is needed is to emit the header: Access-Control-Allow-Origin: * ...and you're done. (If you're running password-protected DAS servers, or DAS servers hosting sensitive information behind a firewall, you might want a slightly more sophisticated CORS implementation. Happy to discuss if anyone is interested). Thomas. On Tue, Aug 3, 2010 at 10:21 PM, Lincoln Stein wrote: > Someone give me a quick summary of CORS support. I want to make sure that > GBrowse exports DAS 1.53 with CORS (is it just the registry metadata, or > something new?) > > Lincoln > > On Tue, Aug 3, 2010 at 4:52 PM, Jonathan Warren wrote: > >> This is very cool - I had a look the other day. Was wondering why some >> sources could be attached and some can't.... >> Best browser experience yet by far I'd say. >> >> No problems about adding CORS support - for the record I'm very happy to >> implement new capabilities testing and other suggestions to the registry >> from anyone who cares to drop me a line. Especially if it's going to enhance >> and promote the use of the registry :) >> >> >> >> On 3 Aug 2010, at 20:41, Thomas Down wrote: >> >> As some of you already know, I've been experimenting recently with a >>> web-based DAS client for genomic data. It's still in >>> a unashamedly prototypical state (in particular, some of the popups and >>> configuration stuff is outright clunky, and we know it!), but we're >>> starting >>> to find it quite useful, and would be interested to receive more >>> feedback. >>> So if you're curious, you can try it here: >>> >>> http://www.biodalliance.org/human/ncbi36/ >>> >>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and >>> hopefully rather more coming soon), but has one major caveat: since it's >>> pure Javascript code running in your web browser, there are limitations >>> to >>> which servers it can connect to. Specifically, it will only work with >>> DAS >>> servers that implement the W3C cross-origin resource sharing model (which >>> has been discussed on this list before, but drop me a line if you've got >>> any >>> questions). What does this mean in practice? If you're adding >>> datasources >>> from the registry, things are simple because Dalliance will only allow >>> you >>> to add CORS-enabled sources (a huge thanks to Jonathan Warren for adding >>> some support for this in the registry). If you run your own DAS servers >>> and >>> don't list them in the registry, you'll need to check for CORS >>> compatibility >>> yourself. The latest versions of Proserver and Dazzle should both be >>> okay. >>> >>> All comments, suggestions, and bug reports are welcome! >>> >>> Thomas Down. >>> _______________________________________________ >>> DAS mailing list >>> DAS at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/das >>> >> >> Jonathan Warren >> Senior Developer and DAS coordinator >> blog: http://biodasman.wordpress.com/ >> jw12 at sanger.ac.uk >> Ext: 2314 >> Telephone: 01223 492314 >> >> >> >> >> >> >> >> >> >> -- >> The Wellcome Trust Sanger Institute is operated by Genome ResearchLimited, >> a charity registered in England with number 1021457 and acompany registered >> in England with number 2742969, whose registeredoffice is 215 Euston Road, >> London, NW1 2BE._______________________________________________ >> >> DAS mailing list >> DAS at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/das >> > > > > -- > Lincoln D. Stein > Director, Informatics and Biocomputing Platform > Ontario Institute for Cancer Research > 101 College St., Suite 800 > Toronto, ON, Canada M5G0A3 > 416 673-8514 > Assistant: Renata Musa > From ljgarcia at ebi.ac.uk Wed Aug 4 10:03:41 2010 From: ljgarcia at ebi.ac.uk (Leyla Garcia) Date: Wed, 04 Aug 2010 15:03:41 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> Message-ID: <4C59733D.4090304@ebi.ac.uk> About CORS, >If you run your own DAS servers and >don't list them in the registry, you'll need to check for CORS compatibility >yourself. The latest versions of Proserver and Dazzle should both be okay. If I am not mistaken, Proserver already implements CORS headers? Andy, could you please send me a link of a Proserver server that implements this? Thanks, Leyla On 03/08/2010 21:52, Jonathan Warren wrote: > This is very cool - I had a look the other day. Was wondering why some > sources could be attached and some can't.... > Best browser experience yet by far I'd say. > > No problems about adding CORS support - for the record I'm very happy > to implement new capabilities testing and other suggestions to the > registry from anyone who cares to drop me a line. Especially if it's > going to enhance and promote the use of the registry :) > > > On 3 Aug 2010, at 20:41, Thomas Down wrote: > >> As some of you already know, I've been experimenting recently with a >> web-based DAS client for genomic data. It's still in >> a unashamedly prototypical state (in particular, some of the popups and >> configuration stuff is outright clunky, and we know it!), but we're >> starting >> to find it quite useful, and would be interested to receive more >> feedback. >> So if you're curious, you can try it here: >> >> http://www.biodalliance.org/human/ncbi36/ >> >> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and >> hopefully rather more coming soon), but has one major caveat: since it's >> pure Javascript code running in your web browser, there are >> limitations to >> which servers it can connect to. Specifically, it will only work >> with DAS >> servers that implement the W3C cross-origin resource sharing model >> (which >> has been discussed on this list before, but drop me a line if you've >> got any >> questions). What does this mean in practice? If you're adding >> datasources >> from the registry, things are simple because Dalliance will only >> allow you >> to add CORS-enabled sources (a huge thanks to Jonathan Warren for adding >> some support for this in the registry). If you run your own DAS >> servers and >> don't list them in the registry, you'll need to check for CORS >> compatibility >> yourself. The latest versions of Proserver and Dazzle should both be >> okay. >> >> All comments, suggestions, and bug reports are welcome! >> >> Thomas Down. >> _______________________________________________ >> DAS mailing list >> DAS at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/das > > Jonathan Warren > Senior Developer and DAS coordinator > blog: http://biodasman.wordpress.com/ > jw12 at sanger.ac.uk > Ext: 2314 > Telephone: 01223 492314 > > > > > > > > > From jw12 at sanger.ac.uk Wed Aug 4 10:51:06 2010 From: jw12 at sanger.ac.uk (Jonathan Warren) Date: Wed, 4 Aug 2010 15:51:06 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: <4C59733D.4090304@ebi.ac.uk> References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <4C59733D.4090304@ebi.ac.uk> Message-ID: This url list 150 das sources that implement cors - see traffic light on the end: http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find You can check the headers responses for any of their valid responses. If you want some java code that adds this functionality to MyDAS I can send it to you... On 4 Aug 2010, at 15:03, Leyla Garcia wrote: > About CORS, > > >If you run your own DAS servers and > >don't list them in the registry, you'll need to check for CORS > compatibility > >yourself. The latest versions of Proserver and Dazzle should both > be okay. > > If I am not mistaken, Proserver already implements CORS headers? > Andy, could you please send me a link of a Proserver server that > implements this? > > Thanks, > > Leyla > > On 03/08/2010 21:52, Jonathan Warren wrote: >> This is very cool - I had a look the other day. Was wondering why >> some sources could be attached and some can't.... >> Best browser experience yet by far I'd say. >> >> No problems about adding CORS support - for the record I'm very >> happy to implement new capabilities testing and other suggestions >> to the registry from anyone who cares to drop me a line. Especially >> if it's going to enhance and promote the use of the registry :) >> >> >> On 3 Aug 2010, at 20:41, Thomas Down wrote: >> >>> As some of you already know, I've been experimenting recently with a >>> web-based DAS client for genomic data. It's still in >>> a unashamedly prototypical state (in particular, some of the >>> popups and >>> configuration stuff is outright clunky, and we know it!), but >>> we're starting >>> to find it quite useful, and would be interested to receive more >>> feedback. >>> So if you're curious, you can try it here: >>> >>> http://www.biodalliance.org/human/ncbi36/ >>> >>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, >>> and >>> hopefully rather more coming soon), but has one major caveat: >>> since it's >>> pure Javascript code running in your web browser, there are >>> limitations to >>> which servers it can connect to. Specifically, it will only work >>> with DAS >>> servers that implement the W3C cross-origin resource sharing model >>> (which >>> has been discussed on this list before, but drop me a line if >>> you've got any >>> questions). What does this mean in practice? If you're adding >>> datasources >>> from the registry, things are simple because Dalliance will only >>> allow you >>> to add CORS-enabled sources (a huge thanks to Jonathan Warren for >>> adding >>> some support for this in the registry). If you run your own DAS >>> servers and >>> don't list them in the registry, you'll need to check for CORS >>> compatibility >>> yourself. The latest versions of Proserver and Dazzle should both >>> be okay. >>> >>> All comments, suggestions, and bug reports are welcome! >>> >>> Thomas Down. >>> _______________________________________________ >>> DAS mailing list >>> DAS at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/das >> >> Jonathan Warren >> Senior Developer and DAS coordinator >> blog: http://biodasman.wordpress.com/ >> jw12 at sanger.ac.uk >> Ext: 2314 >> Telephone: 01223 492314 >> >> >> >> >> >> >> >> >> > > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das Jonathan Warren Senior Developer and DAS coordinator blog: http://biodasman.wordpress.com/ jw12 at sanger.ac.uk Ext: 2314 Telephone: 01223 492314 -- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. From andy.jenkinson at ebi.ac.uk Wed Aug 4 10:53:05 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Wed, 4 Aug 2010 15:53:05 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: <4C59733D.4090304@ebi.ac.uk> References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <4C59733D.4090304@ebi.ac.uk> Message-ID: <6A5CF73C-4C54-4A45-B0C3-3D589C1C732D@ebi.ac.uk> Sure: http://www.ebi.ac.uk/das-srv/genomicdas/das/sources The implementation in ProServer is a bit more advanced in that it deals with preflight (HTTP "options") requests. These are necessary if the client sends custom headers (err, like X-DAS-Client in DAS 1.6) and non-simple requests (such as PUT). Cross-origin writeback would require preflight request support, for example. By the way, I just looked at the CORS spec and have made some changes to ProServer's implementation to support the 'expose headers' portion of the spec. Either this wasn't there last year or I missed it. I think this is necessary before the browser will get access to the custom DAS headers such as X-DAS-Status. Here's some code: if ($request->method eq 'options') { # For W3C Access Control preflight requests $response->header('Access-Control-Allow-Methods' => 'GET, POST, OPTIONS'); $response->header('Access-Control-Allow-Headers' => 'X-DAS-Version, X-DAS-Client'); $response->header('Access-Control-Max-Age', => '2592000'); # preflight can be cached for 30 days } # Add Access Control headers (for cross site requests): $response->header('Access-Control-Allow-Origin' => $request->header('Origin') || '*'); $response->header('Access-Control-Expose-Headers' => 'X-DAS-Version, X-DAS-Server, X-DAS-Status, X-DAS-Capabilities'); $response->header('Access-Control-Allow-Credentials' => 'true'); Cheers, Andy On 4 Aug 2010, at 15:03, Leyla Garcia wrote: > About CORS, > > >If you run your own DAS servers and > >don't list them in the registry, you'll need to check for CORS compatibility > >yourself. The latest versions of Proserver and Dazzle should both be okay. > > If I am not mistaken, Proserver already implements CORS headers? > Andy, could you please send me a link of a Proserver server that implements this? > > Thanks, > > Leyla > > On 03/08/2010 21:52, Jonathan Warren wrote: >> This is very cool - I had a look the other day. Was wondering why some sources could be attached and some can't.... >> Best browser experience yet by far I'd say. >> >> No problems about adding CORS support - for the record I'm very happy to implement new capabilities testing and other suggestions to the registry from anyone who cares to drop me a line. Especially if it's going to enhance and promote the use of the registry :) >> >> >> On 3 Aug 2010, at 20:41, Thomas Down wrote: >> >>> As some of you already know, I've been experimenting recently with a >>> web-based DAS client for genomic data. It's still in >>> a unashamedly prototypical state (in particular, some of the popups and >>> configuration stuff is outright clunky, and we know it!), but we're starting >>> to find it quite useful, and would be interested to receive more feedback. >>> So if you're curious, you can try it here: >>> >>> http://www.biodalliance.org/human/ncbi36/ >>> >>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and >>> hopefully rather more coming soon), but has one major caveat: since it's >>> pure Javascript code running in your web browser, there are limitations to >>> which servers it can connect to. Specifically, it will only work with DAS >>> servers that implement the W3C cross-origin resource sharing model (which >>> has been discussed on this list before, but drop me a line if you've got any >>> questions). What does this mean in practice? If you're adding datasources >>> from the registry, things are simple because Dalliance will only allow you >>> to add CORS-enabled sources (a huge thanks to Jonathan Warren for adding >>> some support for this in the registry). If you run your own DAS servers and >>> don't list them in the registry, you'll need to check for CORS compatibility >>> yourself. The latest versions of Proserver and Dazzle should both be okay. >>> >>> All comments, suggestions, and bug reports are welcome! >>> >>> Thomas Down. >>> _______________________________________________ >>> DAS mailing list >>> DAS at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/das >> >> Jonathan Warren >> Senior Developer and DAS coordinator >> blog: http://biodasman.wordpress.com/ >> jw12 at sanger.ac.uk >> Ext: 2314 >> Telephone: 01223 492314 >> >> >> >> >> >> >> >> >> > > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das From jw12 at sanger.ac.uk Wed Aug 4 10:54:04 2010 From: jw12 at sanger.ac.uk (Jonathan Warren) Date: Wed, 4 Aug 2010 15:54:04 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <4C59733D.4090304@ebi.ac.uk> Message-ID: resp.setHeader("Access-Control-Allow-Origin", "*"); On 4 Aug 2010, at 15:51, Jonathan Warren wrote: > This url list 150 das sources that implement cors - see traffic > light on the end: > > http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find > > You can check the headers responses for any of their valid responses. > > If you want some java code that adds this functionality to MyDAS I > can send it to you... > > On 4 Aug 2010, at 15:03, Leyla Garcia wrote: > >> About CORS, >> >> >If you run your own DAS servers and >> >don't list them in the registry, you'll need to check for CORS >> compatibility >> >yourself. The latest versions of Proserver and Dazzle should both >> be okay. >> >> If I am not mistaken, Proserver already implements CORS headers? >> Andy, could you please send me a link of a Proserver server that >> implements this? >> >> Thanks, >> >> Leyla >> >> On 03/08/2010 21:52, Jonathan Warren wrote: >>> This is very cool - I had a look the other day. Was wondering why >>> some sources could be attached and some can't.... >>> Best browser experience yet by far I'd say. >>> >>> No problems about adding CORS support - for the record I'm very >>> happy to implement new capabilities testing and other suggestions >>> to the registry from anyone who cares to drop me a line. >>> Especially if it's going to enhance and promote the use of the >>> registry :) >>> >>> >>> On 3 Aug 2010, at 20:41, Thomas Down wrote: >>> >>>> As some of you already know, I've been experimenting recently >>>> with a >>>> web-based DAS client for genomic data. It's still in >>>> a unashamedly prototypical state (in particular, some of the >>>> popups and >>>> configuration stuff is outright clunky, and we know it!), but >>>> we're starting >>>> to find it quite useful, and would be interested to receive more >>>> feedback. >>>> So if you're curious, you can try it here: >>>> >>>> http://www.biodalliance.org/human/ncbi36/ >>>> >>>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, >>>> and >>>> hopefully rather more coming soon), but has one major caveat: >>>> since it's >>>> pure Javascript code running in your web browser, there are >>>> limitations to >>>> which servers it can connect to. Specifically, it will only work >>>> with DAS >>>> servers that implement the W3C cross-origin resource sharing >>>> model (which >>>> has been discussed on this list before, but drop me a line if >>>> you've got any >>>> questions). What does this mean in practice? If you're adding >>>> datasources >>>> from the registry, things are simple because Dalliance will only >>>> allow you >>>> to add CORS-enabled sources (a huge thanks to Jonathan Warren for >>>> adding >>>> some support for this in the registry). If you run your own DAS >>>> servers and >>>> don't list them in the registry, you'll need to check for CORS >>>> compatibility >>>> yourself. The latest versions of Proserver and Dazzle should >>>> both be okay. >>>> >>>> All comments, suggestions, and bug reports are welcome! >>>> >>>> Thomas Down. >>>> _______________________________________________ >>>> DAS mailing list >>>> DAS at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/das >>> >>> Jonathan Warren >>> Senior Developer and DAS coordinator >>> blog: http://biodasman.wordpress.com/ >>> jw12 at sanger.ac.uk >>> Ext: 2314 >>> Telephone: 01223 492314 >>> >>> >>> >>> >>> >>> >>> >>> >>> >> >> _______________________________________________ >> DAS mailing list >> DAS at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/das > > Jonathan Warren > Senior Developer and DAS coordinator > blog: http://biodasman.wordpress.com/ > jw12 at sanger.ac.uk > Ext: 2314 > Telephone: 01223 492314 > > > > > > > Jonathan Warren Senior Developer and DAS coordinator blog: http://biodasman.wordpress.com/ jw12 at sanger.ac.uk Ext: 2314 Telephone: 01223 492314 -- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. From andy.jenkinson at ebi.ac.uk Wed Aug 4 10:55:55 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Wed, 4 Aug 2010 15:55:55 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> Message-ID: If you want the application to be able to access the 1.53 X-DAS headers I think you also need: Access-Control-Expose-Headers: X-DAS-Version, X-DAS-Server, X-DAS-Status, X-DAS-Capabilities (see my previous mail to the list). In DAS 1.6, there are some client headers too, which would require handling of the CORS preflight request (again, see my previous mail). On 3 Aug 2010, at 22:28, Thomas Down wrote: > Jonathan's written a nice summary here: > > http://biodasman.wordpress.com/2010/07/20/cors/ > > But briefly... it's the "official" way to work around the same-origin > policy (by default, browsers only allow unsigned javascript to trigger HTTP > requests to the server from which it was originally downloaded). The > specification is here: > > http://www.w3.org/TR/cors/ > > (Please don't be too alarmed by the datestamp! The core parts have been > stable for > a year now, and it's well supported by Mozilla, WebKit, and -- > via a slightly different API -- Internet Explorer). > > If you're running a public server and want it to be CORS accessible, all > that is needed is to emit the header: > > Access-Control-Allow-Origin: * > > ...and you're done. > > (If you're running password-protected DAS servers, or DAS servers hosting > sensitive information behind a firewall, you might want a slightly more > sophisticated CORS implementation. Happy to discuss if anyone is > interested). > > Thomas. > > On Tue, Aug 3, 2010 at 10:21 PM, Lincoln Stein wrote: > >> Someone give me a quick summary of CORS support. I want to make sure that >> GBrowse exports DAS 1.53 with CORS (is it just the registry metadata, or >> something new?) >> >> Lincoln >> >> On Tue, Aug 3, 2010 at 4:52 PM, Jonathan Warren wrote: >> >>> This is very cool - I had a look the other day. Was wondering why some >>> sources could be attached and some can't.... >>> Best browser experience yet by far I'd say. >>> >>> No problems about adding CORS support - for the record I'm very happy to >>> implement new capabilities testing and other suggestions to the registry >>> from anyone who cares to drop me a line. Especially if it's going to enhance >>> and promote the use of the registry :) >>> >>> >>> >>> On 3 Aug 2010, at 20:41, Thomas Down wrote: >>> >>> As some of you already know, I've been experimenting recently with a >>>> web-based DAS client for genomic data. It's still in >>>> a unashamedly prototypical state (in particular, some of the popups and >>>> configuration stuff is outright clunky, and we know it!), but we're >>>> starting >>>> to find it quite useful, and would be interested to receive more >>>> feedback. >>>> So if you're curious, you can try it here: >>>> >>>> http://www.biodalliance.org/human/ncbi36/ >>>> >>>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and >>>> hopefully rather more coming soon), but has one major caveat: since it's >>>> pure Javascript code running in your web browser, there are limitations >>>> to >>>> which servers it can connect to. Specifically, it will only work with >>>> DAS >>>> servers that implement the W3C cross-origin resource sharing model (which >>>> has been discussed on this list before, but drop me a line if you've got >>>> any >>>> questions). What does this mean in practice? If you're adding >>>> datasources >>>> from the registry, things are simple because Dalliance will only allow >>>> you >>>> to add CORS-enabled sources (a huge thanks to Jonathan Warren for adding >>>> some support for this in the registry). If you run your own DAS servers >>>> and >>>> don't list them in the registry, you'll need to check for CORS >>>> compatibility >>>> yourself. The latest versions of Proserver and Dazzle should both be >>>> okay. >>>> >>>> All comments, suggestions, and bug reports are welcome! >>>> >>>> Thomas Down. >>>> _______________________________________________ >>>> DAS mailing list >>>> DAS at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/das >>>> >>> >>> Jonathan Warren >>> Senior Developer and DAS coordinator >>> blog: http://biodasman.wordpress.com/ >>> jw12 at sanger.ac.uk >>> Ext: 2314 >>> Telephone: 01223 492314 >>> >>> >>> >>> >>> >>> >>> >>> >>> >>> -- >>> The Wellcome Trust Sanger Institute is operated by Genome ResearchLimited, >>> a charity registered in England with number 1021457 and acompany registered >>> in England with number 2742969, whose registeredoffice is 215 Euston Road, >>> London, NW1 2BE._______________________________________________ >>> >>> DAS mailing list >>> DAS at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/das >>> >> >> >> >> -- >> Lincoln D. Stein >> Director, Informatics and Biocomputing Platform >> Ontario Institute for Cancer Research >> 101 College St., Suite 800 >> Toronto, ON, Canada M5G0A3 >> 416 673-8514 >> Assistant: Renata Musa >> > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das From andy.jenkinson at ebi.ac.uk Wed Aug 4 10:58:45 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Wed, 4 Aug 2010 15:58:45 +0100 Subject: [DAS] CORS In-Reply-To: References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> Message-ID: <3378D8DA-614B-4D56-93AD-024DAFF7F61D@ebi.ac.uk> Since this seems to have been given the thumbs up, shall we make CORS support mandatory from 1.6 onwards? I suggested this when it first came up last year, but I got no replies so didn't put it in the spec. I suspect because it was in the middle of a flurry of emails about "maxbins" :) On 3 Aug 2010, at 22:28, Thomas Down wrote: > Jonathan's written a nice summary here: > > http://biodasman.wordpress.com/2010/07/20/cors/ > > But briefly... it's the "official" way to work around the same-origin > policy (by default, browsers only allow unsigned javascript to trigger HTTP > requests to the server from which it was originally downloaded). The > specification is here: > > http://www.w3.org/TR/cors/ > > (Please don't be too alarmed by the datestamp! The core parts have been > stable for > a year now, and it's well supported by Mozilla, WebKit, and -- > via a slightly different API -- Internet Explorer). > > If you're running a public server and want it to be CORS accessible, all > that is needed is to emit the header: > > Access-Control-Allow-Origin: * > > ...and you're done. > > (If you're running password-protected DAS servers, or DAS servers hosting > sensitive information behind a firewall, you might want a slightly more > sophisticated CORS implementation. Happy to discuss if anyone is > interested). > > Thomas. > > On Tue, Aug 3, 2010 at 10:21 PM, Lincoln Stein wrote: > >> Someone give me a quick summary of CORS support. I want to make sure that >> GBrowse exports DAS 1.53 with CORS (is it just the registry metadata, or >> something new?) >> >> Lincoln >> >> On Tue, Aug 3, 2010 at 4:52 PM, Jonathan Warren wrote: >> >>> This is very cool - I had a look the other day. Was wondering why some >>> sources could be attached and some can't.... >>> Best browser experience yet by far I'd say. >>> >>> No problems about adding CORS support - for the record I'm very happy to >>> implement new capabilities testing and other suggestions to the registry >>> from anyone who cares to drop me a line. Especially if it's going to enhance >>> and promote the use of the registry :) >>> >>> >>> >>> On 3 Aug 2010, at 20:41, Thomas Down wrote: >>> >>> As some of you already know, I've been experimenting recently with a >>>> web-based DAS client for genomic data. It's still in >>>> a unashamedly prototypical state (in particular, some of the popups and >>>> configuration stuff is outright clunky, and we know it!), but we're >>>> starting >>>> to find it quite useful, and would be interested to receive more >>>> feedback. >>>> So if you're curious, you can try it here: >>>> >>>> http://www.biodalliance.org/human/ncbi36/ >>>> >>>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and >>>> hopefully rather more coming soon), but has one major caveat: since it's >>>> pure Javascript code running in your web browser, there are limitations >>>> to >>>> which servers it can connect to. Specifically, it will only work with >>>> DAS >>>> servers that implement the W3C cross-origin resource sharing model (which >>>> has been discussed on this list before, but drop me a line if you've got >>>> any >>>> questions). What does this mean in practice? If you're adding >>>> datasources >>>> from the registry, things are simple because Dalliance will only allow >>>> you >>>> to add CORS-enabled sources (a huge thanks to Jonathan Warren for adding >>>> some support for this in the registry). If you run your own DAS servers >>>> and >>>> don't list them in the registry, you'll need to check for CORS >>>> compatibility >>>> yourself. The latest versions of Proserver and Dazzle should both be >>>> okay. >>>> >>>> All comments, suggestions, and bug reports are welcome! >>>> >>>> Thomas Down. >>>> _______________________________________________ >>>> DAS mailing list >>>> DAS at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/das >>>> >>> >>> Jonathan Warren >>> Senior Developer and DAS coordinator >>> blog: http://biodasman.wordpress.com/ >>> jw12 at sanger.ac.uk >>> Ext: 2314 >>> Telephone: 01223 492314 >>> >>> >>> >>> >>> >>> >>> >>> >>> >>> -- >>> The Wellcome Trust Sanger Institute is operated by Genome ResearchLimited, >>> a charity registered in England with number 1021457 and acompany registered >>> in England with number 2742969, whose registeredoffice is 215 Euston Road, >>> London, NW1 2BE._______________________________________________ >>> >>> DAS mailing list >>> DAS at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/das >>> >> >> >> >> -- >> Lincoln D. Stein >> Director, Informatics and Biocomputing Platform >> Ontario Institute for Cancer Research >> 101 College St., Suite 800 >> Toronto, ON, Canada M5G0A3 >> 416 673-8514 >> Assistant: Renata Musa >> > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das From thomas.a.down at gmail.com Wed Aug 4 10:59:00 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Wed, 4 Aug 2010 15:59:00 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: <6A5CF73C-4C54-4A45-B0C3-3D589C1C732D@ebi.ac.uk> References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <4C59733D.4090304@ebi.ac.uk> <6A5CF73C-4C54-4A45-B0C3-3D589C1C732D@ebi.ac.uk> Message-ID: On Wed, Aug 4, 2010 at 3:53 PM, Andy Jenkinson wrote: > Sure: > http://www.ebi.ac.uk/das-srv/genomicdas/das/sources > > The implementation in ProServer is a bit more advanced in that it deals > with preflight (HTTP "options") requests. These are necessary if the client > sends custom headers (err, like X-DAS-Client in DAS 1.6) and non-simple > requests (such as PUT). Cross-origin writeback would require preflight > request support, for example. > > By the way, I just looked at the CORS spec and have made some changes to > ProServer's implementation to support the 'expose headers' portion of the > spec. Either this wasn't there last year or I missed it. I think this is > necessary before the browser will get access to the custom DAS headers such > as X-DAS-Status. > Yes, Access-Control-Expose-Headers is a new addition in the latest draft -- and, as you say, kind-of useful for DAS (specifically: we'll be able to implement zooming rather more efficiently in Dalliance once we can detect which sources actually implement maxbins -- which is currently a problem). Unfortunately, the client support for this bit isn't there yet (see, for instance, https://bugs.webkit.org/show_bug.cgi?id=41210), but will hopefully appear soon! Thomas. From ljgarcia at ebi.ac.uk Wed Aug 4 11:03:33 2010 From: ljgarcia at ebi.ac.uk (Leyla Garcia) Date: Wed, 04 Aug 2010 16:03:33 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <4C59733D.4090304@ebi.ac.uk> Message-ID: <4C598145.5090007@ebi.ac.uk> Thanks Andy and Jonathan, The CORS header is already in MyDAS and it will be available for the next release, which will be soon :) I will check the specification about Access-Control-Expose-Headers Jonathan, I am checking the response headers with an add-on in Firefox. I tried the first URL in the dasregistry link you send me, i.e. http://cathdb.info:9000/das/cath_pdb/features?segment=5pti but I do not see the Access-Control-Allow-Origin header there. So I am not sure what I missing... Leyla On 04/08/2010 15:54, Jonathan Warren wrote: > resp.setHeader("Access-Control-Allow-Origin", "*"); > > On 4 Aug 2010, at 15:51, Jonathan Warren wrote: > >> This url list 150 das sources that implement cors - see traffic light >> on the end: >> >> http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find >> >> >> You can check the headers responses for any of their valid responses. >> >> If you want some java code that adds this functionality to MyDAS I >> can send it to you... >> >> On 4 Aug 2010, at 15:03, Leyla Garcia wrote: >> >>> About CORS, >>> >>> >If you run your own DAS servers and >>> >don't list them in the registry, you'll need to check for CORS >>> compatibility >>> >yourself. The latest versions of Proserver and Dazzle should both >>> be okay. >>> >>> If I am not mistaken, Proserver already implements CORS headers? >>> Andy, could you please send me a link of a Proserver server that >>> implements this? >>> >>> Thanks, >>> >>> Leyla >>> >>> On 03/08/2010 21:52, Jonathan Warren wrote: >>>> This is very cool - I had a look the other day. Was wondering why >>>> some sources could be attached and some can't.... >>>> Best browser experience yet by far I'd say. >>>> >>>> No problems about adding CORS support - for the record I'm very >>>> happy to implement new capabilities testing and other suggestions >>>> to the registry from anyone who cares to drop me a line. Especially >>>> if it's going to enhance and promote the use of the registry :) >>>> >>>> >>>> On 3 Aug 2010, at 20:41, Thomas Down wrote: >>>> >>>>> As some of you already know, I've been experimenting recently with a >>>>> web-based DAS client for genomic data. It's still in >>>>> a unashamedly prototypical state (in particular, some of the >>>>> popups and >>>>> configuration stuff is outright clunky, and we know it!), but >>>>> we're starting >>>>> to find it quite useful, and would be interested to receive more >>>>> feedback. >>>>> So if you're curious, you can try it here: >>>>> >>>>> http://www.biodalliance.org/human/ncbi36/ >>>>> >>>>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and >>>>> hopefully rather more coming soon), but has one major caveat: >>>>> since it's >>>>> pure Javascript code running in your web browser, there are >>>>> limitations to >>>>> which servers it can connect to. Specifically, it will only work >>>>> with DAS >>>>> servers that implement the W3C cross-origin resource sharing model >>>>> (which >>>>> has been discussed on this list before, but drop me a line if >>>>> you've got any >>>>> questions). What does this mean in practice? If you're adding >>>>> datasources >>>>> from the registry, things are simple because Dalliance will only >>>>> allow you >>>>> to add CORS-enabled sources (a huge thanks to Jonathan Warren for >>>>> adding >>>>> some support for this in the registry). If you run your own DAS >>>>> servers and >>>>> don't list them in the registry, you'll need to check for CORS >>>>> compatibility >>>>> yourself. The latest versions of Proserver and Dazzle should both >>>>> be okay. >>>>> >>>>> All comments, suggestions, and bug reports are welcome! >>>>> >>>>> Thomas Down. >>>>> _______________________________________________ >>>>> DAS mailing list >>>>> DAS at lists.open-bio.org >>>>> http://lists.open-bio.org/mailman/listinfo/das >>>> >>>> Jonathan Warren >>>> Senior Developer and DAS coordinator >>>> blog: http://biodasman.wordpress.com/ >>>> jw12 at sanger.ac.uk >>>> Ext: 2314 >>>> Telephone: 01223 492314 >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>> >>> _______________________________________________ >>> DAS mailing list >>> DAS at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/das >> >> Jonathan Warren >> Senior Developer and DAS coordinator >> blog: http://biodasman.wordpress.com/ >> jw12 at sanger.ac.uk >> Ext: 2314 >> Telephone: 01223 492314 >> >> >> >> >> >> >> > > Jonathan Warren > Senior Developer and DAS coordinator > blog: http://biodasman.wordpress.com/ > jw12 at sanger.ac.uk > Ext: 2314 > Telephone: 01223 492314 > > > > > > > > > -- The Wellcome Trust Sanger Institute is operated by Genome Research > Limited, a charity registered in England with number 1021457 and a > company registered in England with number 2742969, whose registered > office is 215 Euston Road, London, NW1 2BE. From andy.jenkinson at ebi.ac.uk Wed Aug 4 11:06:27 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Wed, 4 Aug 2010 16:06:27 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <4C59733D.4090304@ebi.ac.uk> <6A5CF73C-4C54-4A45-B0C3-3D589C1C732D@ebi.ac.uk> Message-ID: On 4 Aug 2010, at 15:59, Thomas Down wrote: > > Yes, Access-Control-Expose-Headers is a new addition in the latest draft -- and, as you say, kind-of useful for DAS (specifically: we'll be able to implement zooming rather more efficiently in Dalliance once we can detect which sources actually implement maxbins -- which is currently a problem). > > Unfortunately, the client support for this bit isn't there yet (see, for instance, https://bugs.webkit.org/show_bug.cgi?id=41210), but will hopefully appear soon! > > Thomas. OK, so clients (or safari/chrome at least) strip the headers from the response at the moment. There's always the sources command... if it's listed in X-DAS-Capabilities, it should be listed in the sources command ;) From jw12 at sanger.ac.uk Wed Aug 4 11:09:43 2010 From: jw12 at sanger.ac.uk (Jonathan Warren) Date: Wed, 4 Aug 2010 16:09:43 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: <4C598145.5090007@ebi.ac.uk> References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <4C59733D.4090304@ebi.ac.uk> <4C598145.5090007@ebi.ac.uk> Message-ID: On 4 Aug 2010, at 16:03, Leyla Garcia wrote: > Thanks Andy and Jonathan, > > The CORS header is already in MyDAS and it will be available for the > next release, which will be soon :) > I will check the specification about Access-Control-Expose-Headers > Great stuff. > Jonathan, I am checking the response headers with an add-on in > Firefox. I tried the first URL in the dasregistry link you send me, > i.e. http://cathdb.info:9000/das/cath_pdb/features?segment=5pti > but I do not see the Access-Control-Allow-Origin header there. So I > am not sure what I missing... > Thats odd - the first in the list is "structure" source and is run by the sanger which has cors in the header ( I just checked using the firefox plugin)- did you use the whole url I sent? > Leyla > > > On 04/08/2010 15:54, Jonathan Warren wrote: >> >> resp.setHeader("Access-Control-Allow-Origin", "*"); >> >> On 4 Aug 2010, at 15:51, Jonathan Warren wrote: >> >>> This url list 150 das sources that implement cors - see traffic >>> light on the end: >>> >>> http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find >>> >>> You can check the headers responses for any of their valid >>> responses. >>> >>> If you want some java code that adds this functionality to MyDAS >>> I can send it to you... >>> >>> On 4 Aug 2010, at 15:03, Leyla Garcia wrote: >>> >>>> About CORS, >>>> >>>> >If you run your own DAS servers and >>>> >don't list them in the registry, you'll need to check for CORS >>>> compatibility >>>> >yourself. The latest versions of Proserver and Dazzle should >>>> both be okay. >>>> >>>> If I am not mistaken, Proserver already implements CORS headers? >>>> Andy, could you please send me a link of a Proserver server that >>>> implements this? >>>> >>>> Thanks, >>>> >>>> Leyla >>>> >>>> On 03/08/2010 21:52, Jonathan Warren wrote: >>>>> This is very cool - I had a look the other day. Was wondering >>>>> why some sources could be attached and some can't.... >>>>> Best browser experience yet by far I'd say. >>>>> >>>>> No problems about adding CORS support - for the record I'm very >>>>> happy to implement new capabilities testing and other >>>>> suggestions to the registry from anyone who cares to drop me a >>>>> line. Especially if it's going to enhance and promote the use of >>>>> the registry :) >>>>> >>>>> >>>>> On 3 Aug 2010, at 20:41, Thomas Down wrote: >>>>> >>>>>> As some of you already know, I've been experimenting recently >>>>>> with a >>>>>> web-based DAS client for genomic data. It's still in >>>>>> a unashamedly prototypical state (in particular, some of the >>>>>> popups and >>>>>> configuration stuff is outright clunky, and we know it!), but >>>>>> we're starting >>>>>> to find it quite useful, and would be interested to receive >>>>>> more feedback. >>>>>> So if you're curious, you can try it here: >>>>>> >>>>>> http://www.biodalliance.org/human/ncbi36/ >>>>>> >>>>>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/ >>>>>> 1.6, and >>>>>> hopefully rather more coming soon), but has one major caveat: >>>>>> since it's >>>>>> pure Javascript code running in your web browser, there are >>>>>> limitations to >>>>>> which servers it can connect to. Specifically, it will only >>>>>> work with DAS >>>>>> servers that implement the W3C cross-origin resource sharing >>>>>> model (which >>>>>> has been discussed on this list before, but drop me a line if >>>>>> you've got any >>>>>> questions). What does this mean in practice? If you're adding >>>>>> datasources >>>>>> from the registry, things are simple because Dalliance will >>>>>> only allow you >>>>>> to add CORS-enabled sources (a huge thanks to Jonathan Warren >>>>>> for adding >>>>>> some support for this in the registry). If you run your own >>>>>> DAS servers and >>>>>> don't list them in the registry, you'll need to check for CORS >>>>>> compatibility >>>>>> yourself. The latest versions of Proserver and Dazzle should >>>>>> both be okay. >>>>>> >>>>>> All comments, suggestions, and bug reports are welcome! >>>>>> >>>>>> Thomas Down. >>>>>> _______________________________________________ >>>>>> DAS mailing list >>>>>> DAS at lists.open-bio.org >>>>>> http://lists.open-bio.org/mailman/listinfo/das >>>>> >>>>> Jonathan Warren >>>>> Senior Developer and DAS coordinator >>>>> blog: http://biodasman.wordpress.com/ >>>>> jw12 at sanger.ac.uk >>>>> Ext: 2314 >>>>> Telephone: 01223 492314 >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>> >>>> _______________________________________________ >>>> DAS mailing list >>>> DAS at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/das >>> >>> Jonathan Warren >>> Senior Developer and DAS coordinator >>> blog: http://biodasman.wordpress.com/ >>> jw12 at sanger.ac.uk >>> Ext: 2314 >>> Telephone: 01223 492314 >>> >>> >>> >>> >>> >>> >>> >> >> Jonathan Warren >> Senior Developer and DAS coordinator >> blog: http://biodasman.wordpress.com/ >> jw12 at sanger.ac.uk >> Ext: 2314 >> Telephone: 01223 492314 >> >> >> >> >> >> >> >> >> -- The Wellcome Trust Sanger Institute is operated by Genome >> Research Limited, a charity registered in England with number >> 1021457 and a company registered in England with number 2742969, >> whose registered office is 215 Euston Road, London, NW1 2BE. > Jonathan Warren Senior Developer and DAS coordinator blog: http://biodasman.wordpress.com/ jw12 at sanger.ac.uk Ext: 2314 Telephone: 01223 492314 -- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. From ljgarcia at ebi.ac.uk Wed Aug 4 11:14:58 2010 From: ljgarcia at ebi.ac.uk (Leyla Garcia) Date: Wed, 04 Aug 2010 16:14:58 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <4C59733D.4090304@ebi.ac.uk> <4C598145.5090007@ebi.ac.uk> Message-ID: <4C5983F2.7030803@ebi.ac.uk> Sorry, I do not know what link I used but I tried again and CORS headers are there Thanks On 04/08/2010 16:09, Jonathan Warren wrote: > > On 4 Aug 2010, at 16:03, Leyla Garcia wrote: > >> Thanks Andy and Jonathan, >> >> The CORS header is already in MyDAS and it will be available for the >> next release, which will be soon :) >> I will check the specification about Access-Control-Expose-Headers >> > Great stuff. > >> Jonathan, I am checking the response headers with an add-on in >> Firefox. I tried the first URL in the dasregistry link you send me, >> i.e. http://cathdb.info:9000/das/cath_pdb/features?segment=5pti >> but I do not see the Access-Control-Allow-Origin header there. So I >> am not sure what I missing... >> > > Thats odd - the first in the list is "structure" source and is run by > the sanger which has cors in the header ( I just checked using the > firefox plugin)- did you use the whole url I sent? > >> Leyla >> >> >> On 04/08/2010 15:54, Jonathan Warren wrote: >>> resp.setHeader("Access-Control-Allow-Origin", "*"); >>> >>> On 4 Aug 2010, at 15:51, Jonathan Warren wrote: >>> >>>> This url list 150 das sources that implement cors - see traffic >>>> light on the end: >>>> >>>> http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find >>>> >>>> >>>> You can check the headers responses for any of their valid responses. >>>> >>>> If you want some java code that adds this functionality to MyDAS I >>>> can send it to you... >>>> >>>> On 4 Aug 2010, at 15:03, Leyla Garcia wrote: >>>> >>>>> About CORS, >>>>> >>>>> >If you run your own DAS servers and >>>>> >don't list them in the registry, you'll need to check for CORS >>>>> compatibility >>>>> >yourself. The latest versions of Proserver and Dazzle should >>>>> both be okay. >>>>> >>>>> If I am not mistaken, Proserver already implements CORS headers? >>>>> Andy, could you please send me a link of a Proserver server that >>>>> implements this? >>>>> >>>>> Thanks, >>>>> >>>>> Leyla >>>>> >>>>> On 03/08/2010 21:52, Jonathan Warren wrote: >>>>>> This is very cool - I had a look the other day. Was wondering why >>>>>> some sources could be attached and some can't.... >>>>>> Best browser experience yet by far I'd say. >>>>>> >>>>>> No problems about adding CORS support - for the record I'm very >>>>>> happy to implement new capabilities testing and other suggestions >>>>>> to the registry from anyone who cares to drop me a line. >>>>>> Especially if it's going to enhance and promote the use of the >>>>>> registry :) >>>>>> >>>>>> >>>>>> On 3 Aug 2010, at 20:41, Thomas Down wrote: >>>>>> >>>>>>> As some of you already know, I've been experimenting recently with a >>>>>>> web-based DAS client for genomic data. It's still in >>>>>>> a unashamedly prototypical state (in particular, some of the >>>>>>> popups and >>>>>>> configuration stuff is outright clunky, and we know it!), but >>>>>>> we're starting >>>>>>> to find it quite useful, and would be interested to receive more >>>>>>> feedback. >>>>>>> So if you're curious, you can try it here: >>>>>>> >>>>>>> http://www.biodalliance.org/human/ncbi36/ >>>>>>> >>>>>>> It's a fully-fledged DAS/1.53 client (with a few bits of >>>>>>> DAS/1.6, and >>>>>>> hopefully rather more coming soon), but has one major caveat: >>>>>>> since it's >>>>>>> pure Javascript code running in your web browser, there are >>>>>>> limitations to >>>>>>> which servers it can connect to. Specifically, it will only >>>>>>> work with DAS >>>>>>> servers that implement the W3C cross-origin resource sharing >>>>>>> model (which >>>>>>> has been discussed on this list before, but drop me a line if >>>>>>> you've got any >>>>>>> questions). What does this mean in practice? If you're adding >>>>>>> datasources >>>>>>> from the registry, things are simple because Dalliance will only >>>>>>> allow you >>>>>>> to add CORS-enabled sources (a huge thanks to Jonathan Warren >>>>>>> for adding >>>>>>> some support for this in the registry). If you run your own DAS >>>>>>> servers and >>>>>>> don't list them in the registry, you'll need to check for CORS >>>>>>> compatibility >>>>>>> yourself. The latest versions of Proserver and Dazzle should >>>>>>> both be okay. >>>>>>> >>>>>>> All comments, suggestions, and bug reports are welcome! >>>>>>> >>>>>>> Thomas Down. >>>>>>> _______________________________________________ >>>>>>> DAS mailing list >>>>>>> DAS at lists.open-bio.org >>>>>>> http://lists.open-bio.org/mailman/listinfo/das >>>>>> >>>>>> Jonathan Warren >>>>>> Senior Developer and DAS coordinator >>>>>> blog: http://biodasman.wordpress.com/ >>>>>> jw12 at sanger.ac.uk >>>>>> Ext: 2314 >>>>>> Telephone: 01223 492314 >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>> >>>>> _______________________________________________ >>>>> DAS mailing list >>>>> DAS at lists.open-bio.org >>>>> http://lists.open-bio.org/mailman/listinfo/das >>>> >>>> Jonathan Warren >>>> Senior Developer and DAS coordinator >>>> blog: http://biodasman.wordpress.com/ >>>> jw12 at sanger.ac.uk >>>> Ext: 2314 >>>> Telephone: 01223 492314 >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>> >>> Jonathan Warren >>> Senior Developer and DAS coordinator >>> blog: http://biodasman.wordpress.com/ >>> jw12 at sanger.ac.uk >>> Ext: 2314 >>> Telephone: 01223 492314 >>> >>> >>> >>> >>> >>> >>> >>> >>> -- The Wellcome Trust Sanger Institute is operated by Genome >>> Research Limited, a charity registered in England with number >>> 1021457 and a company registered in England with number 2742969, >>> whose registered office is 215 Euston Road, London, NW1 2BE. >> > > Jonathan Warren > Senior Developer and DAS coordinator > blog: http://biodasman.wordpress.com/ > jw12 at sanger.ac.uk > Ext: 2314 > Telephone: 01223 492314 > > > > > > > > > -- The Wellcome Trust Sanger Institute is operated by Genome Research > Limited, a charity registered in England with number 1021457 and a > company registered in England with number 2742969, whose registered > office is 215 Euston Road, London, NW1 2BE. From thomas.a.down at gmail.com Wed Aug 4 11:26:25 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Wed, 4 Aug 2010 16:26:25 +0100 Subject: [DAS] CORS In-Reply-To: <3378D8DA-614B-4D56-93AD-024DAFF7F61D@ebi.ac.uk> References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <3378D8DA-614B-4D56-93AD-024DAFF7F61D@ebi.ac.uk> Message-ID: Despite being a strong CORS advocate (and not just for DAS -- it'll be beneficial for a whole raft of services), I'm actually a bit reluctant to make it mandatory without some rather careful though. Unrestricted CORS is, as far as I can tell, always appropriate for public DAS servers offering data to the community. It's probably also good for password-protected-by-publically-routable servers (although the implementation gets a wee bit more complex in that case). However, if you're running a DAS server behind a firewall, CORS does potentially open you to possible security issues which wouldn't otherwise be present. In the most security-conscious environments, people might want to just whitelist the origins of specific clients. How about including a link to the CORS spec and saying "implementation is strongly encouraged", or something like that? thomas. On Wed, Aug 4, 2010 at 3:58 PM, Andy Jenkinson wrote: > Since this seems to have been given the thumbs up, shall we make CORS > support mandatory from 1.6 onwards? > > I suggested this when it first came up last year, but I got no replies so > didn't put it in the spec. I suspect because it was in the middle of a > flurry of emails about "maxbins" :) > > On 3 Aug 2010, at 22:28, Thomas Down wrote: > > > Jonathan's written a nice summary here: > > > > http://biodasman.wordpress.com/2010/07/20/cors/ > > > > But briefly... it's the "official" way to work around the same-origin > > policy (by default, browsers only allow unsigned javascript to trigger > HTTP > > requests to the server from which it was originally downloaded). The > > specification is here: > > > > http://www.w3.org/TR/cors/ > > > > (Please don't be too alarmed by the datestamp! The core parts have been > > stable for > a year now, and it's well supported by Mozilla, WebKit, and > -- > > via a slightly different API -- Internet Explorer). > > > > If you're running a public server and want it to be CORS accessible, all > > that is needed is to emit the header: > > > > Access-Control-Allow-Origin: * > > > > ...and you're done. > > > > (If you're running password-protected DAS servers, or DAS servers hosting > > sensitive information behind a firewall, you might want a slightly more > > sophisticated CORS implementation. Happy to discuss if anyone is > > interested). > > > > Thomas. > > > > On Tue, Aug 3, 2010 at 10:21 PM, Lincoln Stein >wrote: > > > >> Someone give me a quick summary of CORS support. I want to make sure > that > >> GBrowse exports DAS 1.53 with CORS (is it just the registry metadata, or > >> something new?) > >> > >> Lincoln > >> > >> On Tue, Aug 3, 2010 at 4:52 PM, Jonathan Warren > wrote: > >> > >>> This is very cool - I had a look the other day. Was wondering why some > >>> sources could be attached and some can't.... > >>> Best browser experience yet by far I'd say. > >>> > >>> No problems about adding CORS support - for the record I'm very happy > to > >>> implement new capabilities testing and other suggestions to the > registry > >>> from anyone who cares to drop me a line. Especially if it's going to > enhance > >>> and promote the use of the registry :) > >>> > >>> > >>> > >>> On 3 Aug 2010, at 20:41, Thomas Down wrote: > >>> > >>> As some of you already know, I've been experimenting recently with a > >>>> web-based DAS client for genomic data. It's still in > >>>> a unashamedly prototypical state (in particular, some of the popups > and > >>>> configuration stuff is outright clunky, and we know it!), but we're > >>>> starting > >>>> to find it quite useful, and would be interested to receive more > >>>> feedback. > >>>> So if you're curious, you can try it here: > >>>> > >>>> http://www.biodalliance.org/human/ncbi36/ > >>>> > >>>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and > >>>> hopefully rather more coming soon), but has one major caveat: since > it's > >>>> pure Javascript code running in your web browser, there are > limitations > >>>> to > >>>> which servers it can connect to. Specifically, it will only work with > >>>> DAS > >>>> servers that implement the W3C cross-origin resource sharing model > (which > >>>> has been discussed on this list before, but drop me a line if you've > got > >>>> any > >>>> questions). What does this mean in practice? If you're adding > >>>> datasources > >>>> from the registry, things are simple because Dalliance will only allow > >>>> you > >>>> to add CORS-enabled sources (a huge thanks to Jonathan Warren for > adding > >>>> some support for this in the registry). If you run your own DAS > servers > >>>> and > >>>> don't list them in the registry, you'll need to check for CORS > >>>> compatibility > >>>> yourself. The latest versions of Proserver and Dazzle should both be > >>>> okay. > >>>> > >>>> All comments, suggestions, and bug reports are welcome! > >>>> > >>>> Thomas Down. > >>>> _______________________________________________ > >>>> DAS mailing list > >>>> DAS at lists.open-bio.org > >>>> http://lists.open-bio.org/mailman/listinfo/das > >>>> > >>> > >>> Jonathan Warren > >>> Senior Developer and DAS coordinator > >>> blog: http://biodasman.wordpress.com/ > >>> jw12 at sanger.ac.uk > >>> Ext: 2314 > >>> Telephone: 01223 492314 > >>> > >>> > >>> > >>> > >>> > >>> > >>> > >>> > >>> > >>> -- > >>> The Wellcome Trust Sanger Institute is operated by Genome > ResearchLimited, > >>> a charity registered in England with number 1021457 and acompany > registered > >>> in England with number 2742969, whose registeredoffice is 215 Euston > Road, > >>> London, NW1 2BE._______________________________________________ > >>> > >>> DAS mailing list > >>> DAS at lists.open-bio.org > >>> http://lists.open-bio.org/mailman/listinfo/das > >>> > >> > >> > >> > >> -- > >> Lincoln D. Stein > >> Director, Informatics and Biocomputing Platform > >> Ontario Institute for Cancer Research > >> 101 College St., Suite 800 > >> Toronto, ON, Canada M5G0A3 > >> 416 673-8514 > >> Assistant: Renata Musa > >> > > _______________________________________________ > > DAS mailing list > > DAS at lists.open-bio.org > > http://lists.open-bio.org/mailman/listinfo/das > > From thomas.a.down at gmail.com Wed Aug 4 11:36:49 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Wed, 4 Aug 2010 16:36:49 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: References: Message-ID: One word of warning: I've not done a lot of testing with old web browsers, so I'd strongly recommend using Firefox 3.6, Safari 4 or 5, or an up-to-date version of Google Chrome (Chrome auto-updates by default, so I doubt there are many old versions in the wild). Internet Explorer support *is* on the way, but isn't quite there yet, and will probably be limited to Internet Explorer 9 (previous versions don't have SVG graphics, which we're using extensively). Thomas. On Tue, Aug 3, 2010 at 8:41 PM, Thomas Down wrote: > As some of you already know, I've been experimenting recently with a > web-based DAS client for genomic data. It's still in > a unashamedly prototypical state (in particular, some of the popups and > configuration stuff is outright clunky, and we know it!), but we're starting > to find it quite useful, and would be interested to receive more feedback. > So if you're curious, you can try it here: > > http://www.biodalliance.org/human/ncbi36/ > > It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and > hopefully rather more coming soon), but has one major caveat: since it's > pure Javascript code running in your web browser, there are limitations to > which servers it can connect to. Specifically, it will only work with DAS > servers that implement the W3C cross-origin resource sharing model (which > has been discussed on this list before, but drop me a line if you've got any > questions). What does this mean in practice? If you're adding datasources > from the registry, things are simple because Dalliance will only allow you > to add CORS-enabled sources (a huge thanks to Jonathan Warren for adding > some support for this in the registry). If you run your own DAS servers and > don't list them in the registry, you'll need to check for CORS compatibility > yourself. The latest versions of Proserver and Dazzle should both be okay. > > All comments, suggestions, and bug reports are welcome! > > Thomas Down. > From andy.jenkinson at ebi.ac.uk Wed Aug 4 12:17:50 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Wed, 4 Aug 2010 17:17:50 +0100 Subject: [DAS] CORS In-Reply-To: References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <3378D8DA-614B-4D56-93AD-024DAFF7F61D@ebi.ac.uk> Message-ID: Well we could specify that software must implement the procedure, and not actually require servers to accept requests from all origins? On 4 Aug 2010, at 16:26, Thomas Down wrote: > Despite being a strong CORS advocate (and not just for DAS -- it'll be beneficial for a whole raft of services), I'm actually a bit reluctant to make it mandatory without some rather careful though. > > Unrestricted CORS is, as far as I can tell, always appropriate for public DAS servers offering data to the community. It's probably also good for password-protected-by-publically-routable servers (although the implementation gets a wee bit more complex in that case). > > However, if you're running a DAS server behind a firewall, CORS does potentially open you to possible security issues which wouldn't otherwise be present. In the most security-conscious environments, people might want to just whitelist the origins of specific clients. > > How about including a link to the CORS spec and saying "implementation is strongly encouraged", or something like that? > > thomas. > > On Wed, Aug 4, 2010 at 3:58 PM, Andy Jenkinson wrote: > Since this seems to have been given the thumbs up, shall we make CORS support mandatory from 1.6 onwards? > > I suggested this when it first came up last year, but I got no replies so didn't put it in the spec. I suspect because it was in the middle of a flurry of emails about "maxbins" :) > > On 3 Aug 2010, at 22:28, Thomas Down wrote: > > > Jonathan's written a nice summary here: > > > > http://biodasman.wordpress.com/2010/07/20/cors/ > > > > But briefly... it's the "official" way to work around the same-origin > > policy (by default, browsers only allow unsigned javascript to trigger HTTP > > requests to the server from which it was originally downloaded). The > > specification is here: > > > > http://www.w3.org/TR/cors/ > > > > (Please don't be too alarmed by the datestamp! The core parts have been > > stable for > a year now, and it's well supported by Mozilla, WebKit, and -- > > via a slightly different API -- Internet Explorer). > > > > If you're running a public server and want it to be CORS accessible, all > > that is needed is to emit the header: > > > > Access-Control-Allow-Origin: * > > > > ...and you're done. > > > > (If you're running password-protected DAS servers, or DAS servers hosting > > sensitive information behind a firewall, you might want a slightly more > > sophisticated CORS implementation. Happy to discuss if anyone is > > interested). > > > > Thomas. > > > > On Tue, Aug 3, 2010 at 10:21 PM, Lincoln Stein wrote: > > > >> Someone give me a quick summary of CORS support. I want to make sure that > >> GBrowse exports DAS 1.53 with CORS (is it just the registry metadata, or > >> something new?) > >> > >> Lincoln > >> > >> On Tue, Aug 3, 2010 at 4:52 PM, Jonathan Warren wrote: > >> > >>> This is very cool - I had a look the other day. Was wondering why some > >>> sources could be attached and some can't.... > >>> Best browser experience yet by far I'd say. > >>> > >>> No problems about adding CORS support - for the record I'm very happy to > >>> implement new capabilities testing and other suggestions to the registry > >>> from anyone who cares to drop me a line. Especially if it's going to enhance > >>> and promote the use of the registry :) > >>> > >>> > >>> > >>> On 3 Aug 2010, at 20:41, Thomas Down wrote: > >>> > >>> As some of you already know, I've been experimenting recently with a > >>>> web-based DAS client for genomic data. It's still in > >>>> a unashamedly prototypical state (in particular, some of the popups and > >>>> configuration stuff is outright clunky, and we know it!), but we're > >>>> starting > >>>> to find it quite useful, and would be interested to receive more > >>>> feedback. > >>>> So if you're curious, you can try it here: > >>>> > >>>> http://www.biodalliance.org/human/ncbi36/ > >>>> > >>>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and > >>>> hopefully rather more coming soon), but has one major caveat: since it's > >>>> pure Javascript code running in your web browser, there are limitations > >>>> to > >>>> which servers it can connect to. Specifically, it will only work with > >>>> DAS > >>>> servers that implement the W3C cross-origin resource sharing model (which > >>>> has been discussed on this list before, but drop me a line if you've got > >>>> any > >>>> questions). What does this mean in practice? If you're adding > >>>> datasources > >>>> from the registry, things are simple because Dalliance will only allow > >>>> you > >>>> to add CORS-enabled sources (a huge thanks to Jonathan Warren for adding > >>>> some support for this in the registry). If you run your own DAS servers > >>>> and > >>>> don't list them in the registry, you'll need to check for CORS > >>>> compatibility > >>>> yourself. The latest versions of Proserver and Dazzle should both be > >>>> okay. > >>>> > >>>> All comments, suggestions, and bug reports are welcome! > >>>> > >>>> Thomas Down. > >>>> _______________________________________________ > >>>> DAS mailing list > >>>> DAS at lists.open-bio.org > >>>> http://lists.open-bio.org/mailman/listinfo/das > >>>> > >>> > >>> Jonathan Warren > >>> Senior Developer and DAS coordinator > >>> blog: http://biodasman.wordpress.com/ > >>> jw12 at sanger.ac.uk > >>> Ext: 2314 > >>> Telephone: 01223 492314 > >>> > >>> > >>> > >>> > >>> > >>> > >>> > >>> > >>> > >>> -- > >>> The Wellcome Trust Sanger Institute is operated by Genome ResearchLimited, > >>> a charity registered in England with number 1021457 and acompany registered > >>> in England with number 2742969, whose registeredoffice is 215 Euston Road, > >>> London, NW1 2BE._______________________________________________ > >>> > >>> DAS mailing list > >>> DAS at lists.open-bio.org > >>> http://lists.open-bio.org/mailman/listinfo/das > >>> > >> > >> > >> > >> -- > >> Lincoln D. Stein > >> Director, Informatics and Biocomputing Platform > >> Ontario Institute for Cancer Research > >> 101 College St., Suite 800 > >> Toronto, ON, Canada M5G0A3 > >> 416 673-8514 > >> Assistant: Renata Musa > >> > > _______________________________________________ > > DAS mailing list > > DAS at lists.open-bio.org > > http://lists.open-bio.org/mailman/listinfo/das > > From thomas.a.down at gmail.com Wed Aug 4 13:39:08 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Wed, 4 Aug 2010 18:39:08 +0100 Subject: [DAS] CORS In-Reply-To: References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <3378D8DA-614B-4D56-93AD-024DAFF7F61D@ebi.ac.uk> Message-ID: That sounds good to me, Thomas. On Wed, Aug 4, 2010 at 5:17 PM, Andy Jenkinson wrote: > Well we could specify that software must implement the procedure, and not > actually require servers to accept requests from all origins? > > On 4 Aug 2010, at 16:26, Thomas Down wrote: > > > Despite being a strong CORS advocate (and not just for DAS -- it'll be > beneficial for a whole raft of services), I'm actually a bit reluctant to > make it mandatory without some rather careful though. > > > > Unrestricted CORS is, as far as I can tell, always appropriate for public > DAS servers offering data to the community. It's probably also good for > password-protected-by-publically-routable servers (although the > implementation gets a wee bit more complex in that case). > > > > However, if you're running a DAS server behind a firewall, CORS does > potentially open you to possible security issues which wouldn't otherwise be > present. In the most security-conscious environments, people might want to > just whitelist the origins of specific clients. > > > > How about including a link to the CORS spec and saying "implementation is > strongly encouraged", or something like that? > > > > thomas. > > > > On Wed, Aug 4, 2010 at 3:58 PM, Andy Jenkinson > wrote: > > Since this seems to have been given the thumbs up, shall we make CORS > support mandatory from 1.6 onwards? > > > > I suggested this when it first came up last year, but I got no replies so > didn't put it in the spec. I suspect because it was in the middle of a > flurry of emails about "maxbins" :) > > > > On 3 Aug 2010, at 22:28, Thomas Down wrote: > > > > > Jonathan's written a nice summary here: > > > > > > http://biodasman.wordpress.com/2010/07/20/cors/ > > > > > > But briefly... it's the "official" way to work around the same-origin > > > policy (by default, browsers only allow unsigned javascript to trigger > HTTP > > > requests to the server from which it was originally downloaded). The > > > specification is here: > > > > > > http://www.w3.org/TR/cors/ > > > > > > (Please don't be too alarmed by the datestamp! The core parts have > been > > > stable for > a year now, and it's well supported by Mozilla, WebKit, > and -- > > > via a slightly different API -- Internet Explorer). > > > > > > If you're running a public server and want it to be CORS accessible, > all > > > that is needed is to emit the header: > > > > > > Access-Control-Allow-Origin: * > > > > > > ...and you're done. > > > > > > (If you're running password-protected DAS servers, or DAS servers > hosting > > > sensitive information behind a firewall, you might want a slightly more > > > sophisticated CORS implementation. Happy to discuss if anyone is > > > interested). > > > > > > Thomas. > > > > > > On Tue, Aug 3, 2010 at 10:21 PM, Lincoln Stein < > lincoln.stein at gmail.com>wrote: > > > > > >> Someone give me a quick summary of CORS support. I want to make sure > that > > >> GBrowse exports DAS 1.53 with CORS (is it just the registry metadata, > or > > >> something new?) > > >> > > >> Lincoln > > >> > > >> On Tue, Aug 3, 2010 at 4:52 PM, Jonathan Warren > wrote: > > >> > > >>> This is very cool - I had a look the other day. Was wondering why > some > > >>> sources could be attached and some can't.... > > >>> Best browser experience yet by far I'd say. > > >>> > > >>> No problems about adding CORS support - for the record I'm very happy > to > > >>> implement new capabilities testing and other suggestions to the > registry > > >>> from anyone who cares to drop me a line. Especially if it's going to > enhance > > >>> and promote the use of the registry :) > > >>> > > >>> > > >>> > > >>> On 3 Aug 2010, at 20:41, Thomas Down wrote: > > >>> > > >>> As some of you already know, I've been experimenting recently with a > > >>>> web-based DAS client for genomic data. It's still in > > >>>> a unashamedly prototypical state (in particular, some of the popups > and > > >>>> configuration stuff is outright clunky, and we know it!), but we're > > >>>> starting > > >>>> to find it quite useful, and would be interested to receive more > > >>>> feedback. > > >>>> So if you're curious, you can try it here: > > >>>> > > >>>> http://www.biodalliance.org/human/ncbi36/ > > >>>> > > >>>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, > and > > >>>> hopefully rather more coming soon), but has one major caveat: since > it's > > >>>> pure Javascript code running in your web browser, there are > limitations > > >>>> to > > >>>> which servers it can connect to. Specifically, it will only work > with > > >>>> DAS > > >>>> servers that implement the W3C cross-origin resource sharing model > (which > > >>>> has been discussed on this list before, but drop me a line if you've > got > > >>>> any > > >>>> questions). What does this mean in practice? If you're adding > > >>>> datasources > > >>>> from the registry, things are simple because Dalliance will only > allow > > >>>> you > > >>>> to add CORS-enabled sources (a huge thanks to Jonathan Warren for > adding > > >>>> some support for this in the registry). If you run your own DAS > servers > > >>>> and > > >>>> don't list them in the registry, you'll need to check for CORS > > >>>> compatibility > > >>>> yourself. The latest versions of Proserver and Dazzle should both > be > > >>>> okay. > > >>>> > > >>>> All comments, suggestions, and bug reports are welcome! > > >>>> > > >>>> Thomas Down. > > >>>> _______________________________________________ > > >>>> DAS mailing list > > >>>> DAS at lists.open-bio.org > > >>>> http://lists.open-bio.org/mailman/listinfo/das > > >>>> > > >>> > > >>> Jonathan Warren > > >>> Senior Developer and DAS coordinator > > >>> blog: http://biodasman.wordpress.com/ > > >>> jw12 at sanger.ac.uk > > >>> Ext: 2314 > > >>> Telephone: 01223 492314 > > >>> > > >>> > > >>> > > >>> > > >>> > > >>> > > >>> > > >>> > > >>> > > >>> -- > > >>> The Wellcome Trust Sanger Institute is operated by Genome > ResearchLimited, > > >>> a charity registered in England with number 1021457 and acompany > registered > > >>> in England with number 2742969, whose registeredoffice is 215 Euston > Road, > > >>> London, NW1 2BE._______________________________________________ > > >>> > > >>> DAS mailing list > > >>> DAS at lists.open-bio.org > > >>> http://lists.open-bio.org/mailman/listinfo/das > > >>> > > >> > > >> > > >> > > >> -- > > >> Lincoln D. Stein > > >> Director, Informatics and Biocomputing Platform > > >> Ontario Institute for Cancer Research > > >> 101 College St., Suite 800 > > >> Toronto, ON, Canada M5G0A3 > > >> 416 673-8514 > > >> Assistant: Renata Musa > > >> > > > _______________________________________________ > > > DAS mailing list > > > DAS at lists.open-bio.org > > > http://lists.open-bio.org/mailman/listinfo/das > > > > > > From lincoln.stein at gmail.com Wed Aug 4 15:26:32 2010 From: lincoln.stein at gmail.com (Lincoln Stein) Date: Wed, 4 Aug 2010 15:26:32 -0400 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: <4C598145.5090007@ebi.ac.uk> References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <4C59733D.4090304@ebi.ac.uk> <4C598145.5090007@ebi.ac.uk> Message-ID: Thanks for the pointers. I've added CORS capabilities to the GBrowse DAS server. Lincoln On Wed, Aug 4, 2010 at 11:03 AM, Leyla Garcia wrote: > Thanks Andy and Jonathan, > > The CORS header is already in MyDAS and it will be available for the next > release, which will be soon :) > I will check the specification about Access-Control-Expose-Headers > > Jonathan, I am checking the response headers with an add-on in Firefox. I > tried the first URL in the dasregistry link you send me, i.e. > http://cathdb.info:9000/das/cath_pdb/features?segment=5pti > but I do not see the Access-Control-Allow-Origin header there. So I am not > sure what I missing... > > Leyla > > > > On 04/08/2010 15:54, Jonathan Warren wrote: > >> resp.setHeader("Access-Control-Allow-Origin", "*"); >> >> On 4 Aug 2010, at 15:51, Jonathan Warren wrote: >> >> This url list 150 das sources that implement cors - see traffic light on >>> the end: >>> >>> >>> http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find< >>> http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find >>> > >>> >>> >>> You can check the headers responses for any of their valid responses. >>> >>> If you want some java code that adds this functionality to MyDAS I can >>> send it to you... >>> >>> On 4 Aug 2010, at 15:03, Leyla Garcia wrote: >>> >>> About CORS, >>>> >>>> >If you run your own DAS servers and >>>> >don't list them in the registry, you'll need to check for CORS >>>> compatibility >>>> >yourself. The latest versions of Proserver and Dazzle should both be >>>> okay. >>>> >>>> If I am not mistaken, Proserver already implements CORS headers? >>>> Andy, could you please send me a link of a Proserver server that >>>> implements this? >>>> >>>> Thanks, >>>> >>>> Leyla >>>> >>>> On 03/08/2010 21:52, Jonathan Warren wrote: >>>> >>>>> This is very cool - I had a look the other day. Was wondering why some >>>>> sources could be attached and some can't.... >>>>> Best browser experience yet by far I'd say. >>>>> >>>>> No problems about adding CORS support - for the record I'm very happy >>>>> to implement new capabilities testing and other suggestions to the registry >>>>> from anyone who cares to drop me a line. Especially if it's going to enhance >>>>> and promote the use of the registry :) >>>>> >>>>> >>>>> On 3 Aug 2010, at 20:41, Thomas Down wrote: >>>>> >>>>> As some of you already know, I've been experimenting recently with a >>>>>> web-based DAS client for genomic data. It's still in >>>>>> a unashamedly prototypical state (in particular, some of the popups >>>>>> and >>>>>> configuration stuff is outright clunky, and we know it!), but we're >>>>>> starting >>>>>> to find it quite useful, and would be interested to receive more >>>>>> feedback. >>>>>> So if you're curious, you can try it here: >>>>>> >>>>>> http://www.biodalliance.org/human/ncbi36/ >>>>>> >>>>>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and >>>>>> hopefully rather more coming soon), but has one major caveat: since >>>>>> it's >>>>>> pure Javascript code running in your web browser, there are >>>>>> limitations to >>>>>> which servers it can connect to. Specifically, it will only work with >>>>>> DAS >>>>>> servers that implement the W3C cross-origin resource sharing model >>>>>> (which >>>>>> has been discussed on this list before, but drop me a line if you've >>>>>> got any >>>>>> questions). What does this mean in practice? If you're adding >>>>>> datasources >>>>>> from the registry, things are simple because Dalliance will only allow >>>>>> you >>>>>> to add CORS-enabled sources (a huge thanks to Jonathan Warren for >>>>>> adding >>>>>> some support for this in the registry). If you run your own DAS >>>>>> servers and >>>>>> don't list them in the registry, you'll need to check for CORS >>>>>> compatibility >>>>>> yourself. The latest versions of Proserver and Dazzle should both be >>>>>> okay. >>>>>> >>>>>> All comments, suggestions, and bug reports are welcome! >>>>>> >>>>>> Thomas Down. >>>>>> _______________________________________________ >>>>>> DAS mailing list >>>>>> DAS at lists.open-bio.org >>>>>> http://lists.open-bio.org/mailman/listinfo/das >>>>>> >>>>> >>>>> Jonathan Warren >>>>> Senior Developer and DAS coordinator >>>>> blog: http://biodasman.wordpress.com/ >>>>> jw12 at sanger.ac.uk >>>>> Ext: 2314 >>>>> Telephone: 01223 492314 >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>> _______________________________________________ >>>> DAS mailing list >>>> DAS at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/das >>>> >>> >>> Jonathan Warren >>> Senior Developer and DAS coordinator >>> blog: http://biodasman.wordpress.com/ >>> jw12 at sanger.ac.uk >>> Ext: 2314 >>> Telephone: 01223 492314 >>> >>> >>> >>> >>> >>> >>> >>> >> Jonathan Warren >> Senior Developer and DAS coordinator >> blog: http://biodasman.wordpress.com/ >> jw12 at sanger.ac.uk >> >> Ext: 2314 >> Telephone: 01223 492314 >> >> >> >> >> >> >> >> >> -- The Wellcome Trust Sanger Institute is operated by Genome Research >> Limited, a charity registered in England with number 1021457 and a company >> registered in England with number 2742969, whose registered office is 215 >> Euston Road, London, NW1 2BE. >> > > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das > -- Lincoln D. Stein Director, Informatics and Biocomputing Platform Ontario Institute for Cancer Research 101 College St., Suite 800 Toronto, ON, Canada M5G0A3 416 673-8514 Assistant: Renata Musa From thomas.a.down at gmail.com Wed Aug 4 15:35:58 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Wed, 4 Aug 2010 20:35:58 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <4C59733D.4090304@ebi.ac.uk> <4C598145.5090007@ebi.ac.uk> Message-ID: That's great Lincoln. I don't suppose there's an updated server up somewhere that I could test against? Thanks, Thomas. On Wed, Aug 4, 2010 at 8:26 PM, Lincoln Stein wrote: > Thanks for the pointers. I've added CORS capabilities to the GBrowse DAS > server. > > Lincoln > > On Wed, Aug 4, 2010 at 11:03 AM, Leyla Garcia wrote: > > > Thanks Andy and Jonathan, > > > > The CORS header is already in MyDAS and it will be available for the next > > release, which will be soon :) > > I will check the specification about Access-Control-Expose-Headers > > > > Jonathan, I am checking the response headers with an add-on in Firefox. I > > tried the first URL in the dasregistry link you send me, i.e. > > http://cathdb.info:9000/das/cath_pdb/features?segment=5pti > > but I do not see the Access-Control-Allow-Origin header there. So I am > not > > sure what I missing... > > > > Leyla > > > > > > > > On 04/08/2010 15:54, Jonathan Warren wrote: > > > >> resp.setHeader("Access-Control-Allow-Origin", "*"); > >> > >> On 4 Aug 2010, at 15:51, Jonathan Warren wrote: > >> > >> This url list 150 das sources that implement cors - see traffic light > on > >>> the end: > >>> > >>> > >>> > http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find > < > >>> > http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find > >>> > > >>> > >>> > >>> You can check the headers responses for any of their valid responses. > >>> > >>> If you want some java code that adds this functionality to MyDAS I can > >>> send it to you... > >>> > >>> On 4 Aug 2010, at 15:03, Leyla Garcia wrote: > >>> > >>> About CORS, > >>>> > >>>> >If you run your own DAS servers and > >>>> >don't list them in the registry, you'll need to check for CORS > >>>> compatibility > >>>> >yourself. The latest versions of Proserver and Dazzle should both be > >>>> okay. > >>>> > >>>> If I am not mistaken, Proserver already implements CORS headers? > >>>> Andy, could you please send me a link of a Proserver server that > >>>> implements this? > >>>> > >>>> Thanks, > >>>> > >>>> Leyla > >>>> > >>>> On 03/08/2010 21:52, Jonathan Warren wrote: > >>>> > >>>>> This is very cool - I had a look the other day. Was wondering why > some > >>>>> sources could be attached and some can't.... > >>>>> Best browser experience yet by far I'd say. > >>>>> > >>>>> No problems about adding CORS support - for the record I'm very happy > >>>>> to implement new capabilities testing and other suggestions to the > registry > >>>>> from anyone who cares to drop me a line. Especially if it's going to > enhance > >>>>> and promote the use of the registry :) > >>>>> > >>>>> > >>>>> On 3 Aug 2010, at 20:41, Thomas Down wrote: > >>>>> > >>>>> As some of you already know, I've been experimenting recently with a > >>>>>> web-based DAS client for genomic data. It's still in > >>>>>> a unashamedly prototypical state (in particular, some of the popups > >>>>>> and > >>>>>> configuration stuff is outright clunky, and we know it!), but we're > >>>>>> starting > >>>>>> to find it quite useful, and would be interested to receive more > >>>>>> feedback. > >>>>>> So if you're curious, you can try it here: > >>>>>> > >>>>>> http://www.biodalliance.org/human/ncbi36/ > >>>>>> > >>>>>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, > and > >>>>>> hopefully rather more coming soon), but has one major caveat: since > >>>>>> it's > >>>>>> pure Javascript code running in your web browser, there are > >>>>>> limitations to > >>>>>> which servers it can connect to. Specifically, it will only work > with > >>>>>> DAS > >>>>>> servers that implement the W3C cross-origin resource sharing model > >>>>>> (which > >>>>>> has been discussed on this list before, but drop me a line if you've > >>>>>> got any > >>>>>> questions). What does this mean in practice? If you're adding > >>>>>> datasources > >>>>>> from the registry, things are simple because Dalliance will only > allow > >>>>>> you > >>>>>> to add CORS-enabled sources (a huge thanks to Jonathan Warren for > >>>>>> adding > >>>>>> some support for this in the registry). If you run your own DAS > >>>>>> servers and > >>>>>> don't list them in the registry, you'll need to check for CORS > >>>>>> compatibility > >>>>>> yourself. The latest versions of Proserver and Dazzle should both > be > >>>>>> okay. > >>>>>> > >>>>>> All comments, suggestions, and bug reports are welcome! > >>>>>> > >>>>>> Thomas Down. > >>>>>> _______________________________________________ > >>>>>> DAS mailing list > >>>>>> DAS at lists.open-bio.org > >>>>>> http://lists.open-bio.org/mailman/listinfo/das > >>>>>> > >>>>> > >>>>> Jonathan Warren > >>>>> Senior Developer and DAS coordinator > >>>>> blog: http://biodasman.wordpress.com/ > >>>>> jw12 at sanger.ac.uk > >>>>> Ext: 2314 > >>>>> Telephone: 01223 492314 > >>>>> > >>>>> > >>>>> > >>>>> > >>>>> > >>>>> > >>>>> > >>>>> > >>>>> > >>>>> > >>>> _______________________________________________ > >>>> DAS mailing list > >>>> DAS at lists.open-bio.org > >>>> http://lists.open-bio.org/mailman/listinfo/das > >>>> > >>> > >>> Jonathan Warren > >>> Senior Developer and DAS coordinator > >>> blog: http://biodasman.wordpress.com/ > >>> jw12 at sanger.ac.uk > >>> Ext: 2314 > >>> Telephone: 01223 492314 > >>> > >>> > >>> > >>> > >>> > >>> > >>> > >>> > >> Jonathan Warren > >> Senior Developer and DAS coordinator > >> blog: http://biodasman.wordpress.com/ > >> jw12 at sanger.ac.uk > >> > >> Ext: 2314 > >> Telephone: 01223 492314 > >> > >> > >> > >> > >> > >> > >> > >> > >> -- The Wellcome Trust Sanger Institute is operated by Genome Research > >> Limited, a charity registered in England with number 1021457 and a > company > >> registered in England with number 2742969, whose registered office is > 215 > >> Euston Road, London, NW1 2BE. > >> > > > > _______________________________________________ > > DAS mailing list > > DAS at lists.open-bio.org > > http://lists.open-bio.org/mailman/listinfo/das > > > > > > -- > Lincoln D. Stein > Director, Informatics and Biocomputing Platform > Ontario Institute for Cancer Research > 101 College St., Suite 800 > Toronto, ON, Canada M5G0A3 > 416 673-8514 > Assistant: Renata Musa > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das > From lincoln.stein at gmail.com Wed Aug 4 16:14:08 2010 From: lincoln.stein at gmail.com (Lincoln Stein) Date: Wed, 4 Aug 2010 16:14:08 -0400 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <4C59733D.4090304@ebi.ac.uk> <4C598145.5090007@ebi.ac.uk> Message-ID: No, it will be a while before this version is up on a live site. Lincoln On Wed, Aug 4, 2010 at 3:35 PM, Thomas Down wrote: > That's great Lincoln. > > I don't suppose there's an updated server up somewhere that I could test > against? > > Thanks, > > Thomas. > > > On Wed, Aug 4, 2010 at 8:26 PM, Lincoln Stein wrote: > >> Thanks for the pointers. I've added CORS capabilities to the GBrowse DAS >> server. >> >> Lincoln >> >> On Wed, Aug 4, 2010 at 11:03 AM, Leyla Garcia wrote: >> >> > Thanks Andy and Jonathan, >> > >> > The CORS header is already in MyDAS and it will be available for the >> next >> > release, which will be soon :) >> > I will check the specification about Access-Control-Expose-Headers >> > >> > Jonathan, I am checking the response headers with an add-on in Firefox. >> I >> > tried the first URL in the dasregistry link you send me, i.e. >> > http://cathdb.info:9000/das/cath_pdb/features?segment=5pti >> > but I do not see the Access-Control-Allow-Origin header there. So I am >> not >> > sure what I missing... >> > >> > Leyla >> > >> > >> > >> > On 04/08/2010 15:54, Jonathan Warren wrote: >> > >> >> resp.setHeader("Access-Control-Allow-Origin", "*"); >> >> >> >> On 4 Aug 2010, at 15:51, Jonathan Warren wrote: >> >> >> >> This url list 150 das sources that implement cors - see traffic light >> on >> >>> the end: >> >>> >> >>> >> >>> >> http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find >> < >> >>> >> http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find >> >>> > >> >>> >> >>> >> >>> You can check the headers responses for any of their valid responses. >> >>> >> >>> If you want some java code that adds this functionality to MyDAS I >> can >> >>> send it to you... >> >>> >> >>> On 4 Aug 2010, at 15:03, Leyla Garcia wrote: >> >>> >> >>> About CORS, >> >>>> >> >>>> >If you run your own DAS servers and >> >>>> >don't list them in the registry, you'll need to check for CORS >> >>>> compatibility >> >>>> >yourself. The latest versions of Proserver and Dazzle should both >> be >> >>>> okay. >> >>>> >> >>>> If I am not mistaken, Proserver already implements CORS headers? >> >>>> Andy, could you please send me a link of a Proserver server that >> >>>> implements this? >> >>>> >> >>>> Thanks, >> >>>> >> >>>> Leyla >> >>>> >> >>>> On 03/08/2010 21:52, Jonathan Warren wrote: >> >>>> >> >>>>> This is very cool - I had a look the other day. Was wondering why >> some >> >>>>> sources could be attached and some can't.... >> >>>>> Best browser experience yet by far I'd say. >> >>>>> >> >>>>> No problems about adding CORS support - for the record I'm very >> happy >> >>>>> to implement new capabilities testing and other suggestions to the >> registry >> >>>>> from anyone who cares to drop me a line. Especially if it's going to >> enhance >> >>>>> and promote the use of the registry :) >> >>>>> >> >>>>> >> >>>>> On 3 Aug 2010, at 20:41, Thomas Down wrote: >> >>>>> >> >>>>> As some of you already know, I've been experimenting recently with >> a >> >>>>>> web-based DAS client for genomic data. It's still in >> >>>>>> a unashamedly prototypical state (in particular, some of the popups >> >>>>>> and >> >>>>>> configuration stuff is outright clunky, and we know it!), but we're >> >>>>>> starting >> >>>>>> to find it quite useful, and would be interested to receive more >> >>>>>> feedback. >> >>>>>> So if you're curious, you can try it here: >> >>>>>> >> >>>>>> http://www.biodalliance.org/human/ncbi36/ >> >>>>>> >> >>>>>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, >> and >> >>>>>> hopefully rather more coming soon), but has one major caveat: since >> >>>>>> it's >> >>>>>> pure Javascript code running in your web browser, there are >> >>>>>> limitations to >> >>>>>> which servers it can connect to. Specifically, it will only work >> with >> >>>>>> DAS >> >>>>>> servers that implement the W3C cross-origin resource sharing model >> >>>>>> (which >> >>>>>> has been discussed on this list before, but drop me a line if >> you've >> >>>>>> got any >> >>>>>> questions). What does this mean in practice? If you're adding >> >>>>>> datasources >> >>>>>> from the registry, things are simple because Dalliance will only >> allow >> >>>>>> you >> >>>>>> to add CORS-enabled sources (a huge thanks to Jonathan Warren for >> >>>>>> adding >> >>>>>> some support for this in the registry). If you run your own DAS >> >>>>>> servers and >> >>>>>> don't list them in the registry, you'll need to check for CORS >> >>>>>> compatibility >> >>>>>> yourself. The latest versions of Proserver and Dazzle should both >> be >> >>>>>> okay. >> >>>>>> >> >>>>>> All comments, suggestions, and bug reports are welcome! >> >>>>>> >> >>>>>> Thomas Down. >> >>>>>> _______________________________________________ >> >>>>>> DAS mailing list >> >>>>>> DAS at lists.open-bio.org >> >>>>>> http://lists.open-bio.org/mailman/listinfo/das >> >>>>>> >> >>>>> >> >>>>> Jonathan Warren >> >>>>> Senior Developer and DAS coordinator >> >>>>> blog: http://biodasman.wordpress.com/ >> >>>>> jw12 at sanger.ac.uk >> >>>>> Ext: 2314 >> >>>>> Telephone: 01223 492314 >> >>>>> >> >>>>> >> >>>>> >> >>>>> >> >>>>> >> >>>>> >> >>>>> >> >>>>> >> >>>>> >> >>>>> >> >>>> _______________________________________________ >> >>>> DAS mailing list >> >>>> DAS at lists.open-bio.org >> >>>> http://lists.open-bio.org/mailman/listinfo/das >> >>>> >> >>> >> >>> Jonathan Warren >> >>> Senior Developer and DAS coordinator >> >>> blog: http://biodasman.wordpress.com/ >> >>> jw12 at sanger.ac.uk >> >>> Ext: 2314 >> >>> Telephone: 01223 492314 >> >>> >> >>> >> >>> >> >>> >> >>> >> >>> >> >>> >> >>> >> >> Jonathan Warren >> >> Senior Developer and DAS coordinator >> >> blog: http://biodasman.wordpress.com/ >> >> jw12 at sanger.ac.uk >> >> >> >> Ext: 2314 >> >> Telephone: 01223 492314 >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> -- The Wellcome Trust Sanger Institute is operated by Genome Research >> >> Limited, a charity registered in England with number 1021457 and a >> company >> >> registered in England with number 2742969, whose registered office is >> 215 >> >> Euston Road, London, NW1 2BE. >> >> >> > >> > _______________________________________________ >> > DAS mailing list >> > DAS at lists.open-bio.org >> > http://lists.open-bio.org/mailman/listinfo/das >> > >> >> >> >> -- >> Lincoln D. Stein >> Director, Informatics and Biocomputing Platform >> Ontario Institute for Cancer Research >> 101 College St., Suite 800 >> Toronto, ON, Canada M5G0A3 >> 416 673-8514 >> Assistant: Renata Musa >> _______________________________________________ >> DAS mailing list >> DAS at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/das >> > > -- Lincoln D. Stein Director, Informatics and Biocomputing Platform Ontario Institute for Cancer Research 101 College St., Suite 800 Toronto, ON, Canada M5G0A3 416 673-8514 Assistant: Renata Musa From ljgarcia at ebi.ac.uk Thu Aug 5 04:42:34 2010 From: ljgarcia at ebi.ac.uk (Leyla Garcia) Date: Thu, 05 Aug 2010 09:42:34 +0100 Subject: [DAS] Dalliance: a new genome DAS client In-Reply-To: <4c59bfba.2337720a.65b7.459a@mx.google.com> References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <4C59733D.4090304@ebi.ac.uk> <4C598145.5090007@ebi.ac.uk> <4c59bfba.2337720a.65b7.459a@mx.google.com> Message-ID: <4C5A797A.6080104@ebi.ac.uk> Hi Suzanna, Yes, I am in the DAS mailing list. Best, Leyla On 04/08/2010 20:29, Suzanna Lewis wrote: > Hi Leyla, > > Are you on the DAS mailing list? For some reason I only see your messages in someone else's reply, but nothing directly from you. > > On Aug 4, 2010, at 12:26 PM, Lincoln Stein wrote: > > >> Thanks for the pointers. I've added CORS capabilities to the GBrowse DAS >> server. >> >> Lincoln >> >> On Wed, Aug 4, 2010 at 11:03 AM, Leyla Garcia wrote: >> >> >>> Thanks Andy and Jonathan, >>> >>> The CORS header is already in MyDAS and it will be available for the next >>> release, which will be soon :) >>> I will check the specification about Access-Control-Expose-Headers >>> >>> Jonathan, I am checking the response headers with an add-on in Firefox. I >>> tried the first URL in the dasregistry link you send me, i.e. >>> http://cathdb.info:9000/das/cath_pdb/features?segment=5pti >>> but I do not see the Access-Control-Allow-Origin header there. So I am not >>> sure what I missing... >>> >>> Leyla >>> >>> >>> >>> On 04/08/2010 15:54, Jonathan Warren wrote: >>> >>> >>>> resp.setHeader("Access-Control-Allow-Origin", "*"); >>>> >>>> On 4 Aug 2010, at 15:51, Jonathan Warren wrote: >>>> >>>> This url list 150 das sources that implement cors - see traffic light on >>>> >>>>> the end: >>>>> >>>>> >>>>> http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find< >>>>> http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find >>>>> >>>>>> >>>>> >>>>> You can check the headers responses for any of their valid responses. >>>>> >>>>> If you want some java code that adds this functionality to MyDAS I can >>>>> send it to you... >>>>> >>>>> On 4 Aug 2010, at 15:03, Leyla Garcia wrote: >>>>> >>>>> About CORS, >>>>> >>>>>> >>>>>>> If you run your own DAS servers and >>>>>>> don't list them in the registry, you'll need to check for CORS >>>>>>> >>>>>> compatibility >>>>>> >>>>>>> yourself. The latest versions of Proserver and Dazzle should both be >>>>>>> >>>>>> okay. >>>>>> >>>>>> If I am not mistaken, Proserver already implements CORS headers? >>>>>> Andy, could you please send me a link of a Proserver server that >>>>>> implements this? >>>>>> >>>>>> Thanks, >>>>>> >>>>>> Leyla >>>>>> >>>>>> On 03/08/2010 21:52, Jonathan Warren wrote: >>>>>> >>>>>> >>>>>>> This is very cool - I had a look the other day. Was wondering why some >>>>>>> sources could be attached and some can't.... >>>>>>> Best browser experience yet by far I'd say. >>>>>>> >>>>>>> No problems about adding CORS support - for the record I'm very happy >>>>>>> to implement new capabilities testing and other suggestions to the registry >>>>>>> from anyone who cares to drop me a line. Especially if it's going to enhance >>>>>>> and promote the use of the registry :) >>>>>>> >>>>>>> >>>>>>> On 3 Aug 2010, at 20:41, Thomas Down wrote: >>>>>>> >>>>>>> As some of you already know, I've been experimenting recently with a >>>>>>> >>>>>>>> web-based DAS client for genomic data. It's still in >>>>>>>> a unashamedly prototypical state (in particular, some of the popups >>>>>>>> and >>>>>>>> configuration stuff is outright clunky, and we know it!), but we're >>>>>>>> starting >>>>>>>> to find it quite useful, and would be interested to receive more >>>>>>>> feedback. >>>>>>>> So if you're curious, you can try it here: >>>>>>>> >>>>>>>> http://www.biodalliance.org/human/ncbi36/ >>>>>>>> >>>>>>>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/1.6, and >>>>>>>> hopefully rather more coming soon), but has one major caveat: since >>>>>>>> it's >>>>>>>> pure Javascript code running in your web browser, there are >>>>>>>> limitations to >>>>>>>> which servers it can connect to. Specifically, it will only work with >>>>>>>> DAS >>>>>>>> servers that implement the W3C cross-origin resource sharing model >>>>>>>> (which >>>>>>>> has been discussed on this list before, but drop me a line if you've >>>>>>>> got any >>>>>>>> questions). What does this mean in practice? If you're adding >>>>>>>> datasources >>>>>>>> from the registry, things are simple because Dalliance will only allow >>>>>>>> you >>>>>>>> to add CORS-enabled sources (a huge thanks to Jonathan Warren for >>>>>>>> adding >>>>>>>> some support for this in the registry). If you run your own DAS >>>>>>>> servers and >>>>>>>> don't list them in the registry, you'll need to check for CORS >>>>>>>> compatibility >>>>>>>> yourself. The latest versions of Proserver and Dazzle should both be >>>>>>>> okay. >>>>>>>> >>>>>>>> All comments, suggestions, and bug reports are welcome! >>>>>>>> >>>>>>>> Thomas Down. >>>>>>>> _______________________________________________ >>>>>>>> DAS mailing list >>>>>>>> DAS at lists.open-bio.org >>>>>>>> http://lists.open-bio.org/mailman/listinfo/das >>>>>>>> >>>>>>>> >>>>>>> Jonathan Warren >>>>>>> Senior Developer and DAS coordinator >>>>>>> blog: http://biodasman.wordpress.com/ >>>>>>> jw12 at sanger.ac.uk >>>>>>> Ext: 2314 >>>>>>> Telephone: 01223 492314 >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>> _______________________________________________ >>>>>> DAS mailing list >>>>>> DAS at lists.open-bio.org >>>>>> http://lists.open-bio.org/mailman/listinfo/das >>>>>> >>>>>> >>>>> Jonathan Warren >>>>> Senior Developer and DAS coordinator >>>>> blog: http://biodasman.wordpress.com/ >>>>> jw12 at sanger.ac.uk >>>>> Ext: 2314 >>>>> Telephone: 01223 492314 >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>> Jonathan Warren >>>> Senior Developer and DAS coordinator >>>> blog: http://biodasman.wordpress.com/ >>>> jw12 at sanger.ac.uk >>>> >>>> Ext: 2314 >>>> Telephone: 01223 492314 >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> -- The Wellcome Trust Sanger Institute is operated by Genome Research >>>> Limited, a charity registered in England with number 1021457 and a company >>>> registered in England with number 2742969, whose registered office is 215 >>>> Euston Road, London, NW1 2BE. >>>> >>>> >>> _______________________________________________ >>> DAS mailing list >>> DAS at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/das >>> >>> >> >> >> -- >> Lincoln D. Stein >> Director, Informatics and Biocomputing Platform >> Ontario Institute for Cancer Research >> 101 College St., Suite 800 >> Toronto, ON, Canada M5G0A3 >> 416 673-8514 >> Assistant: Renata Musa >> _______________________________________________ >> DAS mailing list >> DAS at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/das >> >> > From jw12 at sanger.ac.uk Thu Aug 5 04:47:20 2010 From: jw12 at sanger.ac.uk (Jonathan Warren) Date: Thu, 5 Aug 2010 09:47:20 +0100 Subject: [DAS] GBrowse as a DAS client In-Reply-To: References: <3E124A50-F665-4B60-91C8-DF7F84C82BB0@sanger.ac.uk> <4C59733D.4090304@ebi.ac.uk> <4C598145.5090007@ebi.ac.uk> Message-ID: Hi Lincoln I noticed for the GBrowse clients that I've seen online that the option of a user adding a DAS track is not available, only the GBrowse provider can do so in the config? For the new GBrowse is there a plan/chance that it will use the registry and/or allow people to specify a url directly? There is some registry code available now that is written purely in javascript that would make interacting with the registry very easy to add. On 4 Aug 2010, at 21:14, Lincoln Stein wrote: > No, it will be a while before this version is up on a live site. > > Lincoln > > On Wed, Aug 4, 2010 at 3:35 PM, Thomas Down > wrote: > >> That's great Lincoln. >> >> I don't suppose there's an updated server up somewhere that I could >> test >> against? >> >> Thanks, >> >> Thomas. >> >> >> On Wed, Aug 4, 2010 at 8:26 PM, Lincoln Stein > >wrote: >> >>> Thanks for the pointers. I've added CORS capabilities to the >>> GBrowse DAS >>> server. >>> >>> Lincoln >>> >>> On Wed, Aug 4, 2010 at 11:03 AM, Leyla Garcia >>> wrote: >>> >>>> Thanks Andy and Jonathan, >>>> >>>> The CORS header is already in MyDAS and it will be available for >>>> the >>> next >>>> release, which will be soon :) >>>> I will check the specification about Access-Control-Expose-Headers >>>> >>>> Jonathan, I am checking the response headers with an add-on in >>>> Firefox. >>> I >>>> tried the first URL in the dasregistry link you send me, i.e. >>>> http://cathdb.info:9000/das/cath_pdb/features?segment=5pti >>>> but I do not see the Access-Control-Allow-Origin header there. So >>>> I am >>> not >>>> sure what I missing... >>>> >>>> Leyla >>>> >>>> >>>> >>>> On 04/08/2010 15:54, Jonathan Warren wrote: >>>> >>>>> resp.setHeader("Access-Control-Allow-Origin", "*"); >>>>> >>>>> On 4 Aug 2010, at 15:51, Jonathan Warren wrote: >>>>> >>>>> This url list 150 das sources that implement cors - see traffic >>>>> light >>> on >>>>>> the end: >>>>>> >>>>>> >>>>>> >>> http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find >>> < >>>>>> >>> http://www.dasregistry.org/listServices.jsp?organism=any&CSName=any&CSTypes=any&capabilities=cors&labels=any&spec=any&cmd=find >>>>>>> >>>>>> >>>>>> >>>>>> You can check the headers responses for any of their valid >>>>>> responses. >>>>>> >>>>>> If you want some java code that adds this functionality to >>>>>> MyDAS I >>> can >>>>>> send it to you... >>>>>> >>>>>> On 4 Aug 2010, at 15:03, Leyla Garcia wrote: >>>>>> >>>>>> About CORS, >>>>>>> >>>>>>>> If you run your own DAS servers and >>>>>>>> don't list them in the registry, you'll need to check for CORS >>>>>>> compatibility >>>>>>>> yourself. The latest versions of Proserver and Dazzle should >>>>>>>> both >>> be >>>>>>> okay. >>>>>>> >>>>>>> If I am not mistaken, Proserver already implements CORS headers? >>>>>>> Andy, could you please send me a link of a Proserver server that >>>>>>> implements this? >>>>>>> >>>>>>> Thanks, >>>>>>> >>>>>>> Leyla >>>>>>> >>>>>>> On 03/08/2010 21:52, Jonathan Warren wrote: >>>>>>> >>>>>>>> This is very cool - I had a look the other day. Was wondering >>>>>>>> why >>> some >>>>>>>> sources could be attached and some can't.... >>>>>>>> Best browser experience yet by far I'd say. >>>>>>>> >>>>>>>> No problems about adding CORS support - for the record I'm very >>> happy >>>>>>>> to implement new capabilities testing and other suggestions >>>>>>>> to the >>> registry >>>>>>>> from anyone who cares to drop me a line. Especially if it's >>>>>>>> going to >>> enhance >>>>>>>> and promote the use of the registry :) >>>>>>>> >>>>>>>> >>>>>>>> On 3 Aug 2010, at 20:41, Thomas Down wrote: >>>>>>>> >>>>>>>> As some of you already know, I've been experimenting recently >>>>>>>> with >>> a >>>>>>>>> web-based DAS client for genomic data. It's still in >>>>>>>>> a unashamedly prototypical state (in particular, some of the >>>>>>>>> popups >>>>>>>>> and >>>>>>>>> configuration stuff is outright clunky, and we know it!), >>>>>>>>> but we're >>>>>>>>> starting >>>>>>>>> to find it quite useful, and would be interested to receive >>>>>>>>> more >>>>>>>>> feedback. >>>>>>>>> So if you're curious, you can try it here: >>>>>>>>> >>>>>>>>> http://www.biodalliance.org/human/ncbi36/ >>>>>>>>> >>>>>>>>> It's a fully-fledged DAS/1.53 client (with a few bits of DAS/ >>>>>>>>> 1.6, >>> and >>>>>>>>> hopefully rather more coming soon), but has one major >>>>>>>>> caveat: since >>>>>>>>> it's >>>>>>>>> pure Javascript code running in your web browser, there are >>>>>>>>> limitations to >>>>>>>>> which servers it can connect to. Specifically, it will only >>>>>>>>> work >>> with >>>>>>>>> DAS >>>>>>>>> servers that implement the W3C cross-origin resource sharing >>>>>>>>> model >>>>>>>>> (which >>>>>>>>> has been discussed on this list before, but drop me a line if >>> you've >>>>>>>>> got any >>>>>>>>> questions). What does this mean in practice? If you're >>>>>>>>> adding >>>>>>>>> datasources >>>>>>>>> from the registry, things are simple because Dalliance will >>>>>>>>> only >>> allow >>>>>>>>> you >>>>>>>>> to add CORS-enabled sources (a huge thanks to Jonathan >>>>>>>>> Warren for >>>>>>>>> adding >>>>>>>>> some support for this in the registry). If you run your own >>>>>>>>> DAS >>>>>>>>> servers and >>>>>>>>> don't list them in the registry, you'll need to check for CORS >>>>>>>>> compatibility >>>>>>>>> yourself. The latest versions of Proserver and Dazzle >>>>>>>>> should both >>> be >>>>>>>>> okay. >>>>>>>>> >>>>>>>>> All comments, suggestions, and bug reports are welcome! >>>>>>>>> >>>>>>>>> Thomas Down. >>>>>>>>> _______________________________________________ >>>>>>>>> DAS mailing list >>>>>>>>> DAS at lists.open-bio.org >>>>>>>>> http://lists.open-bio.org/mailman/listinfo/das >>>>>>>>> >>>>>>>> >>>>>>>> Jonathan Warren >>>>>>>> Senior Developer and DAS coordinator >>>>>>>> blog: http://biodasman.wordpress.com/ >>>>>>>> jw12 at sanger.ac.uk >>>>>>>> Ext: 2314 >>>>>>>> Telephone: 01223 492314 >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>> _______________________________________________ >>>>>>> DAS mailing list >>>>>>> DAS at lists.open-bio.org >>>>>>> http://lists.open-bio.org/mailman/listinfo/das >>>>>>> >>>>>> >>>>>> Jonathan Warren >>>>>> Senior Developer and DAS coordinator >>>>>> blog: http://biodasman.wordpress.com/ >>>>>> jw12 at sanger.ac.uk >>>>>> Ext: 2314 >>>>>> Telephone: 01223 492314 >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>> Jonathan Warren >>>>> Senior Developer and DAS coordinator >>>>> blog: http://biodasman.wordpress.com/ >>>>> jw12 at sanger.ac.uk >>>>> >>>>> Ext: 2314 >>>>> Telephone: 01223 492314 >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> -- The Wellcome Trust Sanger Institute is operated by Genome >>>>> Research >>>>> Limited, a charity registered in England with number 1021457 and a >>> company >>>>> registered in England with number 2742969, whose registered >>>>> office is >>> 215 >>>>> Euston Road, London, NW1 2BE. >>>>> >>>> >>>> _______________________________________________ >>>> DAS mailing list >>>> DAS at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/das >>>> >>> >>> >>> >>> -- >>> Lincoln D. Stein >>> Director, Informatics and Biocomputing Platform >>> Ontario Institute for Cancer Research >>> 101 College St., Suite 800 >>> Toronto, ON, Canada M5G0A3 >>> 416 673-8514 >>> Assistant: Renata Musa >>> _______________________________________________ >>> DAS mailing list >>> DAS at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/das >>> >> >> > > > -- > Lincoln D. Stein > Director, Informatics and Biocomputing Platform > Ontario Institute for Cancer Research > 101 College St., Suite 800 > Toronto, ON, Canada M5G0A3 > 416 673-8514 > Assistant: Renata Musa > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das Jonathan Warren Senior Developer and DAS coordinator blog: http://biodasman.wordpress.com/ jw12 at sanger.ac.uk Ext: 2314 Telephone: 01223 492314 -- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. From andy.jenkinson at ebi.ac.uk Thu Aug 5 06:14:13 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Thu, 5 Aug 2010 11:14:13 +0100 Subject: [DAS] Fwd: [ensembl-dev] Job position - DAS developer, Ensembl web team References: <4CDF3A35-9790-4A4D-99BD-C70BFABD1313@sanger.ac.uk> Message-ID: <01AFA5AE-69AC-4E1B-AA37-45AEE50AD7E1@ebi.ac.uk> Perl people may be interested in the following Ensembl job. Begin forwarded message: > From: Anne Parker > Date: 5 August 2010 10:43:13 GMT+01:00 > To: dev at ensembl.org > Subject: [ensembl-dev] Job position - DAS developer, Ensembl web team > > Hi > > We currently have a vacancy for a Perl web developer, to work on both client and server DAS code for the main Ensembl website (www.ensembl.org). > > More details here: > > https://jobs.sanger.ac.uk/wd/plsql/wd_portal.show_job?p_web_site_id=1764&p_web_page_id=111984 > > Please feel free to contact me if you have any questions about the position. > > > Cheers > > Anne > > > Anne Parker > Ensembl Web Production Manager > http://www.ensembl.org > > > > > _______________________________________________ > Dev mailing list > Dev at ensembl.org > http://lists.ensembl.org/mailman/listinfo/dev From thomas.a.down at gmail.com Wed Aug 11 14:37:37 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Wed, 11 Aug 2010 19:37:37 +0100 Subject: [DAS] Restricting the range of an alignment query. Message-ID: Hi, I'm currently looking at the spec for the DAS1.6 alignment command: http://www.ebi.ac.uk/~aj/1.6_draft6/documents/spec.html#alignment Suppose I'm interested in a relatively small interval of a large sequence (ex. human NCBI36 chr22:30000000,30200000), and want to find the orthologous segments of the mouse genome... I can certainly do: .../das/align-oracle/alignments?query=xyzzy;subject=22 But that's potentially going to return a slew of data which could swamp lightweight clients and all but the best network connections! One potential solution would be to use the "cols=" filter. However, my reading of the spec is that this is working in "alignment" coordinates, rather than sequence coordinates. Great if you're writing a full-blown alignment viewer and want to do some lazy-fetching, but troublesome if you want to layer a limited about of alignment data onto a more conventional sequence display (or, for that matter, jump into a large alignment using a sequence feature -- for instance, a gene name -- as your reference point). My preferred solution would be to use a "normal" DAS segment identifier. So in my example above, I'd just query for subject?=chr22:30000000,30200000 and get the relevant alignment blocks back straight away. However, the current spec seems to attach a *different* meaning for subject=X:y,z (specifically, return alignments for X plus y sequences before and z sequences after it in a big multiple sequence alignment). Am I missing something here? Thomas. PS. Also, I'd be really interested to hear from anyone else who's used the alignment command for genome-genome alignments (or, indeed, anything bigger than a protein), so I can coordinate and make my implementation as close as possible to whatever's already out there. From thomas.a.down at gmail.com Wed Aug 11 15:14:41 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Wed, 11 Aug 2010 20:14:41 +0100 Subject: [DAS] DAS1.6: coordinate systems Message-ID: My reading of the current spec is a bit vague about how we should refer to coordinate systems. There seem to be three ways to represent a CS: - Comma-separated list, e.g. NCBI_36,Chromosome,Homo sapiens - URI, e.g. http://www.dasregistry.org/dasregistry/coordsys/CS_DS40 - XML, e.g.: NCBI_36,Chromosome,Homo sapiens The XML representation seems to be the most complete. The URIs don't really get discussed much in the spec. Should they resolve to anything in particular? Or should they just be treated as opaque strings? The example I've given resolves to an HTML document with a Vitruvian Man icon and some human-readable details, but probably isn't going to be any help to a client. If you restrict yourself to single-genome DAS (sequence, features, etc.), this all works out fine -- the only interaction you need with the coordinate system infrastructure is to filter out suitable sources from a registry, and in that case you can either filter on the XML COORDINATES elements -- which is fairly straightforward -- or you can ask the registry to filter for you (using a data model which is a reasonably close match to the XML). However, working with coordinate systems seems to be pretty much essential once you start working with alignements, and this is where things start to get complex. The returned alignment XML defines the CS of each sequence in the alignment using the comma-separated form. My assumption is that you're meant to treat this as an opaque string and correlate it with data from a registry, but this isn't 100% clear. On the other hand, if you want to specify a coordinate system in the alignment QUERY, you're supposed to provide a URI. It's not at all clear to me what a server is supposed to be doing with this. Again, opaque string? Is it too late to ask if there's any chance of rationalizing this (and maybe providing a few concrete examples in the spec) before 1.6-final? Thomas. From andy.jenkinson at ebi.ac.uk Thu Aug 12 04:25:33 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Thu, 12 Aug 2010 09:25:33 +0100 Subject: [DAS] Restricting the range of an alignment query. In-Reply-To: References: Message-ID: <97E57D65-3289-4DF8-8CB9-9A0C3A0E15A0@ebi.ac.uk> Hi Thomas, Unfortunately the existing use of the subject parameter was how it was designed as part of the eFamily project, very much focussed on proteins and with the alignments being their own semi-reference data type. Rob or Andreas can say more. The command is used in Pfam's alignment viewer and as mappings for SPICE, both using the full width of the alignment. The principal change between 1.53E and 1.6 is the 'cols' parameter, which at least makes it possible to retrieve sections of an alignment, but as you say this would be in the alignment's own coordinates (there's no other way to do it - which of the many sequences' coordinates would it use?). At the moment genomic alignments are purely theoretical, but 'cols' support is there in ProServer and MyDas. As regards to what would be necessary to do what you want, I don't think we can change the subject parameter unless the existing alignment servers and clients using it can be changed, i.e. Pfam (not sure if there are others). I can't really think of another way to do it off the top of my head - the 'query' parameter has space for start/end positions and can be a sequence identifier, but this is more like 'get all alignments containing this sequence' which is not quite the same thing. It would also be a bit clunky to describe - what would "?query=alignment42:30,40" do? Any suggestions? Cheers, Andy On 11 Aug 2010, at 19:37, Thomas Down wrote: > Hi, > > I'm currently looking at the spec for the DAS1.6 alignment command: > > http://www.ebi.ac.uk/~aj/1.6_draft6/documents/spec.html#alignment > > Suppose I'm interested in a relatively small interval of a large sequence > (ex. human NCBI36 chr22:30000000,30200000), and want to find the orthologous > segments of the mouse genome... I can certainly do: > > .../das/align-oracle/alignments?query=xyzzy;subject=22 > > But that's potentially going to return a slew of data which could swamp > lightweight clients and all but the best network connections! > > One potential solution would be to use the "cols=" filter. However, my > reading of the spec is that this is working in "alignment" coordinates, > rather than sequence coordinates. Great if you're writing a full-blown > alignment viewer and want to do some lazy-fetching, but troublesome if you > want to layer a limited about of alignment data onto a more conventional > sequence display (or, for that matter, jump into a large alignment using a > sequence feature -- for instance, a gene name -- as your reference point). > > My preferred solution would be to use a "normal" DAS segment identifier. So > in my example above, I'd just query for subject?=chr22:30000000,30200000 and > get the relevant alignment blocks back straight away. However, the current > spec seems to attach a *different* meaning for subject=X:y,z (specifically, > return alignments for X plus y sequences before and z sequences after it in > a big multiple sequence alignment). > > Am I missing something here? > > Thomas. > > > PS. Also, I'd be really interested to hear from anyone else who's used the > alignment command for genome-genome alignments (or, indeed, anything bigger > than a protein), so I can coordinate and make my implementation as close as > possible to whatever's already out there. > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das From andy.jenkinson at ebi.ac.uk Thu Aug 12 05:30:08 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Thu, 12 Aug 2010 10:30:08 +0100 Subject: [DAS] DAS1.6: coordinate systems In-Reply-To: References: Message-ID: Hi Thomas, Uh-oh, URIs... :) For coordinate systems, I think the definitions of the component pieces are fairly well described. It is a pity that the species name is not given its own parameter though. The sources documentation then says: "The uri (required) attribute is a globally unique identifier for the coordinate system. It should be a fully resolvable URL providing more information about the coordinate system." This could be misleading as although the URIs _are_ resolvable, the content is not particularly machine friendly. I am not willing to change the syntax of the coordinate system URIs out in the wild, but if you need the content returned to be machine readable we could replace the HTML content with an XML+XSLT combination. That is, "http://www.dasregistry.org/dasregistry/coordsys/CS_DS6" would look more like one of the entries in "http://www.dasregistry.org/das/coordinatesystem" to a machine, and the same as it currently does to a human. From a practical perspective though, if a client parses the XML elements from the registry's /das/coordinatesystem output, it can identify all the coordinate systems by both URI and text description. Changing the output wouldn't materially change what a client needs to do given either a URI or a comma separated string. It is always going to need to run a HTTP get and do some parsing of coordinatesystem XML. But it is certainly true that having the URI resolve to the XML is a more elegant and simple to explain system, and in any case the spec makes no mention of the fact that a client can even obtain the XML for all the coordinate systems together. Throughout writing the 1.6 spec, URIs have always been a big problem to describe, mainly because there are lots of complications for DAS (source vs version, server vs registry namespace). URIs simply weren't given a lot of thought and explanation from the start, and it's too late to change them. In 1.6 things are a little better in that source URIs have been formalised and are more useful, without breaking the assumptions clients currently make. But I have changed the wording describing URIs a few times. I did have a large section describing URIs in general, the rules for formulating them, relative URI references etc, but in the latest drafts this is simplified so as not to confuse people (as much). It only really refers to source URIs rather than coordinate systems though, so I'm happy to add something. Could you please provide the wording and examples? Nobody ever seems to want to :) With regards to the alignment command specifically, I wanted to use the URI for both the query and the content as they are more robust, but there was some practical reason for the existing servers that prevented us from doing so. Perhaps Rob or Andreas can comment? Again, technically it doesn't matter to the client if it has access to the coordinates XML, but it does make the spec not 'feel right' IMO. Also, if coordinate system descriptions (i.e. the comma separated string) were to change over time servers would drift and this would cause big problems for the client, but in truth plenty of stuff would break if that were to happen. Cheers, Andy On 11 Aug 2010, at 20:14, Thomas Down wrote: > My reading of the current spec is a bit vague about how we should refer to > coordinate systems. > > There seem to be three ways to represent a CS: > > - Comma-separated list, e.g. NCBI_36,Chromosome,Homo sapiens > - URI, e.g. > http://www.dasregistry.org/dasregistry/coordsys/CS_DS40 > - XML, e.g.: > > source="Chromosome" authority="NCBI" test_range="1:1,1000" > version="36">NCBI_36,Chromosome,Homo sapiens > > The XML representation seems to be the most complete. > > The URIs don't really get discussed much in the spec. Should they resolve > to anything in particular? Or should they just be treated as opaque > strings? The example I've given resolves to an HTML document with a > Vitruvian Man icon and some human-readable details, but probably isn't going > to be any help to a client. > > If you restrict yourself to single-genome DAS (sequence, features, etc.), > this all works out fine -- the only interaction you need with the coordinate > system infrastructure is to filter out suitable sources from a registry, and > in that case you can either filter on the XML COORDINATES elements -- which > is fairly straightforward -- or you can ask the registry to filter for you > (using a data model which is a reasonably close match to the XML). > > However, working with coordinate systems seems to be pretty much essential > once you start working with alignements, and this is where things start to > get complex. > > The returned alignment XML defines the CS of each sequence in the alignment > using the comma-separated form. My assumption is that you're meant to treat > this as an opaque string and correlate it with data from a registry, but > this isn't 100% clear. > > On the other hand, if you want to specify a coordinate system in the > alignment QUERY, you're supposed to provide a URI. It's not at all clear to > me what a server is supposed to be doing with this. Again, opaque string? > > Is it too late to ask if there's any chance of rationalizing this (and maybe > providing a few concrete examples in the spec) before 1.6-final? > > Thomas. > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das From andy.jenkinson at ebi.ac.uk Thu Aug 12 05:41:54 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Thu, 12 Aug 2010 10:41:54 +0100 Subject: [DAS] Restricting the range of an alignment query. In-Reply-To: References: <97E57D65-3289-4DF8-8CB9-9A0C3A0E15A0@ebi.ac.uk> Message-ID: <88D5E515-89B1-4BF8-99AC-6DCD0B4A82B0@ebi.ac.uk> (ccing list, comments inline) On 12 Aug 2010, at 09:49, Thomas Down wrote: > > On Thu, Aug 12, 2010 at 9:25 AM, Andy Jenkinson wrote: > Hi Thomas, > > Unfortunately the existing use of the subject parameter was how it was designed as part of the eFamily project, very much focussed on proteins and with the alignments being their own semi-reference data type. Rob or Andreas can say more. The command is used in Pfam's alignment viewer and as mappings for SPICE, both using the full width of the alignment. The principal change between 1.53E and 1.6 is the 'cols' parameter, which at least makes it possible to retrieve sections of an alignment, but as you say this would be in the alignment's own coordinates (there's no other way to do it - which of the many sequences' coordinates would it use?). At the moment genomic alignments are purely theoretical, but 'cols' support is there in ProServer and MyDas. > > Okay, I see your point (and I'm certainly not proposing we break existing stuff), but not entirely true -- I'm running genome-genome alignment servers on my dev. machine and will be pushing at least a couple of them public once this issue is resolved. The next version of Dalliance will support alignment DAS (initially just for coordinate system mapping, but want to do a proper comparative genomics view in the future which will hopefully use much the same DAS code). Again, that's 90% working now and at this point mostly just waiting until I'm sure I'm using the alignment system properly, or at least abusing it in a somewhat-sensible manner. OK so you're a lot further along, sounds good. A while back we were aiming to get compara alignments as DAS too (which necessitated the cols parameter). > As regards to what would be necessary to do what you want, I don't think we can change the subject parameter unless the existing alignment servers and clients using it can be changed, i.e. Pfam (not sure if there are others). I can't really think of another way to do it off the top of my head - the 'query' parameter has space for start/end positions and can be a sequence identifier, but this is more like 'get all alignments containing this sequence' which is not quite the same thing. It would also be a bit clunky to describe - what would "?query=alignment42:30,40" do? > > Any suggestions? > > Well, the obvious thing is to couple the coordinate restrictions to the sequence to which they apply. > > Simplest solution I can think of would be to add: > > ?segment=seqName[:start,end] > > ...where: > > ?segment=P12345 > > ...is synonymous with: > > ?subject=P12345 (which would still be supported), > > ...but... > > ?segment=22:30000000,30200000 > > ...does what I want. Maybe not the cleanest solution, but I don't think it's going to horribly break anything (unless there are subtleties I'm missing here?) > > Thomas. I think you're right in that it is going to need another parameter to make it work. Any objections from anyone? What would happen if you specified multiple segments which did not correspond to the same section? Return multiple blocks representing multiple horizontal sections? From thomas.a.down at gmail.com Thu Aug 12 07:06:19 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Thu, 12 Aug 2010 12:06:19 +0100 Subject: [DAS] Restricting the range of an alignment query. In-Reply-To: <88D5E515-89B1-4BF8-99AC-6DCD0B4A82B0@ebi.ac.uk> References: <97E57D65-3289-4DF8-8CB9-9A0C3A0E15A0@ebi.ac.uk> <88D5E515-89B1-4BF8-99AC-6DCD0B4A82B0@ebi.ac.uk> Message-ID: On Thu, Aug 12, 2010 at 10:41 AM, Andy Jenkinson wrote: > > > OK so you're a lot further along, sounds good. A while back we were aiming > to get compara alignments as DAS too (which necessitated the cols > parameter). By "compara alignments", you're talking about the MySQL ensembl-compara database, right? Is there still any interest in this on the Ensembl side? It's something I'm going to be needing soon, too (my current chain-file-based server doesn't handle all the cases I'm interested in). > > As regards to what would be necessary to do what you want, I don't think > we can change the subject parameter unless the existing alignment servers > and clients using it can be changed, i.e. Pfam (not sure if there are > others). I can't really think of another way to do it off the top of my head > - the 'query' parameter has space for start/end positions and can be a > sequence identifier, but this is more like 'get all alignments containing > this sequence' which is not quite the same thing. It would also be a bit > clunky to describe - what would "?query=alignment42:30,40" do? > > > > Any suggestions? > > > > Well, the obvious thing is to couple the coordinate restrictions to the > sequence to which they apply. > > > > Simplest solution I can think of would be to add: > > > > ?segment=seqName[:start,end] > > > > ...where: > > > > ?segment=P12345 > > > > ...is synonymous with: > > > > ?subject=P12345 (which would still be supported), > > > > ...but... > > > > ?segment=22:30000000,30200000 > > > > ...does what I want. Maybe not the cleanest solution, but I don't think > it's going to horribly break anything (unless there are subtleties I'm > missing here?) > > > > Thomas. > > I think you're right in that it is going to need another parameter to make > it work. Any objections from anyone? What would happen if you specified > multiple segments which did not correspond to the same section? Return > multiple blocks representing multiple horizontal sections? That's the idea. Use case for this: user is viewing 22:30000000,30200000, then zooms out. Trigger a fetch for: ?segment=22:29900000,29999999;segment=22:30200001,30300000 ...then merge the results into the working set. Thomas. From thomas.a.down at gmail.com Thu Aug 12 07:16:47 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Thu, 12 Aug 2010 12:16:47 +0100 Subject: [DAS] DAS1.6: coordinate systems In-Reply-To: References: Message-ID: On Thu, Aug 12, 2010 at 10:30 AM, Andy Jenkinson wrote: > Hi Thomas, > > Uh-oh, URIs... :) > > For coordinate systems, I think the definitions of the component pieces are > fairly well described. It is a pity that the species name is not given its > own parameter though. The sources documentation then says: "The uri > (required) attribute is a globally unique identifier for the coordinate > system. It should be a fully resolvable URL providing more information about > the coordinate system." This could be misleading as although the URIs _are_ > resolvable, the content is not particularly machine friendly. > > I am not willing to change the syntax of the coordinate system URIs out in > the wild, but if you need the content returned to be machine readable we > could replace the HTML content with an XML+XSLT combination. That is, " > http://www.dasregistry.org/dasregistry/coordsys/CS_DS6" would look more > like one of the entries in " > http://www.dasregistry.org/das/coordinatesystem" to a machine, and the > same as it currently does to a human. From a practical perspective though, > if a client parses the XML elements from the registry's > /das/coordinatesystem output, it can identify all the coordinate systems by > both URI and text description. Changing the output wouldn't materially > change what a client needs to do given either a URI or a comma separated > string. It is always going to need to run a HTTP get and do some parsing of > coordinatesystem XML. But it is certainly true that having the URI resolve > to the XML is a more elegant and simple to explain system, and in any case > the spec makes no mention of the fact that a client can even obtain the XML > for all the coordinate systems together. > CS URIs pointing to XML definitely seems more symmetrical. Mentioning /das/coordinatesystem in the spec would help, too -- that's currently rather opaque. A few other questions (don't really have "preferred" answers to any of these, just trying to test the boundaries): 1. What do you expect a server to do if it sees a CS URI that it hasn't seen before? 2. If my organization has sequenced a new genome and is running some internal DAS stuff on that while we finish annotating, etc., what URI do we use for the coordinate system? 3. If my organization is running an internal mirror of the central DAS registry, would I mirror the CS URIs (" http://das.bigpharma.com/dasregistry/coordsys/CS_DS40/"). Still point to dasregistry.org? Something else? Thomas. From jw12 at sanger.ac.uk Thu Aug 12 07:54:24 2010 From: jw12 at sanger.ac.uk (Jonathan Warren) Date: Thu, 12 Aug 2010 12:54:24 +0100 Subject: [DAS] DAS1.6: coordinate systems In-Reply-To: References: Message-ID: On 12 Aug 2010, at 12:16, Thomas Down wrote: > On Thu, Aug 12, 2010 at 10:30 AM, Andy Jenkinson > wrote: > >> Hi Thomas, >> >> Uh-oh, URIs... :) >> >> For coordinate systems, I think the definitions of the component >> pieces are >> fairly well described. It is a pity that the species name is not >> given its >> own parameter though. The sources documentation then says: "The uri >> (required) attribute is a globally unique identifier for the >> coordinate >> system. It should be a fully resolvable URL providing more >> information about >> the coordinate system." This could be misleading as although the >> URIs _are_ >> resolvable, the content is not particularly machine friendly. >> >> I am not willing to change the syntax of the coordinate system URIs >> out in >> the wild, but if you need the content returned to be machine >> readable we >> could replace the HTML content with an XML+XSLT combination. That >> is, " >> http://www.dasregistry.org/dasregistry/coordsys/CS_DS6" would look >> more >> like one of the entries in " >> http://www.dasregistry.org/das/coordinatesystem" to a machine, and >> the >> same as it currently does to a human. From a practical perspective >> though, >> if a client parses the XML elements from the registry's >> /das/coordinatesystem output, it can identify all the coordinate >> systems by >> both URI and text description. Changing the output wouldn't >> materially >> change what a client needs to do given either a URI or a comma >> separated >> string. It is always going to need to run a HTTP get and do some >> parsing of >> coordinatesystem XML. But it is certainly true that having the URI >> resolve >> to the XML is a more elegant and simple to explain system, and in >> any case >> the spec makes no mention of the fact that a client can even obtain >> the XML >> for all the coordinate systems together. >> > > CS URIs pointing to XML definitely seems more symmetrical. > > Mentioning /das/coordinatesystem in the spec would help, too -- that's > currently rather opaque. > > A few other questions (don't really have "preferred" answers to any of > these, just trying to test the boundaries): > > 1. What do you expect a server to do if it sees a CS URI that > it > hasn't seen before? > > 2. If my organization has sequenced a new genome and is > running some > internal DAS stuff on that while we finish annotating, etc., what > URI do we > use for the coordinate system? If it were me I'd just add it to the known coordinate systems in the registry. You can ask us to do it or run a script that will add it if it doesn't exist already (can send you script if you want). Having a not widely available genome in the registry wouldn't be harmful in any way? especially if it was coming out in the future. The only issue would be advertising an institute as working on this if they put themselves down as the authority I guess? which maybe in issue.. > > 3. If my organization is running an internal mirror of the > central > DAS registry, would I mirror the CS URIs (" > http://das.bigpharma.com/dasregistry/coordsys/CS_DS40/"). Still > point to > dasregistry.org? Something else? If issue with authority advertising as above then mirror all sources plus private ones and change config/hardcode change of servers/clients to point at internal registry (which can just be a sources document hosted locally). ??? > > Thomas. > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das Jonathan Warren Senior Developer and DAS coordinator blog: http://biodasman.wordpress.com/ jw12 at sanger.ac.uk Ext: 2314 Telephone: 01223 492314 -- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. From thomas.a.down at gmail.com Thu Aug 12 08:05:38 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Thu, 12 Aug 2010 13:05:38 +0100 Subject: [DAS] DAS1.6: coordinate systems In-Reply-To: References: Message-ID: On Thu, Aug 12, 2010 at 12:54 PM, Jonathan Warren wrote: > > 2. If my organization has sequenced a new genome and is running some >> internal DAS stuff on that while we finish annotating, etc., what URI do >> we >> use for the coordinate system? >> > If it were me I'd just add it to the known coordinate systems in the > registry. You can ask us to do it or run a script that will add it if it > doesn't exist already (can send you script if you want). Yep, would be very interested to see that. Thomas. From jherrero at ebi.ac.uk Thu Aug 12 09:15:04 2010 From: jherrero at ebi.ac.uk (Javier Herrero) Date: Thu, 12 Aug 2010 14:15:04 +0100 Subject: [DAS] Restricting the range of an alignment query. In-Reply-To: References: <88D5E515-89B1-4BF8-99AC-6DCD0B4A82B0@ebi.ac.uk> Message-ID: <201008121415.05406.jherrero@ebi.ac.uk> Hi Thomas On Thursday 12 Aug 2010 12:06:19 Thomas Down wrote: > On Thu, Aug 12, 2010 at 10:41 AM, Andy Jenkinson > > wrote: > > OK so you're a lot further along, sounds good. A while back we were > > aiming to get compara alignments as DAS too (which necessitated the cols > > parameter). > > By "compara alignments", you're talking about the MySQL ensembl-compara > database, right? Yes, that is what we discussed some time ago. > Is there still any interest in this on the Ensembl side? It's something > I'm going to be needing soon, too (my current chain-file-based server > doesn't handle all the cases I'm interested in). I guess the interest must come from "the other side". I am quite keen on providing alignments through DAS if people and/or DAS clients will use them and if that is not too heavy for our servers. You can imagine that things can go horribly wrong if one asked all 33-way EPO alignments on a chromosome at once. This can probably be controlled in the server. Another question is whether it is easy to fit genomic alignments into the current dasalignment structure. I am not sure how to interpret things like dbAccessionId, objectVersion, dbSource, etc for a genomic alignment. In other words, should protein and genomic alignments share the same query and response? Javier > > > As regards to what would be necessary to do what you want, I don't > > > think > > > > we can change the subject parameter unless the existing alignment servers > > and clients using it can be changed, i.e. Pfam (not sure if there are > > others). I can't really think of another way to do it off the top of my > > head - the 'query' parameter has space for start/end positions and can > > be a sequence identifier, but this is more like 'get all alignments > > containing this sequence' which is not quite the same thing. It would > > also be a bit clunky to describe - what would "?query=alignment42:30,40" > > do? > > > > > Any suggestions? > > > > > > Well, the obvious thing is to couple the coordinate restrictions to the > > > > sequence to which they apply. > > > > > Simplest solution I can think of would be to add: > > > ?segment=seqName[:start,end] > > > > > > ...where: > > > ?segment=P12345 > > > > > > ...is synonymous with: > > > ?subject=P12345 (which would still be supported), > > > > > > ...but... > > > > > > ?segment=22:30000000,30200000 > > > > > > ...does what I want. Maybe not the cleanest solution, but I don't > > > think > > > > it's going to horribly break anything (unless there are subtleties I'm > > missing here?) > > > > > Thomas. > > > > I think you're right in that it is going to need another parameter to > > make it work. Any objections from anyone? What would happen if you > > specified multiple segments which did not correspond to the same > > section? Return multiple blocks representing multiple horizontal > > sections? > > That's the idea. > > Use case for this: user is viewing 22:30000000,30200000, then zooms out. > Trigger a fetch for: > > ?segment=22:29900000,29999999;segment=22:30200001,30300000 > > ...then merge the results into the working set. > > Thomas. > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das -- Javier Herrero, PhD Ensembl Compara Project Leader European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus, Hinxton Cambridge - CB10 1SD - UK From thomas.a.down at gmail.com Thu Aug 12 09:57:19 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Thu, 12 Aug 2010 14:57:19 +0100 Subject: [DAS] Restricting the range of an alignment query. In-Reply-To: <201008121415.05406.jherrero@ebi.ac.uk> References: <88D5E515-89B1-4BF8-99AC-6DCD0B4A82B0@ebi.ac.uk> <201008121415.05406.jherrero@ebi.ac.uk> Message-ID: On Thu, Aug 12, 2010 at 2:15 PM, Javier Herrero wrote: > Hi Thomas > > > > Is there still any interest in this on the Ensembl side? It's something > > I'm going to be needing soon, too (my current chain-file-based server > > doesn't handle all the cases I'm interested in). > > I guess the interest must come from "the other side". I am quite keen on > providing alignments through DAS if people and/or DAS clients will use them > and if that is not too heavy for our servers. Well, I'm very keen to get comparative data into Dalliance ( http://www.biodalliance.org/human/ncbi36/) if you haven't seen it, and an ensembl-compara DAS server would be substantially the best way to do that. > You can imagine that things can > go horribly wrong if one asked all 33-way EPO alignments on a chromosome at > once. This can probably be controlled in the server. > That's an interesting general question. Historically, DAS has gone more for trusting clients to request "sensible" amounts of data (although personally, I'd like to see a richer way of hinting to clients what "sensible" might mean in a given context). You could just forbid fetching alignments >1Mb or something. Having said that, I actually think there *are* legitimate reasons for fetching a whole chromosome worth of alignments. Firstly for clients that want to do something other than pure data visualisation. Secondly, for a client which wants to show a "karyotype" type view, with syntenic blocks labeled, rather than a very detailed base-level alignment. The first one can probably only be addressed by chunky servers and/or responsible usage patterns, but the second one could be handled quite nicely with a very small extension to the current DAS protocol. The current format encourages you to represent the high-level structure of the alignment using BLOCK elements, then fill in the fine base-level structure with CIGAR strings. Given a server that follows this pattern, all that's needed is a flag to omit the CIGARs and you'd have pretty-much perfect data for use in a synteny browser. Given that the CIGAR is already optional, this ought to be pretty painless to add. > Another question is whether it is easy to fit genomic alignments into the > current dasalignment structure. I am not sure how to interpret things like > dbAccessionId, objectVersion, dbSource, etc for a genomic alignment. In > other > words, should protein and genomic alignments share the same query and > response? The response format actually fits pretty well as far as I can tell. To my mind: assembly name/version == dbSource (although this is kind-of redundant with coordinate system...) chromosome name/number == objectVersion. Concrete example (NB. experimental server, may move, change or disappear!): view-source: http://www.derkholm.net:8080/das/hg18ToHg19/alignment?query=22 Because of the block/segment approach, it would be easy to generalize this to >2 way alignments. There may be a few minor additions that are worthwhile, but overall I'm fairly certain this will work okay. Right now, I'm much more concerned about the query format. Thomas. From andy.jenkinson at ebi.ac.uk Thu Aug 12 10:35:42 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Thu, 12 Aug 2010 15:35:42 +0100 Subject: [DAS] DAS1.6: coordinate systems In-Reply-To: References: Message-ID: <98F09657-A419-40E5-AD78-A0D0A1877CE5@ebi.ac.uk> On 12 Aug 2010, at 12:16, Thomas Down wrote: > > A few other questions (don't really have "preferred" answers to any of these, just trying to test the boundaries): > > 1. What do you expect a server to do if it sees a CS URI that it hasn't seen before? I assume you mean an alignment server receiving such a URI as a parameter? If so I think there is only one realistic thing it can do right now, which is return nothing (i.e. no alignments). It is not an authority on which coordinate systems are real and which aren't, it just knows about the ones it is using. In essence it simply has no data for the requested subject because it doesn't recognise the subject. We could invent error situations for things like this, but that would make DAS more complex for little gain IMO.. > 2. If my organization has sequenced a new genome and is running some internal DAS stuff on that while we finish annotating, etc., what URI do we use for the coordinate system? > > 3. If my organization is running an internal mirror of the central DAS registry, would I mirror the CS URIs ("http://das.bigpharma.com/dasregistry/coordsys/CS_DS40/"). Still point to dasregistry.org? Something else? As Jon says, I think it's easier just to create the coordinate system in the registry. At the moment you need admin privileges to do that via the interface, but I implemented a programmatic version of this wherein the registry accepts POSTed coordinate system XML (sans URI) and replies with the same (with URI included). It is restricted to allow you to create new authority/versions, but not new "segment types": so you can create "GRCh_38,Chromosome,Homo sapiens" but not "GRCh_37,wibble,Homo sapiens". We were a bit wary of opening up the ability to do the latter in fear of ending up with different spellings of the same thing, or other similar issues. I suppose the same thing could be made available to registered users in the interface (possibly at the same time reworking it to use the web services underneath). > Thomas. From jw12 at sanger.ac.uk Thu Aug 12 10:45:38 2010 From: jw12 at sanger.ac.uk (Jonathan Warren) Date: Thu, 12 Aug 2010 15:45:38 +0100 Subject: [DAS] DAS1.6: coordinate systems In-Reply-To: <98F09657-A419-40E5-AD78-A0D0A1877CE5@ebi.ac.uk> References: <98F09657-A419-40E5-AD78-A0D0A1877CE5@ebi.ac.uk> Message-ID: On 12 Aug 2010, at 15:35, Andy Jenkinson wrote: > On 12 Aug 2010, at 12:16, Thomas Down wrote: >> >> A few other questions (don't really have "preferred" answers to any >> of these, just trying to test the boundaries): >> >> 1. What do you expect a server to do if it sees a CS URI >> that it hasn't seen before? > > I assume you mean an alignment server receiving such a URI as a > parameter? If so I think there is only one realistic thing it can do > right now, which is return nothing (i.e. no alignments). It is not > an authority on which coordinate systems are real and which aren't, > it just knows about the ones it is using. In essence it simply has > no data for the requested subject because it doesn't recognise the > subject. We could invent error situations for things like this, but > that would make DAS more complex for little gain IMO.. > >> 2. If my organization has sequenced a new genome and is >> running some internal DAS stuff on that while we finish annotating, >> etc., what URI do we use for the coordinate system? >> >> 3. If my organization is running an internal mirror of the >> central DAS registry, would I mirror the CS URIs ("http://das.bigpharma.com/dasregistry/coordsys/CS_DS40/ >> "). Still point to dasregistry.org? Something else? > > As Jon says, I think it's easier just to create the coordinate > system in the registry. At the moment you need admin privileges to > do that via the interface, but I implemented a programmatic version > of this wherein the registry accepts POSTed coordinate system XML > (sans URI) and replies with the same (with URI included). It is > restricted to allow you to create new authority/versions, but not > new "segment types": so you can create "GRCh_38,Chromosome,Homo > sapiens" but not "GRCh_37,wibble,Homo sapiens". We were a bit wary > of opening up the ability to do the latter in fear of ending up with > different spellings of the same thing, or other similar issues. > > I suppose the same thing could be made available to registered users > in the interface (possibly at the same time reworking it to use the > web services underneath). > This functionality is currently available in the interface for admin users. >> Thomas. > > > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das Jonathan Warren Senior Developer and DAS coordinator blog: http://biodasman.wordpress.com/ jw12 at sanger.ac.uk Ext: 2314 Telephone: 01223 492314 -- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. From andreas at sdsc.edu Thu Aug 12 11:22:59 2010 From: andreas at sdsc.edu (Andreas Prlic) Date: Thu, 12 Aug 2010 08:22:59 -0700 Subject: [DAS] Restricting the range of an alignment query. In-Reply-To: References: <88D5E515-89B1-4BF8-99AC-6DCD0B4A82B0@ebi.ac.uk> <201008121415.05406.jherrero@ebi.ac.uk> Message-ID: Just to add my 0.02$: I have seen the spec always as a tool to write applications. If the spec does not support a new type of application it should get modified. I like pragmatic approach of coupling the coordinate restrictions to the sequence to which they apply... Andreas -- ----------------------------------------------------------------------- Dr. Andreas Prlic Senior Scientist, RCSB PDB Protein Data Bank University of California, San Diego (+1) 858.246.0526 ----------------------------------------------------------------------- From andreas at sdsc.edu Thu Aug 12 11:29:38 2010 From: andreas at sdsc.edu (Andreas Prlic) Date: Thu, 12 Aug 2010 08:29:38 -0700 Subject: [DAS] DAS1.6: coordinate systems In-Reply-To: References: Message-ID: > With regards to the alignment command specifically, I wanted to use the URI for both the query and the content as they are more robust, but there was some practical reason for the existing servers that prevented us from doing so. Perhaps Rob or Andreas can comment? Do you mean that if you have a URI as part of a URL parameter, you need to HTTP escape it? Andreas -- ----------------------------------------------------------------------- Dr. Andreas Prlic Senior Scientist, RCSB PDB Protein Data Bank University of California, San Diego (+1) 858.246.0526 ----------------------------------------------------------------------- From andy.jenkinson at ebi.ac.uk Thu Aug 12 12:28:27 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Thu, 12 Aug 2010 17:28:27 +0100 Subject: [DAS] DAS1.6: coordinate systems In-Reply-To: References: Message-ID: <39B1538B-947D-4CCB-9759-0EFFD438AB38@ebi.ac.uk> On 12 Aug 2010, at 16:29, Andreas Prlic wrote: >> With regards to the alignment command specifically, I wanted to use the URI for both the query and the content as they are more robust, but there was some practical reason for the existing servers that prevented us from doing so. Perhaps Rob or Andreas can comment? > > Do you mean that if you have a URI as part of a URL parameter, you > need to HTTP escape it? I'm not sure if you need to escape or not, I don't think so. But what I meant is that I wanted the XML content to contain the URI instead of the comma separated description, i.e.: instead of: I went through the alignment command with Rob when drawing up the 1.6 spec, and this was one of the issues we discussed. The 1.53E documentation does not say what should be used in this field. Rob preferred to use the comma separated version, probably because servers were already doing it that way. However, the coordinate system can also be specified in the "subjectcoordsys" query parameter (to help when subjects within an alignment have the same name but different coordinate systems): /das/mysource/alignment?query=alignid&subject=seqid&subjectcoordsys=http://www.dasregistry.org/dasregistry/coordsys/CS_DS123 I remember this being added to 1.53E, but again it is not fully described in the 1.53E documentation. In the 1.6 specification I've written that it should be the URI, but I can't remember why I wrote it this way - it could be a mistake, or there could be servers/clients doing it this way. From thomas.a.down at gmail.com Thu Aug 12 16:50:42 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Thu, 12 Aug 2010 21:50:42 +0100 Subject: [DAS] Restricting the range of an alignment query. In-Reply-To: <88D5E515-89B1-4BF8-99AC-6DCD0B4A82B0@ebi.ac.uk> References: <97E57D65-3289-4DF8-8CB9-9A0C3A0E15A0@ebi.ac.uk> <88D5E515-89B1-4BF8-99AC-6DCD0B4A82B0@ebi.ac.uk> Message-ID: Just to follow up briefly, here's a somewhat usable example use of genome alignment DAS. (NB, requires Firefox 3.6+ or recent-ish version of Safari or Google Chrome). 1. Go to http://www.biodalliance.org/human/GRCh37/ 2. Click on the "Add tracks" button (left hand end of the toolbar). 3. Click over to the "NCBI36" tab. 4. Select some datasources from the registry and click "Add". Behind the scenes, your browser will be fetching relevant parts of the NCBI36 vs. GRCh37 alignment via DAS, then using this to re-map features on the fly. (Currently the alignment-fetching depends on what I fear is a slightly non-standard usage of the query= parameter. But once we've got a good alternative to that, we can do the whole process using pure DAS). Thomas. From awitney at sgul.ac.uk Tue Aug 17 11:49:31 2010 From: awitney at sgul.ac.uk (Adam Witney) Date: Tue, 17 Aug 2010 16:49:31 +0100 Subject: [DAS] DAS and bacterial genomes Message-ID: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> Hi, What would be the best approach to use DAS with bacterial genomes? I can't seem to find any coordinate systems for these organisms in the Registry. Thanks for any advice Adam From andy.jenkinson at ebi.ac.uk Tue Aug 17 13:07:48 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Tue, 17 Aug 2010 18:07:48 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> Message-ID: <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> Hi Adam, There are no coordinate systems yet as nobody has yet been brave enough to start using DAS with bacteria in anger. Eugene at Ensembl Genomes will have an interest in doing this, but they have issues with matching up their species/strain names with the NCBI taxonomy upon which DAS's coordinates are based. In essence if you will need to name the coordinate systems after which they will need to be added to the registry. For example when Ensembl Genomes manage to do this, the coordinate systems might end up looking like: EB_1,Chromosome,Shigella flexneri 2a str. 301 EB_1,Plasmid,Shigella flexneri 2a str. 301 This is for a specific shigella strain with taxonomy ID 198214. The authority and version parts of the DAS coordinate system are somewhat arbitrarily named, ideally they would be a standard that is used by the rest of the community for interoperability purposes. What exactly is it you'd like to be able to do? How many species' are we talking about? The reason I ask is that getting these coordinate systems into the DAS registry does require some work. Some of this is on the registry's side, but depending where your data come from there may be issues with identifying the correct coordinate system details such that others can reuse them meaningfully. To use the example above, Ensembl Genomes give the "301" strain a different name from NCBI and use the taxonomy ID not for the strain but for the parent species (Shigella flexneri). In fact the 2457T strain also uses the same taxonomy ID, which isn't helpful. Given the number of species', this adds up to a major headache. Cheers, Andy On 17 Aug 2010, at 16:49, Adam Witney wrote: > Hi, > > What would be the best approach to use DAS with bacterial genomes? I can't seem to find any coordinate systems for these organisms in the Registry. > > Thanks for any advice > > Adam > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das From awitney at sgul.ac.uk Wed Aug 18 04:52:35 2010 From: awitney at sgul.ac.uk (Adam Witney) Date: Wed, 18 Aug 2010 09:52:35 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> Message-ID: Hi Andy, Yes I am aware of the some of the idiosyncrasies of the Ensembl Genomes naming conventions. But is there a reason that the DAS registry should be constrained by Ensembl Genomes? Could the Registry entry refer to a specific taxonomy iD and its corresponding entry in EG, despite EG using a different taxonomy ID? I'd like to be able to export our microarray designs and data via DAS for others to use (including EnsemblBacteria). This if for 16 or so species with multiple strains thereof. cheers Adam On 17 Aug 2010, at 18:07, Andy Jenkinson wrote: > Hi Adam, > > There are no coordinate systems yet as nobody has yet been brave enough to start using DAS with bacteria in anger. Eugene at Ensembl Genomes will have an interest in doing this, but they have issues with matching up their species/strain names with the NCBI taxonomy upon which DAS's coordinates are based. In essence if you will need to name the coordinate systems after which they will need to be added to the registry. > > For example when Ensembl Genomes manage to do this, the coordinate systems might end up looking like: > EB_1,Chromosome,Shigella flexneri 2a str. 301 > EB_1,Plasmid,Shigella flexneri 2a str. 301 > > This is for a specific shigella strain with taxonomy ID 198214. The authority and version parts of the DAS coordinate system are somewhat arbitrarily named, ideally they would be a standard that is used by the rest of the community for interoperability purposes. > > What exactly is it you'd like to be able to do? How many species' are we talking about? > > The reason I ask is that getting these coordinate systems into the DAS registry does require some work. Some of this is on the registry's side, but depending where your data come from there may be issues with identifying the correct coordinate system details such that others can reuse them meaningfully. To use the example above, Ensembl Genomes give the "301" strain a different name from NCBI and use the taxonomy ID not for the strain but for the parent species (Shigella flexneri). In fact the 2457T strain also uses the same taxonomy ID, which isn't helpful. Given the number of species', this adds up to a major headache. > > Cheers, > Andy > > On 17 Aug 2010, at 16:49, Adam Witney wrote: > >> Hi, >> >> What would be the best approach to use DAS with bacterial genomes? I can't seem to find any coordinate systems for these organisms in the Registry. >> >> Thanks for any advice >> >> Adam >> _______________________________________________ >> DAS mailing list >> DAS at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/das > From birney at ebi.ac.uk Wed Aug 18 05:30:32 2010 From: birney at ebi.ac.uk (Ewan Birney) Date: Wed, 18 Aug 2010 10:30:32 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> Message-ID: <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> On 18 Aug 2010, at 09:52, Adam Witney wrote: > > Hi Andy, > > Yes I am aware of the some of the idiosyncrasies of the Ensembl > Genomes naming conventions. But is there a reason that the DAS > registry should be constrained by Ensembl Genomes? Could the > Registry entry refer to a specific taxonomy iD and its corresponding > entry in EG, despite EG using a different taxonomy ID? > > I'd like to be able to export our microarray designs and data via > DAS for others to use (including EnsemblBacteria). This if for 16 or > so species with multiple strains thereof. > Just to say that I think we should get this as straight as we can; just to state the obvious - EG is not trying to be deliberately complex here, it is just that the concept of "one taxid == one species == one assembly series" just breaks down in bacteria. I've brought in the three key people here on the EG side - Eugene (does the web side of this); Dan (main data production manager) and Paul Kersey (the EG PI) - some of them are on holiday now, but I suggest perhaps setting up a phone conference (and/or Adam could you come for a visit?) to get this as straight as we can - I suspect there will both be short term fixes and more longer term infrastructural fixes here. > cheers > > Adam > > > On 17 Aug 2010, at 18:07, Andy Jenkinson wrote: > >> Hi Adam, >> >> There are no coordinate systems yet as nobody has yet been brave >> enough to start using DAS with bacteria in anger. Eugene at Ensembl >> Genomes will have an interest in doing this, but they have issues >> with matching up their species/strain names with the NCBI taxonomy >> upon which DAS's coordinates are based. In essence if you will need >> to name the coordinate systems after which they will need to be >> added to the registry. >> >> For example when Ensembl Genomes manage to do this, the coordinate >> systems might end up looking like: >> EB_1,Chromosome,Shigella flexneri 2a str. 301 >> EB_1,Plasmid,Shigella flexneri 2a str. 301 >> >> This is for a specific shigella strain with taxonomy ID 198214. The >> authority and version parts of the DAS coordinate system are >> somewhat arbitrarily named, ideally they would be a standard that >> is used by the rest of the community for interoperability purposes. >> >> What exactly is it you'd like to be able to do? How many species' >> are we talking about? >> >> The reason I ask is that getting these coordinate systems into the >> DAS registry does require some work. Some of this is on the >> registry's side, but depending where your data come from there may >> be issues with identifying the correct coordinate system details >> such that others can reuse them meaningfully. To use the example >> above, Ensembl Genomes give the "301" strain a different name from >> NCBI and use the taxonomy ID not for the strain but for the parent >> species (Shigella flexneri). In fact the 2457T strain also uses the >> same taxonomy ID, which isn't helpful. Given the number of >> species', this adds up to a major headache. >> >> Cheers, >> Andy >> >> On 17 Aug 2010, at 16:49, Adam Witney wrote: >> >>> Hi, >>> >>> What would be the best approach to use DAS with bacterial genomes? >>> I can't seem to find any coordinate systems for these organisms in >>> the Registry. >>> >>> Thanks for any advice >>> >>> Adam >>> _______________________________________________ >>> DAS mailing list >>> DAS at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/das >> > > > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das From thomas.a.down at gmail.com Wed Aug 18 05:35:36 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Wed, 18 Aug 2010 10:35:36 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> Message-ID: On Wed, Aug 18, 2010 at 10:30 AM, Ewan Birney wrote: > > On 18 Aug 2010, at 09:52, Adam Witney wrote: > > >> Hi Andy, >> >> Yes I am aware of the some of the idiosyncrasies of the Ensembl Genomes >> naming conventions. But is there a reason that the DAS registry should be >> constrained by Ensembl Genomes? Could the Registry entry refer to a specific >> taxonomy iD and its corresponding entry in EG, despite EG using a different >> taxonomy ID? >> >> I'd like to be able to export our microarray designs and data via DAS for >> others to use (including EnsemblBacteria). This if for 16 or so species with >> multiple strains thereof. >> >> > Just to say that I think we should get this as straight as we can; just to > state the obvious - EG is not trying to be deliberately complex here, it is > just that the concept of "one taxid == one species == one assembly series" > just > breaks down in bacteria. > > > I've brought in the three key people here on the EG side - Eugene (does the > web > side of this); Dan (main data production manager) and Paul Kersey (the EG > PI) - > some of them are on holiday now, but I suggest perhaps setting up a phone > conference (and/or Adam could you come for a visit?) to get this as > straight as > we can - I suspect there will both be short term fixes and more longer term > infrastructural fixes here. The DAS coordinate system scheme already handles multiple assemblies for a given taxon fairly well (via the "authority" part). So it sounds like the main thing that's missing is a sensible way of distinguishing between strains. Given that the CSs are already defined by XML elements with multiple attributes (many of them optional), would adding an (also optional, of course) strain attribute get things 95% of the way there? Thomas. From birney at ebi.ac.uk Wed Aug 18 05:52:06 2010 From: birney at ebi.ac.uk (Ewan Birney) Date: Wed, 18 Aug 2010 10:52:06 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> Message-ID: <5CDE0B00-9FC5-481F-AD70-D25467D0C38A@ebi.ac.uk> >> >> >> I've brought in the three key people here on the EG side - Eugene >> (does the >> web >> side of this); Dan (main data production manager) and Paul Kersey >> (the EG >> PI) - >> some of them are on holiday now, but I suggest perhaps setting up a >> phone >> conference (and/or Adam could you come for a visit?) to get this as >> straight as >> we can - I suspect there will both be short term fixes and more >> longer term >> infrastructural fixes here. > > > The DAS coordinate system scheme already handles multiple assemblies > for a > given taxon fairly well (via the "authority" part). So it sounds > like the > main thing that's missing is a sensible way of distinguishing between > strains. Given that the CSs are already defined by XML elements with > multiple attributes (many of them optional), would adding an (also > optional, > of course) strain attribute get things 95% of the way there? > I dont think this quite works; the whole thing goes fractal here in that multiple groups assign "their" assemblies to the same strain id so you have hte same problem, just one level lower (I think). At the last EBI/NCBI meeting we got to more clarity on there being tracking identifiers for assembly series. I think this is the long term solution. But it needs to be pushed out to bacteria, systems in place, and populated. In the meantime we should do something practical to at least allow DAS servers to be mounted against EG web sites (I think this is Adam's use case). ...but I am not the expert here. Each time I think this is simple it .... doesn't end up that way. > Thomas. > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das From awitney at sgul.ac.uk Wed Aug 18 06:02:44 2010 From: awitney at sgul.ac.uk (Adam Witney) Date: Wed, 18 Aug 2010 11:02:44 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> Message-ID: <6EC863C7-F241-4CAD-9613-CDCC35E8074A@sgul.ac.uk> >> Yes I am aware of the some of the idiosyncrasies of the Ensembl Genomes naming conventions. But is there a reason that the DAS registry should be constrained by Ensembl Genomes? Could the Registry entry refer to a specific taxonomy iD and its corresponding entry in EG, despite EG using a different taxonomy ID? >> >> I'd like to be able to export our microarray designs and data via DAS for others to use (including EnsemblBacteria). This if for 16 or so species with multiple strains thereof. >> > > Just to say that I think we should get this as straight as we can; just to > state the obvious - EG is not trying to be deliberately complex here, it is > just that the concept of "one taxid == one species == one assembly series" just > breaks down in bacteria. Could it be treated as "one taxid == one strain == one assembly"? > I've brought in the three key people here on the EG side - Eugene (does the web > side of this); Dan (main data production manager) and Paul Kersey (the EG PI) - > some of them are on holiday now, but I suggest perhaps setting up a phone > conference (and/or Adam could you come for a visit?) to get this as straight as > we can - I suspect there will both be short term fixes and more longer term > infrastructural fixes here. I'm happy to help any way I can. I am registered on the Ensembl developers workshop next month so could chat to you guys then? thanks adam From biopython at maubp.freeserve.co.uk Wed Aug 18 07:23:56 2010 From: biopython at maubp.freeserve.co.uk (Peter) Date: Wed, 18 Aug 2010 12:23:56 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: <6EC863C7-F241-4CAD-9613-CDCC35E8074A@sgul.ac.uk> References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> <6EC863C7-F241-4CAD-9613-CDCC35E8074A@sgul.ac.uk> Message-ID: On Wed, Aug 18, 2010 at 11:02 AM, Adam Witney wrote: > >> >> Just to say that I think we should get this as straight as we can; just to >> state the obvious - EG is not trying to be deliberately complex here, it is >> just that the concept of "one taxid == one species == one assembly series" just >> breaks down in bacteria. > > Could it be treated as "one taxid == one strain == one assembly"? I'm sticking my oar in here, and probably this is obvious, but anyway... Surely it has to be "one taxid == one strain == multiple assemblies"? And this isn't just for Bacteria, but all organisms. After all, "the" human sequence has already gone through several revisions (hg18 and hg19 being the latest). Even things like the E coli K12 sequence have gone through multiple revisions after publication. What about multiple individual genomes from the same taxid? What I am thinking of here is different human genomes - do we call Craig Ventor a strain of human with his own taxid? That doesn't seem like it will scale well ;) Would we have an extra level here, i.e. one taxid == one strain == multiple individuals, one individual == multiple assemblies Peter From andy.jenkinson at ebi.ac.uk Wed Aug 18 08:23:54 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Wed, 18 Aug 2010 13:23:54 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: <6EC863C7-F241-4CAD-9613-CDCC35E8074A@sgul.ac.uk> References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> <6EC863C7-F241-4CAD-9613-CDCC35E8074A@sgul.ac.uk> Message-ID: >> I've brought in the three key people here on the EG side - Eugene (does the web >> side of this); Dan (main data production manager) and Paul Kersey (the EG PI) - >> some of them are on holiday now, but I suggest perhaps setting up a phone >> conference (and/or Adam could you come for a visit?) to get this as straight as >> we can - I suspect there will both be short term fixes and more longer term >> infrastructural fixes here. > > I'm happy to help any way I can. I am registered on the Ensembl developers workshop next month so could chat to you guys then? > > thanks > > adam I think this is sensible. I've been through this topic a few times with Eugene and I know it is not simple, but I would really like to see this resolved and I do have my own ideas of options to do it. I feel we need to get clear on what the issues are and formulate a plan, and it's much easier to do in person. From lincoln.stein at gmail.com Wed Aug 18 15:25:29 2010 From: lincoln.stein at gmail.com (Lincoln Stein) Date: Wed, 18 Aug 2010 15:25:29 -0400 Subject: [DAS] DAS and bacterial genomes In-Reply-To: <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> Message-ID: It sounds to me as though the DAS source metadata needs just one additional field to indicate the strain, isolate or individual, making the hierarchy: taxid -> strain/isolate/individual id -> assembly (the taxid and the assembly are already in DAS) Is there a definitive repository of isolate IDs that could be used for this purpose? As mentioned elsewhere in this thread, the problem of distinguishing an individual from its taxon is not limited to bacteria. Does the 1000 genomes project use the assembly as a surrogate for isolate/individual? Lincoln On Wed, Aug 18, 2010 at 5:30 AM, Ewan Birney wrote: > > On 18 Aug 2010, at 09:52, Adam Witney wrote: > > >> Hi Andy, >> >> Yes I am aware of the some of the idiosyncrasies of the Ensembl Genomes >> naming conventions. But is there a reason that the DAS registry should be >> constrained by Ensembl Genomes? Could the Registry entry refer to a specific >> taxonomy iD and its corresponding entry in EG, despite EG using a different >> taxonomy ID? >> >> I'd like to be able to export our microarray designs and data via DAS for >> others to use (including EnsemblBacteria). This if for 16 or so species with >> multiple strains thereof. >> >> > Just to say that I think we should get this as straight as we can; just to > state the obvious - EG is not trying to be deliberately complex here, it is > just that the concept of "one taxid == one species == one assembly series" > just > breaks down in bacteria. > > > I've brought in the three key people here on the EG side - Eugene (does the > web > side of this); Dan (main data production manager) and Paul Kersey (the EG > PI) - > some of them are on holiday now, but I suggest perhaps setting up a phone > conference (and/or Adam could you come for a visit?) to get this as > straight as > we can - I suspect there will both be short term fixes and more longer term > infrastructural fixes here. > > > > cheers >> >> Adam >> >> >> On 17 Aug 2010, at 18:07, Andy Jenkinson wrote: >> >> Hi Adam, >>> >>> There are no coordinate systems yet as nobody has yet been brave enough >>> to start using DAS with bacteria in anger. Eugene at Ensembl Genomes will >>> have an interest in doing this, but they have issues with matching up their >>> species/strain names with the NCBI taxonomy upon which DAS's coordinates are >>> based. In essence if you will need to name the coordinate systems after >>> which they will need to be added to the registry. >>> >>> For example when Ensembl Genomes manage to do this, the coordinate >>> systems might end up looking like: >>> EB_1,Chromosome,Shigella flexneri 2a str. 301 >>> EB_1,Plasmid,Shigella flexneri 2a str. 301 >>> >>> This is for a specific shigella strain with taxonomy ID 198214. The >>> authority and version parts of the DAS coordinate system are somewhat >>> arbitrarily named, ideally they would be a standard that is used by the rest >>> of the community for interoperability purposes. >>> >>> What exactly is it you'd like to be able to do? How many species' are we >>> talking about? >>> >>> The reason I ask is that getting these coordinate systems into the DAS >>> registry does require some work. Some of this is on the registry's side, but >>> depending where your data come from there may be issues with identifying the >>> correct coordinate system details such that others can reuse them >>> meaningfully. To use the example above, Ensembl Genomes give the "301" >>> strain a different name from NCBI and use the taxonomy ID not for the strain >>> but for the parent species (Shigella flexneri). In fact the 2457T strain >>> also uses the same taxonomy ID, which isn't helpful. Given the number of >>> species', this adds up to a major headache. >>> >>> Cheers, >>> Andy >>> >>> On 17 Aug 2010, at 16:49, Adam Witney wrote: >>> >>> Hi, >>>> >>>> What would be the best approach to use DAS with bacterial genomes? I >>>> can't seem to find any coordinate systems for these organisms in the >>>> Registry. >>>> >>>> Thanks for any advice >>>> >>>> Adam >>>> _______________________________________________ >>>> DAS mailing list >>>> DAS at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/das >>>> >>> >>> >> >> _______________________________________________ >> DAS mailing list >> DAS at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/das >> > > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das > -- Lincoln D. Stein Director, Informatics and Biocomputing Platform Ontario Institute for Cancer Research 101 College St., Suite 800 Toronto, ON, Canada M5G0A3 416 673-8514 Assistant: Renata Musa From awitney at sgul.ac.uk Wed Aug 18 15:47:30 2010 From: awitney at sgul.ac.uk (Adam Witney) Date: Wed, 18 Aug 2010 20:47:30 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> Message-ID: On 18 Aug 2010, at 20:25, Lincoln Stein wrote: > It sounds to me as though the DAS source metadata needs just one additional field to indicate the strain, isolate or individual, making the hierarchy: > > taxid -> strain/isolate/individual id -> assembly > > (the taxid and the assembly are already in DAS) Is there a definitive repository of isolate IDs that could be used for this purpose? most of the sequenced strains already have strain specific taxid's. Although I don't know how NCBI add new taxid's so not sure how this will scale with the number of genomes that are currently being sequenced. > As mentioned elsewhere in this thread, the problem of distinguishing an individual from its taxon is not limited to bacteria. Does the 1000 genomes project use the assembly as a surrogate for isolate/individual? No it's not unique to bacteria, indeed I notice that Plasmodium vivax is already in the DAS registry, does this not suffer the same problems? i.e which strain are the coordinates referring to? Adam > Lincoln > > On Wed, Aug 18, 2010 at 5:30 AM, Ewan Birney wrote: > > On 18 Aug 2010, at 09:52, Adam Witney wrote: > > > Hi Andy, > > Yes I am aware of the some of the idiosyncrasies of the Ensembl Genomes naming conventions. But is there a reason that the DAS registry should be constrained by Ensembl Genomes? Could the Registry entry refer to a specific taxonomy iD and its corresponding entry in EG, despite EG using a different taxonomy ID? > > I'd like to be able to export our microarray designs and data via DAS for others to use (including EnsemblBacteria). This if for 16 or so species with multiple strains thereof. > > > Just to say that I think we should get this as straight as we can; just to > state the obvious - EG is not trying to be deliberately complex here, it is > just that the concept of "one taxid == one species == one assembly series" just > breaks down in bacteria. > > > I've brought in the three key people here on the EG side - Eugene (does the web > side of this); Dan (main data production manager) and Paul Kersey (the EG PI) - > some of them are on holiday now, but I suggest perhaps setting up a phone > conference (and/or Adam could you come for a visit?) to get this as straight as > we can - I suspect there will both be short term fixes and more longer term > infrastructural fixes here. > > > > cheers > > Adam > > > On 17 Aug 2010, at 18:07, Andy Jenkinson wrote: > > Hi Adam, > > There are no coordinate systems yet as nobody has yet been brave enough to start using DAS with bacteria in anger. Eugene at Ensembl Genomes will have an interest in doing this, but they have issues with matching up their species/strain names with the NCBI taxonomy upon which DAS's coordinates are based. In essence if you will need to name the coordinate systems after which they will need to be added to the registry. > > For example when Ensembl Genomes manage to do this, the coordinate systems might end up looking like: > EB_1,Chromosome,Shigella flexneri 2a str. 301 > EB_1,Plasmid,Shigella flexneri 2a str. 301 > > This is for a specific shigella strain with taxonomy ID 198214. The authority and version parts of the DAS coordinate system are somewhat arbitrarily named, ideally they would be a standard that is used by the rest of the community for interoperability purposes. > > What exactly is it you'd like to be able to do? How many species' are we talking about? > > The reason I ask is that getting these coordinate systems into the DAS registry does require some work. Some of this is on the registry's side, but depending where your data come from there may be issues with identifying the correct coordinate system details such that others can reuse them meaningfully. To use the example above, Ensembl Genomes give the "301" strain a different name from NCBI and use the taxonomy ID not for the strain but for the parent species (Shigella flexneri). In fact the 2457T strain also uses the same taxonomy ID, which isn't helpful. Given the number of species', this adds up to a major headache. > > Cheers, > Andy > > On 17 Aug 2010, at 16:49, Adam Witney wrote: > > Hi, > > What would be the best approach to use DAS with bacterial genomes? I can't seem to find any coordinate systems for these organisms in the Registry. > > Thanks for any advice > > Adam > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das > > > > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das > > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das > > > > -- > Lincoln D. Stein > Director, Informatics and Biocomputing Platform > Ontario Institute for Cancer Research > 101 College St., Suite 800 > Toronto, ON, Canada M5G0A3 > 416 673-8514 > Assistant: Renata Musa From andy.jenkinson at ebi.ac.uk Wed Aug 18 19:10:27 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Thu, 19 Aug 2010 00:10:27 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> Message-ID: <6EC71B69-E8DE-4BCA-AE64-E25B147884EA@ebi.ac.uk> On 18 Aug 2010, at 20:47, Adam Witney wrote: >> As mentioned elsewhere in this thread, the problem of distinguishing an individual from its taxon is not limited to bacteria. Does the 1000 genomes project use the assembly as a surrogate for isolate/individual? > > No it's not unique to bacteria, indeed I notice that Plasmodium vivax is already in the DAS registry, does this not suffer the same problems? i.e which strain are the coordinates referring to? So far a coordinate system always refers to the species or strain identified by its taxonomy ID. As you say, strains DO have their own NCBI taxonomy ID. It may be that this is not the case for a strain that someone wants to annotate, but I have yet to see an actual example. There is the wider question of how to handle individuals though. I can't comment on how 1000 genomes do this as I've only seen these data expressed as variations annotated upon the reference assembly, but my feeling is that if annotations of an individual were needed then it could/would be done using the assembly paradigm as a surrogate. From birney at ebi.ac.uk Thu Aug 19 05:56:49 2010 From: birney at ebi.ac.uk (Ewan Birney) Date: Thu, 19 Aug 2010 10:56:49 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: <6EC71B69-E8DE-4BCA-AE64-E25B147884EA@ebi.ac.uk> References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> <6EC71B69-E8DE-4BCA-AE64-E25B147884EA@ebi.ac.uk> Message-ID: Just to repeat : I always think this should be easy and then I get educated by Paul: I thikn each time one thinks about "just moving it down a level" (eg, to strain) there are submitted cases in which two people have submitted assemblies with the same "strain tax id" but actually clearly arent (eg, there is a big insertion of something). The whole thing keeps moving down a notch. The right thing here is to assign tracking idenitifers to assembly series independently of the strain assignments, and track assemblies separately (but obviously with relationships) to strains. Paul has met most (?all) of the use cases and understands this better than me. I think we should wait for Paul to weigh in here - it's just always a bit more complicated than you think ;) On 19 Aug 2010, at 00:10, Andy Jenkinson wrote: > On 18 Aug 2010, at 20:47, Adam Witney wrote: >>> As mentioned elsewhere in this thread, the problem of >>> distinguishing an individual from its taxon is not limited to >>> bacteria. Does the 1000 genomes project use the assembly as a >>> surrogate for isolate/individual? >> >> No it's not unique to bacteria, indeed I notice that Plasmodium >> vivax is already in the DAS registry, does this not suffer the same >> problems? i.e which strain are the coordinates referring to? > > So far a coordinate system always refers to the species or strain > identified by its taxonomy ID. As you say, strains DO have their own > NCBI taxonomy ID. It may be that this is not the case for a strain > that someone wants to annotate, but I have yet to see an actual > example. There is the wider question of how to handle individuals > though. I can't comment on how 1000 genomes do this as I've only > seen these data expressed as variations annotated upon the reference > assembly, but my feeling is that if annotations of an individual > were needed then it could/would be done using the assembly paradigm > as a surrogate. > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das From jherrero at ebi.ac.uk Thu Aug 19 12:00:55 2010 From: jherrero at ebi.ac.uk (Javier Herrero) Date: Thu, 19 Aug 2010 17:00:55 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <6EC71B69-E8DE-4BCA-AE64-E25B147884EA@ebi.ac.uk> Message-ID: <201008191700.55590.jherrero@ebi.ac.uk> Yes, this can be fun. I can also foresee similar problems in vertebrate genomes. On one hand, some assemblies correspond to more than one single individual: chimeric assemblies, haplotypes, etc. On the other hand, you can get more than one genome per individual (i.e. cancer genomes). Javier On Thursday 19 Aug 2010 10:56:49 Ewan Birney wrote: > Just to repeat : > > I always think this should be easy and then I get educated by Paul: > > I thikn each time one thinks about "just moving it down a > level" (eg, to strain) there are submitted > cases in which two people have submitted assemblies with the same > "strain tax id" but actually > clearly arent (eg, there is a big insertion of something). The whole > thing keeps moving down > a notch. > > The right thing here is to assign tracking idenitifers to assembly > series independently of > the strain assignments, and track assemblies separately (but obviously > with relationships) > to strains. > > Paul has met most (?all) of the use cases and understands this > better than me. I think > we should wait for Paul to weigh in here - it's just always a bit more > complicated than you > think ;) > > On 19 Aug 2010, at 00:10, Andy Jenkinson wrote: > > On 18 Aug 2010, at 20:47, Adam Witney wrote: > >>> As mentioned elsewhere in this thread, the problem of > >>> distinguishing an individual from its taxon is not limited to > >>> bacteria. Does the 1000 genomes project use the assembly as a > >>> surrogate for isolate/individual? > >> > >> No it's not unique to bacteria, indeed I notice that Plasmodium > >> vivax is already in the DAS registry, does this not suffer the same > >> problems? i.e which strain are the coordinates referring to? > > > > So far a coordinate system always refers to the species or strain > > identified by its taxonomy ID. As you say, strains DO have their own > > NCBI taxonomy ID. It may be that this is not the case for a strain > > that someone wants to annotate, but I have yet to see an actual > > example. There is the wider question of how to handle individuals > > though. I can't comment on how 1000 genomes do this as I've only > > seen these data expressed as variations annotated upon the reference > > assembly, but my feeling is that if annotations of an individual > > were needed then it could/would be done using the assembly paradigm > > as a surrogate. > > _______________________________________________ > > DAS mailing list > > DAS at lists.open-bio.org > > http://lists.open-bio.org/mailman/listinfo/das > > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das -- Javier Herrero, PhD Ensembl Compara Project Leader European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus, Hinxton Cambridge - CB10 1SD - UK From rmb32 at cornell.edu Thu Aug 19 13:09:45 2010 From: rmb32 at cornell.edu (Robert Buels) Date: Thu, 19 Aug 2010 10:09:45 -0700 Subject: [DAS] reminder: Aug 25 deadline for GMOD Hackathon application Message-ID: <4C6D6559.3080809@cornell.edu> Hi all, This is your one-week reminder: the deadline for open applications to the GMOD Evo hackathon is Wednesday, August 25th. Rob ======================================== We are seeking participants for the GMOD Tools for Evolutionary Biology Hackathon, held November 8-12, 2010 at the US National Evolutionary Synthesis Center (NESCent) in Durham, NC. This hackathon targets three critical gaps in the capabilities of the GMOD toolbox that currently limit its utility for evolutionary research: 1. Visualization of comparative genomics data 2. Visualization of phylogenetic data and trees 3. Support for population diversity and phenotype data If you are interested in these areas and have relevant expertise, you are strongly encouraged to apply. Relevant areas of expertise include more than just software development: if you are a GMOD power user, visualization guru, domain expert (comparative, phylogenetics, population, ...), or documentation wizard, then your skills are needed! How To Apply: Fill out the online application form at http://bit.ly/gmodevohack. Applications are due August 25. About GMOD: GMOD is an intercompatible suite of open-source software components for storing, managing, analyzing, and visualizing genome-scale data. GMOD includes many widely-used software components: GBrowse and JBrowse, both genome viewers; GBrowse_syn, a comparative genomics viewer; Chado, a generic and modular database schema; CMap, a comparative map viewer; as well as many other components including Apollo, MAKER, BioMart, InterMine, and Galaxy. We hope to extend the functionality of existing GMOD components, and integrate new components as well. About Hackathons: A hackathon is an intense event at which a group of programmers with different backgrounds and skills collaborate hands-on and face-to-face to develop working code that is of utility to the community as a whole. The mix of people will include domain experts and computer-savvy end-users. More details about the event, its motivation, organization, procedures, and attendees, as well as URLs to the hackathon and related websites are included below. Sincerely, The GMOD EvoHack Organizing Committee (and project affiliations as relevant): Nicole Washington, Chair (LBNL, modENCODE, Phenote) Robert Buels (SGN, Chado NatDiv) Scott Cain (OICR, GMOD) Dave Clements (NESCent, GMOD) Hilmar Lapp (NESCent, Phenoscape, Chado NatDiv) Sheldon McKay (University of Arizona, iPlant, GBrowse_syn) ----------------------------- About the GMOD Evo Hackathon Overview We are organizing a hackathon to fill critical gaps in the capabilities of the Generic Model Organism Database (GMOD) toolbox that currently limit its utility for evolutionary research. Specifically, we will focus on tools for 1) viewing comparative genomics data; 2) visualizing phylogenomic data; and 3) supporting population diversity data and phenotype annotation. The event will be hosted at NESCent and bring together a group of about 20+ software developers, end-user representatives, and documentation experts who would otherwise not meet. The participants will include key developers of GMOD components that currently lack features critical for emerging evolutionary biology research, developers of informatics tools in evolutionary research that lack GMOD integration, and informatics-savvy biologists who can represent end-user requirements. The event will provide a unique opportunity to infuse the GMOD developer community with a heightened awareness of unmet needs in evolutionary biology that GMOD components have the potential to fill, and for tool developers in evolutionary biology to better understand how best to extend or integrate with already existing GMOD components. Before the Event Discussion of ideas and sometimes even design actually starts well before the hackathon, on mailing lists, wiki pages, and conference calls set up among accepted attendees. This advance work lays the foundation for participants to be productive from the very first day. This also means that participants should be willing to contribute some time in advance of the hackathon itself to participate in this preparatory discussion. During the Event Typically, hackathon participants use the morning of the first day of the event to organize themselves into working groups of between 3 and 6 people, each with a focused implementation objective. Ideas and objectives are discussed, and attendees coalesce around the projects in which they have the most experience or interest. Deliverables / Event Results The meeting's attendance, working groups, and outcomes will be fully logged and documented on the GMOD wiki (http://gmod.org). Each working group during the event will typically have its own wiki page, linked from the main EvoHack page, where it documents its minutes and design notes, and provides links to the code and documentation it produces. Also, since GMOD and NESCent are both committed to open source principles, all code and documentation produced by participants during the hackathon must be published under an OSI-approved open source license. As contributions to existing GMOD tools, all hackathon products will most likely satisfy this requirement automatically. NESCent This event is sponsored by the US National Evolutionary Synthesis Center (NESCent, http://www.nescent.org) through its Informatics Whitepapers program (http://www.nescent.org/informatics/whitepapers.php). NESCent promotes the synthesis of information, concepts and knowledge to address significant, emerging, or novel questions in evolutionary science and its applications. NESCent achieves this by supporting research and education across disciplinary, institutional, geographic, and demographic boundaries (see http://www.nescent.org/science/proposals.php). Links Main GMOD EvoHack page, and full proposal: http://gmod.org/wiki/GMOD_Evo_Hackathon NESCent: http://www.nescent.org/ GMOD: http://gmod.org Similar past NESCent events, see: http://hackathon.nescent.org/ GMOD hackathon application: http://bit.ly/gmodevohack -- http://gmod.org/wiki/GMOD_News http://gmod.org/wiki/GMOD_Europe_2010 http://gmod.org/wiki/Help_Desk_Feedback From thomas.a.down at gmail.com Thu Aug 19 14:32:31 2010 From: thomas.a.down at gmail.com (Thomas Down) Date: Thu, 19 Aug 2010 19:32:31 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> <6EC71B69-E8DE-4BCA-AE64-E25B147884EA@ebi.ac.uk> Message-ID: I do wonder if there are two somewhat-disjoint problems here: 1. Defining exactly what was sequenced by a given project. 2. Coming up with a scheme of globally-unique identifiers for genome assemblies which can be applied reasonably consistently. (1) is hard. I absolutely agree that this needs serious thought (and is maybe a moving target right now anyway). I'm going to suggest that (2) is sufficient to get DAS working fairly well, and might be a rather easier situation. Currently a coordinate system is defined by up to four properties (quoting from Andy's draft spec): - The *category* or type of object. For example a chromosome, contig or protein sequence. - The *authority* responsible for defining the coordinate system. For example NCBI or UniProt. - The *version*, for coordinate systems containing entities that are not versioned (e.g. genomic assemblies). - The *species*, for coordinate systems containing only entities from a single organism. The authority is the name of the organization "responsible" for the assembly. So if two different groups sequence the same organism, we can already unambiguously identify them. One might hope that it would be reasonably straightforward to project annotations between closely related strains via alignments? Not saying that the full semantic "what did they sequence" isn't worth solving... but I'm not convinced it's needed to get DAS working. Thomas. On Thu, Aug 19, 2010 at 10:56 AM, Ewan Birney wrote: > > Just to repeat : > > I always think this should be easy and then I get educated by Paul: > > I thikn each time one thinks about "just moving it down a level" (eg, to > strain) there are submitted > cases in which two people have submitted assemblies with the same "strain > tax id" but actually > clearly arent (eg, there is a big insertion of something). The whole thing > keeps moving down > a notch. > > The right thing here is to assign tracking idenitifers to assembly series > independently of > the strain assignments, and track assemblies separately (but obviously with > relationships) > to strains. > > Paul has met most (?all) of the use cases and understands this better than > me. I think > we should wait for Paul to weigh in here - it's just always a bit more > complicated than you > think ;) > > > > > On 19 Aug 2010, at 00:10, Andy Jenkinson wrote: > > On 18 Aug 2010, at 20:47, Adam Witney wrote: >> >>> As mentioned elsewhere in this thread, the problem of distinguishing an >>>> individual from its taxon is not limited to bacteria. Does the 1000 genomes >>>> project use the assembly as a surrogate for isolate/individual? >>>> >>> >>> No it's not unique to bacteria, indeed I notice that Plasmodium vivax is >>> already in the DAS registry, does this not suffer the same problems? i.e >>> which strain are the coordinates referring to? >>> >> >> So far a coordinate system always refers to the species or strain >> identified by its taxonomy ID. As you say, strains DO have their own NCBI >> taxonomy ID. It may be that this is not the case for a strain that someone >> wants to annotate, but I have yet to see an actual example. There is the >> wider question of how to handle individuals though. I can't comment on how >> 1000 genomes do this as I've only seen these data expressed as variations >> annotated upon the reference assembly, but my feeling is that if annotations >> of an individual were needed then it could/would be done using the assembly >> paradigm as a surrogate. >> _______________________________________________ >> DAS mailing list >> DAS at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/das >> > > _______________________________________________ > DAS mailing list > DAS at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/das > From andy.jenkinson at ebi.ac.uk Thu Aug 26 10:59:30 2010 From: andy.jenkinson at ebi.ac.uk (Andy Jenkinson) Date: Thu, 26 Aug 2010 15:59:30 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: <4C6E4FDC.9050601@ebi.ac.uk> References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> <4C6E4FDC.9050601@ebi.ac.uk> Message-ID: <15F7435E-930A-4C98-BD5F-507DE401A3CD@ebi.ac.uk> On 20 Aug 2010, at 10:50, Paul Kersey wrote: > Adam - I should be about on the 13th and the morning of the 14th. I'm aware that this currently doesn't work well - some of what we are discussing here for DAS also applies (in part) to the API/registry. > > best wishes > > Paul Can we arrange to get together to discuss this one one of these days then? I don't know the schedule of the Ensembl workshop or Genome Informatics meeting but am currently available all that week. Can anyone suggest a time? Cheers, Andy From awitney at sgul.ac.uk Fri Aug 27 04:08:31 2010 From: awitney at sgul.ac.uk (Adam Witney) Date: Fri, 27 Aug 2010 09:08:31 +0100 Subject: [DAS] DAS and bacterial genomes In-Reply-To: <4C768AF6.5070506@ebi.ac.uk> References: <1C04AD6A-43A2-42E7-8C32-AC2DE6918C1D@sgul.ac.uk> <486250E2-BA0E-4321-8E4D-283B5A5F4881@ebi.ac.uk> <4A57341B-BB29-4291-A61B-D6C314F846C4@ebi.ac.uk> <4C6E4FDC.9050601@ebi.ac.uk> <15F7435E-930A-4C98-BD5F-507DE401A3CD@ebi.ac.uk> <4C768AF6.5070506@ebi.ac.uk> Message-ID: Just looking at the workshop schedule, Tuesday morning looks fine for me also. adam On 26 Aug 2010, at 16:40, Dan Staines wrote: > I'm away on the 13th and in meetings on the afternoon of the 14th, and then attending Genome Informatics, so the morning of the 14th looks best for me. > > On 08/26/2010 03:59 PM, Andy Jenkinson wrote: >> On 20 Aug 2010, at 10:50, Paul Kersey wrote: >>> Adam - I should be about on the 13th and the morning of the 14th. I'm aware that this currently doesn't work well - some of what we are discussing here for DAS also applies (in part) to the API/registry. >>> >>> best wishes >>> >>> Paul >> >> Can we arrange to get together to discuss this one one of these days then? I don't know the schedule of the Ensembl workshop or Genome Informatics meeting but am currently available all that week. Can anyone suggest a time? >> >> Cheers, >> Andy > > -- > Dan Staines, PhD Ensembl Genomes Technical Coordinator > EMBL-EBI Tel: +44-(0)1223-492507 > Wellcome Trust Genome Campus Fax: +44-(0)1223-494468 > Cambridge CB10 1SD, UK http://www.ensemblgenomes.org/ From David.Messina at sbc.su.se Fri Aug 27 10:07:12 2010 From: David.Messina at sbc.su.se (Dave Messina) Date: Fri, 27 Aug 2010 16:07:12 +0200 Subject: [DAS] 1.53E validation errors Message-ID: Hi everybody, After receiving an automated notice that my DAS 1.53E servers are scheduled for deletion, I'm trying to bring them into compliance. I'm not sure exactly how to fix the validation issues, though, so I'm here to ask for help. 1. sources When validating my server, the DAS registry reports this error: capability is stated for this server but is not valid or provided.validating http://das.sbc.su.se:9000//das/sources RegReportErrorHandler.Error Line Number:4 column number:467 attribute "type" has a bad value. Possible values are: das1:alignment,das1:component,das1:cors,das1:dna,das1:entry_points,das1:error-segment,das1:feature-by-id,das1:features,das1:interaction,das1:maxbins,das1:sequence,das1:sources,das1:structure,das1:stylesheet,das1:supercomponent,das1:types,das1:unknown-feature,das1:unknown-segment My DAS server does provide a sources response, including capabilities: http://das.sbc.su.se:9000//das/sources But I'm not sure why it's invalid. The capability types my server returns are: das1:dsn das1:features das1:stylesheet das1:dsn isn't in the list possible values; is it no longer valid? Otherwise, I wonder if this is because I'm running an ancient version of ProServer (v463). 2. stylesheet When validating, the registry reports this error: capability was stated for this server but is either not present or valid.validating http://das.sbc.su.se:9000//das/hmmtop/stylesheet RegReportErrorHandler.Error Line Number:9 column number:17 tag name "FONT" is not allowed. Possible tag names are: ,,,