[BioRuby] Fwd: Re: BioSQL development
Julian Nordt
ju at ncoffee.de
Sun Aug 22 11:17:44 EDT 2010
One more thing in regard to the mapping between BioSQL and GFF3:
I tried to follow the mapping given by the biosql wiki and blue collar
bioinformatics. The mapping is acceptable in the sense that you can store
*most* or even all (?) of the features that GFF3 offers. The further I got
though within the development the unclearer things got me, especially in
terms of the "attribute" column.
If you compare the table at the biosql wiki (for the attribute column)
with the one at blue collar bioinformatics, one will notice that the there
are keywords that occour in one, but not at the other table. That not
mentioning the todos on the wiki regarding the "standard" columns. I
havn't looked in that detail though through blue collars code, maybe the
answer is given there.
However I wrote a small library that managed to store most - but not all
the given information of the GFF3-files - correctly to BioSQL. There were
some points where the mapping has been unclear to me and where I stored
the given information where I thought it would fit best.
Considering that I chose a standard db schema to avoid any ambiguously and
the fact that I experienced performance issues with MYSQL+Rails (not
related to BioSQL) at the project made it enough for me to switch to CHADO
backed by POSTGRES.
The documentation regarding CHADO is in my opinion richer and most
importantly one can follow gmod_bulk_load_gff3.pl for the mapping
relatively easy, since it is well documented.
I would very much welcome other opinions on the topic, especially in
combination with the use of web applications.
-- Julian
On Sun, 22 Aug 2010 16:17:45 +0200, Rob Syme <rob.syme at gmail.com> wrote:
> I've had a look around and a pretty solid mapping seems to be available:
> http://www.biosql.org/wiki/Annotation_Mapping#GFF3
>
> Blue collar bioinformatics gave it a shot here:
> http://bcbio.wordpress.com/2009/02/22/exploring-bioperl-genbank-to-gff-mapping/
>
> -r
>
> On 22 Aug 2010 22:02, "Hilmar Lapp" <hlapp at drycafe.net> wrote:
> Is the issue with GFF3 in the Bioruby to BioSQL mapping, or is somehow in
> the BioSQL schema?
>
> I recall there was a thread on GFF recently which I wasn't able to
> follow,
> so if the answer is in that thread and isn't easy to sum up here, just
> point
> me there.
>
> -hilmar
>
>
>
> On Aug 22, 2010, at 6:30 AM, Julian Nordt wrote:
>
>> Hi Rob,
>>
>> I just wanted to point that there ...
--
Using Opera's revolutionary email client: http://www.opera.com/mail/
More information about the BioRuby
mailing list