[Biopython-dev] [BioSQL-l] SwissProt DE lines and bioentry.description field in BioSQL
Peter
biopython at maubp.freeserve.co.uk
Sat May 16 19:28:43 EDT 2009
On 5/17/09, Chris Fields <cjfields at illinois.edu> wrote:
>
> On May 16, 2009, at 5:34 PM, Hilmar Lapp wrote:
> > My inclination from a BioPerl perspective is to extract the part following
> > 'RecName: Full=' as the description, and attach the rest as annotation. We
> > could in fact use the TagTree class for this. I'm cross-posting to BioPerl
> > too to gather what other BioPerl'ers think about this.
> >
> > -hilmar
> >
>
> This is much like the GN issues we've run into before, and we *could* set
> this up using TagTree or similar. In the latter case of gene name the data
> is stored in a text tree as follows:
>
> gene_names:
> gene_name:
> Name: GC1QBP
> Synonyms: HABP1
> Synonyms: SF2P32
> Synonyms: C1QBP
>
> That could be changed to an XML string:
>
> <?xml version="1.0" encoding="UTF-8"?>
> <gene_names>
> <gene_name>
> <Name>GC1QBP</Name>
> <Synonyms>HABP1</Synonyms>
> <Synonyms>SF2P32</Synonyms>
> <Synonyms>C1QBP</Synonyms>
> </gene_name>
> </gene_names>
>
> Thinking about this we should attempt to coalesce around a standard instead
> of forcing the other Bio* to a specific format.
How would you record this in BioSQL? As an XML string for an annotation value?
Brad has suggested JSON might be useful for this kind of thing (see
also per-letter-annotation discussion).
Peter
More information about the Biopython-dev
mailing list