[BioSQL-l] Treating GenBank source features as top level	annotation
    Peter 
    biopython at maubp.freeserve.co.uk
       
    Wed Nov 18 12:27:12 UTC 2009
    
    
  
On Wed, Nov 18, 2009 at 12:08 PM, Richard Holland
<holland at eaglegenomics.com> wrote:
>
> BioJava's latest parsers do the following:
> ...
Without checking all the details, that is broadly what Biopython does
at the moment.
> The main reason why we still use the source feature and don't go to sequence
> level is because when converting between formats it's hard to tell which
> sequence-level qualifier_values are from the source feature and which are
> from other places.
Makes sense.
> The main reason why we rely entirely on the source feature for organism
> and taxon ID info is because it's much easier to parse than the SOURCE
> and ORGANISM tags.
>From memory, Biopython also uses the taxon table here too.
Peter
    
    
More information about the BioSQL-l
mailing list