EMBL and Ensembl
    Stefanie Lager 
    stefanielager at fastmail.ca
       
    Tue Mar 18 06:12:05 UTC 2003
    
    
  
Hi,
I have some problems with the EMBL format output from Ensembl. If I
retrieve a LARGE piece of DNA from Ensembl in EMBL format, the SQ line
gets so long so it's divided into two SQ lines, this is NOT handled
correctly by EMBOSS programs! Some EMBOSS programs gives a warning
about illegal characters, others just incorporates the second SQ line
in the sequence. It's easy to fix the problem by manual editing, but
it would be nice to know it this IS standard EMBL format or if it's
Ensembl that's made a mistake?
ID   1.77242832-92443803    ENSEMBL; DNA; PLN; 15200972 BP.
XX
.....
.....
.....
FT   misc_feature    14757170..15200972
FT                   /note="contig 1.92000001-93000000 1..443803(1)"
XX
SQ   Sequence 15200972 BP; 4106479 A; 3111667 C; 3123445 G; 4136833 T;
722548
SQ   other;
     TAGAACTTGC AAATGAGAAA ACAGAGTTCT GTCAAGCTGT GTTAGTGTTT GCCCAACACA
       60 
_________________________________________________________________
    http://fastmail.ca/ - Fast Secure Web Email for Canadians
    
    
More information about the EMBOSS
mailing list