[EMBOSS] Database too lare for dbifasta
    Peter Rice 
    pmr at ebi.ac.uk
       
    Thu Jul 19 06:47:28 UTC 2007
    
    
  
George Magklaras wrote:
> Question to the developers:
> 
> Why INT_MAX (signed)? Why not unsigned UINT_MAX (to raise it a bit) or 
> another raised limit? It is a bit of an overhead to have to go through 
> the file split stage.
The index file format was originally defined that way by the Staden 
package, and also used by EMBL/EBI CD-ROM indexing and by utilities at 
the Sanger Centre/Institute.
The dbi* index files have two problems - they cannot store file 
positions larger than 2Gb aqnd they do not allow duplicate primary 
identifiers.
We may remove them in a future release - but for smaller databases many 
users seem to find them useful still.
regards,
Peter
    
    
More information about the EMBOSS
mailing list