[EMBOSS] FW: Reducing a FASTA repository, new user
    Peter Rice 
    pmr at ebi.ac.uk
       
    Tue Feb 15 08:59:20 UTC 2011
    
    
  
On 14/02/2011 23:35, Marvin Stodolsky wrote:
>   This is elementary I’m sure, but I’ve been unable to work out the
> syntax  from the documentation.
> More minor issue.
>
> When using infoseq to extract all the fasta Headers from a sequence
> Repository, the GeneBegin..GeneEnd (like   234466..234589) often fails to
> come as a uniform field/fields in a resultant spreadsheet.  Is there a Fix
> for this?
I don't see the genebegin and geneend in EMBOSS infoseq output. Are they 
part of the sequence ID in the FASTA file?
You can use a delimiter between items for infoseq using:
  -nocolumn
on the command line.
For import into a spreadsheet you can set the delimiter to be tab with:
  -nocolumn -delimiter "\t"
on the command line. That should then import nicely into a spreadsheet.
Hope that helps
Peter Rice
EMBOSS Team
    
    
More information about the EMBOSS
mailing list