[EMBOSS] difficulty using databases and "water"
    Scott Markel 
    smarkel at scitegic.com
       
    Fri Sep 10 17:23:24 UTC 2004
    
    
  
I'm using 2.9.0, built in cygwin, running in Windows XP.
I'm having problems creating databases for use with "water".
I created a USA database using dbifasta.
133 > dbifasta -idformat idacc -directory /C/SandBox/EMBOSS -filenames NRDB_protein_10.* -dbname NRDB_protein_10 -release 1.0 -date "10/09/04" -fields acnum -fields des -fields seqvn
Index a fasta database
I edited ~/.embossrc to include the new database.
134 > cat ~/.embossrc
DB NRDB_protein_10 [
    type: P
    format: fasta
    method: emblcd
    fields: "id acc sv des"
    directory: /C/SandBox/EMBOSS
    indexdirectory: /C/SandBox/EMBOSS
    file: NRDB_protein_10.fa
]
showdb does the right thing.
135 > showdb
Displays information on the currently available databases
# Name        Type ID  Qry All Comment
# ====        ==== ==  === === =======
NRDB_protein_10 P    OK  OK  OK  -
seqret can access all of the database.
136 > seqret NRDB_protein_10:\* stdout
Reads and writes (returns) sequences
 >XP_357594.1 XP_357594.1 similar to KIAA0960 protein [Mus musculus]
MCFPGEEVDRQLCRDAIFPIPVACDAPCPKDCVLSAWSSWSSCSHTCSGKTTEGKQTRAR
<shows all 10 sequences just fine>
I can't seem to access an individual sequence.
137 > seqret NRDB_protein_10:XP_357594
Reads and writes (returns) sequences
Error: Unable to read sequence 'NRDB_protein_10:XP_357594'
Died: seqret terminated: Bad value for '-sequence' and no prompt
What I'm really trying to do is run "water", but I expect
my database problems are impacting that.
161 > water -asequence query.fa -bsequence NRDB_protein_10:\*
Smith-Waterman local alignment.
    EMBOSS An error in ajarr.c at line 1805:
Token missing
Any pointers to documentation or examples would be greatly
appreciated.
Scott
-- 
Scott Markel, Ph.D.
Principal Bioinformatics Architect  email:  smarkel at scitegic.com
SciTegic Inc.                       mobile: +1 858 205 3653
9665 Chesapeake Drive, Suite 401    voice:  +1 858 279 8800, ext. 253
San Diego, CA 92123                 fax:    +1 858 279 8804
USA                                 web:    http://www.scitegic.com
    
    
More information about the EMBOSS
mailing list