[EMBOSS] Why tfscan generate duplicate results?
    Tao Zhu 
    tzhu at mail.bnu.edu.cn
       
    Fri Aug  3 00:53:19 UTC 2012
    
    
  
tfscan from emboss 6.5.7.0
My input sequence is an intron sequence from Arabidopsis thaliana(in
attached files: test.fasta)
I run:
$ tfscan -sequence test.fasta -menu P -mismatch 0 -outfile test.out
the result is:
########################################
# Program: tfscan
# Rundate: Fri  3 Aug 2012 08:55:50
# Commandline: tfscan
#    -sequence test.fasta
#    -menu P
#    -mismatch 0
#    -outfile test.out
# Report_format: seqtable
# Report_file: test.out
########################################
#=======================================
#
# Sequence: Atha     from: 1   to: 3388
# HitCount: 20
#=======================================
  Start     End  Strand Accession Factor
               Sequence
   3108    3114       + R03715
               ggttaat
   3108    3114       + R03715
               ggttaat
   2482    2488       + R03710
               gaaagaa
   3354    3359       + R02731    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatctc
   2185    2190       + R02731    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatctc
   1008    1013       + R02731    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatctc
   1261    1266       + R02731    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. gagata
   3354    3359       + R02729    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatctc
   2185    2190       + R02729    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatctc
   1008    1013       + R02729    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatctc
   2726    2731       + R02728    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatcta
    989     994       + R02728    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatcta
    223     228       + R02728    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatcta
   2726    2731       + R02728    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatcta
    989     994       + R02728    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatcta
    223     228       + R02728    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatcta
   2876    2879       + R01203
               ctcc
   2449    2452       + R01203
               ctcc
   2141    2144       + R01203
               ctcc
   2877    2883       + R01202
               tccacct
#---------------------------------------
#---------------------------------------
#---------------------------------------
# Reported_sequences: 1
# Reported_hitcount: 20
#---------------------------------------
It could be seen that there exists duplicate items: for example,
3108-3114, +, appear and be counted twice. Why so?
-- 
Tao Zhu, College of Life Sciences, Beijing Normal University, Beijing
100875, China
Email: tzhu at mail.bnu.edu.cn
-------------- next part --------------
A non-text attachment was scrubbed...
Name: test.fasta
Type: application/x-wine-extension-fasta
Size: 3394 bytes
Desc: not available
URL: <http://lists.open-bio.org/pipermail/emboss/attachments/20120803/0ecf718a/attachment-0002.bin>
    
    
More information about the EMBOSS
mailing list