[Bioperl-l] BUG
fangl
fangl at big.ac.cn
Mon Apr 15 04:13:05 EDT 2013
Hi,
I find a bug when I use Bio::SearchIO to parse Hmmscan.
The result of my Hmmscan has 2 hits against 1 query ,but the module of Bio::SearchIO can only parse 1 hit in the output file.
eg:
hmmscan result:
# hmmscan :: search sequence(s) against a profile database
# HMMER 3.0 (March 2010); http://hmmer.org/
# Copyright (C) 2010 Howard Hughes Medical Institute.
# Freely distributed under the GNU General Public License (GPLv3).
# - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
# query sequence file: pro_2.fa
# target HMM database: /leofs/biodenovo/fangl/data/Pfam/Pfam-A.hmm
# - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Query: PEG03688 [L=200]
Scores for complete sequence (score includes all domains):
--- full sequence --- --- best 1 domain --- -#dom-
E-value score bias E-value score bias exp N Model Description
------- ------ ----- ------- ------ ----- ---- -- -------- -----------
0.00038 20.7 0.1 0.00054 20.2 0.1 1.3 1 SnoaL_2 SnoaL-like domain
------ inclusion threshold ------
0.068 12.9 0.0 0.43 10.3 0.0 2.0 1 FAD_binding_8 FAD-binding domain
Domain annotation for each model (and alignments):
>> SnoaL_2 SnoaL-like domain
# score bias c-Evalue i-Evalue hmmfrom hmm to alifrom ali to envfrom env to acc
--- ------ ----- --------- --------- ------- ------- ------- ------- ------- ------- ----
1 ! 20.2 0.1 7.9e-08 0.00054 11 86 .. 59 157 .. 46 172 .. 0.72
Alignments for each domain:
== domain 1 score: 20.2 bits; conditional E-value: 7.9e-08
T-HHHHHHTEEEEEEEE............CT.S--..E---HHHHHHHTTHHHCEECEEEEEEEEESSTTEEEEEEEEE-E........ESBS--EEEE CS
SnoaL_2 11 gdldalaallapdvvwe............dp.fge..lrGrealraffrallaafpdlrfevedviadgdrvvvrwtvtgt........ipptgrgvtv 86
d l +l++ d+++ +p +g+ + G++a+ + f ++ + f+++++++d+ v++++ +t+t i p ++ +v
PEG03688 59 PDYNLLKELVTYDCTYIsltfdnptlhgiMPwAGThtHVGPQAFIDIFTRVGLYWDRGPFSIDHIFGDDGNVTAWGSFTATsrtlgktvISPWAARARV 157
3666888999999999877777555545545544488******************************************98555555554444444444 PP
>> FAD_binding_8 FAD-binding domain
# score bias c-Evalue i-Evalue hmmfrom hmm to alifrom ali to envfrom env to acc
--- ------ ----- --------- --------- ------- ------- ------- ------- ------- ------- ----
1 ? 10.3 0.0 6.3e-05 0.43 19 63 .. 77 128 .. 63 197 .. 0.69
Alignments for each domain:
== domain 1 score: 10.3 bits; conditional E-value: 6.3e-05
FAD_binding_8 19 lklkkpkks.......lkykpGqyvfini.pslsklflqsHPFtiasapeddk 63
l++ p+ ++ + G +fi i ++ l+++ PF+i dd
PEG03688 77 LTFDNPTLHgimpwagTHTHVGPQAFIDIfTRVG-LYWDRGPFSIDHIFGDDG 128
555555444444444468899999*****66666.99********88777664 PP
Internal pipeline statistics summary:
-------------------------------------
Query sequence(s): 1 (200 residues)
Target model(s): 13672 (2396357 nodes)
Passed MSV filter: 244 (0.0178467); expected 273.4 (0.02)
Passed bias filter: 219 (0.0160181); expected 273.4 (0.02)
Passed Vit filter: 14 (0.00102399); expected 13.7 (0.001)
Passed Fwd filter: 2 (0.000146284); expected 0.1 (1e-05)
Initial search space (Z): 13672 [actual number of targets]
Domain search space (domZ): 2 [number of targets reported over threshold]
# CPU time: 0.15u 0.13s 00:00:00.28 Elapsed: 00:00:06.72
# Mc/sec: 71.32
//
my parse result:
query len hit len query_start query_end query_match_len hit_start hit_end hit_match_len evalue cov
PEG03688 200 SnoaL_2 0 59 157 99 11 86 76 7.9e-08 0.49
pleast to solve this problem!
Thanks!
Best wishes!
fangl
More information about the Bioperl-l
mailing list