[Bioperl-l] BUG
Fields, Christopher J
cjfields at illinois.edu
Tue Apr 16 10:29:44 EDT 2013
fangl,
It would help if you can post a bug report on this, just to make sure we can reproduce it. Attach an example data set and code.
https://redmine.open-bio.org/
chris
On Apr 15, 2013, at 3:13 AM, fangl <fangl at big.ac.cn>
wrote:
> Hi,
> I find a bug when I use Bio::SearchIO to parse Hmmscan.
> The result of my Hmmscan has 2 hits against 1 query ,but the module of Bio::SearchIO can only parse 1 hit in the output file.
> eg:
> hmmscan result:
> # hmmscan :: search sequence(s) against a profile database
> # HMMER 3.0 (March 2010); http://hmmer.org/
> # Copyright (C) 2010 Howard Hughes Medical Institute.
> # Freely distributed under the GNU General Public License (GPLv3).
> # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
> # query sequence file: pro_2.fa
> # target HMM database: /leofs/biodenovo/fangl/data/Pfam/Pfam-A.hmm
> # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
>
> Query: PEG03688 [L=200]
> Scores for complete sequence (score includes all domains):
> --- full sequence --- --- best 1 domain --- -#dom-
> E-value score bias E-value score bias exp N Model Description
> ------- ------ ----- ------- ------ ----- ---- -- -------- -----------
> 0.00038 20.7 0.1 0.00054 20.2 0.1 1.3 1 SnoaL_2 SnoaL-like domain
> ------ inclusion threshold ------
> 0.068 12.9 0.0 0.43 10.3 0.0 2.0 1 FAD_binding_8 FAD-binding domain
>
>
> Domain annotation for each model (and alignments):
>>> SnoaL_2 SnoaL-like domain
> # score bias c-Evalue i-Evalue hmmfrom hmm to alifrom ali to envfrom env to acc
> --- ------ ----- --------- --------- ------- ------- ------- ------- ------- ------- ----
> 1 ! 20.2 0.1 7.9e-08 0.00054 11 86 .. 59 157 .. 46 172 .. 0.72
>
> Alignments for each domain:
> == domain 1 score: 20.2 bits; conditional E-value: 7.9e-08
> T-HHHHHHTEEEEEEEE............CT.S--..E---HHHHHHHTTHHHCEECEEEEEEEEESSTTEEEEEEEEE-E........ESBS--EEEE CS
> SnoaL_2 11 gdldalaallapdvvwe............dp.fge..lrGrealraffrallaafpdlrfevedviadgdrvvvrwtvtgt........ipptgrgvtv 86
> d l +l++ d+++ +p +g+ + G++a+ + f ++ + f+++++++d+ v++++ +t+t i p ++ +v
> PEG03688 59 PDYNLLKELVTYDCTYIsltfdnptlhgiMPwAGThtHVGPQAFIDIFTRVGLYWDRGPFSIDHIFGDDGNVTAWGSFTATsrtlgktvISPWAARARV 157
> 3666888999999999877777555545545544488******************************************98555555554444444444 PP
>
>>> FAD_binding_8 FAD-binding domain
> # score bias c-Evalue i-Evalue hmmfrom hmm to alifrom ali to envfrom env to acc
> --- ------ ----- --------- --------- ------- ------- ------- ------- ------- ------- ----
> 1 ? 10.3 0.0 6.3e-05 0.43 19 63 .. 77 128 .. 63 197 .. 0.69
>
> Alignments for each domain:
> == domain 1 score: 10.3 bits; conditional E-value: 6.3e-05
> FAD_binding_8 19 lklkkpkks.......lkykpGqyvfini.pslsklflqsHPFtiasapeddk 63
> l++ p+ ++ + G +fi i ++ l+++ PF+i dd
> PEG03688 77 LTFDNPTLHgimpwagTHTHVGPQAFIDIfTRVG-LYWDRGPFSIDHIFGDDG 128
> 555555444444444468899999*****66666.99********88777664 PP
>
>
>
> Internal pipeline statistics summary:
> -------------------------------------
> Query sequence(s): 1 (200 residues)
> Target model(s): 13672 (2396357 nodes)
> Passed MSV filter: 244 (0.0178467); expected 273.4 (0.02)
> Passed bias filter: 219 (0.0160181); expected 273.4 (0.02)
> Passed Vit filter: 14 (0.00102399); expected 13.7 (0.001)
> Passed Fwd filter: 2 (0.000146284); expected 0.1 (1e-05)
> Initial search space (Z): 13672 [actual number of targets]
> Domain search space (domZ): 2 [number of targets reported over threshold]
> # CPU time: 0.15u 0.13s 00:00:00.28 Elapsed: 00:00:06.72
> # Mc/sec: 71.32
> //
>
> my parse result:
> query len hit len query_start query_end query_match_len hit_start hit_end hit_match_len evalue cov
> PEG03688 200 SnoaL_2 0 59 157 99 11 86 76 7.9e-08 0.49
>
>
> pleast to solve this problem!
> Thanks!
>
> Best wishes!
>
>
>
> fangl
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l
More information about the Bioperl-l
mailing list