[Bioperl-l] Trouble in converting abi files to phd files by Bio::SeqIO
陳彥璋
ycchen1981 at gmail.com
Mon Jul 19 07:11:39 UTC 2010
Hi,
I'm processing chromate files (.ab1 format) with BioPerl-1.6.1,
bioperl-ext-1.5.1, and io-lib-1.8.11;
my OS was 32-bit, Centos 5.2.
When I convert abi-format files to phd-format files,
the value of quality of all bases are '0', and the value of
trace_index of all bases are undefined.
Is there any possible reason leading to this problem?
Besides, it claims that CALL_METHOD is phred and PHRED_VERSION is
0.980904.e in phd files;
if this is the case, where can I find phred executers?
The following is the code utilizing BioPerl and results from BioPerl
and Phred respectively:
==============================================
#!/usr/bin/perl -w
unshift(@INC, "/opt/BioPerl-1.6.1/");
unshift(@INC, "/opt/bioperl-ext");
use strict;
use Bio::SeqIO;
my $inputfilename='./HMA040184H12.T3.ab1';
my $outputfilename='test11.phd';
my $in = Bio::SeqIO->new(-file => "$inputfilename",
-format => 'abi');
my $out = Bio::SeqIO->new(-file => ">$outputfilename",
-format => 'phd');
while ( my $seq = $in->next_seq() ) {
for my $curr(1 .. $seq->length()) {
print "[",$seq->qualat($curr),"]:";
print "[",$seq->trace_index_at($curr),"]\t";
}
$out->write_seq($seq);
last;
}
================== test11.phd (by Bio::SeqIO) =================
BEGIN_SEQUENCE (null)
BEGIN_COMMENT
CHROMAT_FILE: unknown
ABI_THUMBPRINT: 0
PHRED_VERSION: 0.980904.e
CALL_METHOD: phred
QUALITY_LEVELS: 99
TIME: Mon Jul 19 11:59:09 2010
TRACE_ARRAY_MIN_INDEX: 0
TRACE_ARRAY_MAX_INDEX: unknown
CHEM: unknown
DYE: unknown
END_COMMENT
BEGIN_DNA
A 0
G 0
G 0
G 0
G 0
.
.
.
.
G 0
T 0
T 0
G 0
T 0
C 0
G 0
C 0
T 0
A 0
C 0
END_DNA
END_SEQUENCE
================== .phd (by phred 0.020425.c) =======================
BEGIN_SEQUENCE HMA040184H12.T3.ab1
BEGIN_COMMENT
CHROMAT_FILE: HMA040184H12.T3.ab1
ABI_THUMBPRINT: 0
PHRED_VERSION: 0.020425.c
CALL_METHOD: phred
QUALITY_LEVELS: 99
TIME: Fri Jul 9 16:20:59 2010
TRACE_ARRAY_MIN_INDEX: 0
TRACE_ARRAY_MAX_INDEX: 11905
TRIM: 38 900 0.0500
TRACE_PEAK_AREA_RATIO: 0.0630
CHEM: term
DYE: big
END_COMMENT
BEGIN_DNA
a 11 2
g 11 13
g 11 22
a 11 37
a 11 52
g 8 65
c 6 76
t 6 91
c 6 97
.
.
.
a 14 11840
c 16 11849
a 16 11861
a 12 11873
c 12 11885
g 12 11897
END_DNA
END_SEQUENCE
================================================
Sincerely yours,
Yen-Chang Chen
More information about the Bioperl-l
mailing list