[Bioperl-l] Bio::Tools::GFF parsing error

Scott Cain cain at cshl.edu
Wed Feb 15 16:07:28 UTC 2006


Hi Robert,

No column should ever be padded with spaces; GFF columns should always
be separated by a single tab.  Therefore, I don't thing Bio::Tools::GFF
is at fault here.

Scott


On Wed, 2006-02-15 at 11:01 -0500, Robert Buels wrote:
> Hi all,
> 
> I'm parsing a GFF2 file with Bio::Tools::GFF (I would be using 
> FeatureIO, except it purports not to support gff 2), and the file looks 
> like:
> 
> ##gff-version 2
> ##date 2006-02-13
> ##sequence-region C01HBa0088L02.seq 1 120525
> C01HBa0088L02   RepeatMasker    similarity      3537    4267     3.3    
> -       .       Target "Motif:bac_end_repeat_family_345" 1 740
> C01HBa0088L02   RepeatMasker    similarity      4172    4279     2.9    
> +       .       Target "Motif:HRSiTERT00100141" 1 104
> C01HBa0088L02   RepeatMasker    similarity      4267    4323     0.0    
> -       .       Target "Motif:k_29" 150 206
> C01HBa0088L02   RepeatMasker    similarity      4322    4492    26.6    
> +       .       Target "Motif:PRSiTERT00300001" 1960 2129
> C01HBa0088L02   RepeatMasker    similarity      4557    5124    29.5    
> +       .       Target "Motif:PRSiTERT00300001" 2142 2711
> 
> Notice the score column is padded with spaces.
> 
> Bio::Tools::GFF does not like this, and says that ' 3.3' is not a valid 
> score.  My question is, who is wrong here, my input file or 
> Bio::Tools::GFF?  Should Bio::Tools::GFF be able to read this file?
> 
> Rob
> 
-- 
------------------------------------------------------------------------
Scott Cain, Ph. D.                                         cain at cshl.edu
GMOD Coordinator (http://www.gmod.org/)                     216-392-3087
Cold Spring Harbor Laboratory




More information about the Bioperl-l mailing list