[BioRuby] EMBL parsing

Anthony Underwood email2ants at gmail.com
Wed May 2 12:38:31 EDT 2007


Dear all,

I am having a problem when parsing EMBL genome files.

embl = open(filename)
entry = embl.gets(Bio::EMBL::DELIMITER)
seq_obj = Bio::EMBL.new(entry)
puts seq_obj.sequence_length


This takes a long time (minutes) and reports a sequence_length of 0

When taking an equivalent genbank file and changing the code  
appropriately it parses the file in seconds and reports the correct  
length. I am new to bioruby having used bioperl until now. Please can  
anybody let me know if they have had similar problems and any  
possible solutions.

Many thanks

Anthony


More information about the BioRuby mailing list