[Biopython] Planning Biopython 1.72

Peter Cock p.j.a.cock at googlemail.com
Tue Jun 12 22:25:23 UTC 2018


Hi Martin,

Can you open an issue on this please.

My feeling is a strong negative one - filenames are fragile.
However, when given a filename and we own the handle
and can therefore seek or reopen it as needed, I would
be more neutral on sniffing the filetype instead. If we are
going to do it, we should do it centrally for Bio.SeqIO,
Bio.AlignIO and Bio.SearchIO in one place - but I don't
want to rush that given my intended time frame for getting
Biopython 1.72 out before GCC BOSC 2018.

Thanks,

Peter

On Tue, Jun 12, 2018 at 6:04 PM, Martin Mokrejs
<mmokrejs at fold.natur.cuni.cz> wrote:
> Hi,
>   I just realized that SearchIO does not automatically
> detect compressed files on its input using file extension.
> Could .gz, .bz2 be supported?
>
>>>> for record in SearchIO.parse('blah.xml.gz', 'blast-xml'):
> ...     print record.id
> ...
> Traceback (most recent call last):
>   File "<stdin>", line 1, in <module>
>   File "/usr/lib64/python2.7/site-packages/Bio/SearchIO/__init__.py", line 312, in parse
>     generator = iterator(source_file, **kwargs)
>   File "/usr/lib64/python2.7/site-packages/Bio/SearchIO/BlastIO/blast_xml.py", line 211, in __init__
>     self._meta, self._fallback = self._parse_preamble()
>   File "/usr/lib64/python2.7/site-packages/Bio/SearchIO/BlastIO/blast_xml.py", line 225, in _parse_preamble
>     for event, elem in self.xml_iter:
>   File "<string>", line 91, in next
> cElementTree.ParseError: not well-formed (invalid token): line 1, column 0
>>>>
>
> Martin


More information about the Biopython mailing list