[BioPython] BioSQL problems on load

Peter biopython at maubp.freeserve.co.uk
Fri Dec 21 17:56:08 UTC 2007


> This is possibly a simple configuration problem, but I'm having some
> problems loading files into BioSQL.
>
> ...
>
>   warnings.warn("Bio.crc is deprecated; use crc32 and crc64 in
> Bio.SeqUtils.CheckSum instead", DeprecationWarning)

We should fix that - I hadn't seen that warning being triggered on my machine.
>   File
> "/home/dtomso/repository/biopython/build/lib.linux-i686-2.5/BioSQL/BioSe
> qDatabase.py", line 277, in execute
>
>     self.cursor.execute(sql, args or ())
>
>   File "/usr/lib/python2.5/site-packages/MySQLdb/cursors.py", line 149,
> in execute
>
>     query = query % db.literal(args)
>
> TypeError: not all arguments converted during string formatting
>
> This happens with standard genbank and fasta files pulled off of NCBI.
> Any suggestions?

That looks like Bug 2390, http://bugzilla.open-bio.org/show_bug.cgi?id=2390

You didn't say what version of Biopython you are using, but that has
been fixed in CVS.  Could you update to CVS, or wait for the next
release (hopefully in January?).

> There's another issue regarding standard parsing of accession numbers to
> get version IDs (the code doesn't like non-NCBI fasta headers, e.g.
> those produced by phrap), but that is pretty minor and doesn't seem to
> be related to this.

Perhaps the parser or BioSQL can be a bit more robust here.  Could you
file a bug on this issue, and then attach an example input file with
some code showing the problem?

Thanks

Peter



More information about the Biopython mailing list