[Biopython-dev] [Biopython (old issues only) - Bug #3395] (Closed) Biopython trie implementation can't load large data sets

redmine at redmine.open-bio.org redmine at redmine.open-bio.org
Sun Jul 24 01:07:43 UTC 2016


Issue #3395 has been updated by Travis Wrightsman.

Status changed from New to Closed
% Done changed from 0 to 100
URL set to https://github.com/biopython/biopython/issues/892

Migrated to GitHub, see URL.

----------------------------------------
Bug #3395: Biopython trie implementation can't load large data sets
https://redmine.open-bio.org/issues/3395#change-15311

* Author: Michał Nowotka
* Status: Closed
* Priority: Normal
* Assignee: Biopython Dev Mailing List
* Category: Main Distribution
* Target version: 
* URL: https://github.com/biopython/biopython/issues/892
----------------------------------------
Imagine I have Biopython trie:

from Bio import trie
import gzip

f = gzip.open('/tmp/trie.dat.gz', 'w')
tr = trie.trie()
#fill in the trie
trie.save(f, trie)

Now /tmp/trie.dat.gz is about 50MB. Let's try to read it:

from Bio import trie
import gzip

f = gzip.open('/tmp/trie.dat.gz', 'r')
tr = trie.load(f)

Unfortunately I'm getting meaningless error saying:
"loading failed for some reason"

Any hints?


---Files--------------------------------
trie_debug.patch (1.63 KB)
minimal_data.pkl (1.23 MB)
minimal_data.pkl (1.24 MB)


-- 
You have received this notification because you have either subscribed to it, or are involved in it.
To change your notification preferences, please click here and login: http://redmine.open-bio.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.open-bio.org/pipermail/biopython-dev/attachments/20160724/7685624c/attachment.html>


More information about the Biopython-dev mailing list