[Biopython] how Entrez.parse() internally work

c.buhtz at posteo.jp c.buhtz at posteo.jp
Wed Dec 9 07:53:58 UTC 2015


Can someone please explain how Entrez.parse() internally really work,
from the viewpoint of the NCBI-servers.

e.g.
# returning e.g. 100.000 recods
handle = Entrez.efetch(retstart=0, retmax=999999, ...)
record = Entrez.parse(handle)

In that case "record" doesn't hold all records in RAM at the same time.
This would cause a out-of-RAM-error on my small system if it would hold
that all in RAM.

The question is?
Where are all these records (physicaly)?
And how often is the NCBI-Server really requested (with eFetch) and
with which values of retmax and retstart?
-- 
GnuPGP-Key ID 0751A8EC


More information about the Biopython mailing list