[Biopython] Bio.SeqIO.index() - gzip support and/or index stored on disk?

Kevin Lam kevin at aitbiotech.com
Fri Jun 4 20:21:05 UTC 2010


Just thinking out loud.
would generating a fake region id (unique for each read id) and the
corresponding index when creating the bam be a good quick fix to utilise bam
format for ID based retrieval?
Or would the double mapping slow things down considerably?
Kevin

>
> What do you think about co-opting the SAM/BAM format for this? This
> would make it more specific for things that can go into BAM (so no
> GenBank and what not), but would have the advantage of working with
> existing workflows.
>
> Region based indexing is already implemented for BAM, but it would
> be really useful to also have ID based retrieval along the lines of
> what you are proposing.
>
> Brad
> _______________________________________________
> Biopython mailing list  -  Biopython at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/biopython
>



More information about the Biopython mailing list