[BioSQL-l] Indexing of (seqfeature) locations?
Peter
biopython at maubp.freeserve.co.uk
Wed Aug 26 10:53:40 UTC 2009
Hi BioSQL folks,
The BioSQL schema includes a few indexes on the location table
(e.g. quoting the MySQL schema, but it looks the same on pg too):
CREATE INDEX seqfeatureloc_start ON location(start_pos, end_pos);
CREATE INDEX seqfeatureloc_dbx ON location(dbxref_id);
CREATE INDEX seqfeatureloc_trm ON location(term_id);
Will these facilitate searches like this?:
"SELECT ... WHERE 2000 <= location.start_pos
AND location.end_pos <= 5000 AND ..."
Or, for this would it help to include:
CREATE INDEX seqfeatureloc_start ON location(start_pos);
CREATE INDEX seqfeatureloc_start ON location(end_pos);
A motivational use case would be to pull out an operon, or a
region of a record as part of a genome browser.
Thanks,
Peter
More information about the BioSQL-l
mailing list