[Bioperl-l] Getting sequences by base pair locations

Sean Davis sdavis2 at mail.nih.gov
Fri Jul 28 13:30:52 UTC 2006


Yuval Itan wrote:
> Hello all,
> 
> I was BLATing a few hundred human genes against the chimp genome, and 
> kept the best chimp hits for every human gene.
> I have the base pair start and end location for every chimp hit, and I 
> need to get the sequence for each of these chimp hits. Here is an 
> example for a few chimp hits bp locations:
> 
> Start End*
> *142854 144504
> 154479 155198
> 153066 167370
> 163146 163559
> 
> I have one chimp genome file (about 3GB) including all chromosomes, but 
> I could also get one file per chromosome if that would make things 
> easier. Does anyone have a script or a link for an interface that can do 
> the job?

See this module:

http://doc.bioperl.org/releases/bioperl-current/bioperl-live/Bio/DB/Fasta.html

Sean



More information about the Bioperl-l mailing list