[Bioperl-l] Getting sequences by base pair locations
    Cui, Wenwu (NIH/NLM/NCBI) [C] 
    cuiw at ncbi.nlm.nih.gov
       
    Fri Jul 28 13:46:50 UTC 2006
    
    
  
Maybe the easiest way is to use LWP to get the webpage. Here is an
example for CHIMP1A:10:12345678:12348888:
 
http://www.ensembl.org/Pan_troglodytes/exportview?format=fasta&l=10%3A12
345678-12348888&action=export&_format=Text&output=txt&submit=Continue+%3
E%3E
 
Wenwu Cui 
________________________________
From: Yuval Itan [mailto:y.itan at ucl.ac.uk] 
Sent: Friday, July 28, 2006 8:08 AM
To: bioperl-l at lists.open-bio.org
Subject: [Bioperl-l] Getting sequences by base pair locations
 
Hello all, 
 
I was BLATing a few hundred human genes against the chimp genome, and
kept the best chimp hits for every human gene. 
I have the base pair start and end location for every chimp hit, and I
need to get the sequence for each of these chimp hits. Here is an
example for a few chimp hits bp locations: 
 
Start End 
142854 144504 
154479 155198 
153066 167370 
163146 163559 
I have one chimp genome file (about 3GB) including all chromosomes, but
I could also get one file per chromosome if that would make things
easier. Does anyone have a script or a link for an interface that can do
the job? 
 
Thank you very much. 
    
    
More information about the Bioperl-l
mailing list