[Biopython] parsing a fasta with multiple entries

Nick Leake nick_leake77 at hotmail.com
Mon Apr 26 15:36:28 UTC 2010


Hello,
 
I'm having trouble parsing a fasta file with multiple sequences - it is a fasta that has most of the transposable elements in fruit flies found at http://www.fruitfly.org/p_disrupt/TE.html#NAT right side, third box down.  I want to be able to access the DNA sequences for manipulation and later removal from a chromosomal region.  I originally thought that I could follow the same fasta format example shown in the biopython tutorial.  However, that failed to work.  I think it might be because there are multiple entries.  
 
Basically, I just want parse the information and have dictionaries hold the transposon elements name and sequence for later use.  Can I do that with biopython or should I make my own parser? Any help would be greatly appreciated.  I'm still very much a python novice and get frustrated by not knowing how to ask my questions appropriately. 		 	   		  
_________________________________________________________________
The New Busy think 9 to 5 is a cute idea. Combine multiple calendars with Hotmail. 
http://www.windowslive.com/campaign/thenewbusy?tile=multicalendar&ocid=PID28326::T:WLMTAGL:ON:WL:en-US:WM_HMP:042010_5



More information about the Biopython mailing list