[Biopython] how to validate fasta format

Peter biopython at maubp.freeserve.co.uk
Tue Oct 27 12:36:52 UTC 2009


On Tue, Oct 27, 2009 at 12:03 PM, Yvan Strahm <yvan.strahm at bccs.uib.no> wrote:
> Yes by validating I mainly meant check for the correct alphabet in the Seq
> object but also the correct header's format. So I guess, I have to trust the
> user.... ;-)

The FASTA header is basically free format - almost anything is valid,
although some tools object to things like pipes and underscores.
You will need to test the data in terms of your own criteria.

Peter



More information about the Biopython mailing list