[Biopython] Writing fasta+qual files and adjusting adapter clip positions in sff files

Martin Mokrejs mmokrejs at fold.natur.cuni.cz
Tue Apr 5 13:21:10 UTC 2011


Hi Peter,
  I was looking into the Tutorial for a way to write fasta+qual files
but couldn't find it. I wanted to trim my objects assembled through
SeqIO.QualityIO.PairedFastaQualIterator. Could
_record[_start:_stop]
be used?



  Anyways, I found that there is a way to convert sff files into re-trimmed
sff files which is even closer to my goal. Here is the help text from SffIO:

        >>> from Bio import SeqIO
        >>> def filter_and_trim(records, primer):
        ...     for record in records:
        ...         if record.seq[record.annotations["clip_qual_left"]:].startswith(primer):
        ...             record.annotations["clip_qual_left"] += len(primer)
        ...             yield record
        >>> records = SeqIO.parse("Roche/E3MFGYR02_random_10_reads.sff", "sff")
        >>> count = SeqIO.write(filter_and_trim(records,"AAAGA"),
        ...                     "temp_filtered.sff", "sff")
        >>> print "Selected %i records" % count
        Selected 2 records



And this code from the Tutorial:

>>> from Bio import SeqIO
>>> SeqIO.convert("E3MFGYR02_random_10_reads.sff", "sff-trim", "trimmed.fasta", "fasta")
10
>>> SeqIO.convert("E3MFGYR02_random_10_reads.sff", "sff-trim", "trimmed.qual", "qual")
10
>>> SeqIO.convert("E3MFGYR02_random_10_reads.sff", "sff-trim", "trimmed.fastq", "fastq")
10



  My questions is: could I provide readnames with clip_adapter_right directly to
SeqIO.convert()? Well I will probably stick to 'sfffile -t trimpositions.txt myfile.sff'
anyways hoping it will be faster. ;)

Martin




More information about the Biopython mailing list