[Bioperl-l] mummer3 output format

Albert Vilella avilella at gmail.com
Thu Mar 1 10:45:19 EST 2012


Hi,

I am trying to understand how to transform Mummer3's output format
into something I can pipe into another program, like MAF or similar.
How can I parse the results so that I can then do a write_aln into MAF
o similar?

Details:

If I run nucmer v.3.23 with the options below, I get an out.delta like this:

~/MUMmer3.23/nucmer -maxgap $g -l $l $ref $qry

------------------
Leishmania_major.LM2.12.dna.toplevel.fa
LtarParrotTarIIGenomic_TriTrypDB-4.0.fasta
NUCMER
>LmjF.34 ULAVAL|LtaPseq521 1866748 641
959335 959806 169 640 91 91 0
20
17
-3
-2
-183
5
0
>LmjF.12 ULAVAL|LtaPseq501 675346 1438
322990 324081 1436 342 178 178 0
-45
-1
-1
-1

This doesn't look like any of the formats in t/AlignIO/mummer.t to me.

I can also run:

~/MUMmer3.23/show-aligns out.delta $region1 $region2

Which gives me something that looks like a blast or exonerate output, like so:

------
Leishmania_major.LM2.12.dna.toplevel.fa
LtarParrotTarIIGenomic_TriTrypDB-4.0.fasta

============================================================
-- Alignments between LmjF.34 and ULAVAL|LtaPseq521

-- BEGIN alignment [ +1 959335 - 959806 | +1 169 - 640 ]


959335     cacacgcctcgtagaggtctccttgctttcgcgcggtgc.c.tcacttg
169        cacacgcctcgtagagatc.ccctgccttcgcgcgg.gctcttcacttg
                           ^  ^  ^   ^         ^  ^ ^

959382     cgcatgcggtagtagaagagaatgctgtgggcccacccagcgtagttgc
216        cgcatgcggtagtagaagagaatgctgtgtgcccacccagcgtagttgc
                                        ^

959431     caaacagcttccggaaggcctcctgaatgacgttatgatgccgctcgta
265        caaacagtttccagaaggcatcctggataacattatgatgccgttcgta
                  ^    ^      ^     ^  ^  ^           ^

959480     caagggtgggacaggcgtttttcgtgaggcgcgcagcggggctgctgca
314        caggggcggcacaggtgttttccgtaaggcacgtgaagaggtcgttgca
             ^   ^  ^     ^     ^   ^    ^  ^^^^ ^  ^^ ^

959529     gagcttccaccttcctctatcgccttta.cggtcgctggcgacacgcct
363        gagcctccgtttcccttcaccgcccgcagcgat.gatgatgtcactcct
               ^   ^^^ ^   ^^ ^    ^^^ ^  ^ ^ ^  ^^ ^   ^

959577     ttcttaaccttgagaacctccgcctgcttcctccactccagcagcagat
411        ttcttcaccttgagagcctccgcctggttcttccactccaggagaagat
                ^         ^          ^   ^          ^  ^

959626     tatcccgtgagcgggcttcctcttcgggcaacggacaccctggacgaga
460        cagtgggtgcgcagacttcttcttcgcgcagtagagaccctgagcgaga
           ^ ^^^^   ^  ^ ^    ^      ^   ^^^  ^      ^^

959675     gcgcttacgacccaccgccgtcgcggcgcttggtgcggcaaggtactcc
509        acgctttcgacccgccgatgtcacggtgcttgcggtggcaagatactcc
           ^     ^      ^   ^^   ^   ^     ^^ ^      ^

959724     accgcaacttgcgccatgtgcgtgtccacggggacaatgtgggtgcggt
558        accgaaacctgcgccatgtgtgtgtccacggggacgatgtgggtgcggt
               ^   ^           ^              ^

959773     tgagcgcgaagagcgccacgcagtcagcaacttt
607        tgagagcaaagagcgccacgcaatccgccacttt
               ^  ^              ^  ^  ^


--   END alignment [ +1 959335 - 959806 | +1 169 - 640 ]

============================================================



More information about the Bioperl-l mailing list