[Bioperl-l] Bioperl clustalw Vs Online clustalw

Anil Kumar anikng at gmail.com
Mon May 26 08:15:34 EDT 2014


Hi Forum  Members,

I have a doubt regarding Bioperl clustalw and it is as follows. If someone
know the solution, kindly suggest me.

Using Bioperl (clustalw 1.8V), I successfully aligned multiple sequences
from a .fasta file and saved it in clustalw format. However, the alignment
result is different from the  online clustalw result of the same sequences.

I guess that the change is due to some input parameters, but i failed to
provide the exact input parameter in order to get the alignment as online
clustalw.



Ani Lee
Seoul, ROK

Here is my code, (Also sending the input file (seq1.fasta), output file
(out.aln) and online output (online.aln.txt)

#!/usr/bin/perl
BEGIN { $ENV{CLUSTALDIR} = 'C:/clustalw'; }


use Bio::Tools::Run::Alignment::Clustalw;
use Bio::AlignIO;


$inputfilename = "seq1.fasta";
$in  = Bio::AlignIO->new(-file   => $inputfilename , ktuple => 2, matrix =>
BLOSUM, type => dna, extend => 0.1, gapopen => 10, gapext => 0.05,
                             format => 'fasta');



$out = Bio::AlignIO->new(-file   => ">out.aln" ,
                             -format => 'clustalw');



while ( my $aln = $in->next_aln() ) {
        $out->write_aln($aln);

                                   }
-------------- next part --------------
CLUSTAL W (1.83) multiple sequence alignment


1               GTTGGTAATGAAGAAGACGAGACGACGACTTCCCCACTAGGAAACACGACGGAGGCGGAG
2               ------------------------------------------------------------
3               ------------------------------------------------------------
                                                                            

1               ATGATCGACGGCGGAGAGAGCTACAGAAACATCGATGCCTCCTGTCCAATCCCCCCATCC
2               ATGATCGACGGCGGAGAGAGCTACAGAAACATCGATGCCTCCTGTCCAATCCCCCCATCC
3               ------------------------------------------------------------
                                                                            

1               CATTCGGTAGTTGGATTGAAGACTACCGAATAAGAGAAGCAGGCAGGCAGACAAACCCTT
2               CATTCGGTAGTTGGATTGAAGACTACCGAATAAGAGAAGCAGGCAGGCAGACAAACCCTT
3               ------------------------------------------------------------
                                                                            

1               GAACCAAGGAGTCCTCGCTGAGGAAGCTTTGGATCCACGACGCAGCTATGGCCTCCCCGC
2               GAACCAAGGAGTCCTCGCTGAGGAAGCTTTGGATCCACGACGCAGCTATGGCCTCCCCGC
3               ------------------------------------------------------------
                                                                            

1               CCACCAGGCCGCCAGCCACAACCAGCTGACTAGGTAGGCTTCCTAGGTAGGGATCCCATC
2               CCACCAGGCCGCCAGCCACAACCAGCTGACTAGGTAGGCTTCCTAGGTAGGGATCCCATC
3               CCACCAGGCCGCCAGCCACAACCAGCTGACTAGGTAGGCTTCCTAGGTAGGGATCCCATC
                ************************************************************

1               CCTTCGATTCCCTACTCCCTCCCCCGATTGATTTGATTTGATTTGATTTAATTCGATTGC
2               CCTTCGATTCCCTACTCCCTCCCCCGATTGATTTGATTTGATTTGATTTAATTCGATTGC
3               CCTTCGATTCCCTACTCCCTCCCCCGATTGATTTGATTTGATTTGATTTAATTCGATTGC
                ************************************************************

1               CTGCTTTTCAGGTCGCATGCATCATCAGATTTCAATCTCCCTTCGTTCCCTGTCCCTAAT
2               CTGCTTTTCAGGTCGCATGCATCATCAGATTTCAATCTCCCTTCGTTCCCTGTCCCTAAT
3               CTGCTTTTCAGGTCGCATGCATCATCAGATTTCAATCTCCCTTCGTTCCCTGTCCCTAAT
                ************************************************************

1               CCAATACCAATAGGGAGCAATCAGCTGCTCCTCGACGGCGAGGGAGATGTCGTCGGCCGC
2               CCAATACCAATAGGGAGCAATCAGCTGCTCCTCGACGGCGAGGGAGATGTCGTCGGCCGC
3               CCAATACCAATAGGGAGCAATCAGCTGCTCCTCGACGGCGAGGGAGATGTCGTCGGCCGC
                ************************************************************

1               GGGCCAAGACAACGGAGATACCGCTGGGGACTACATCAAGTGGATGTGCGGCGCCGGTGG
2               GGGCCAAGACAACGGAGATACCGCTGGGGACTACATCAAGTGGATGTGCGGCGCCGGTGG
3               GGGCCAAGACAACGGAGATACCGCTGGGGACTACATCAAGTGGATGTGCGGCGCCGGTGG
                ************************************************************

1               CCGTGCGGGCGGCGCCATGGCCAACCTCCAGCGCGGCGTTGGCTCCCTCGTCCGTGACAT
2               CCGTGCGGGCGGCGCCATGGCCAACCTCCAGCGCGGCGTTGGCTCCCTCGTCCGTGACAT
3               CCGTGCGGGCGGCGCCATGGCCAACCTCCAGCGCGGCGTTGGCTCCCTCGTCCGTGACAT
                ************************************************************

1               TGGCGACCCCTGCCTCAACCCATCCCCCGTTAAGGTTCGCCTACTCCTACTCTAGCTCCA
2               TGGCGACCCCTGCCTCAACCCATCCCCCGTTAAGGTTCGCCTACTCCTACTCTAGCTCCA
3               TGGCGACCCCTGCCTCAACCCATCCCCCGTTAAGGTTCGCCTACTCCTACTCTAGCTCCA
                ************************************************************

1               TATGGATATGGATTCGATTAGCTGTCTACATTCTATGCCATCAATTTGTTTTCCATCACT
2               TATGGATATGGATTCGATTAGCTGTCTACATTCTATGCCATCAATTTGTTTTCCATCACT
3               TATGGATATGGATTCGATTAGCTGTCTACATTCTATGCCATCAATTTGTTTTCCATCACT
                ************************************************************

1               TCTCATTATACATCTCCATTGCTCTCTAAATTAAGGCTTCTCTAGCTCTATCCTATATAT
2               TCTCATTATACATCTCCATTGCTCTCTAAATTAAGGCTTCTCTAGCTCTATCCTATATAT
3               TCTCATTATACATCTCCATTGCTCTCTAAATTAAGGCTTCTCTAGCTCTATCCTATATAT
                ************************************************************

1               ATACTAGTACTCCGTATATGATTCTGCTTCATCACTTATTTATTCATCATCATACCGTGA
2               ATACTAGTACTCCGTATATGATTCTGCTTCATCACTTATTTATTCATCATCATACCGTGA
3               ATACTAGTACTCCGTATATGATTCTGCTTCATCACTTATTTATTCATCATCATACCGTGA
                ************************************************************

1               AGCTGTATAAGTCCTGTTATTTGTCATTTGGCATATGTTGTATAGATAACACTTTCAGCC
2               AGCTGTATAAGTCCTGTTATTTGTCATTTGGCATATGTTGTATAGATAACACTTTCAGCC
3               AGCTGTATAAGTCCTGTTATTTGTCATTTGGCATATGTTGTATAGATAACACTTTCAGCC
                ************************************************************

1               CAGGGACTTTCTATTGCCACTTTTATACATATACAATCAACATTTAGCATATGTATATCC
2               CAGGGACTTTCTATTGCCACTTTTATACATATACAATCAACATTTAGCATATGTATATCC
3               CAGGGACTTTCTATTGCCACTTTTATACATATACAATCAACATTTAGCATATGTATATCC
                ************************************************************

1               TGTACCGCTCAGTTGTACTGCTGCTTTGAAGATAATGTTTGCAAAACATATGAGCTCACA
2               TGTACCGCTCAGTTGTACTGCTGCTTTGAAGATAATGTTTGCAAAACATATGAGCTCACA
3               TGTACCGCTCAGTTGTACTGCTGCTTTGAAGATAATGTTTGCAAAACATATGAGCTCACA
                ************************************************************

1               CACGAGGCTAGATGCTGCATAGGCTGCACAATGGGGTAGCCATTTTGATACTTCTTAATG
2               CACGAGGCTAGATGCTGCATAGGCTGCACAATGGGGTAGCCATTTTGATACTTCTTAATG
3               CACGAGGCTAGATGCTGCATAGGCTGCACAATGGGGTAGCCATTTTGATACTTCTTAATG
                ************************************************************

1               CTATTTATGCTGGTCCAAATGTTCATTGGTAGCCTCAGGCAGAGAATTTTAGGGAGATAT
2               CTATTTATGCTGGTCCAAATGTTCATTGGTAGCCTCAGGCAGAGAATTTTAGGGAGATAT
3               CTATTTATGCTGGTCCAAATGTTCATTGGTAGCCTCAGGCAGAGAATTTTAGGGAGATAT
                ************************************************************

1               TATGTCAAACTCCCTGGATTCACTATGATTATACAATATTACCCATTCAATCCATTCAAA
2               TATGTCAAACTCCCTGGATTCACTATGATTATACAATATTACCCATTCAATCCATTCAAA
3               TATGTCAAACTCCCTGGATTCACTATGATTATACAATATTACCCATTCAATCCATTCAAA
                ************************************************************

1               TAAATTTCCAGGTCAACTCACATATTGTACTGAAAGAAGACTAATTTGCACTAATCATAT
2               TAAATTTCCAGGTCAACTCACATATTGTACTGAAAGAAGACTAATTTGCACTAATCATAT
3               TAAATTTCCAGGTCAACTCACATATTGTACTGAAAGAAGACTAATTTGCACTAATCATAT
                ************************************************************

1               CCTTTTATCCACCAGAGTATATGAGATTGTCTTCTATATGCAATTTACTTCATATCGACA
2               CCTTTTATCCACCAGAGTATATGAGATTGTCTTCTATATGCAATTTACTTCATATCGACA
3               CCTTTTATCCACCAGAGTATATGAGATTGTCTTCTATATGCAATTTACTTCATATCGACA
                ************************************************************

1               TTTTTCTGCAGAATTGCAAAACTGACGCCTTGTTTTCCATCTGGAATTGTGCAGGGGAGC
2               TTTTTCTGCAGAATTGCAAAACTGACGCCTTGTTTTCCATCTGGAATTGTGCAGGGGAGC
3               TTTTTCTGCAGAATTGCAAAACTGACGCCTTGTTTTCCATCTGGAATTGTGCAGGGGAGC
                ************************************************************

1               AAAATGCTCAAACCGGAAAAATGGCACACATGTTTTGATAATGATGGAAAGGTCATAGGT
2               AAAATGCTCAAACCGGAAAAATGGCACACATGTTTTGATAATGATGGAAAGGTCATAGGT
3               AAAATGCTCAAACCGGAAAAATGGCACACATGTTTTGATAATGATGGAAAGGTCATAGGT
                ************************************************************

1               TTCCGTAAAGCCCTAAAATTCATTGTCTTAGGGGTGAGTTAATTGTTTCTTTTGTGCTTC
2               TTCCGTAAAGCCCTAAAATTCATTGTCTTAGGGGTGAGTTAATTGTTTCTTTTGTGCTTC
3               TTCCGTAAAGCCCTAAAATTCATTGTCTTAGGGGTGAGTTAATTGTTTCTTTTGTGCTTC
                ************************************************************

1               AAAAACTTGTTTTTTCATGTATTTTAGCTGTTGAAGGGTGTGGATTTTTTTTTCAGTCTT
2               AAAAACTTGTTTTTTCATGTATTTTAGCTGTTGAAGGGTGTGGATTTTTTTTTCAGTCTT
3               AAAAACTTGTTTTTTCATGTATTTTAGCTGTTGAAGGGTGTGGATTTTTTTTTCAGTCTT
                ************************************************************

1               AATACATTACCTTTTAGAGCATGATGCCCCACGCGCCATTTGTGAATGTGTAAAATAGGG
2               AATACATTACCTTTTAGAGCATGATGCCCCACGCGCCATTTGTGAATGTGTAAAATAGGG
3               AATACATTACCTTTTAGAGCATGATGCCCCACGCGCCATTTGTGAATGTGTAAAATAGGG
                ************************************************************

1               AAGATCAAATAGAACGTGACAACTGTTTATTTTATTACCATGATCATTGTTCTCTTTGAG
2               AAGATCAAATAGAACGTGACAACTGTTTATTTTATTACCATGATCATTGTTCTCTTTGAG
3               AAGATCAAATAGAACGTGACAACTGTTTATTTTATTACCATGATCATTGTTCTCTTTGAG
                ************************************************************

1               AGTGTCTGCTTTGGAAATTTCAAAGAAATTAATGCATATGATTGCGTAGCTCTAGTTTTT
2               AGTGTCTGCTTTGGAAATTTCAAAGAAATTAATGCATATGATTGCGTAGCTCTAGTTTTT
3               AGTGTCTGCTTTGGAAATTTCAAAGAAATTAATGCATATGATTGCGTAGCTCTAGTTTTT
                ************************************************************

1               CTTAAAAAAATAAGCGGCTAGGCTCTTTATACTATTATGTCACAGAGTGTTTGCCTTTTA
2               CTTAAAAAAATAAGCGGCTAGGCTCTTTATACTATTATGTCACAGAGTGTTTGCCTTTTA
3               CTTAAAAAAATAAGCGGCTAGGCTCTTTATACTATTATGTCACAGAGTGTTTGCCTTTTA
                ************************************************************

1               CCTGTGTATGTAGCTAGGCATAAACATTTGTTTGTGGGTAAATACAAAAAAAAATCCAAT
2               CCTGTGTATGTAGCTAGGCATAAACATTTGTTTGTGGGTAAATACAAAAAAAAATCCAAT
3               CCTGTGTATGTAGCTAGGCATAAACATTTGTTTGTGGGTAAATACAAAAAAAAATCCAAT
                ************************************************************

1               GTGTAGAAACTGCTTATAAATACATAAAAGTTACATAAATACATAACTGCTTCAAATAAG
2               GTGTAGAAACTGCTTATAAATACATAAAAGTTACATAAATACATAACTGCTTCAAATAAG
3               GTGTAGAAACTGCTTATAAATACATAAAAGTTACATAAATACATAACTGCTTCAAATAAG
                ************************************************************

1               CAAGATTTGAACATACCTTCACATAAGCTTTCCCCTAACTGAAAAGCCATTTGGTATAAC
2               CAAGATTTGAACATACCTTCACATAAGCTTTCCCCTAACTGAAAAGCCATTTGGTATAAC
3               CAAGATTTGAACATACCTTCACATAAGCTTTCCCCTAACTGAAAAGCCATTTGGTATAAC
                ************************************************************

1               CTTATAGAAATTAGATATAGTGGAGATGTGGAATGAGCTGCTATCAAGTCTTGTTTCGTG
2               CTTATAGAAATTAGATATAGTGGAGATGTGGAATGAGCTGCTATCAAGTCTTGTTTCGTG
3               CTTATAGAAATTAGATATAGTGGAGATGTGGAATGAGCTGCTATCAAGTCTTGTTTCGTG
                ************************************************************

1               AAAGCAAACTGACACGGTCTCCACAGCAAACACGTCCACAGCTTTGCAATTAGTATCTTT
2               AAAGCAAACTGACACGGTCTCCACAGCAAACACGTCCACAGCTTTGCAATTAGTATCTTT
3               AAAGCAAACTGACACGGTCTCCACAGCAAACACGTCCACAGCTTTGCAATTAGTATCTTT
                ************************************************************

1               ATAGGAAGGTCAATCAGAGTTCATTGGCCATATCTTGGAAAAAAAAGGCATCAGATTTTA
2               ATAGGAAGGTCAATCAGAGTTCATTGGCCATATCTTGGAAAAAAAAGGCATCAGATTTTA
3               ATAGGAAGGTCAATCAGAGTTCATTGGCCATATCTTGGAAAAAAAAGGCATCAGATTTTA
                ************************************************************

1               GTTTAATTATGTCTATATTTTAGGCATGGATTATTGATCATTTTTGCAGCAGACTAAGTT
2               GTTTAATTATGTCTATATTTTAGGCATGGATTATTGATCATTTTTGCAGCAGACTAAGTT
3               GTTTAATTATGTCTATATTTTAGGCATGGATTATTGATCATTTTTGCAGCAGACTAAGTT
                ************************************************************

1               TAATGCATGTAGTTATTTTCCTTGATTAGTGATTAGTGTTGTTATTTTCTATACACCACT
2               TAATGCATGTAGTTATTTTCCTTGATTAGTGATTAGTGTTGTTATTTTCTATACACCACT
3               TAATGCATGTAGTTATTTTCCTTGATTAGTGATTAGTGTTGTTATTTTCTATACACCACT
                ************************************************************

1               GGCACTGATAGATGTGCTCCATGCTTTGCAATGCATCAAAATTATCATATCGTGCCTAAC
2               GGCACTGATAGATGTGCTCCATGCTTTGCAATGCATCAAAATTATCATATCGTGCCTAAC
3               GGCACTGATAGATGTGCTCCATGCTTTGCAATGCATCAAAATTATCATATCGTGCCTAAC
                ************************************************************
-------------- next part --------------
A non-text attachment was scrubbed...
Name: out.aln
Type: application/octet-stream
Size: 13465 bytes
Desc: not available
URL: <http://lists.open-bio.org/pipermail/bioperl-l/attachments/20140526/0e9d35d8/attachment.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: seq1.fasta
Type: application/octet-stream
Size: 6962 bytes
Desc: not available
URL: <http://lists.open-bio.org/pipermail/bioperl-l/attachments/20140526/0e9d35d8/attachment-0001.obj>


More information about the Bioperl-l mailing list