[Bioperl-l] $hsp->seq_inds and axt file

Jason Stajich jason.stajich at duke.edu
Thu Jul 21 15:12:39 EDT 2005


There's no midline/homology line in the axt format so there is no way to know
which columns are identical so I don't see how it can work.

-jason
-- 
Jason Stajich
jason.stajich at duke.edu
http://www.duke.edu/~jes12/


Quoting Alynne Chiromatzo <chiromatzo at gmail.com>:

> Hi!
> 
> I'm having trouble in finding the hsp->seq_inds in the axt file(whole
> genome alignment from UCSC Genome Browser). The code is below and a
> sample of the input file. It doens't show the sequence that it suppose
> to contain. Anyone can help me?
> 
> Thanks very much!
> 
> Alynne Oya.
> 
> #! /usr/bin/perl
> 
> use Bio::SearchIO;
> 
>  my $parser = new Bio::SearchIO(-format => 'axt',
>                                  -file   => '/work/project/align/testeaxt');
>  while( my $result = $parser->next_result ) {
>    while( my $hit = $result->next_hit ) {
>      while( my $hsp = $hit->next_hsp) {
>          print "Hank: ".$hsp->rank." Strand : ".$hsp->strand('hit')."\n";
>          print "Query Name: ".$result->query_name." Hit Name:
> ".$hit->name."\n";
>          ($query_beg, $query_end) = $hsp->range('query');#encontra os
> valores de inicio-final, mas soh q somados de 1
>          ($hit_beg,$hit_end) = $hsp->range('hit');
>          print "Range: ".($query_beg-1)."-".($query_end-1)."
> ".($hit_beg-1)."-".($hit_end-1)."\n";
>          print $hsp->query_string."\n".$hsp->hit_string."\n";
>          @h_ind = $hsp->seq_inds('query', 'identical', 1);
> 
>          #Here doesn't apper the index sequence like it suppose to show
>          foreach (@h_ind){    
>             print "==> ".$_." ";
>          }
>          print "\n";
>      }
> 
> This is a sample of the input file:
> 
> 1 SCAFFOLD1 1535 1688 chrX 44389546 44389697 + 6498
>
TACAATAGGTCAAGGGTCTGCAAACTATAGGTTTAAAAATTAAAAAGAA-GAAAAATATATGGTGGAGACTGGTTGGGATCATAAAGCCCAATATATTTATTGTATGGTCtgtgt-tagccaggagtcttcagagaaacagaaccaataagataCA
>
TACAATAAATCAGAGGTCAGCAAGCTATAGGTTTT----TTAAACAGGACAAAAAATATACAACAGAGAAAATGTAGGACCAGAAAACCCAACATATTTATTATATGGGCTTTTTGTGgtcagggttctcctgtgaaacaggaccaataggatgta
> 
> 3 SCAFFOLD1 3665 3845 chrX 44391563 44391740 + 7187
>
CCCTAAAAAGTCA-GTTTTTCA------AGAAGCATAAGCATAGTGTAAATGTAGGAGTTCATAGATCCATAGCAGGGAGAGCTGTTTAGCCTACTTATAGCTTATTTCCAGCTTATATCATCTGTTTGGGGCACGGTCATCCCTAGAGGCAGAGGAA-GAGATTTGGAATGAGGTTTTAGCATGATAT
>
TCCTGAAAATTTATATTTTTCACCAAGAAGAAACATAAACATCTTGCACA---AGGA---CATAAATCTATAGCTGGGGGTGCTGTT-AGTCTAGTTCTAGCATATTTCTAGCCTACATCATCTGTTTGGGGCATAATCATGTCTGGAAGAAAAGGAATGAGGTTTG----GGGATTTTAGCATGGTAT
> 
> 17 SCAFFOLD2 22789 22919 chrX 44409117 44409239 - 5180
>
AGAATACACATCATAGTTATCATAGGGGAAT-GTTTAGGTGGCAGGATAAGGCATATTT--TTTTCTTTTCTCTGGTCTGTAAATTCTCTAACATAACTATATTGCTTTTAAATTTTAAATTGATTTTCAATTA
>
agaaaacacacc-cacttataatagtggatttgtccaggtggcaggactatacatctttgttttctttttttcttgtTTATAAATGTTCTAATATAACTATATTGCCtttaaa----------atttttaatta
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 



More information about the Bioperl-l mailing list