[Bioperl-l] store variations, generate sequences

Marian Thieme marian.thieme at klinik.uni-regensburg.de
Tue Jan 2 16:48:18 UTC 2007


Hi all,

I am quite new to bioperl and I have a question about sequence data: I
am working on a resequencing project. Here we have resequenced 1000
genes of a certain gene. My question: What is easiest way to store each
discovered variation of each individual and get a fasta sequence for an
arbitrary individual.

I would expect that there is some way to set up a reference sequence and
store all variationsm relative to this reference sequence. Afterward it
should be possible to genereate sequences for each indiviudal in
question, right ?

My approach was the following:

I have created an seqdiff object:

$seqDiff = Bio::Variation::SeqDiff->new (...)


and I have assigned the reference sequence to that object via:

$seqDiff->dna_ori('atgcgtatatg');


Now I thought, I can create some variations via DNAMutation object:

$dnamut = Bio::Variation::DNAMutation->new (
  -start => 6,
  -end => 6,
  -length => 1,
  -isMutation => 1,
  -upStreamSeq => 'atgcg',
  -dnStreamSeq => 'atatg'
);

$a1 = Bio::Variation::Allele->new;
$a1->seq('t');
$dnamut->allele_ori($a1);

my $a2 = Bio::Variation::Allele->new;
$a2->seq('a');
$dnamut->add_Allele($a2);



Is that the correct way to describe the reference sequence, describe a
variation and attach this to seqdiff object ?
Probably I didnt understand the api right. (I did assume start/end means
start/endposition of the mutation). Is it possible to get a complete
sequence print (fast format) of each variation/indiviudal ?

Regards,
Marian

-- 
Marian Thieme
University Regensburg
Institute of Functional Genomics
Josef-Engert-Str. 9
93053
Regensburg
Germany
P: 0049 (0)941 943 5055
F: 0049 (0)941 943 5020
E: marian.thieme at klinik.uni-regensburg.de
W: http://www-cgi.uni-regensburg.de/Klinik/FunktionelleGenomik




More information about the Bioperl-l mailing list