[Bioperl-l] Retrieve FASTA seqs with NCBI definition line

Mikaela Ilinca Gabrielli MILG at lundbeck.com
Mon Apr 7 15:45:08 EDT 2003


Thanks for the clarification!

I'm afraid my programming skills are still in their infancy so I make no
promise of a new function. But I'll give it a try.

Meanwhile, if anyone feels keen and capable to undertake this task and code
a seq2NCBIfasta - function, it would be appreciated. 

Cheers,

/Mikaela

-----Original Message-----
From: Heikki Lehvaslaiho [mailto:heikki at ebi.ac.uk]
Sent: 07-apr-2003 14:26
To: Mikaela Ilinca Gabrielli
Cc: Bioperl
Subject: Re: [Bioperl-l] Retrieve FASTA seqs with NCBI definition line


Mikaela,

I do not think there is a shortcut and I do not think modifying the the
way fasta is processed from the sequence information is a good idea. So
many things depend on it. You have to manually modify the return values
of methods: display_id() and  desc().

Assuming you go ahead and do it, could you put the code into a function
(seq2NCBIfasta  ? ) which could be added into Bio::SeqUtils?

Then anyone needing to do the same thing could:

$out = Bio::SeqIO->new(-format => 'fasta');
$out->write_seq(Bio::SeqUtils->seq2NCBIfasta($seq));

Cheers,
	-Heikki


On Mon, 2003-04-07 at 12:06, Mikaela Ilinca Gabrielli wrote:
> Dear all, 
> 
> I'd like to retrieve sequences from GenPept that are in fasta format AND
> include the NCBI definition line. I thought this was easy but as I apply
> Bio::DB::GenPept I get only a part of the NCBI definition line - missing
gi
> and accession number information.
> 
> ex def-line from NCBI:
> >gi|4504379|ref|NP_003658.1| G protein-coupled receptor 49; orphan G
> protein-coupled receptor HG38; G protein-coupled receptor 67 [Homo
sapiens] 
> 
> ex defline retrieved through Bio:
> 
> >GPR49 G protein-coupled receptor 49; orphan G protein-coupled receptor
> HG38; G protein-coupled receptor 67 [Homo sapiens]
> 
> Is there any easy way to get around this or do I have to use
> '$seq->primary_id' and '$seq->accession_number' to "cut&paste" my own
fasta
> records that look like those in NCBI ?
> 
> Best regards,
> 
> Mikaela
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at bioperl.org
> http://bioperl.org/mailman/listinfo/bioperl-l
-- 
______ _/      _/_____________________________________________________
      _/      _/                      http://www.ebi.ac.uk/mutations/
     _/  _/  _/  Heikki Lehvaslaiho          heikki at ebi.ac.uk
    _/_/_/_/_/  EMBL Outstation, European Bioinformatics Institute
   _/  _/  _/  Wellcome Trust Genome Campus, Hinxton
  _/  _/  _/  Cambs. CB10 1SD, United Kingdom
     _/      Phone: +44 (0)1223 494 644   FAX: +44 (0)1223 494 468
___ _/_/_/_/_/________________________________________________________


More information about the Bioperl-l mailing list