From bugzilla-daemon at portal.open-bio.org Mon May 2 10:53:07 2005
From: bugzilla-daemon at portal.open-bio.org (bugzilla-daemon@portal.open-bio.org)
Date: Mon May 2 11:21:04 2005
Subject: [Biopython-dev] [Bug 1777] Bio.Emboss.Primer patch for
hybridization probes
Message-ID: <200505021453.j42Er7rG011078@portal.open-bio.org>
http://bugzilla.open-bio.org/show_bug.cgi?id=1777
------- Additional Comments From jhessel@gmail.com 2005-05-02 10:53 -------
Created an attachment (id=205)
--> (http://bugzilla.open-bio.org/attachment.cgi?id=205&action=view)
Patches and Tests for Bio.Emboss.Primer
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
From bugzilla-daemon at portal.open-bio.org Mon May 2 10:55:03 2005
From: bugzilla-daemon at portal.open-bio.org (bugzilla-daemon@portal.open-bio.org)
Date: Mon May 2 11:21:05 2005
Subject: [Biopython-dev] [Bug 1777] Bio.Emboss.Primer patch for
hybridization probes
Message-ID: <200505021455.j42Et3gK011100@portal.open-bio.org>
http://bugzilla.open-bio.org/show_bug.cgi?id=1777
------- Additional Comments From jhessel@gmail.com 2005-05-02 10:55 -------
Created an attachment (id=206)
--> (http://bugzilla.open-bio.org/attachment.cgi?id=206&action=view)
Flat file with patches, test and sample file concatenated
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
From bugzilla-daemon at portal.open-bio.org Mon May 2 10:50:38 2005
From: bugzilla-daemon at portal.open-bio.org (bugzilla-daemon@portal.open-bio.org)
Date: Mon May 2 11:21:06 2005
Subject: [Biopython-dev] [Bug 1777] New: Bio.Emboss.Primer patch for
hybridization probes
Message-ID: <200505021450.j42Eoc4W011056@portal.open-bio.org>
http://bugzilla.open-bio.org/show_bug.cgi?id=1777
Summary: Bio.Emboss.Primer patch for hybridization probes
Product: Biopython
Version: Not Applicable
Platform: All
OS/Version: Linux
Status: NEW
Severity: normal
Priority: P2
Component: Main Distribution
AssignedTo: biopython-dev@biopython.org
ReportedBy: jhessel@gmail.com
I've patched Bio.Emboss.Primer to correctly parse results from a search for
hybridization probes. The patch for Primer.py and primer3_format.py is pasted
below. I've also updated Tests/test_Emboss.py and included a sample file, but
I'm not sure how to submit those (the test runs without errors).
Thanks,
Jay Hesselberth
*** Primer.py Fri Feb 18 13:14:18 2005
--- /usr/lib/python2.3/site-packages/Bio/Emboss/Primer.py Sun May 1
09:57:08 2005
***************
*** 227,232 ****
--- 227,247 ----
def reverse_seq(self, seq):
self._cur_primer.reverse_seq = seq
+ def internal_start(self, start):
+ self._cur_primer.internal_start = int(start)
+
+ def internal_length(self, length):
+ self._cur_primer.internal_length = int(length)
+
+ def internal_tm(self, tm):
+ self._cur_primer.internal_tm = float(tm)
+
+ def internal_gc(self, gc):
+ self._cur_primer.internal_gc = float(gc)
+
+ def internal_seq(self, seq):
+ self._cur_primer.internal_seq = seq
+
def end_record(self):
self._add_last_primer()
***************
*** 240,245 ****
--- 255,262 ----
"forward_tm", "forward_gc", "forward_seq",
"reverse_start", "reverse_length",
"reverse_tm", "reverse_gc", "reverse_seq",
+ "internal_start", "internal_length",
+ "internal_tm", "internal_gc", "internal_seq",
"end_record"]
expression = Martel.select_names(primer3_format.record,
*** primer3_format.py Sun May 11 12:27:46 2003
--- /usr/lib/python2.3/site-packages/Bio/Emboss/primer3_format.py Sun May
1 09:59:04 2005
***************
*** 71,76 ****
--- 71,95 ----
any_space + reverse_primer_gc + any_space +
reverse_primer_seq + Martel.AnyEol())
+ # INTERNAL OLIGO 197 20 59.96 40.00 AATTGCACATAACGGATGCA
+
+ internal_oligo_start = Martel.Group("internal_start",
+ any_integer)
+ internal_oligo_length = Martel.Group("internal_length",
+ any_integer)
+ internal_oligo_tm = Martel.Group("internal_tm",
+ any_float)
+ internal_oligo_gc = Martel.Group("internal_gc",
+ any_float)
+ internal_oligo_seq = Martel.Group("internal_seq",
+ sequence)
+ internal_line = Martel.Group("internal_line",
+ primer_space + Martel.Str("INTERNAL OLIGO") +
+ any_space + internal_oligo_start + any_space +
+ internal_oligo_length + any_space + internal_oligo_tm +
+ any_space + internal_oligo_gc + any_space +
+ internal_oligo_seq + Martel.AnyEol())
+
# XXX This record definition is ugly. But it works :-)
record = Martel.Group("primer3_record",
***************
*** 90,97 ****
reverse_line + blank_line,
# case 2b1. Reverse primer
reverse_line + blank_line,
! # case 2b2, Forward primer
! forward_line + blank_line)) +
blank_line + blank_line + Martel.Rep(blank_line)))
--- 109,118 ----
reverse_line + blank_line,
# case 2b1. Reverse primer
reverse_line + blank_line,
! # case 2b2, Forward primer
! forward_line + blank_line,
! # case 2b3, Internal oligo
! internal_line + blank_line)) +
blank_line + blank_line + Martel.Rep(blank_line)))
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
From Frederic.Sohm at iaf.cnrs-gif.fr Wed May 11 07:20:50 2005
From: Frederic.Sohm at iaf.cnrs-gif.fr (=?iso-8859-1?b?RnLpZOlyaWM=?= Sohm)
Date: Wed May 11 07:16:09 2005
Subject: [Biopython-dev] Rethinking Seq objects
Message-ID: <1115810450.4281ea922610d@mail.iaf.cnrs-gif.fr>
Hi Michiel and everyone,
I have no problem with suppressing the mutable Seq object. The solution proposed
should be OK.
But last time I posted something and forgot the subject so it got lost in the
mailing list instead of coming to the right place, here is the proposition :
Just a thought, don't flame me for that.
Since you will be making a new Seq object, will it be worth making it behave
more like a typical object :
But first a disclaimer, I realise the proposed change could mean breaking a
lot of code, so it might a very bad idea in the end.
When I did first used Biopython, I have been surprised by the behaviour of
Seq object, in regards of the use of the built-in str() and repr() functions
(I should have read the manual first, but hey...) :
Ok here is a the Seq behaviour :
>>> from Bio.Seq import Seq
>>> a = 'a'*80
>>> a
'aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaa'
>>> s = Seq(a)
>>> s
Seq('aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaa', Alphabet())
>>> str(s)
"Seq('aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaa ...', Alphabet())"
>>> repr(s)
"Seq('aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaa', Alphabet())"
>>> s.tostring()
'aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaa'
Now here is what I was expecting at the time following the respective
meaning of str and repr
>>> a = 'a'*80
>>> a
'aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaa'
>>> s = Seq(a)
>>> s
Seq('aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaa', Alphabet())
>>> str(s)
'aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaa'
>>> repr(s)
"Seq('aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaa', Alphabet())"
So what I would propose is to :
- change str(seq) to return the actual sequence as do seq.tostring() right
now. leave repr(seq) as it is,
- make seq.tostring() return str(seq) for backward compatibity. (Would be
eventually removed).
- add a new function Seq.short() for example which would behave like the
actual str(Seq).
I don't have any idea how much code this would break. And the feasability of
it will as well depends on the way the new Seq will be release (I mean do you
plan to have the actual Seq and the new one co-existing for a while or to
directly replace the old Seq?).
If the later is the way we go this change is certainly not desirable,
otherwise it might be something to consider.
Personally I have mix filling about it, but I think it is worth discussing
the matter now.
This change would make the Seq objects behave more like a Python programmer
would expect, on the other hand Biopython have been built on the current
model and this might be a bad idea to change after so much time.
Since the only real problem with this is the replacement of the str() method
all boiled down to how frequently people use the actual string method of Seq
in their code?
I do not have the impression it is very frequent but ...
What do you think ?
Fred
--
Fr?d?ric Sohm
Equipe INRA U1126 "Morphogen?se du syst?me nerveux des Chord?s"
UPR 2197 DEPSN, CNRS
Institut de Neurosciences A. Fessard
1 Avenue de la Terrasse
91 198 GIF-SUR-YVETTE
FRANCE
Phone: +33 (0) 1 69 82 34 12
Fax:+33 (0) 1 69 82 34 47
From hoffman at ebi.ac.uk Wed May 11 07:42:04 2005
From: hoffman at ebi.ac.uk (Michael Hoffman)
Date: Wed May 11 07:36:43 2005
Subject: [Biopython-dev] Rethinking Seq objects
In-Reply-To: <1115810450.4281ea922610d@mail.iaf.cnrs-gif.fr>
References: <1115810450.4281ea922610d@mail.iaf.cnrs-gif.fr>
Message-ID:
On Wed, 11 May 2005, Fr?d?ric Sohm wrote:
> - change str(seq) to return the actual sequence as do seq.tostring() right
> now.
+1
> - make seq.tostring() return str(seq) for backward compatibity. (Would be
> eventually removed).
+1
> - add a new function Seq.short() for example which would behave like the
> actual str(Seq).
Actually, if this is just for programmer/debugging use, it would be
much better for this behavior to be in repr.
--
Michael Hoffman
European Bioinformatics Institute
From Frederic.Sohm at iaf.cnrs-gif.fr Wed May 11 09:04:57 2005
From: Frederic.Sohm at iaf.cnrs-gif.fr (=?iso-8859-1?b?RnLpZOlyaWM=?= Sohm)
Date: Wed May 11 08:57:53 2005
Subject: [Biopython-dev] Rethinking Seq objects
Message-ID: <1115816697.428202f9c79f5@mail.iaf.cnrs-gif.fr>
>> - add a new function Seq.short() for example which would behave like the
>> actual str(Seq).
>
>Actually, if this is just for programmer/debugging use, it would be
>much better for this behavior to be in repr.
personally I would not change repr, as it should returns a string which can be
used with the built-in eval(). But it is true this can make the repr string very
long.
--
Fr?d?ric Sohm
Equipe INRA U1126 "Morphogen?se du syst?me nerveux des Chord?s"
UPR 2197 DEPSN, CNRS
Institut de Neurosciences A. Fessard
1 Avenue de la Terrasse
91 198 GIF-SUR-YVETTE
FRANCE
Phone: +33 (0) 1 69 82 34 12
Fax:+33 (0) 1 69 82 34 47
From bugzilla-daemon at portal.open-bio.org Sat May 21 02:08:17 2005
From: bugzilla-daemon at portal.open-bio.org (bugzilla-daemon@portal.open-bio.org)
Date: Sat May 21 02:26:04 2005
Subject: [Biopython-dev] [Bug 1801] New: Enhanced speed for the isoelectric
point calculation in the ProteinAnalysis module
Message-ID: <200505210608.j4L68Gxn011937@portal.open-bio.org>
http://bugzilla.open-bio.org/show_bug.cgi?id=1801
Summary: Enhanced speed for the isoelectric point calculation in
the ProteinAnalysis module
Product: Biopython
Version: Not Applicable
Platform: PC
OS/Version: All
Status: NEW
Severity: enhancement
Priority: P2
Component: Main Distribution
AssignedTo: biopython-dev@biopython.org
ReportedBy: victor.hr@gmail.com
I've reworked the algorithm to calculate the isoelectric point to make it
faster. I'll include a test. The reworked algorithm twice as fast as the old one.
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
From bugzilla-daemon at portal.open-bio.org Sat May 21 02:18:30 2005
From: bugzilla-daemon at portal.open-bio.org (bugzilla-daemon@portal.open-bio.org)
Date: Sat May 21 02:26:07 2005
Subject: [Biopython-dev] [Bug 1801] Enhanced speed for the isoelectric point
calculation in the ProteinAnalysis module
Message-ID: <200505210618.j4L6IUKJ013095@portal.open-bio.org>
http://bugzilla.open-bio.org/show_bug.cgi?id=1801
------- Additional Comments From victor.hr@gmail.com 2005-05-21 02:18 -------
Created an attachment (id=213)
--> (http://bugzilla.open-bio.org/attachment.cgi?id=213&action=view)
Benchmark for the new algorithm and the old one
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
From bugzilla-daemon at portal.open-bio.org Sat May 21 02:16:59 2005
From: bugzilla-daemon at portal.open-bio.org (bugzilla-daemon@portal.open-bio.org)
Date: Sat May 21 02:26:08 2005
Subject: [Biopython-dev] [Bug 1801] Enhanced speed for the isoelectric point
calculation in the ProteinAnalysis module
Message-ID: <200505210616.j4L6Gxkg013014@portal.open-bio.org>
http://bugzilla.open-bio.org/show_bug.cgi?id=1801
------- Additional Comments From victor.hr@gmail.com 2005-05-21 02:16 -------
Created an attachment (id=212)
--> (http://bugzilla.open-bio.org/attachment.cgi?id=212&action=view)
The re-worked IsoelectricPoint module
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
From idoerg at burnham.org Sun May 29 02:13:37 2005
From: idoerg at burnham.org (Iddo Friedberg)
Date: Sun May 29 02:06:38 2005
Subject: [Biopython-dev] Re: [BioPython] Tool for Biomolecular data edition
and analysis in Python
In-Reply-To:
Message-ID:
Frederico,
This looks like a really useful tool, thanks for sharing.
One way I can see this fitting into Biopython is as an addition to the
Align module. We'll have to think exactly how.
However, I am wondering if this really fits within biopython: you have
what seems to me as a standalone program. Biopython's goal is to provide
buiding blocks for such tools. So your code is the next step: you have
constructed a house (a very fine one, may I add), Biopython provides
lumber, bricks, etc. etc.
Like I said, I'll have to look at it. But with ISMB coming up, it will
probably not be for a month. If someone else has any thoughts on the
matter, please share.
Cheers, and thanks again,
Iddo
--
Iddo Friedberg, Ph.D.
The Burnham Institute
10901 N. Torrey Pines Rd.
La Jolla, CA 92037, USA
Tel: +1 (858) 646 3100 x3516
Fax: +1 (858) 646 3171
http://ffas.ljcrf.edu/~iddo
-------------------------------------
Automated Protein Function Prediction Meeting, June 24, 2005
http://ffas.burnham.org/AFP
On Sat, 28 May 2005, Frederico Arnoldi wrote:
> Hello,
>
> My name is Frederico G. Colombo Arnoldi, I am Phd student (subject:
> Molecular Evolution) in Brazil.
> I have been developing a software for Biomolecular data edition and
> analysis in python, mainly for gene analysis. It started as a little
> tool to help me and became a more serious work. Actually this tool is
> formed by a GTK interface that allows the user to align sequences with
> Malign and Clustalw; color sequences previously aligned according
> conservation and residues characteristics; create reverse,
> reverse/complement and consensus sequences; search for conserved
> regions and determined sequences inside a bigger (like restriction
> enzymes sites and ORF's); generate alignments colored reports and
> others.
> The program and Its portal has been written in English, although,
> as anyone can see in my bad english, I'm not a native english speaker.
> I would like to know about the possibility and the interest of
> integrate it to biopython. The portal is :
> http://mpalign.incubadora.fapesp.br/portal
>
> Thanks a lot.
> Frederico
>
> _________________________________________________________________
> MSN Messenger: converse online com seus amigos .
> http://messenger.msn.com.br
>
> _______________________________________________
> BioPython mailing list - BioPython@biopython.org
> http://biopython.org/mailman/listinfo/biopython
>