[Biopython-dev] PATCH: NumPy support for BioPython

Michiel Jan Laurens de Hoon mdehoon at c2b2.columbia.edu
Tue Mar 20 16:46:52 EDT 2007


Sebastian Bassi wrote:
> On 3/14/07, Michiel Jan Laurens de Hoon <mdehoon at c2b2.columbia.edu> wrote:
>> OK, thanks. I have fixed clustermodule.c in Biopython's CVS to use
>> PyArray_ISCONTIGUOUS instead of (flags & CONTIGUOUS).
>> I'll look at this patch in more detail after the upcoming release is out.
> 
> Did the numpy instead of numeric made it into 1.43?
> 
Yes and no.

Yes):

The patch above did make it in, which means that it is now fairly easy 
to compile Biopython with Numpy instead of Numeric. All you would have 
to do is to change the #include <Numeric/arrayobject.h> statements in 
the C-code to #include <whereever/numpy/puts/its/arrayobject.h>. (Why 
the Numpy folks put arrayobject.h in a different location, I don't know; 
if they hadn't, a transition to Numpy would have been a lot easier).

But, if you're not on a 32-bits platform, Bio.Cluster will not work 
correctly. Also, some import statements will fail (e.g. from Numeric 
import * should become from numpy import *).

So, in summary, you can compile the code but you'll probably have to 
tinker with the Python code. On the bright side, most of Biopython does 
not need Numeric or Numpy, so 90% of Biopython will work.

No):

Adding numpy support is a major undertaking; it's not something I'd want 
to add one week before a release is coming out. Some tests fail with 
Numpy. While these don't seem to be major issues, it's something that 
needs to be fixed first. An additional problem is that we cannot just 
drop Numeric support, so we'll have to support both for now. Numeric is 
still needed because (a) Numpy does not compile cleanly on all major 
platforms (for example on Cygwin); (b) Other Python software relevant to 
computational biology uses Numeric; (c) Numpy's documentation costs $40; 
Numeric's free.

--Michiel.


-- 
Michiel de Hoon
Center for Computational Biology and Bioinformatics
Columbia University
1130 St Nicholas Avenue
New York, NY 10032


More information about the Biopython-dev mailing list