[Bioperl-l] Script taxonomy2tree version 1.4 crashed on 110 species

Sendu Bala bix at sendu.me.uk
Mon Dec 18 22:50:11 UTC 2006


Chris Fields wrote:
> 
> On Dec 18, 2006, at 1:15 PM, Sendu Bala wrote:
> 
>> For example, the first 'drop' happens for the 101st species which is
>> 'Leptospira interrogans serovar Copenhageni'. This is a variation
>> (descendant) of species 24: 'Leptospira interrogans'. So when the
>> variation is added it becomes a leaf and 'Leptospira interrogans' is no
>> longer a leaf, so the overall number of leaves does not increase.
>
> Makes sense now.  I personally would consider this a bug since the 
> results are unexpected (so the docs need to be modified in order to 
> clarify).  Some say tomato...
> 
> I suppose this is one of the issues one might run into when using NCBI 
> taxonomy to build trees.

No, the tree produced is perfectly fine. The taxonomy2tree.pl script 
deliberately then does:

# simple paths are contracted by removing degree one nodes
$tree->contract_linear_paths;

Because that is what Gabriel's script originally did.


>> Gabriel, please confirm this isn't a bug, or suggest how you propose to
>> see your taxa when they are not all leaves of the tree.
> 
> Having the nodes appear internally seems semantically correct to me.  Is 
> there any other way?

I suppose if we want to see all the input species output again we have 
to make contract_linear_paths() aware of nodes we want to keep, even 
when they are degree one nodes. Gabriel, is that what you want to see?




More information about the Bioperl-l mailing list