[Bioperl-l] swiss parsing

gert thijs gert.thijs@esat.kuleuven.ac.be
Thu, 19 Sep 2002 10:56:59 +0200


Hi,

'eurosids II' wouldn't match /^[\sa-z]+$/ either.
I gues it should be
   /^[\sa-zA-Z]+$/
or is this not stringent enough?

Gert


Elia Stupka wrote:
> Hello there,
> 
> before we go ahead and commit this, just want to double-check. There are
> tons of swissprot entries that have as part of the big @class array in
> classification string names that don't start with a capital letter like
> 'eurosids II'. 
> 
> Am I right to say that the consensus still is that if there are tons of
> records breaking a rule we fix our parsers rather than say "it ain't
> correct"?
> 
> We would simply change
> 
> /^[A-Z][\sa-z]+$/
> 
> to /^[\sa-z]+$/ 
> 
> Ok?
> 
> Elia
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@bioperl.org
> http://bioperl.org/mailman/listinfo/bioperl-l
> 


-- 
+ Gert Thijs
+  K.U.Leuven
+  ESAT-SCD
+  Kasteelpark Arenberg 10
+  B-3001 Leuven-Heverlee
+  Belgium
+
+ Tel  : +32 16 32 85 88
+ Fax  : +32 16 32 19 70
+ email: gert.thijs@esat.kuleuven.ac.be
+
+  http://www.esat.kuleuven.ac.be/~thijs
+  http://www.esat.kuleuven.ac.be/~dna/BioI/
+