[Bioperl-l] Bio::SeqFeature::Annotated API change
Chris Fields
cjfields at uiuc.edu
Thu Oct 11 10:19:51 EDT 2007
On Oct 11, 2007, at 5:40 AM, Sendu Bala wrote:
> Following Chris's changes to SF::Annotated et al., lots of existing
> user
> code breaks. Also, at least some Bioperl code breaks, notably
> Bio::DB::SeqFeature::Store, which in mysql mode calls
> _get_location_and_bin() which calls $feature->seq_id which ends up
> storing something like 'Bio::Annotation::SimpleValue=HASH
> (0x1f435d0)' in
> the database, instead of an actual sequence id (which completely
> breaks
> searching by seq_id).
>
> I propose its API be changed to be more consistent with
> Bio::SeqFeatureI, eg. instead of:
>
> seq_id()
> Usage : $obj->seq_id($newval)
> Function: holds a string corresponding to the unique
> seq_id of the sequence underlying the feature
> (e.g. database accession or primary key).
> Returns : a Bio::Annotation::SimpleValue object representing the
> seq_id.
> Args : on set, some string or a Bio::Annotation::SimpleValue
> object.
>
> we have:
>
> seq_id()
> Usage : $obj->seq_id($newval)
> Function: holds a string corresponding to the unique
> seq_id of the sequence underlying the feature
> (e.g. database accession or primary key).
> Returns : string representing the seq_id.
> Args : on set, some string or a Bio::Annotation::SimpleValue
> object.
>
> This would apply to seq_id(), name(), type(), source(), phase() and
> frame(). Internally the implementation could store the string value
> in a
> SimpleValue object.
Agreed. It would be easy to change over but we need to also make
sure FeatureIO fixes are in place. In reality all FeatureIO methods
should be changed over to recognize any SeqFeatureI or (if we retain
it) the stricter TypedSeqFeatureI. Using only Bio::SF::Annotated
limits other more lightweight implementations. I simply haven't had
time to work on it yet due to $job; if you want to make the necessary
changes you are more than welcome; the few tests I found I moved into
SeqFeatAnnotated.t, which likely expects the wrong behavior.
> However, I'm obviously missing something, because I have no idea what
> the justification for returning SimpleValue objects was in the first
> place (what other module needs them?), nor even what the point of
> SimpleValue objects is in the first place.
I believe it was to ensure any data stored or retrieved was strongly
typed (i.e. scalars in SimpleValue, dbxrefs in DBLink, comments in
Comment, etc). Since B::SF::Generic is also AnnotatableI, it can
store a mix of scalars in methods as well as Bio::Annotation data;
this class attempts to lump them all together as Bio::Annotation in a
Collection in a strongly typed, uniform way. Hilmar's
Bio::SF::AnnotationAdaptor frankly does a better job of describing
the reasoning behind this and is more flexible; I use that now for
typing via feature(), though I just realized it should be changed to
be a singleton instance per class (oops!).
B::SF::A violated the SeqFeatureI interface from the get-go by
returning objects. To trick it's way around the issue it used
overloading so that calling it in some contexts (print, comparison)
returned a string or value; removing the overloads unmasked that
behavior. To me an object returned (regardless of overloading) is
still an object and not a scalar, and still violates the interface
methods where scalars are expected. I can't fault the authors
involved completely since the idea was to radically change the way
SFs/Annotation worked together, but the implementation was never
completed so I rolled it back and limited typing to B::SF::A until
something else can be worked out.
Personally I think it's too 'heavy' and other options should be
explored, such as abstracting out the type checking into a separate
utility class which FeatureIO can use on any SeqFeatureI (TypeMapper
does something like this for the primary_tag()).
chris
More information about the Bioperl-l
mailing list