<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">

<html>

<head>

  <title>MOBY Use Case Panel Information</title>

  <meta http-equiv="content-type"

 content="text/html; charset=ISO-8859-1">

  <meta name="author" content="Fiona Cunningham">

</head>

  <body>

<h2>Scenario: &nbsp;</h2>

<br>

A researcher is doing a microarray experiment and want to both analyse his

data as well as compare it to similar experiments in the same or other species.<br>

<br>

<hr width="10%" align="left">  

<h2>   Background Knowledge: </h2>

<p>Non-affy microarray spot data, in its most simplistic form, could be represented 

as three fields - Spot ID ("Reporter"), the label1 signal intensity, and

the label2 signal intensity. &nbsp;Generally speaking, a series of different

spots are observed in a single experiment and experiments are done over a

series (e.g. time course, or treatment regime). &nbsp;Thus a full microarray

experiment is, in its most simple case,&nbsp; a series of sets of spot data.

&nbsp;These data are generally clustered in some way, such that spots that

behave similarly, oppositely, or spots with functional similarity are grouped

together for visual inspection.<br>

  </p>

<hr width="10%" align="left"><br>

<h1>Microarray Project Use Cases </h1>

<h2>Name: &nbsp;ClusterByFunction</h2>

<p><b>Scenario Reference: above</b></p>

<p><b>Problem: </b>Send a set of microarray data to be clustered by biological

function of the reporter spot<br>

</p>

<hr width="10%" align="left">     

<h2> Primary&nbsp; Actors: </h2>

<ol>

         <li>A biologist</li>

         <li>A microaray data clustering service</li>

</ol>

<hr width="10%" align="left"> 

<h2>Other&nbsp; Actors: </h2>

 &nbsp;a client program capable of visualizing microarray data<br>

<br>

<hr width="10%" align="left"> 

<h2>Initial State and Preconditions: &nbsp;</h2>

<p>The microarray data is unclustered. </p>

<hr width="10%" align="left"> 

<h2>    End Result: &nbsp;</h2>

<p>The biologist recieves clustered data </p>

<hr width="10%" align="left"> 

<h2>  Current Workflow: </h2>

<ol>

         <li>data is imported into a local clustering tool, usually not web-based.</li>

  <li>clustered data is visualized<br>

  </li>

</ol>

          <br>

<hr width="10%" align="left"> 

<h2>    Existing Workflow Limitations: </h2>

<ol>

  <li>not integrated with any other tool</li>

  <li>available tool may not have all clustering options, or may have outdated

functional mapping</li>

  <li>existing infrastructure (often the users desktop machine) might be

slow</li>

</ol>

<hr width="10%" align="left"> 

<h2>Existing Workflow Exemplars: </h2>

<ol>

  <li>privacy - data is not transmitted over net</li>

  <li>user has access to full documentation of the algorithm or program so

they know what is happening.</li>

</ol>

<br>

<hr width="10%" align="left">

<h2>MOBY Workflow: </h2>

<ol>

  <li>available clustering services present themselves as soon as you have

microarray data in-hand</li>

  <li>clustered data is returned</li>

</ol>

<ol>

</ol>

<hr width="10%" align="left">             

<h2>MOBY Workflow Limitations: &nbsp;</h2>

<ol>

  <li>data is passed over the net and might be proprietary or e.g. medical

and thus private</li>

  <li>don't necessarily exactly know how the clustering is being done</li>

  <li>generally large volumes of data are clustered, computationally intensive,

unlikely to find "free" services<br>

  </li>

</ol>

<p><br>

</p>

<hr width="10%" align="left"> 

<h2>    MOBY Workflow Exemplars: &nbsp;</h2>

<ol>

  <li>service might have most up-to-date functional mapping</li>

  <li>several services might present themselves who have greater computational

resources than the user himself<br>

  </li>

</ol>

<p><br>

</p>

<hr width="10%" align="left"> 

<h2>     Discussion: &nbsp;</h2>

<hr width="10%" align="left"> 

<h2>     Priority:</h2>

<p>Desirable - if we can find anyone who is generous with their compute cycles

;-)<br>

<br>

</p>

<hr width="10%" align="left"> 

<h2>   Key References:        </h2>

<p><br>

<a href="%20http://www.genome.gov/page.cfm?pageID=10005336"></a></p>

<hr><br>

<br>

<h1>Microarray Project Use Cases </h1>

<h2>Name: &nbsp;FindSimilarExperiments</h2>

<p><b>Scenario Reference: above</b></p>

<p><b>Problem: </b>Discover if there are any experimental data publically

available from a similar experiment in another organism<br>

 </p>

<hr width="10%" align="left">     

<h2> Primary&nbsp; Actors: </h2>

<ol>

  <li>A biologist</li>

  <li>a public microarray data repository (e.g. SMD)</li>

</ol>

<hr width="10%" align="left"> 

<h2>Other&nbsp; Actors: </h2>

<br>

 <br>

<hr width="10%" align="left"> 

<h2>Initial State and Preconditions: &nbsp;</h2>

<p>biologist has defined (somehow) the experimental conditions of interest

- I believe much of this type of info is stored in MAGE-ML, but I also think

there are other more explicit ontologies being built to describe experimental

conditions.<br>

<br>

 </p>

<hr width="10%" align="left"> 

<h2>    End Result: &nbsp;</h2>

<p>Biologist is informed of the availability of experimental data from studies

conducted under identical or similar experimental conditions </p>

<hr width="10%" align="left"> 

<h2>  Current Workflow: </h2>

<ol>

  <li>not possible?&nbsp;</li>

</ol>

<br>

<hr width="10%" align="left"> 

<h2>    Existing Workflow Limitations: </h2>

<ol>

  <li><br>

  </li>

</ol>

<hr width="10%" align="left">

<h2>Existing Workflow Exemplars: </h2>

<ol>

  <li><br>

  </li>

</ol>

 <br>

<hr width="10%" align="left"> 

<h2>MOBY Workflow: </h2>

<ol>

  <li>an experiment-type matching service presents itself as`soon as you

have a MAGE-ML or other ontology descriptor</li>

  <li>experiment ID's are presented to the user</li>

  <li>microarray data associated with that experiment can be downloaded using

that ID</li>

  <li><br>

  </li>

</ol>

<hr width="10%" align="left">             

<h2>MOBY Workflow Limitations: &nbsp;</h2>

<ol>

  <li><br>

  </li>

</ol>

<p><br>

 </p>

<hr width="10%" align="left"> 

<h2>    MOBY Workflow Exemplars: &nbsp;</h2>

<ol>

  <li><br>

  </li>

</ol>

<p><br>

 </p>

<hr width="10%" align="left"> 

<h2>     Discussion: &nbsp;</h2>

experimental type mapping will be tricky to get right... but useful!<br>

<br>

<hr width="10%" align="left"> 

<h2>     Priority:</h2>

<p>Desirable<br>

 <br>

 </p>

<hr width="10%" align="left"> 

<h2>   Key References:        </h2>

<p><a href="http://genome-www5.stanford.edu/MicroArray/SMD/">Stanford Microarray

repository</a><br>

 </p>

<hr>

<h1><br>

</h1>

<h1>Microarray Project Use Cases </h1>

<h2>Name: &nbsp;Retrieve Results of Homologous Loci</h2>

<p><b>Scenario Reference: above</b></p>

<p><b>Problem: </b>You have clustered your genes and have found a cluster

of interest where the members of the cluster appear to behave similarly to

the experimental conditions. &nbsp;Hypothesizing that these genes are co-regulated,

you wish to know if these genes in your organism, or the homologues in other

organisms, generally behave the same way under any other experimental conditions.<br>

 </p>

<hr width="10%" align="left">     

<h2> Primary&nbsp; Actors: </h2>

<ol>

  <li>A biologist</li>

  <li>a public microarray data repository (e.g. SMD)</li>

</ol>

<hr width="10%" align="left"> 

<h2>Other&nbsp; Actors: </h2>

<ol>

  <li>a way to determine putative homologous genes - perhaps Blast, combined

with GO?<br>

  </li>

</ol>

  <br>

<hr width="10%" align="left"> 

<h2>Initial State and Preconditions: &nbsp;</h2>

<p>a set of loci names in-hand<br>

 <br>

  </p>

<hr width="10%" align="left"> 

<h2>    End Result: &nbsp;</h2>

<p>step 1: &nbsp;a set of original &amp; homologous loci, grouped by organism<br>

step 2: &nbsp;microarray data for any/selected experiments that have involved

those loci </p>

<hr width="10%" align="left"> 

<h2>  Current Workflow: </h2>

<ol>

  <li>know a priori or determine by sequence homology searching a list of

putative homologous genes from other organisms</li>

  <li>go to http://genome-www5.stanford.edu/cgi-bin/SMD/cluster/QuerySetup.pl</li>

  <li>select, one by one, various experiments and various organisms, or select

"all"</li>

  <li>at the subsequent page http://genome-www5.stanford.edu/cgi-bin/SMD/cluster/QuerySetup.pl

select arrays one by one</li>

  <li>at the subsequent page enter the gene list appropriate for that organism</li>

  <li>download the microarray data<br>

  </li>

</ol>

 <br>

<hr width="10%" align="left"> 

<h2>    Existing Workflow Limitations: </h2>

<ol>

  <li>extensive manual clicking, cutting, and pasting<br>

 </li>

</ol>

<hr width="10%" align="left"> 

<h2>Existing Workflow Exemplars: </h2>

<ol>

  <li><br>

 </li>

</ol>

  <br>

<hr width="10%" align="left"> 

<h2>MOBY Workflow: </h2>

<ol>

  <li>[numerous ways to retieve a list of homologues, either en masse, or

by organism]</li>

  <li>submit gene list to microarray service</li>

  <li>retrieve [all] microarray results for the submitted genes<br>

  </li>

  <li><br>

   </li>

</ol>

<hr width="10%" align="left">             

<h2>MOBY Workflow Limitations: &nbsp;</h2>

<ol>

  <li>potential to get more data than you bargained for if we don't have

parameters!<br>

 </li>

</ol>

<p><br>

 </p>

<hr width="10%" align="left"> 

<h2>    MOBY Workflow Exemplars: &nbsp;</h2>

<ol>

  <li><br>

 </li>

</ol>

<p><br>

 </p>

<hr width="10%" align="left"> 

<h2>     Discussion: &nbsp;</h2>

<br>

 <br>

<hr width="10%" align="left"> 

<h2>     Priority:</h2>

<p>Highly Desirable<br>

 <br>

 </p>

<hr width="10%" align="left"> 

<h2>   Key References:        </h2>

<p><a href="http://genome-www5.stanford.edu/MicroArray/SMD/">Stanford Microarray

repository</a><br>

 </p>

<hr><br>

</body>

</html>