<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-8399195079684236447</id><updated>2012-01-16T20:01:43.775-08:00</updated><category term='semantic  web'/><category term='hpc'/><category term='provenance'/><category term='tools'/><category term='reviews'/><category term='bio2rdf'/><category term='life sciences'/><category term='cloud computing'/><category term='bug'/><category term='semantic web'/><category term='editorial'/><category term='use case'/><category term='representation'/><category term='principles'/><category term='storage bio2rdf'/><category term='ontology'/><category term='reasoning'/><category term='application'/><category term='snorql'/><category term='bioinformatics'/><category term='trip'/><category term='industry'/><category term='sadi'/><category term='high performance computing'/><category term='owl'/><category term='health care'/><category term='community effort'/><category term='rdf'/><category term='interview'/><category term='travel'/><category term='sparql'/><category term='evaluation'/><category term='task'/><category term='pharmaceuticals'/><category term='design'/><category term='publication'/><category term='open access'/><category term='OBO Foundry'/><category term='hcls'/><category term='maps'/><category term='sabbatical'/><category term='open review'/><category term='w3c'/><category term='training'/><category term='utility'/><category term='science'/><category term='open process'/><category term='google'/><title type='text'>Scientific Ramblings</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>18</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-1910666117999188216</id><published>2011-10-14T19:06:00.000-07:00</published><updated>2011-10-20T17:03:46.450-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='maps'/><category scheme='http://www.blogger.com/atom/ns#' term='trip'/><category scheme='http://www.blogger.com/atom/ns#' term='bug'/><category scheme='http://www.blogger.com/atom/ns#' term='google'/><title type='text'>Google Maps: break in the road bug!</title><content type='html'>&lt;br /&gt;Dear Google Maps,&lt;br /&gt;&amp;nbsp; I'm trying to plan a trip [1] in and around the Acatama Desert while i'm here in Chile, but have found an interesting bug.&amp;nbsp;Basically, there is a break in the road (between A and B) which prevents maps from using that road: see here to reproduce&amp;nbsp;http://g.co/maps/zwq8y&lt;br /&gt;&lt;br /&gt;There's also a break on the road directly east of that break point.&lt;br /&gt;&lt;br /&gt;I'd report the problem to you, if only you had the "Report a Problem" link anywhere on this page, despite your claims [2] to the contrary.&lt;br /&gt;&lt;br /&gt;Best,&lt;br /&gt;&lt;br /&gt;m.&lt;br /&gt;&lt;br /&gt;[1]&amp;nbsp;http://g.co/maps/3zwqt&lt;br /&gt;[2]&amp;nbsp;&lt;a href="http://maps.google.com/support/bin/answer.py?hl=en&amp;amp;answer=162873&amp;amp;topic=1687362"&gt;http://maps.google.com/support/bin/answer.py?hl=en&amp;amp;answer=162873&amp;amp;topic=1687362&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-1910666117999188216?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/1910666117999188216/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=1910666117999188216' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/1910666117999188216'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/1910666117999188216'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2011/10/google-maps-break-in-road-bug.html' title='Google Maps: break in the road bug!'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total><georss:featurename>Bernardo O`Higgins, 501-581, Concepción, Biobío Region, Chile</georss:featurename><georss:point>-36.8281944 -73.0513694</georss:point><georss:box>-37.6405529 -74.3147969 -36.0158359 -71.78794189999999</georss:box></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-5825132351367176396</id><published>2011-09-25T03:09:00.000-07:00</published><updated>2011-09-25T14:31:26.840-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='w3c'/><category scheme='http://www.blogger.com/atom/ns#' term='semantic web'/><category scheme='http://www.blogger.com/atom/ns#' term='provenance'/><category scheme='http://www.blogger.com/atom/ns#' term='hcls'/><category scheme='http://www.blogger.com/atom/ns#' term='representation'/><title type='text'>Provenance: what is it and how should we formalize it?</title><content type='html'>&lt;span class="Apple-style-span" style="background-color: white; font-family: arial, sans-serif; font-size: 13px;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;div&gt;As a testament to the growing recognition of provenance for (e-)science, i'm glad to see that the &lt;a href="http://www.w3.org/2005/Incubator/prov/wiki/W3C_Provenance_Incubator_Group_Wiki"&gt;W3C incubator group&lt;/a&gt; worked hard to think about the issues and make it possible to establish a &lt;a href="http://www.w3.org/2011/01/prov-wg-charter"&gt;W3C provenance interchange Working Group&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;a good starting point:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;"provenance is often represented as metadata, but not all metadata is necessarily provenance"&lt;/div&gt;&lt;div&gt;&lt;a href="http://www.w3.org/2005/Incubator/prov/XGR-prov-20101214/#Provenance_and_Metadata" style="color: #1c51a8;" target="_blank"&gt;http://www.w3.org/2005/&lt;wbr&gt;&lt;/wbr&gt;Incubator/prov/XGR-prov-&lt;wbr&gt;&lt;/wbr&gt;20101214/#Provenance_and_&lt;wbr&gt;&lt;/wbr&gt;Metadata&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;but&lt;/div&gt;&lt;div&gt;"Descriptive metadata of a resource only becomes part of its provenance when one also specifies its relationship to deriving the resource."&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;does not provide adequate description for identifying the conditions. &amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;and&amp;nbsp;&lt;/div&gt;&lt;div class="im" style="color: #500050;"&gt;&lt;div&gt;"&lt;span style="color: darkgreen;"&gt;Provenance of a resource is a record that describes entities and processes involved in producing and delivering or otherwise influencing that resource"&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;contains elements that are undefined (record), uncertain (are processes not also entities?), narrow (producing/delivering) and broad (influencing).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Of course, I appreciate the difficulty in crafting a good definition, and I understand that this is a definition from which useful work can be achieved.&amp;nbsp;&amp;nbsp;I will take the opportunity to express my thoughts on the matter.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;i think there are two key aspects to provenance (not unlike what is suggested here:&amp;nbsp;&lt;a href="http://www.springerlink.com/content/edf0k68ccw3a22hu/" style="color: #1c51a8;" target="_blank"&gt;http://www.springerlink.&lt;wbr&gt;&lt;/wbr&gt;com/content/edf0k68ccw3a22hu/&lt;/a&gt;)&lt;/div&gt;&lt;div&gt;1. how did the resource come about? (relates to creation and justification)&lt;/div&gt;&lt;div&gt;&amp;nbsp;- important for reproducibility (which is an element of science)&lt;/div&gt;&lt;div&gt;&amp;nbsp;- includes&amp;nbsp;attribution (who created the resource),&amp;nbsp;creation (process that generated the resource),&amp;nbsp;reproduction (process in which a copy was made),&amp;nbsp;derivation (process in which the resource was generated from some resource or portion of a resource), versioning (process of keeping count of sequential derivations)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;2. what is the history of the resource (from the point of creation)&lt;/div&gt;&lt;div&gt;&amp;nbsp;- important for authenticity&amp;nbsp;&lt;/div&gt;&lt;div&gt;&amp;nbsp;- includes origin,&amp;nbsp;possession and the acts of transfer&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Both have implications for trust, and can be used for accountability, among other things.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I find this part on recommendations of a provenance framework quite nice:&lt;/div&gt;&lt;div&gt;&lt;a href="http://www.w3.org/2005/Incubator/prov/XGR-prov-20101214/#A_Roadmap_for_Provenance_on_the_Web" style="color: #1c51a8;" target="_blank"&gt;http://www.w3.org/2005/&lt;wbr&gt;&lt;/wbr&gt;Incubator/prov/XGR-prov-&lt;wbr&gt;&lt;/wbr&gt;20101214/#A_Roadmap_for_&lt;wbr&gt;&lt;/wbr&gt;Provenance_on_the_Web&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;but get less excited when i see the collection of "provenance concepts"&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;a href="http://www.w3.org/2005/Incubator/prov/XGR-prov-20101214/#Recommendations" style="color: #1c51a8;" target="_blank"&gt;http://www.w3.org/2005/&lt;wbr&gt;&lt;/wbr&gt;Incubator/prov/XGR-prov-&lt;wbr&gt;&lt;/wbr&gt;20101214/#Recommendations&lt;/a&gt;&amp;nbsp;(&lt;wbr&gt;&lt;/wbr&gt;section 4)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;particularly because we need to simplify the discourse such that we consider&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;an event (for 1 above)&amp;nbsp;&lt;/div&gt;&lt;div&gt;&amp;nbsp;- participants (and their roles; e.g. agents, targets, products)&lt;/div&gt;&lt;div&gt;&amp;nbsp;- locations&lt;/div&gt;&lt;div&gt;&amp;nbsp;- time instants (e.g. action timestamps) and durations (processual attributes)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;and a sequence of events (for both 1 and 2 above)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;this would certainly help to generate a specification with a minimal set of classes and relations to express this kind of information.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;now, i'm writing this late at night, and I appreciate that I may not have considered all the issues that the provenance group has (along with others that have written about the subject), but perhaps there is still some good discussions to be had wrt provenance and how we formally represent it, as it is of strategic importance to the HCLSIG in our current and future efforts.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-5825132351367176396?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/5825132351367176396/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=5825132351367176396' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/5825132351367176396'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/5825132351367176396'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2011/09/provenance-what-is-it-and-how-should-we.html' title='Provenance: what is it and how should we formalize it?'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-269726038842122247</id><published>2011-09-22T15:14:00.001-07:00</published><updated>2011-09-22T15:15:11.885-07:00</updated><title type='text'>A letter to gmail: attachments</title><content type='html'>Dear Gmail team.&lt;br /&gt;&lt;br /&gt;First, thank you for making it possible for me to see my unread mail - i sent you this idea some time ago, and i'm glad that you listened. &lt;br /&gt;&lt;br /&gt;Second, I'm now somehow at 50% of my allocated capacity, and what i need is a way to filter my mails by attachment (which i can do!), but also order them from largest to smallest (which I can't do). Once i can order attachments by size, i can start deleting the big ones and free up more room! YAY!&lt;br /&gt;&lt;br /&gt;m.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-269726038842122247?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/269726038842122247/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=269726038842122247' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/269726038842122247'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/269726038842122247'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2011/09/letter-to-gmail-attachements.html' title='A letter to gmail: attachments'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-2629337268455458370</id><published>2011-09-20T01:41:00.000-07:00</published><updated>2011-09-20T01:41:59.309-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='use case'/><category scheme='http://www.blogger.com/atom/ns#' term='ontology'/><category scheme='http://www.blogger.com/atom/ns#' term='evaluation'/><category scheme='http://www.blogger.com/atom/ns#' term='application'/><category scheme='http://www.blogger.com/atom/ns#' term='tools'/><category scheme='http://www.blogger.com/atom/ns#' term='task'/><category scheme='http://www.blogger.com/atom/ns#' term='utility'/><title type='text'>On the topic of ontology evaluation</title><content type='html'>&lt;br /&gt;&lt;div class="MsoPlainText"&gt;&amp;nbsp;With only a cursory look at the literature pertaining to the evaluation of ontologies, I already get the feeling that the current measures completely miss the point. The answer doesn't lie in the syntax (format) or structure of the ontology (the number of classes and properties, subsumed classes, axioms, etc), but rather theeffectiveness of an ontology ( a representation of knowledge ) is in whether the semantics can be used for some task. So what we really want, is to focus on the nature of the task, and whether ontology provides a competitive advantage over other technologies. &lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoPlainText"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoPlainText"&gt;Off the top of my head, here are some tasks:&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoPlainText"&gt;- &lt;b&gt;search/browse&lt;/b&gt; (most sites using GO, etc)&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoPlainText"&gt;- &lt;b&gt;text annotation &lt;/b&gt;(&lt;a href="http://www.gopubmed.org/"&gt;gopubmed&lt;/a&gt;)&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoPlainText"&gt;- &lt;b&gt;data normalization &lt;/b&gt;and &lt;b&gt;structured queries&amp;nbsp;&lt;/b&gt;- &lt;a href="http://bio2rdf.org/"&gt;bio2rdf&lt;/a&gt;&lt;/div&gt;&lt;div class="MsoPlainText"&gt;- &lt;b&gt;answering questions &lt;/b&gt;that require background knowledge&lt;b&gt;&amp;nbsp;&lt;/b&gt;e.g. across a &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/18562252"&gt;yeast database&lt;/a&gt;.&amp;nbsp;&lt;/div&gt;&lt;div class="MsoPlainText"&gt;- &lt;b&gt;data integration&lt;/b&gt; (heterogeneous types of data; datafrom different sources, of differing granular detail)&amp;nbsp; (see my &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/21624155"&gt;translational medicine paper&lt;/a&gt;)&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoPlainText"&gt;- &lt;b&gt;classification &lt;/b&gt;e.g. &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/15558746"&gt;domains &lt;/a&gt;or chemicals&amp;nbsp;&lt;/div&gt;&lt;div class="MsoPlainText"&gt;- &lt;b&gt;prediction&amp;nbsp;&lt;/b&gt;e.g. predicting &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/21737429"&gt;phenotypes&lt;/a&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoPlainText"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoPlainText"&gt;Perhaps others can suggest some?&lt;/div&gt;&lt;div class="MsoPlainText"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-2629337268455458370?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/2629337268455458370/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=2629337268455458370' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/2629337268455458370'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/2629337268455458370'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2011/09/on-topic-of-ontology-evaluation.html' title='On the topic of ontology evaluation'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-6441328833758200362</id><published>2011-09-11T09:14:00.000-07:00</published><updated>2011-10-15T14:41:27.801-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='w3c'/><category scheme='http://www.blogger.com/atom/ns#' term='semantic web'/><category scheme='http://www.blogger.com/atom/ns#' term='health care'/><category scheme='http://www.blogger.com/atom/ns#' term='rdf'/><category scheme='http://www.blogger.com/atom/ns#' term='pharmaceuticals'/><category scheme='http://www.blogger.com/atom/ns#' term='industry'/><category scheme='http://www.blogger.com/atom/ns#' term='owl'/><category scheme='http://www.blogger.com/atom/ns#' term='hcls'/><category scheme='http://www.blogger.com/atom/ns#' term='life sciences'/><title type='text'>New Charter for the W3C Health Care and Life Sciences Interest Group</title><content type='html'>Last week, the &lt;a href="http://www.w3.org/"&gt;World Wide Web Consortium (W3C)&lt;/a&gt; approved a &lt;a href="http://www.w3.org/2011/09/HCLSIGCharter"&gt;new charter&lt;/a&gt; for its &lt;a href="http://www.w3.org/2001/sw/hcls/"&gt;Health Care and Life Sciences Interest Group (HCLSIG)&lt;/a&gt;, in which &lt;a href="http://dumontierlab.com/"&gt;I&lt;/a&gt;&amp;nbsp;(&lt;a href="http://www.carleton.ca/"&gt;Carleton University&lt;/a&gt;) along with Charlie Mead (&lt;a href="https://cabig.nci.nih.gov/"&gt;NIH CBIIT&lt;/a&gt;) and Vijay Bulusu (&lt;a href="http://www.pfizer.com/"&gt;Pfizer&lt;/a&gt;) were&amp;nbsp;selected as co-chairs. This new charter directs us to develop, advocate and support the use of Semantic Web technologies for translational medicine and its three enabling domains: life sciences, clinical research and health care. While the core HCLSIG values - simplicity, pragmatism, effectiveness - remain firmly in place, Charlie, Vijay and I hope to make subtle changes to the operational strategy such that our efforts become increasingly recognized as critical in conferences and boardrooms across the globe.&lt;br /&gt;&lt;br /&gt;As always, the HCLSIG will create both prototype implementations&amp;nbsp;that demonstrate the value of formalizing and sharing knowledge using Semantic Web technologies. We will&amp;nbsp;marshal&amp;nbsp;our efforts towards fulfilling compelling &lt;a href="http://www.w3.org/2011/05/HCLSIGUseCases"&gt;use cases&lt;/a&gt; that have intrinsic value to not just &lt;a href="http://www.w3.org/Consortium/Member/List"&gt;W3C members&lt;/a&gt;, but ideally to a larger number of outside benefactors. Thus, our experts will now develop these use cases such that&amp;nbsp;&lt;i&gt;a priori&lt;/i&gt;&amp;nbsp;we have a clearer picture of the rationale of the project, its resources, milestones and deliverables, and ultimately, which organizations and communities will directly and indirectly benefit. Coupled with an effective dissemination strategy including leverage our combined social networks, we hope to maximize the impact of the work of our members in this emerging area of knowledge management.&lt;br /&gt;&lt;br /&gt;&amp;nbsp;As part of our dissemination strategy, we also intend to produce more&amp;nbsp;member&amp;nbsp;&lt;i&gt;contributions &lt;/i&gt;that describe methods&lt;i&gt;&amp;nbsp;&lt;/i&gt;for basic and advanced tasks&lt;i&gt;,&lt;/i&gt;&amp;nbsp;in addition to&amp;nbsp;publishing &lt;i&gt;recommendations&amp;nbsp;&lt;/i&gt;arising from&amp;nbsp;&lt;i&gt;consensus &lt;/i&gt;among our members. Such recommendations will endorse and specify the use of terminological resources in the long term context of semantic interoperability across the three core domains. Thus, participation in the HCLSIG will be critical for those wanting to advocate RDF-representations of data, OWL representations of ontologies, for the purposes of semantic annotation and large scale, semantic integration of biomedical data.&lt;br /&gt;&lt;br /&gt;&amp;nbsp;With that, we invite non-members to join the W3C and work with our strong compliment of experts in what will surely be an exciting and productive time over the next few years for the W3C HCLSIG.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-6441328833758200362?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/6441328833758200362/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=6441328833758200362' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/6441328833758200362'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/6441328833758200362'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2011/09/new-charter-for-w3c-health-care-and.html' title='New Charter for the W3C Health Care and Life Sciences Interest Group'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-8647697073328311982</id><published>2011-09-10T08:32:00.000-07:00</published><updated>2011-09-10T08:32:31.131-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='bioinformatics'/><category scheme='http://www.blogger.com/atom/ns#' term='semantic web'/><category scheme='http://www.blogger.com/atom/ns#' term='bio2rdf'/><category scheme='http://www.blogger.com/atom/ns#' term='open access'/><category scheme='http://www.blogger.com/atom/ns#' term='community effort'/><title type='text'>Bio2RDF: moving forward</title><content type='html'>&lt;span class="Apple-style-span" style="background-color: white; font-family: arial, sans-serif; font-size: 13px;"&gt;&amp;nbsp; Last week we held our first virtual&amp;nbsp;&lt;a href="https://docs.google.com/document/d/1-1ytBhdkViCVJLfx42jaJNu0W4XVxiQqpOy-5rKT5fo/edit?hl=en_US" style="color: #1c51a8;" target="_blank"&gt;meeting&lt;/a&gt;&amp;nbsp;towards re-invigorating the Bio2RDF project with a significantly larger and vested&amp;nbsp;&lt;a href="https://docs.google.com/document/d/1XjTpUvwvBZaEJ_0zQOctj1bjx9YaSavOdPDd7oXaXD4/edit?hl=en_US" style="color: #1c51a8;" target="_blank"&gt;community&lt;/a&gt;. From discussions, we plan to establish&amp;nbsp;&lt;a href="https://docs.google.com/document/d/1s1TagsANHRnutnJ-9EsDxkG6HqFt4jfXi7SYpHlswys/edit?hl=en_US" style="color: #1c51a8;" target="_blank"&gt;3 focus groups&amp;nbsp;&lt;/a&gt;around :&lt;/span&gt;&lt;span class="Apple-style-span" style="background-color: white; font-family: arial, sans-serif; font-size: 13px;"&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;A. policy (information, governance, sustainability, outreach)&lt;/div&gt;&lt;div&gt;B. technical (architecture, infrastructure and RDFization)&lt;/div&gt;&lt;div&gt;C. social (user experience and social networking)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The next step then is for groups to:&lt;/div&gt;&lt;div&gt;1. identify and certify discussion leads (responsibilities: set meeting times and agenda, facilitate and encourage discussion among members, draft reports)&lt;/div&gt;&lt;div&gt;2. identify additional people to recruit from the wider community that would provide additional expertise (interested, but didn't attend the first? sign up now !)&lt;/div&gt;&lt;div&gt;3. extend and prioritize discussion items (what exactly will this group focus its efforts on in the short and long term)&lt;/div&gt;&lt;div&gt;4. identify and assign bite-sized tasks (so we can get things done one step at a time :)&lt;/div&gt;&lt;div&gt;5. collate results and present to the wider community&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I suggest that groups self-organize a first meeting in the next two weeks to deal with items 1-4, and either meet again or use the Google documents to collaboratively report findings.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Finally, I'd like for us to hold another meeting with times that are much more accommodating for Europe + North America ;) &amp;nbsp;Please fill the doodle poll (&lt;a href="http://www.doodle.com/fsuz6mgs5cztf2e2" style="color: #1c51a8;" target="_blank"&gt;http://www.doodle.com/&lt;wbr&gt;&lt;/wbr&gt;fsuz6mgs5cztf2e2&lt;/a&gt;)&lt;/div&gt;&lt;div&gt;&amp;nbsp;&lt;/div&gt;&lt;div&gt;As always, feel free to contact me if you have any questions, and please sign up to the&amp;nbsp;&lt;a href="https://groups.google.com/group/bio2rdf/" style="color: #1c51a8;" target="_blank"&gt;Bio2RDF mailing list&lt;/a&gt;&amp;nbsp;for all future discussions.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-8647697073328311982?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/8647697073328311982/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=8647697073328311982' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/8647697073328311982'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/8647697073328311982'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2011/09/bio2rdf-moving-forward.html' title='Bio2RDF: moving forward'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-4078447138305612092</id><published>2011-09-04T03:32:00.000-07:00</published><updated>2011-11-03T07:38:07.048-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='reviews'/><category scheme='http://www.blogger.com/atom/ns#' term='open access'/><category scheme='http://www.blogger.com/atom/ns#' term='publication'/><category scheme='http://www.blogger.com/atom/ns#' term='open process'/><category scheme='http://www.blogger.com/atom/ns#' term='editorial'/><category scheme='http://www.blogger.com/atom/ns#' term='science'/><category scheme='http://www.blogger.com/atom/ns#' term='open review'/><title type='text'>Scientific Publishing: We're not quite there yet...</title><content type='html'>I recall how discussions with &lt;a href="http://www.pascal-hitzler.de/"&gt;Pascal Hitzler&lt;/a&gt; at &lt;a href="http://iswc2008.semanticweb.org/"&gt;ISWC2008 in Karlsruhe&lt;/a&gt; eventually led to the creation of a new journal: &lt;a href="http://www.semantic-web-journal.net/"&gt;Semantic Web&lt;/a&gt;&amp;nbsp;in 2009. It had most of the elements I wanted :&lt;br /&gt;&lt;div&gt;&amp;nbsp;1- immediately availability of the manuscript&lt;/div&gt;&lt;div&gt;&amp;nbsp;2- a transparent review process&lt;/div&gt;&lt;div&gt;&amp;nbsp;3- a high standard of quality.&lt;/div&gt;&lt;div&gt;&amp;nbsp;4- low cost of publication&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&amp;nbsp;The immediately availability of the manuscript is important because publishing in a peer-reviewed journals can often take more than 1 year - so this is problematic when making claims about who did what and when. A transparent review process is important to create an incentive to carefully formulate constructive reviews and weed out spurious reviews dealt under the veil of&amp;nbsp;anonymity. In addition, we can establish the reviewer quality and reliability - important when asking for future review requests. Another other other aspect of open review is to enable the acknowledgement of reviewer contributions to improving the quality of the manuscript. In this respect, Semantic Web is stellar - it acknowledges the editors and the reviewers on the front page of the published manuscript.&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&amp;nbsp; One concern was whether reviewers would be sufficiently forthcoming about the failings of a paper such that it lead to the strong rejection of the manuscript. From all indications (see the paper that Pascal and Krzysztof wrote&amp;nbsp;&lt;a href="http://t.co/twDASAN9"&gt;http://t.co/twDASAN9&lt;/a&gt;), the journal not only &lt;a href="http://www.semantic-web-journal.net/reviewed"&gt;rejects a significant number of manuscripts&lt;/a&gt; that don't meet expectations for publication, but also does so in a constructive manner such that it invites authors to submit a revised manuscript with indicating how they have addressed the reviewer's comments. &amp;nbsp;Indeed, this aspect of open review also means that it authors attempt to shop the failed manuscript to other journals, a simple web search should uncover the reviews associated with it. This saves precious editor and reviewer time and really pushes the authors to make substantial changes.&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&amp;nbsp;While this is all good, the major point of content is the business model. From any which way I look at it, I get the feeling that the tables are turned upside down. First you have the authors creating intellectual content, which they submit to a journal, and is reviewed by people who don't get paid to do so. The journal then turns around and tells the authors that if they want it freely available to the public, then the authors should cough up the money. This is ridiculous. Even a simple advertising based model could easily make a return on investment for articles of increasing impact. The other aspect is that the publisher needs to cover the costs of a print publication, but really, who reads print? I haven't for over 10 years now! &amp;nbsp;In the case of the Journal, they exact a cost in typesetting, but we're used to doing this for our workshop and conference submissions - so we really don't need people to do this anymore. It would be nice to see a break down of costs for digital publishing systems today (perhaps you can point us to one).&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&amp;nbsp;So what is the business model of the modern scientific publisher? &amp;nbsp;Well, i think it lies in aggregating and creating new content, whether editorials or comprehensive reviews, which give readers a viewpoint or summary of where the field is heading. &amp;nbsp;Perhaps low-cost subscriptions (e.g. $100/yr) or pay per view ($5), one could recoup costs if the work is sufficiently meritorious. It's definitely time to think about the next evolution in scientific publishing.&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-4078447138305612092?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/4078447138305612092/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=4078447138305612092' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/4078447138305612092'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/4078447138305612092'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2011/09/scientific-publishing-were-not-quite.html' title='Scientific Publishing: We&apos;re not quite there yet...'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-5231820765654552384</id><published>2011-07-13T16:01:00.000-07:00</published><updated>2011-07-13T16:01:18.398-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='interview'/><category scheme='http://www.blogger.com/atom/ns#' term='sabbatical'/><title type='text'>Sabbatical Interview</title><content type='html'>Earlier this year I was interviewed about my sabbatical plans, and we did a photo shoot as well. Have a look at the article, which includes my rabbits featured on the monitor screen :)&lt;br /&gt;&lt;br /&gt;&lt;a href="http://cualumni.carleton.ca/magazine/summer-2011/parting-shots/"&gt;http://cualumni.carleton.ca/magazine/summer-2011/parting-shots/&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-5231820765654552384?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/5231820765654552384/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=5231820765654552384' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/5231820765654552384'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/5231820765654552384'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2011/07/sabbatical-interview.html' title='Sabbatical Interview'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-2924820680611975086</id><published>2011-07-04T03:29:00.000-07:00</published><updated>2012-01-15T15:49:34.180-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='travel'/><category scheme='http://www.blogger.com/atom/ns#' term='sabbatical'/><title type='text'>Breaking Down the Sabbatical</title><content type='html'>It's hard to believe that it's already been 7 years since I started at Carleton University. I remember thinking about going on sabbatical - a major feature of becoming a University professor. Well, now the time has come, and along with the &lt;a href="http://micheldumontier.blogspot.com/2011/07/sabbatical-2011-2012-formalizing.html"&gt;proposal&amp;nbsp;&lt;/a&gt;, I've scheduled quite a bit of travel to visit with colleagues and to take some well deserved holiday time. So here's the tentative schedule&lt;br /&gt;&lt;br /&gt;July 3-14: Cambridge, UK (&lt;a href="http://www.ebi.ac.uk/"&gt;EBI &lt;/a&gt;- &lt;a href="http://www.ebi.ac.uk/Rebholz/"&gt;Dietrich Rebholz-Schuhmann&lt;/a&gt;, &lt;a href="http://www.cam.ac.uk/"&gt;University of Cambridge&lt;/a&gt; &amp;nbsp;- &lt;a href="http://leechuck.de/"&gt;Robert Hoehndorf&lt;/a&gt;)&lt;br /&gt;July 14-18: Vienna, Austria (&lt;a href="http://www.bio-ontologies.org.uk/"&gt;Bio-ontologies&lt;/a&gt;, &lt;a href="http://www.iscb.org/ismbeccb2011-program/tutorials#am1"&gt;ISMB Tutorial&lt;/a&gt;)&lt;br /&gt;July 18-August 7: Malta+Italy (Valletta,&amp;nbsp;Catania, Salerno,&amp;nbsp;Pompeii, Rome,&amp;nbsp;Florence,&amp;nbsp;Venice,&amp;nbsp;Bologna,&amp;nbsp;Pisa)&lt;br /&gt;August 7-10: Finland (Tempere, Helsinki)&lt;br /&gt;August 10-18: Iceland&lt;br /&gt;August 18-29: Kyoto + Tokyo, Japan (&lt;a href="http://2011.biohackathon.org/"&gt;Biohackathon 2011&lt;/a&gt;)&lt;br /&gt;August 29-September 3: Madrid (&lt;a href="http://www.oeg-upm.net/"&gt;Ontology Engineering Group&lt;/a&gt; : &lt;a href="http://alexdeleon.name/"&gt;Alexander De Leon&lt;/a&gt;)&lt;br /&gt;September 3-7: Heidelberg, Germany (&lt;a href="http://co.mbine.org/events/COMBINE_2011"&gt;COMBINE&lt;/a&gt;)&lt;br /&gt;September 7-12: Ottawa (defense: Leonid Chepelev -&amp;gt; success!)&lt;br /&gt;September 13-17: Nancy, France (INRIA/LORIA: &lt;a href="http://www.loria.fr/~coulet/Welcome.html"&gt;Adrien Coulet&lt;/a&gt;)&lt;br /&gt;September 19: Volendam (&lt;a href="http://www.openphacts.org/ops_workshops.htm"&gt;OpenPHACTS/Gen2Phen meeting on open data&lt;/a&gt;)&lt;br /&gt;September 17-October 1: Paris, St Malo, Bordeaux, London (travel with parents)&lt;br /&gt;October 1-11: Ottawa, Canada (defense: Natalia Villaneuva-Rosales -&amp;gt; success!)&lt;br /&gt;October 11-December 1: Conception, Chile (&lt;a href="http://www.udec.cl/pexterno/"&gt;Universite de Conception &lt;/a&gt;:&amp;nbsp;&lt;a href="http://www.inf.udec.cl/~leo/"&gt;Leo Ferres&lt;/a&gt;)&lt;br /&gt;December 1-December 23: Santiago, San Pedro de Atacama, Mendoza, Buenos Aires, Colonia, Punta del Diablo and Montevideo)&lt;br /&gt;December 24-January 31: Toronto, Ottawa&lt;br /&gt;February - March: Stanford university (&lt;a href="http://bmir.stanford.edu/people/view.php/nigam_h_shah"&gt;Nigam Shah&lt;/a&gt;, &lt;a href="http://bmir.stanford.edu/people/view.php/mark_a_musen/"&gt;Mark Musen&lt;/a&gt;)&lt;br /&gt;April - June : India, Nepal, Bhutan, Thailand, Singapore ?&lt;br /&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-2924820680611975086?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/2924820680611975086/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=2924820680611975086' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/2924820680611975086'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/2924820680611975086'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2011/07/breaking-down-sabbatical.html' title='Breaking Down the Sabbatical'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-3579884643368799078</id><published>2011-07-04T02:59:00.000-07:00</published><updated>2011-07-04T02:59:20.443-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='ontology'/><category scheme='http://www.blogger.com/atom/ns#' term='reasoning'/><category scheme='http://www.blogger.com/atom/ns#' term='bioinformatics'/><category scheme='http://www.blogger.com/atom/ns#' term='sadi'/><category scheme='http://www.blogger.com/atom/ns#' term='sabbatical'/><category scheme='http://www.blogger.com/atom/ns#' term='semantic  web'/><title type='text'>Sabbatical 2011-2012: Formalizing Scientific Discourse</title><content type='html'>&lt;b&gt;Objective: &lt;/b&gt;The goal of my research program is to enable biologists to compose and evaluate scientific hypotheses using a diverse set of informational sources (ontology, database, text, equations, and web services). The purpose of my 2011-2012 sabbatical is to develop expertise in formalizing scientific discourse, with a particular focus on formalizing textual descriptions and mathematical equations such that they interoperate with knowledge represented in databases and structured documents. In particular, I am interested in using&amp;nbsp;high quality facts derived from text and dynamic computation from formalized equations to answer questions and provide evidence for scientific hypotheses.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Background:&amp;nbsp;&lt;/b&gt;Advancing knowledge in the life sciences involves experimentally testing hypotheses and interpreting the results based on prior scientific work. In generating a valid hypothesis, biologists face the overwhelming challenge of collecting, evaluating and integrating large and increasing amounts of different kinds of information about organisms, cells, genes and proteins from thousands of articles, hundreds of databases and dozens of tools. A biologist’s ability to efficiently construct and evaluate a hypothesis over current knowledge requires that i) knowledge, data and hypotheses are formally represented so they may be reasoned about, ii) adequate software exists to manage and query formal knowledge, and iii) data can be obtained by searching relevant databases and invoking the right analytical tools. The inability to efficiently discover relevant information can negatively impact scientific research directions and proposed activities. Methods for facilitating the construction and evaluation of hypotheses against the current state of knowledge could translate into greater scientific insight and increased productivity. Innovative approaches for knowledge discovery could be applied to data on the emerging Semantic Web and be transformative on a global scale.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Proposed Activities&lt;/b&gt;&lt;br /&gt;1.&amp;nbsp;&lt;b&gt;Text to triples&lt;/b&gt;: The purpose of this leg of the sabbatical is to gain an understanding of the current state of the art in natural language processing and develop skill in producing high quality triples from parsing scientific text.&lt;br /&gt;Time Frame: July 2011-September 2011&lt;br /&gt;Location: European Bioinformatics Institute, Hinxton, UK.&lt;br /&gt;Host: Dr. Rebholz-Schuhmann&lt;br /&gt;&lt;br /&gt;2.&amp;nbsp;&lt;b&gt;Formalizing equations&lt;/b&gt;: The purpose of this leg of the sabbatical is to investigate the ontology of equations, represent scientific equations using Semantic Web technologies (principally the Rule Interchange Format), and implement semantic web services that serve to compute over formalized scientific equations.&lt;br /&gt;Time Frame: October 2011-December 2011&lt;br /&gt;Location: Universidad de Concepcion, Concepcion, Chile.&lt;br /&gt;Host: Dr. Leo Ferres&lt;br /&gt;&lt;br /&gt;3.&amp;nbsp;&lt;b&gt;Formalizing Research Hypotheses&lt;/b&gt;: The purpose of this leg of the sabbatical is to explore the formalization of hypotheses concerning disease. Specifically, I will extract meaningful facts from AlzForum and integrate these with resources from the National Centre for BioOntology (NCBO) and Bio2RDF, our large scale Semantic Web project.&lt;br /&gt;&lt;br /&gt;Time Frame: January 2012-March 2012&lt;br /&gt;Location: Stanford University, Palo Alto, California, USA.&lt;br /&gt;Host: Dr. Mark Musen&lt;br /&gt;&lt;br /&gt;4. &lt;b&gt;Integrated&amp;nbsp;Framework for&amp;nbsp;Knowledge Discovery: &lt;/b&gt;The last leg of the sabbatical will be focused towards maximizing interoperability between text, equations, ontologies and database-derived facts. I will use SADI, our platform semantic web services framework, towards achieving this objective.&lt;br /&gt;Time Frame: April 2012-June 2012&lt;br /&gt;Location: India, Thailand, Singapore&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Scientific Value and Broader Beneficial Impacts&amp;nbsp;&lt;/b&gt;&lt;br /&gt;The development and application of efficient strategies for knowledge discovery is a major goal in bioinformatics. My research into new strategies for the representation and evaluation of scientific hypotheses using ontologies, scientific text, data and bioinformatic services will create a novel platform that will significantly contribute to scientific productivity and ultimately improve our understanding of biology. The proposed sabbatical will provide me with new skills that will be used to train a future training of young scientists in the areas of formal knowledge representation, text mining and the Semantic Web. Ultimately, it is expected that the sabbatical will cultivate new partnerships with leading scientists and open new doors to work with industry and government agencies.&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-3579884643368799078?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/3579884643368799078/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=3579884643368799078' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/3579884643368799078'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/3579884643368799078'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2011/07/sabbatical-2011-2012-formalizing.html' title='Sabbatical 2011-2012: Formalizing Scientific Discourse'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-1196145317256664554</id><published>2010-06-04T06:59:00.000-07:00</published><updated>2010-06-04T07:03:40.662-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='semantic web'/><category scheme='http://www.blogger.com/atom/ns#' term='sadi'/><title type='text'>SADI</title><content type='html'>&lt;p class="MsoPlainText"&gt;(modified from an email that Mark Wilkinson sent)&lt;/p&gt;&lt;p class="MsoPlainText"&gt;SADI is a very lightweight "standard" (set of best-practices, really) for modeling and providing Web Services.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It uses standards from the W3C Semantic Web initiative - in particular, it uses OWL for types, and RDF for instance data.&lt;/p&gt;  &lt;p class="MsoPlainText"&gt;&lt;o:p&gt;SADI is used to expose "resources" to the world in a manner that can be discovered automatically, and accessed automatically, by software.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Those resources might be data inside databases (where SADI replaces the traditional Web Query page), or they might be analytical algorithms that consume data, chug away on it, and return output data.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In both cases, the interfaces are structurally identical, so from the perspective of the client software, it doesn't have to know or care whether it is trying to get data out of a database or out of an analytical tool - the question/query structure is the same, and moreover, it is completely predictable.&lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoPlainText"&gt;This is critical advantage #1 for SADI over traditional Web Services frameworks - in traditional XML-based Web Services, you still must code your client software to access each service, since the service interfaces cannot be interpreted by the machine.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In SADI, we can design ONE piece of software to access all resources exposed as SADI services - "one ring to rule them all!". &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;(and we already have several different "rings" that expose SADI data in different ways)&lt;/p&gt;  &lt;p class="MsoPlainText"&gt;Critical advantage #2 is a bit more obscure and hard to describe, but is likely to be the more important in the long-run.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In SADI, data is "grounded" in explicit semantics.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This means that all data in SADI carries with it information about what TYPE of data it is, and how that data relates to other data (e.g. genes transcribed into transcripts translated into proteins which regulate genes:&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Gene, Transcript, Protein are all data types, and "transcribed", "translated", "regulate" are relationships between them).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;With this explicit (and extensive!) grounding in semantics, we can start asking our machines to do a lot of the interpretation for us.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;For example, "what gene regulates gene X" is a nonsensical question biologically, but it's a question that biologists ask all the time!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;With a solid grounding in semantics, the machine would be able to follow the logical pathway above and say "well, to answer that question, I am going to have to go through transcripts and proteins to get there" and then automatically construct the pipeline of services that get to the answer.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is just one example of how Semantics can be used to facilitate question-answering.&lt;/p&gt;  &lt;p class="MsoPlainText"&gt;There are several tutorials available.&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoPlainText"&gt; for what it can do:&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;a href="http://www.slideshare.net/markmoby/sadi-swsip-09"&gt;http://www.slideshare.net/markmoby/sadi-swsip-09&lt;/a&gt;&lt;span style="mso-spacerun:yes"&gt;     &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoPlainText"&gt; then go to &lt;a href="http://sadiframework.org"&gt;http://sadiframework.org&lt;/a&gt; to find the more specific tutorials on how to deploy services.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoPlainText"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;The current list of available services is at &lt;a href="http://sadiframework.org/registry/services/"&gt;http://sadiframework.org/registry/services/&lt;/a&gt;&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;and that list will be growing rapidly over the next year (we have committed to having at least 400 more services, but I suspect that we'll go far beyond that number!)&lt;/p&gt;  &lt;p class="MsoPlainText"&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-1196145317256664554?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/1196145317256664554/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=1196145317256664554' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/1196145317256664554'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/1196145317256664554'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2010/06/sadi.html' title='SADI'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-1721986136307918047</id><published>2010-05-07T12:40:00.000-07:00</published><updated>2010-05-07T13:03:03.570-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sparql'/><category scheme='http://www.blogger.com/atom/ns#' term='semantic web'/><category scheme='http://www.blogger.com/atom/ns#' term='snorql'/><title type='text'>Getting SNORQL to work with Virtuoso</title><content type='html'>&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;SNORQL is an AJAX SPARQL browser that makes it easy to i) see if your queries work and ii) navigate your linked data. SNORQL comes packaged with &lt;a href="http://www4.wiwiss.fu-berlin.de/bizer/d2r-server/"&gt;D2R server&lt;/a&gt;, but one has to make a few modifications to make it work when a) installed in a directory or port that is different than the SPARQL endpoint.&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;You need 3 things to do the following to make SNORQL work with some host-located endpoint:&lt;/div&gt;&lt;div&gt;1. Download and install SNORQL. SNORQL comes as part of the &lt;a href="http://www4.wiwiss.fu-berlin.de/bizer/d2r-server/"&gt;D2R distribution&lt;/a&gt;. Download this and extract the snorql folder from the webapps director into some folder on your host, preferably one that is already accessible by the web server (e.g. in the htdocs directory). If you want to put SNORQL in a folder different than that, you must add an entry to the http.conf file.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;Alias /snorql /usr/var/snorql&lt;/div&gt;&lt;div&gt;&lt;directory&gt;&lt;/directory&gt;&lt;/div&gt;&lt;div&gt;&amp;lt;Directory "/usr/var/snorql"&amp;gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt; Options None &lt;/div&gt;&lt;div&gt; AllowOverride None &lt;/div&gt;&lt;div&gt; Order allow,deny&lt;/div&gt;&lt;div&gt; Allow from all&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&amp;lt;/Directory&amp;gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;2. Configure the Apache server as a proxy to the endpoint. If the port of the apache server and the endpoint are different, you need to make them appear the same for the AJAX to work. Add this to your http.conf file&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;ProxyRequests Off&lt;/div&gt;&lt;div&gt;&amp;lt;Proxy *&amp;gt;&lt;/div&gt;&lt;div&gt;    Order deny,allow&lt;/div&gt;&lt;div&gt;    Allow from all&lt;span class="Apple-tab-span" style="white-space: pre; "&gt; &lt;/span&gt;&lt;/div&gt;&lt;div&gt;&amp;lt;/Proxy&amp;gt;&lt;/div&gt;&lt;div&gt;ProxyPass /sparql http://localhost:8890/sparql &lt;/div&gt;&lt;div&gt;ProxyPassReverse /sparql http://localhost:8890/sparql&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;3. Configure SNORQL to use the endpoint.&lt;/div&gt;&lt;div&gt;Edit the snorql.js file and replace&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;this._endpoint = document.location.href.match(/^([^?]*)snorql\//)[1] + 'sparql';&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;with &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;this._endpoint = 'http://localhost/sparql';&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;4. Open your browser to the SNORQL URL (e.g. http://localhost/snorql) to query and navigate the results :-)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-1721986136307918047?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/1721986136307918047/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=1721986136307918047' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/1721986136307918047'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/1721986136307918047'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2010/05/getting-snorql-to-work-with-virtuoso.html' title='Getting SNORQL to work with Virtuoso'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-5557661969223073634</id><published>2010-04-27T10:48:00.000-07:00</published><updated>2010-04-27T10:52:56.550-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='hpc'/><category scheme='http://www.blogger.com/atom/ns#' term='bioinformatics'/><category scheme='http://www.blogger.com/atom/ns#' term='high performance computing'/><category scheme='http://www.blogger.com/atom/ns#' term='training'/><category scheme='http://www.blogger.com/atom/ns#' term='cloud computing'/><title type='text'>Compute Canada and the future of HPC computing</title><content type='html'>&lt;p class="MsoPlainText"&gt;Compute Canada is hosting a series of &lt;a href="https://computecanada.org/?mod=cms&amp;amp;pageId=898&amp;amp;lang=EN&amp;amp;"&gt;town hall meetings to discuss the future of high performance computing in Canada&lt;/a&gt;. Here are some thoughts:&lt;/p&gt;&lt;p class="MsoPlainText"&gt;In order to increase Canada’s HPC capability and make them more relevant for today’s scientific computing needs, it will have to embrace new computing models.&lt;/p&gt;&lt;p class="MsoPlainText"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoPlainText"&gt;The next generation in computing is &lt;a href="http://en.wikipedia.org/wiki/Cloud_computing"&gt;cloud computing&lt;/a&gt;. Compute Canada should embrace this model as part of its service offering such that researchers can use the cloud across Compute Canada infrastructure. Importantly, it must be possible for researchers to grow their cloud from local private clouds (we already have one setup in our lab), into the Compute Canada cloud and ultimately into commercial clouds (such as &lt;a href="http://aws.amazon.com/ec2/"&gt;Amazon EC2&lt;/a&gt;), if necessary. Compute Canada also needs to invest in data storage, and create the means by which such storage may be accessed using data access standards (e.g. &lt;a href="http://aws.amazon.com/s3/"&gt;Amazon S3&lt;/a&gt;) and provisioned through networks (e.g. &lt;a href="http://canarie.ca"&gt;CANARIE&lt;/a&gt; &lt;-&gt; university &lt;-&gt; commodity networks).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Compute Canada should endeavor to use open standards and ensure interoperability for any deployment.&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoPlainText"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoPlainText"&gt;&lt;o:p&gt;A major issue with Canada’s HPC centers is that we cannot currently (AFAIK) host public services on them. As a bioinformatician, it would be invaluable for me to have Compute Canada host a server using my image, thereby ensuring scalable capacity and continuity for the software tool (as reported in publications). Importantly, some of our services require on-the-fly compute resources to accomplish their task, and it would be ideal if we could setup asynchronous services that use cloud facilities to compute, and then provide a link to users as to where they can find their results (stored on an S3 compatible store).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Being able to do this would be a game changer for bioinformatics (and I suspect in other computing fields), and would create a new paradigm for open service provisioning across the world.&lt;/o:p&gt;&lt;/p&gt;&lt;p class="MsoPlainText"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoPlainText"&gt;&lt;o:p&gt; While full fledged cloud computing software is currently available as both open source and commercial solutions, this would present an opportunity to acquire such software 'en masse' to get started, while also creating new capacity in developing cloud computing solutions. I can imagine training my 4th year project/honours students to develop bioinformatic software using Compute Canada’s compute/storage resources – but not until we see cloud computing.&lt;/o:p&gt;&lt;/p&gt;&lt;p class="MsoPlainText"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-5557661969223073634?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/5557661969223073634/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=5557661969223073634' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/5557661969223073634'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/5557661969223073634'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2010/04/compute-canada-and-future-of-hpc.html' title='Compute Canada and the future of HPC computing'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-1696970236435627544</id><published>2009-07-22T07:32:00.001-07:00</published><updated>2009-07-22T07:34:02.488-07:00</updated><title type='text'>The trouble with ontology is ... the lack of "shared understanding"</title><content type='html'>&lt;div&gt;This kind of analysis goes to the heart of one of the problems i have with monolithic enterprises that trivially assume agreement will come from some shared understanding of "reality". Nothing could be further from reality:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;a href="http://www.nytimes.com/2009/07/21/science/21angier.html?_r=1&amp;amp;ref=science"&gt;http://www.nytimes.com/2009/07/21/science/21angier.html?_r=1&amp;amp;ref=science&lt;/a&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-1696970236435627544?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/1696970236435627544/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=1696970236435627544' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/1696970236435627544'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/1696970236435627544'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2009/07/trouble-with-ontology-is-lack-of-shared.html' title='The trouble with ontology is ... the lack of &quot;shared understanding&quot;'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-2917357042038808865</id><published>2009-06-08T17:36:00.001-07:00</published><updated>2009-06-08T17:36:42.169-07:00</updated><title type='text'>Setting out clear Ontology Design Principles</title><content type='html'>I've documented them here:&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;a href="http://code.google.com/p/semanticscience/wiki/ODP"&gt;http://code.google.com/p/semanticscience/wiki/ODP&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-2917357042038808865?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/2917357042038808865/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=2917357042038808865' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/2917357042038808865'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/2917357042038808865'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2009/06/setting-out-clear-ontology-design.html' title='Setting out clear Ontology Design Principles'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-3718753577038830723</id><published>2009-05-22T10:01:00.000-07:00</published><updated>2009-05-22T10:04:54.112-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='ontology'/><category scheme='http://www.blogger.com/atom/ns#' term='OBO Foundry'/><category scheme='http://www.blogger.com/atom/ns#' term='principles'/><category scheme='http://www.blogger.com/atom/ns#' term='design'/><category scheme='http://www.blogger.com/atom/ns#' term='semantic  web'/><title type='text'>Critique of OBO Foundry Principles</title><content type='html'>&lt;p class="MsoNormal"&gt;The OBO Foundry aims to create a suite of orthogonal interoperable reference ontologies in the biomedical domain. They have outlined their principles here: &lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;a href="http://www.obofoundry.org/wiki/index.php/OBO_Foundry_Principles"&gt;http://www.obofoundry.org/wiki/index.php/OBO_Foundry_Principles&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;In reading some of these I found that they poorly expressed true principles in ontology design. I provide here a brief critique on some of the contentious points:&lt;/p&gt;  &lt;p class="MsoNormal"&gt;"3. The ontologies possesses a unique identifier space within the OBO Foundry. The identifier uniquely and persistently identifies a definition, which itself unambiguous identifies some type of biological entity. The identifier is for the definition: it is NOT the name and it is NOT an identifier for the name.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;There are systems that use alphanumeric id's - eg MetaCyc. This should be dis-encouraged, especially as these have semantic content."&lt;/p&gt;  &lt;p class="MsoNormal"&gt;This mixes up a number of issues. An identifier is a symbol for an entity, which should guarantee uniqueness in the lexical space, unlike human readable names which are not required to be unique. So it doesn’t matter whether the identifier is numeric, alphanumeric or alphabetic and thus the latter part of this principle, referring to alphanumeric MetaCyc ids, is pure nonsense. It is the description of the entity that *matters*, and that the textual description is arguably unchanging (What does OBOF say about when a description changes by even one word? Should a new identifier be crafted? How does one assess whether the previous identifier is in fact compatible with the new one? Should one be directed to use the new identifier – is it possible that the semantics are *&lt;b&gt;fundamentally&lt;/b&gt;* different? These are far more important questions to address)&lt;/p&gt;  &lt;p class="MsoNormal"&gt;"6. The ontology must be orthogonal to other ontologies already lodged within OBO. For each domain, there should be convergence upon a single reference ontology that is recommended for use by those who wish to become involved with the Foundry initiative"&lt;/p&gt;  &lt;p class="MsoNormal"&gt;This is a contestable claim. Given that there is no universal agreement on many biological terms, any given ontology will not necessarily capture the semantics of what one wants to express. Anyone familiar with the word "gene" can easily demonstrate this as a case in point. &lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;"7. The ontologies include textual definitions for all terms."&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Textual descriptions aren't really useful unless they succinctly capture the essence of the entity in question.  For instance, definitions in the (OWL version) BFO are incomprehensible to many people (certainly to my undergrad students). In many other cases the textual descriptions can be shown to be either overly vague or constraining in unrealistic ways.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Einstein said "Make everything as simple as possible, but not simpler" - a good mantra in crafting term descriptions is "Be as accurate as possible, while not adding superfluous information or imposing unnecessary constraints.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;"9. The ontology is well documented."&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Be more specific - What does "well documented" mean?&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;"10. The ontology has a plurality of independent users."&lt;/p&gt;  &lt;p class="MsoNormal"&gt;This is another unreasonable demand. The defining characteristic is that for every ontology, there exists requirements (possibly in the form of use cases) that the ontology can be demonstrated to satisfy.  Ultimately, an ontology should have demonstrated utility. Paraphrasing Salinger - If you build it, (and have shown it to be useful) they will come.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;"11. The ontology will be developed collaboratively with other OBO Foundry members."&lt;/p&gt;  &lt;p class="MsoNormal"&gt;A long standing myth is that ontologies need to be developed collaboratively - but in fact, we have found that such an approach is in fact wholly unproductive. What is productive is collecting use cases, undertaking focused development, and conducting a peer review and refinement process in which the needs of the community can be publicly solicited and addressed. This kind of procedure is in place at the W3C, and results in high quality standards. The OBO Foundry should consider setting up such a facility, with open calls for review across all relevant mailing lists, including quality assessment, additions/removals etc - particularly before things get published as a so-called "standard"&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-3718753577038830723?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/3718753577038830723/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=3718753577038830723' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/3718753577038830723'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/3718753577038830723'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2009/05/critique-of-obo-foundry-principles.html' title='Critique of OBO Foundry Principles'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-4205291226371931599</id><published>2009-04-06T13:55:00.000-07:00</published><updated>2009-04-06T13:57:25.745-07:00</updated><title type='text'>ChemAxon - Chemical Ontology</title><content type='html'>&lt;div&gt;Posted at: &lt;a href="http://wwmm.ch.cam.ac.uk/blogs/adams/?p=195"&gt;http://wwmm.ch.cam.ac.uk/blogs/adams/?p=195&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Making the distinction between a substance and a molecule is indeed important and valuable from an ontological perspective, particularly when it comes to reasoning about the domain. The distinction is often blurred simply because it is more pragmatic not to consider them different. Indeed, most chemists might agree that while there is a conceptual distinction, they don’t want to navigate through a set of high-brow concepts to find such simple (indirect) relationships. Have you considered the impact of representing knowledge in this way with respect to usability?&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Another important issue is that of identity - you are making the argument that every different feature effectively warrants a different identifier. That if i want to make a statement about glucose in one form versus another, i need two different identifiers. This will lead to enormous numbers of different “concepts”, which may affect reasoning capability (especially in OWL!) and potentially also lead to sparsely populated knowledge bases. An alternative is to capture the semantics of non-structural features in relations to the main component. For instance, I could reuse the “Glucose” class by adding additional restrictions, in the context of some process or experimental result. e.g. in my experiment i found glucose to be in its chair form. Indeed, expressing the behaviour (or structural conformation in this case) wrt to the context leads itself to modular reuse and also improves our representation of knowledge.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;A couple of pointers that you might find interesting:&lt;/div&gt;&lt;div&gt;1 - Contextual knowledge representation - &lt;a href="http://dumontierlab.com/pdf/2008_OWLEDEU_MR.pdf"&gt;http://dumontierlab.com/pdf/2008_OWLEDEU_MR.pdf&lt;/a&gt;&lt;/div&gt;&lt;div&gt;2 - Biochemical Identifiers - &lt;a href="http://www.slideshare.net/micheldumontier/accurate-biochemical-knowledge-starting-with-precise-structurebased-criteria-for-molecular-identity"&gt;http://www.slideshare.net/micheldumontier/accurate-biochemical-knowledge-starting-with-precise-structurebased-criteria-for-molecular-identity&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;We’ve also worked on chemical ontologies in OWL - you can see them here:&lt;/div&gt;&lt;div&gt;&lt;a href="http://dumontierlab.com/index.php?page=ontologies"&gt;http://dumontierlab.com/index.php?page=ontologies&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;it would be nice to merge the two and work towards a comprehensive OWL-based ontology for the chemistry domain. Let me know if that would interest you.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-4205291226371931599?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/4205291226371931599/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=4205291226371931599' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/4205291226371931599'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/4205291226371931599'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2009/04/chemaxon-chemical-ontology.html' title='ChemAxon - Chemical Ontology'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8399195079684236447.post-6125640645946588325</id><published>2008-09-17T07:52:00.001-07:00</published><updated>2008-09-17T08:03:39.753-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='storage bio2rdf'/><title type='text'>Wouldn't 48 terabytes be nice to have?</title><content type='html'>Apparently, &lt;a href="http://www.sun.com/"&gt;SUN Microsystems&lt;/a&gt; is making an offer to &lt;a href="http://www.phy.anl.gov/atlas/ATLAS-flyer.pdf"&gt;ATLAS sites&lt;/a&gt; including those involved in the  &lt;span style="text-decoration: underline;"&gt;&lt;/span&gt;&lt;a href="http://en.wikipedia.org/wiki/Large_Hadron_Collider"&gt;CERN LHC (Large Hadron Collider) project&lt;/a&gt; to purchase to extra storage space... $25,000 for the &lt;a href="http://www.sun.com/servers/x64/x4500/"&gt;x4540&lt;/a&gt;, capable of storing 48 TERABYTES. The offer is exclusive to participants, but there's a catch - you have to buy 5 of them to get the good deal ;-)&lt;br /&gt;&lt;br /&gt;I want one! This could be useful for hosting my &lt;a href="http://bio2rdf.org"&gt;Bio2RDF&lt;/a&gt; node...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8399195079684236447-6125640645946588325?l=micheldumontier.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://micheldumontier.blogspot.com/feeds/6125640645946588325/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8399195079684236447&amp;postID=6125640645946588325' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/6125640645946588325'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8399195079684236447/posts/default/6125640645946588325'/><link rel='alternate' type='text/html' href='http://micheldumontier.blogspot.com/2008/09/wouldnt-48-terabytes-be-nice-to-have.html' title='Wouldn&apos;t 48 terabytes be nice to have?'/><author><name>Michel Dumontier</name><uri>https://profiles.google.com/118402274835118487325</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-UGyw2wd65VQ/AAAAAAAAAAI/AAAAAAAAAAA/GtzEO-MTmOU/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry></feed>
