<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>AI3:::Adaptive Information &#187; Description Logics</title>
	<atom:link href="http://www.mkbergman.com/category/description-logics/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mkbergman.com</link>
	<description>Mike Bergman on the semantic Web and structured Web</description>
	<lastBuildDate>Tue, 24 Jan 2012 15:52:16 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Brown Bag Lunch: An Intrepid Guide to Ontologies</title>
		<link>http://www.mkbergman.com/936/brown-bag-lunch-an-intrepid-guide-to-ontologies/</link>
		<comments>http://www.mkbergman.com/936/brown-bag-lunch-an-intrepid-guide-to-ontologies/#comments</comments>
		<pubDate>Fri, 26 Nov 2010 08:43:48 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Adaptive Information]]></category>
		<category><![CDATA[Brown Bag Lunch]]></category>
		<category><![CDATA[Description Logics]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Structured Web]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[knowledge management]]></category>
		<category><![CDATA[Knowledge Representation]]></category>
		<category><![CDATA[Ontology]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=936</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Brown Bag Lunch: An Intrepid Guide to Ontologies&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Brown Bag Lunch&amp;rft.subject=Description Logics&amp;rft.subject=Ontologies&amp;rft.subject=Semantic Web&amp;rft.subject=Structured Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2010-11-26&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/936/brown-bag-lunch-an-intrepid-guide-to-ontologies/&amp;rft.language=English"></span>
There&#8217;s an Endless Variety of World Views, and Almost as Many Ways to Organize and Describe Them Ontology is one of the more daunting terms for those exposed for the first time to the semantic Web. Not only is the word long and without many common antecedents, but it is also a term that has [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Brown Bag Lunch: An Intrepid Guide to Ontologies&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Brown Bag Lunch&amp;rft.subject=Description Logics&amp;rft.subject=Ontologies&amp;rft.subject=Semantic Web&amp;rft.subject=Structured Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2010-11-26&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/936/brown-bag-lunch-an-intrepid-guide-to-ontologies/&amp;rft.language=English"></span>
<h2>There&#8217;s an Endless Variety of World Views, and Almost as Many Ways to Organize and Describe Them<a href="http://www.mkbergman.com/834/announcing-the-sporadic-friday-brown-bag-lunch"><img style="border: 0px solid; float: left; margin-right: 10px;" title="Friday Brown Bag Lunch" src="../wp-content/themes/ai3/images/lunchbag_225.jpg" alt="Friday     Brown Bag Lunch" width="158" height="179" /></a></h2>
<p>Ontology  is one of the more daunting terms for those exposed for the first time  to the semantic Web.  Not only is the word long and without many common  antecedents, but it is also a term that has widely divergent use and  understanding within the community.  It can be argued that this  not-so-little word is one of the barriers to mainstream understanding of  the semantic Web.</p>
<p>The root of the term is the Greek <em>ontos</em>, or being or the nature of things.  Literally &#8212; and in classical philosophy &#8212; ontology was used in relation to the study of the nature of being or the world, <a href="http://en.wikipedia.org/wiki/Ontology">the nature of existence</a>.  <a href="http://tomgruber.org/">Tom Gruber</a>, among others, made the term popular in relation to <a href="http://en.wikipedia.org/wiki/Ontology_%28computer_science%29">computer science</a> and artificial intelligence <a href="http://tomgruber.org/writing/ontolingua-kaj-1993.htm">about 15 years ago</a> when he defined ontology as a &#8220;formal specification of a conceptualization.&#8221;</p>
<p>While there have been attempts to strap on more or less formal  understandings or machinery around ontology, it still has very much the  sense of a world view, a means of viewing and organizing and  conceptualizing and defining a domain of interest.  As is made clear  below, I personally prefer a loose and embracing understanding of the  term (consistent with Deborah McGuinness&#8217;s 2003 paper, <a href="http://www.ksl.stanford.edu/people/dlm/papers/ontologies-come-of-age-mit-press-%28with-citation%29.htm">Ontologies Come of Age</a> [<a href="#onto1">1</a>]).</p>
<p>There has been a resurgence of interest in ontologies of late.  Two  reasons have been the emergence of Web 2.0 and tagging and folksonomies,  as well as the nascent emergence of the structured Web.  In fact, on April 23-24 one of the noted communities of practice around ontologies, <a title="http://ontolog.cim3.net/wiki/" rel="nofollow" href="http://ontolog.cim3.net/wiki/">Ontolog</a>, sponsored the <a href="http://ontolog.cim3.net/cgi-bin/wiki.pl?OntologySummit2007">Ontology Summit 2007</a> ,&#8221;Ontology, Taxonomy, Folksonomy: Understanding the Distinctions.&#8221;</p>
<p>These events have sparked my preparing this guide to ontologies.  I  have to admit this is a somewhat intrepid endeavor given the wealth of  material and diversity of opinions.</p>
<div class="boxBrownDotted center_ok" style="min-height: 80px; max-width: 460px;"><img style="width: 64px; height: 73px; float: left; margin-right: 10px;" title="Friday Brown Bag    Lunch" src="../wp-content/themes/ai3/images/lunchbag_64.png" alt="Friday      Brown Bag Lunch" /> This <a href="../834/announcing-the-sporadic-friday-brown-bag-lunch">Friday      brown bag leftover</a> was first placed into the <span style="font-weight: bold; color: #993300;">AI3</span> <a href="../chronological-listing/">refrigerator</a> more than three years ago on <a href="http://www.mkbergman.com/374/an-intrepid-guide-to-ontologies/">May 16, 2007</a>.  This reprise is unchanged since its original posting, though there is a more recent executive-level <a href="http://techwiki.openstructs.org/index.php/Intro_to_Ontologies">intro to ontologies</a> on the <a href="http://openstructs.org/">OpenStructs</a>&#8216; <a href="http://techwiki.openstructs.org/index.php/Main_Page">TechWiki</a>.</div>
<h3>Overview and Role of Ontologies</h3>
<p>Of course, a fancy name is not sufficient alone to warrant an  interest in ontologies.  There are reasons why understanding, using and  manipulating ontologies can bring practical benefit:</p>
<ul>
<li>Depending on their degree of formalism (an important dimension), ontologies help make explicit the scope, definition, and language and meaning (semantics) of a given domain or world view</li>
<li>Ontologies may provide the power to generalize about their domains</li>
<li>Ontologies, if hierarchically structured in part (and not all are), can provide the power of inheritance</li>
<li>Ontologies provide guidance for how to correctly &#8220;place&#8221; information in relation to other information in that domain</li>
<li>Ontologies may provide the basis to reason or infer over its domain (again as a function of its formalism)</li>
<li>Ontologies can provide a more effective basis for information extraction or content clustering</li>
<li>Ontologies, again depending on their formalism, may be a source of  structure and controlled vocabularies helpful for disambiguating  context; they can inform and provide structure to the &#8220;lexicons&#8221; in  particular domains</li>
<li>Ontologies can provide guiding structure for browsing or discovery within a domain, and</li>
<li>Ontologies can help relate and &#8220;place&#8221; other ontologies or world  views in relation to one another; in other words, ontologies can  organize ontologies from the most specific to the most abstract.</li>
</ul>
<p>Both structure and formalism are dimensions for classifying  ontologies, which combined are often referred to as an ontology&#8217;s  &#8220;expressiveness.&#8221;  How one describes this structure and formality  differs.  One recent attempt is this figure from the <a href="http://ontolog.cim3.net/cgi-bin/wiki.pl?OntologySummit2007">Ontology Summit 2007</a>&#8216;s wrap-up communique:</p>
<div><a href="http://ontolog.cim3.net/file/work/OntologySummit2007/workshop/ontology-dimensions-map_20070423b.png"><img class="center_ok" src="http://ontolog.cim3.net/file/work/OntologySummit2007/workshop/ontology-dimensions-map_20070423b.png" alt="Ontology Summit 2007 Communique Diagram" width="600" height="434" /></a></div>
<p>Note the bridging role that an ontology plays between a domain and  its content.  (By its nature, every ontology attempts to &#8220;define&#8221; and  bound a domain.)  Also note that the Summit&#8217;s 50 or so participants were  focused on the trade-off between semantics v. pragmatic considerations.   This was a result of the ongoing attempts within the community to  understand, embrace and (possibly) legitimize &#8220;less formal&#8221; Web 2.0  efforts such as tagging and the folksonomies that can result from them.</p>
<p>There is an <a href="http://en.wikipedia.org/wiki/M._C._Escher">M.C. Escher</a>-like  recursion of the lizard eating its tail when one observes ontologists  creating an ontology to describe the ontological domain.  The above  diagram, which itself would be different with a slight change in Summit  participation or editorship, is, of course, but one representative view  of the world.  Indeed, a tremendous variety of scientific and research  disciplines concern themselves with classifying and organizing the  &#8220;nature of things.&#8221;  Those disciplines go by such names as logicians,  taxonomists, philosophers, information architects, computer scientists,  librarians, operations researchers, systematicists, statisticians,  historians, and so forth.  (In short, given our ontos,  every area of human endeavor has the urge to classify, to organize.)   In each of these areas not only do their domains differ, but so do the  adopted structures and classification schemes often used.</p>
<p>There are at least 40 terms or concepts across these various  disciplines, most related to Web and general knowledge content, that  have organizational or classificatory aspects that &#8212; loosely defined &#8212;  could be called an &#8220;ontology&#8221; framework or approach:</p>
<div style="margin-top: 15px; margin-bottom: 15px;">
<table class="center_ok" border="0" cellspacing="2" cellpadding="2">
<tbody>
<tr>
<td>
<ul>
<li><a title="Tag cloud" href="http://en.wikipedia.org/wiki/Tag_cloud">Tag cloud</a></li>
<li><a title="Controlled vocabulary" href="http://en.wikipedia.org/wiki/Controlled_vocabulary">Controlled vocabulary</a></li>
<li><a title="Thesauri" href="http://en.wikipedia.org/wiki/Thesauri">Thesauri</a></li>
<li><a title="Collaborative tagging" href="http://en.wikipedia.org/wiki/Collaborative_tagging">Collaborative tagging</a></li>
<li><a title="Folk taxonomy" href="http://en.wikipedia.org/wiki/Folk_taxonomy">Folk taxonomy</a></li>
<li><a href="http://en.wikipedia.org/wiki/Web_directory">Directory</a></li>
<li><a href="http://www.mulberrytech.com/Extreme/Proceedings/html/2005/Newcomb01/EML2005Newcomb01.html">Subject Map</a></li>
<li><a href="http://en.wikipedia.org/wiki/Semantic_Web">Semantic Web</a></li>
<li><a href="http://en.wikipedia.org/wiki/Cladistics">Cladistics</a></li>
<li><a href="http://en.wikipedia.org/wiki/Markup_language">Markup languages</a></li>
</ul>
</td>
<td>
<ul>
<li><a title="Social bookmarking" href="http://en.wikipedia.org/wiki/Social_bookmarking">Social bookmarking</a></li>
<li><a title="Tag (metadata)" href="http://en.wikipedia.org/wiki/Tag_%28metadata%29">Tags</a></li>
<li><a title="Tagging" href="http://en.wikipedia.org/wiki/Tagging">Tagging</a></li>
<li><a title="Taxonomy" href="http://en.wikipedia.org/wiki/Taxonomy">Taxonomy</a></li>
<li><a href="http://en.wikipedia.org/wiki/Folksonomy">Folksonomy</a></li>
<li><a href="http://en.wikipedia.org/wiki/Library_classification">Classification</a></li>
<li><a href="http://en.wikipedia.org/wiki/Categorization">Categorization</a></li>
<li><a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a></li>
<li><a href="http://en.wikipedia.org/wiki/Metadata">Metadata</a></li>
<li><a href="http://en.wikipedia.org/wiki/Systematics">Systematics</a></li>
</ul>
</td>
<td>
<ul>
<li><a href="http://en.wikipedia.org/wiki/Ontology_%28computer_science%29">Ontology</a></li>
<li><a href="http://en.wikipedia.org/wiki/Microformats">Microformats</a></li>
<li><a href="http://en.wikipedia.org/wiki/Data_dictionary">Data dictionary</a></li>
<li><a href="http://en.wikipedia.org/wiki/OPML">OPML</a></li>
<li><a href="http://en.wikipedia.org/wiki/XOXO">XOXO</a></li>
<li><a href="http://en.wikipedia.org/wiki/Web_Ontology_Language">OWL</a></li>
<li><a href="http://en.wikipedia.org/wiki/Tree_structure">Subject Trees</a></li>
<li><a href="http://en.wikipedia.org/wiki/Information_architecture">Information Architecture</a></li>
<li><a href="http://en.wikipedia.org/wiki/Data_Reference_Model">Data Reference Model</a></li>
<li><a href="http://en.wikipedia.org/wiki/Phylogeny">Phylogeny</a></li>
</ul>
</td>
<td>
<div>
<ul>
<li><a href="http://en.wikipedia.org/wiki/Topic_map">Topic Maps</a></li>
<li><a href="http://en.wikipedia.org/wiki/Concept_map">Concept Maps</a></li>
<li><a href="http://en.wikipedia.org/wiki/Synset">Synsets</a></li>
<li><a href="http://en.wikipedia.org/wiki/Glossary">Glossary</a></li>
<li><a href="http://en.wikipedia.org/wiki/WordNet">WordNet</a></li>
<li><a href="http://en.wikipedia.org/wiki/Metadata">Metadata</a></li>
<li><a href="http://en.wikipedia.org/wiki/Faceted_classification">Facets</a></li>
<li><a href="http://en.wikipedia.org/wiki/Structure_%28mathematical_logic%29">Structure</a></li>
<li><a href="http://en.wikipedia.org/wiki/Dublin_Core">Dublin Core</a></li>
<li><a href="http://en.wikipedia.org/wiki/Typology">Typology</a></li>
</ul>
</div>
</td>
</tr>
</tbody>
</table>
</div>
<p>Actual domains or subject coverage are then mostly orthogonal to these approaches.</p>
<p>Loosely defined, the number of possible ontologies is therefore close to infinite:  domain X perspective X schema.  (Just kidding &#8212; sort of!  In fact, UMBC&#8217;s <a href="http://swoogle.umbc.edu/">Swoogle</a> ontology search service claims 10,000 ontologies presently on the Web; the actual data from August 2006 ranges from about <a href="http://ebiquity.umbc.edu/blogger/untangling-ontologies-on-the-semantic-web/">16,000 to 92,000 ontologies</a>, depending on how &#8220;formal&#8221; the definition.  These counts are also limited to OWL-based ontologies.)</p>
<p>Many have misunderstood the semantic Web because of this diversity  and the slipperiness of the concept of an ontology.  This  misunderstanding becomes flat wrong when people claim the semantic Web  implies one single grand ontology or organizational schema, <a href="http://en.wikipedia.org/wiki/One_Ring">One Ring to Rule Them All</a>.  Human and domain diversities makes this viewpoint patently false.</p>
<h3>Diversity, &#8216;Naturalness&#8217; and Change</h3>
<p>The choice of an ontological approach to organize Web and structured  content can be contentious. Publishers and authors perhaps have too many  choices: from straight Atom or RSS feeds and feeds with tags to  informal folksonomies and then Outline Processor Markup Language or  microformats. From there, the formalism increases further to include the  standard RDF ontologies such as SIOC (Semantically-Interlinked Online  Communities), SKOS (Simple Knowledge Organizing System), DOAP  (Description of a Project), and FOAF (Friend of a Friend) and the still  greater formalism of OWL&#8217;s various dialects.</p>
<p>Arguing which of these is the theoretical best method is doomed to  failure, except possibly in a bounded enterprise environment. We live in  the real world, where multiple options will always have their advocates  and their applications. All of us should welcome whatever structure we  can add to our information base, no matter where it comes from or how  it&#8217;s done. The sooner we can embrace content in any of these formats and  convert it to a canonical form, we can then move on to needed  developments in semantic mediation, the threshold condition for the  semantic Web.</p>
<div class="boxGrayDotted" style="float: right; margin-left: 10px; margin-right: 0px; width: 240px; font-size: 1.1em;">There are at least 40 concepts &#8212; loosely defined &#8212; that could be called an &#8220;ontology&#8221; framework or approach.</div>
<p>So, diversity is inevitable and should be accepted.  But that observation need not also embrace chaos.</p>
<p>In my early training in biological systematics, <a title="Ernst Haeckel" href="http://en.wikipedia.org/wiki/Ernst_Haeckel">Ernst Haeckel&#8217;s</a> <a title="Recapitulation theory" href="http://en.wikipedia.org/wiki/Recapitulation_theory">recapitulation theory</a> that &#8220;ontogeny recapitulates phylogeny&#8221; (note the same ontos  root, the difference from ontology being growth v. study) was losing  favor fast.  The theory was that the development of an organism through  its embryological phases mirrors its evolutionary history.  Today,  modern biologists recognize numerous connections between ontogeny and  phylogeny, explain them using <a title="Evolutionary developmental biology" href="http://en.wikipedia.org/wiki/Evolutionary_developmental_biology">evolutionary theory</a>, or view them as supporting evidence for that theory.</p>
<p>Yet, like the construction of phylogenetic trees, systematicists  strive for their classifications of the relatedness of organisms to be  &#8220;natural&#8221;, to reflect the true nature  of the relationship.  Thus, over time, that understanding of a  &#8220;natural&#8221; system has progressed from appearance → embryology →  embryology + detailed morphology → species and interbreeding → DNA.   While details continue to be worked out, the degree of genetic  relatedness is now widely accepted by biologists as a &#8220;natural&#8221; basis  for organizing the <a href="http://en.wikipedia.org/wiki/Tree_of_life_%28Science%29">Tree of Life</a>.</p>
<p>It is not unrealistic to also seek &#8220;naturalness&#8221; in the organization  of other knowledge domains, to seek &#8220;naturalness&#8221; in the organization of  their underlying ontologies.  Like natural systems in biology, this  naturalness should emerge from the shared understandings and perceptions  of the domain&#8217;s participants.  While subject matter expertise and  general and domain knowledge are essential to this development, they are  not the only factors.  As tagging systems on the Web are showing,  common usage and broad acceptance by the community at hand is important  as well.</p>
<p>While it may appear that a domain such as the biological relatedness  of organisms is more empirical than the concepts and ambiguous words in  most domains of human endeavor, these attempts at naturalness are still  not foolish.  The phylogeny example shows that understanding changes  over time as knowledge is gained.  We now accept DNA over the  recapitulation theory.</p>
<p>As the formal SKOS organizational schema for knowledge systems  recognizes (see below), the ideas of narrower and broader concepts can  be readily embraced, as well as concepts of relatedness and aliases  (synonyms).  These simple constructs, I would argue, plus the  application of knowledge being gained in related domains, will enable  tomorrow&#8217;s understandings to be more &#8220;natural&#8221; than today&#8217;s, no matter  the particular domain at hand.</p>
<p>So, in seeking a &#8220;naturalness&#8221; within our organizational schema, we  can also see that change is a constant.  We also see that the tools and  ideas underlying the seemingly abstract cause of merging and relating  existing ontologies to one another will further a greater &#8220;naturalness&#8221;  within our organizations of the world.</p>
<h3>A Spectrum of Formalisms</h3>
<p>According to the Summit, expressiveness is the extent and ease by which an ontology can describe domain semantics.  Structure they define as the degree of organization or hierarchical extent of the ontology.  They further define granularity  as the level of detail in the ontology.  And, as the diagram above  alludes, they define other dimensions of use, logical basis, purpose and  so forth of an ontology.</p>
<p>The over fifty respondents from 42 communities submitted some 70  different ontologies under about 40 terms to a survey that was used by  the Summit to construct their diagram.  These submissions included:</p>
<div class="boxYellowDotted">. . .  formal ontologies (e.g., <a href="http://www.ifomis.uni-saarland.de/bfo/">BFO</a>, <a href="http://www.loa-cnr.it/DOLCE.html">DOLCE</a>, <a href="http://www.ontologyportal.org/">SUMO</a>), biomedical ontologies (e.g., Gene Ontology, SNOMED CT, UMLS, ICD), thesauri (e.g., <a href="http://www.nlm.nih.gov/mesh/termscon.html">MeSH</a>, National Agricultural Library Thesaurus), folksonomies (e.g., Social bookmarking tags), general ontologies (<a href="http://www.cogsci.princeton.edu/%7Ewn">WordNet</a>, <a href="http://www.cyc.com/cyc/opencyc/overview">OpenCyc</a>) and specific ontologies (e.g., Process Specification Language). The list also includes markup languages (e.g., <a href="http://www.neuroml.org/">NeuroML</a>),  representation formalisms (e.g., Entity-Relation model, OWL, WSDL-S)  and various ISO standards (e.g., ISO 11179). This [Ontolog] sample  clearly illustrates the diversity of artifacts collected under  &#8220;ontology&#8221;.</div>
<p>I think the simplest spectrum for such distinctions is the formalism  of the ontology and its approach (and language or syntax, not further  discussed here).  More formal ontologies have greater expressiveness and  structure and inferential power, less formal ones the opposite.   Constructing more formal ontologies is more demanding, and takes more  effort and rigor, resulting in an approach that is more powerful but  also more rigid and less flexible.  Like anything else, there are always  trade-offs.</p>
<p>Based on work by Leo Obrst of Mitre as interpreted by Dan McCreary, we can view this as a trade-off as one of semantic clarity <em>v.</em> the time and money required to construct the formalism [<a href="#onto12">12,</a> <a href="#onto13">13</a>]:</p>
<div style="text-align: center;"><a href="../wp-content/themes/ai3/images/2007Posts/070501d_SemanticSpectrum.png"><img class="center_ok" title="Structure and Formalism Increases Semantic Expressiveness" src="../wp-content/themes/ai3/images/2007Posts/070501d_SemanticSpectrum.png" alt="Structure and Formalism Increases Semantic Expressiveness" width="600" height="428" align="middle" /></a><br />
 [Click on image for full-size pop-up]</div>
<p>Note this diagram reflects the more conventional, practitioner&#8217;s view  of the &#8220;formal&#8221; ontology, which does not include taxonomies or  controlled vocabularies (for example) in the definition.  This  represents the more &#8220;closely defined&#8221; end of the <a href="http://en.wikipedia.org/wiki/Semantic_spectrum">ontology (semantic) spectrum</a>.</p>
<p>However, since we are speaking here of ontologies and the structured Web  or the semantic Web, I believe we need to embrace a concept of ontology  aligned to actual practice.  Not all content providers can or want to  employ ontology engineers to enable formal inferencing of their content.   Yet, on the other hand, their content in its various forms does have  some meaningful structure, some organization.  The trick is to extract  this structure for more meaningful use such as data exchange or data  merging.</p>
<h3>Ontology Approaches on the Web</h3>
<p>Under such &#8220;loosely defined&#8221; bases we can thus see a spectrum of  ontology approaches on the Web, proceeding from less structure and  formalism to more so:</p>
<div style="margin-top: 15px; margin-bottom: 15px;">
<table class="center_ok" style="text-align: left; vertical-align: top; width: 80%;" border="0" cellspacing="0" cellpadding="4">
<tbody>
<tr>
<td style="background-color: #ffffcc; width: 3px;"></td>
<td style="font-weight: bold; text-align: center; background-color: #ffffcc; width: 25%;">Type or Schema</td>
<td style="font-weight: bold; text-align: center; background-color: #ffffcc; width: 25%;">Examples</td>
<td style="font-weight: bold; text-align: center; background-color: #ffffcc; width: 49%;">Comments on Structure and Formalism</td>
</tr>
<tr>
<td style="background-color: #ffd3d3;"></td>
<td style="vertical-align: top;">Standard Web Page</td>
<td style="vertical-align: top;">entire Web</td>
<td style="vertical-align: top;">General metadata fields in the  and internal HTML codes and tags provide minimal, but useful sources of structure; other HTTP and retrieval data can also contribute</td>
</tr>
<tr>
<td style="background-color: #ffbbbb;"></td>
<td style="vertical-align: top;">Blog / Wiki Page</td>
<td style="vertical-align: top;">examples from <a href="http://technorati.com/tag/">Technorati</a>, <a href="http://www.bloglines.com/">Bloglines</a>, <a href="http://en.wikipedia.org/wiki/Main_Page">Wikipedia</a></td>
<td style="vertical-align: top;">Provides still greater formalism for the organization and characterization of content (subjects, categories, posts, comments, date/time stamps, etc.).  Importantly, with the addition of plug-ins, some of the basic software may also provide other structured characterizations or output (SIOC, FOAF, etc.; highly varied and site-specific given the diversity of publishing systems and plug-ins)</td>
</tr>
<tr>
<td style="background-color: #ffa3a3;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Rss">RSS</a> / <a href="http://en.wikipedia.org/wiki/Atom_%28standard%29">Atom</a> feeds</td>
<td style="vertical-align: top;">most blogs and most news feeds</td>
<td style="vertical-align: top;">RSS extends basic XML schema for more robust syndication of content with a tightly controlled vocabulary for feed concepts and their relationships.  Because of its ubiquity, this is becoming a useful baseline of structure and formalism; also, the nature of adoption shows much about how ontological structure is an <span style="font-style: italic;">artifact</span>, not <span style="font-style: italic;">driver</span>, for use</td>
</tr>
<tr>
<td style="background-color: #ff8b8b;"></td>
<td style="vertical-align: top;">RSS / Atom feeds with <a href="http://en.wikipedia.org/wiki/Tag_%28metadata%29">tags</a> or <a href="http://en.wikipedia.org/wiki/OPML">OPML</a></td>
<td style="vertical-align: top;"><a title="http://www.grazr.com" rel="nofollow" href="http://www.grazr.com/">Grazr</a>, most newsfeed aggregators can import and export OPML lists of RSS feeds</td>
<td style="vertical-align: top;">The OPML specification defines an outline as a hierarchical, ordered list of arbitrary elements. The specification is fairly open which makes it suitable for many types of list data.  See also <a title="OML" href="http://en.wikipedia.org/wiki/OML">OML</a> and <a title="XOXO" href="http://en.wikipedia.org/wiki/XOXO">XOXO</a></td>
</tr>
<tr>
<td style="background-color: #ff7373;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Faceted_classification">Hierarchical Faceted Metadata</a></td>
<td style="vertical-align: top;"><a href="http://xfml.org/">XFML</a>, <a href="http://flamenco.berkeley.edu/index.html">Flamenco</a></td>
<td style="vertical-align: top;">These and related efforts from the information architecture (IA) community are geared more to library science.  However, they directly contribute to faceted browsing, which is one of the first practical instantiations of the semantic Web</td>
</tr>
<tr>
<td style="background-color: #ff5b5b;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Folksonomy">Folksonomies</a></td>
<td style="vertical-align: top;"><a title="Flickr" href="http://en.wikipedia.org/wiki/Flickr">Flickr</a>, <a title="Del.icio.us" href="http://en.wikipedia.org/wiki/Del.icio.us">del.icio.us</a></td>
<td style="vertical-align: top;">Based on user-generated tags and informal organizations of the same; not linked to any &#8220;standard&#8221; Web protocols.  Both tags and hierarchical structure are arbitrary, but some researchers now believe over large enough participant sets that structural consensus and value does emerge</td>
</tr>
<tr>
<td style="background-color: #ff4343;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Microformats">Microformats</a></td>
<td style="vertical-align: top;">Example formats include <a title="HAtom" href="http://en.wikipedia.org/wiki/HAtom">hAtom</a>, <a title="HCalendar" href="http://en.wikipedia.org/wiki/HCalendar">hCalendar</a>, <a title="HCard" href="http://en.wikipedia.org/wiki/HCard">hCard</a>, <a class="new" title="HReview" href="http://en.wikipedia.org/w/index.php?title=HReview&amp;action=edit">hReview</a>, <a title="HResume" href="http://en.wikipedia.org/wiki/HResume">hResume</a>, <a class="new" title="Rel-directory" href="http://en.wikipedia.org/w/index.php?title=Rel-directory&amp;action=edit">rel-directory</a>, <a class="new" title="XFolk" href="http://en.wikipedia.org/w/index.php?title=XFolk&amp;action=edit">xFolk</a>, <a title="XHTML Friends Network" href="http://en.wikipedia.org/wiki/XHTML_Friends_Network">XFN</a> and <a title="XOXO" href="http://en.wikipedia.org/wiki/XOXO">XOXO</a></td>
<td style="vertical-align: top;">A microformat is HTML mark up to express semantics with strictly controlled vocabularies.  This markup is embedded using specific HTML attributes such as class, rel, and rev.  This method is easy to implement and understand, but is not free-form</td>
</tr>
<tr>
<td style="background-color: #ff2b2a;"></td>
<td style="vertical-align: top;">Embedded RDF</td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/RDFa">RDFa</a>, eRDF</td>
<td style="vertical-align: top;">An embedded format, like microformats, but free-form, and not subject to the approval strictures associated with microformats</td>
</tr>
<tr>
<td style="background-color: #ff1312;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Topic_map">Topic Maps</a></td>
<td style="vertical-align: top;"><a href="http://www.topicmaps.net/">Infoloom</a>, <a href="http://search.topicmaps.de/">Topic Maps Search Engine</a></td>
<td style="vertical-align: top;">A topic map can represent information using topics (representing any concept, from people, countries, and organizations to software modules, individual files, and events), associations (which represent the relationships between them), and occurrences (which represent relationships between topics and information resources relevant to them)</td>
</tr>
<tr>
<td style="background-color: #f90000;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a></td>
<td style="vertical-align: top;">Many; <a href="http://dbpedia.org/">DBpedia</a>, etc.</td>
<td style="vertical-align: top;">RDF has become the canonical data model since it represents a &#8220;universal&#8221; conversion format</td>
</tr>
<tr>
<td style="background-color: #e10000;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/RDF_Schema">RDF Schema</a></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/SKOS">SKOS</a>, <a href="http://en.wikipedia.org/wiki/SIOC">SIOC</a>, <a href="http://en.wikipedia.org/wiki/DOAP">DOAP</a>, <a href="http://en.wikipedia.org/wiki/FOAF_%28software%29">FOAF</a></td>
<td style="vertical-align: top;">RDFS or RDF Schema is an extensible knowledge representation language, providing basic elements for the description of ontologies, otherwise called RDF vocabularies, intended to structure RDF resources.  This becomes the canonical ontology common meeting ground</td>
</tr>
<tr>
<td style="background-color: #c90000;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Web_Ontology_Language">OWL</a> Lite</td>
<td style="vertical-align: top;" rowspan="3">Here are some existing <a title="http://protege.cim3.net/cgi-bin/wiki.pl?ProtegeOntologiesLibrary#nid81S" rel="nofollow" href="http://protege.cim3.net/cgi-bin/wiki.pl?ProtegeOntologiesLibrary#nid81S">OWL ontologies</a>; also see <a href="http://swoogle.umbc.edu/">Swoogle</a> for OWL search facilities</td>
<td style="vertical-align: top;" rowspan="3">The Web Ontology Language (OWL) is a language for defining and instantiating Web ontologies. An OWL ontology may include descriptions of classes, along with their related properties and instances. OWL is designed for use by applications that need to process the content of information instead of just presenting information to humans. It facilitates greater machine interpretability of Web content than that supported by XML, RDF, and RDF Schema (RDF-S) by providing additional vocabulary along with a formal semantics.  The three language versions are in order of increasing expressiveness</td>
</tr>
<tr>
<td style="background-color: #b10000;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Web_Ontology_Language">OWL</a> DL</td>
</tr>
<tr>
<td style="background-color: #9b0000;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Web_Ontology_Language">OWL</a> Full</td>
</tr>
<tr>
<td style="background-color: #820000;"></td>
<td style="vertical-align: top;">Higher-order &#8220;formal&#8221; and &#8220;upper-level&#8221; ontologies</td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Suggested_Upper_Merged_Ontology">SUMO</a>, <a title="http://wonderweb.semanticweb.org/deliverables/documents/D18.pdf" rel="nofollow" href="http://wonderweb.semanticweb.org/deliverables/documents/D18.pdf">DOLCE</a>, <a href="http://proton.semanticweb.org/D1_8_1.pdf">PROTON</a>, <a title="http://www.ifomis.org/bfo" rel="nofollow" href="http://www.ifomis.org/bfo">BFO</a>, <a href="http://en.wikipedia.org/wiki/Cyc">Cyc</a>, OpenCyc</td>
<td style="vertical-align: top;">These provide comprehensive ontologies and often related knowledge bases, with the goal of enabling AI applications to perform human-like reasoning.  Their reasoning languages often use higher-order logics</td>
</tr>
</tbody>
</table>
</div>
<p>As a rule of thumb, items that are less &#8220;formal&#8221; can be converted to a  more formal expression, but the most formal forms can generally not be  expressed in less formal forms.</p>
<p>As latter sections elaborate, I see RDF as the universal data model  for representing this structure into a common, canonical format, with  RDF Schema (specifically SKOS, but also supplemented by FOAF, DOAP and  SIOC) as the organizing ontology <a href="http://en.wikipedia.org/wiki/Knowledge_representation">knowledge representation language</a> (KRL).</p>
<p>This is not to say that the various dialects of OWL should be  neglected.  In bounded environments, they can provide superior reasoning  power and are warranted if they can be sufficiently mandated or  enforced.  But the RDF and RDF-S systems represent the most tractable  &#8220;meeting place&#8221; or &#8220;middle ground,&#8221;  IMHO.</p>
<h3>Still-Another &#8220;Level&#8221; of Ontologies</h3>
<p>As if the formalism dimension were not complicated enough, there is  also the practice within the ontology community to characterize  ontologies by &#8220;levels&#8221;, specifically upper, middle and lower levels.  For example, chances are that you have heard particularly of <a href="http://en.wikipedia.org/wiki/Upper_ontology_%28computer_science%29">&#8220;upper-level&#8221; ontologies</a>.</p>
<p>The following figure helps illustrate this &#8220;level&#8221; dimension. This diagram is also from Leo Obrst of Mitre [<a href="#onto12">12</a>], and was also used in another 2006 talk by <a href="http://ontolog.cim3.net/file/resource/presentation/JackPark-PatrickDurusau_20060427/Avoiding_Hobson-s_Choice_In_Choosing_An_Ontology--JackPark-PatrickDurusau_20060427.ppt">Jack Park and Patrick Durusau</a> (discussed further below for other reasons):</p>
<div><img class="center_ok" src="../wp-content/themes/ai3/images/2007Posts/070501b_OntologyLevels.png" alt="Ontology Levels" width="569" height="353" /></div>
<p>Examples of upper-level ontologies include the Suggested Upper Merged Ontology (<a href="http://en.wikipedia.org/wiki/Suggested_Upper_Merged_Ontology">SUMO</a>), the Descriptive Ontology for Linguistic and Cognitive Engineering (<a title="http://wonderweb.semanticweb.org/deliverables/documents/D18.pdf" rel="nofollow" href="http://wonderweb.semanticweb.org/deliverables/documents/D18.pdf">DOLCE</a>), <a href="http://proton.semanticweb.org/D1_8_1.pdf">PROTON</a>, <a href="http://en.wikipedia.org/wiki/Cyc">Cyc</a> and <a title="http://www.ifomis.org/bfo" rel="nofollow" href="http://www.ifomis.org/bfo">BFO</a> (Basic Formal Ontology).  Most of the content in their upper-levels is  akin to broad, abstract relations or concepts (similar to the primary  classes, for example, in a <a href="http://en.wikipedia.org/wiki/Roget%27s_Thesaurus">Roget&#8217;s Thesaurus</a> &#8212; that is, real ontos  stuff) than to &#8220;generic common knowledge.&#8221;  Most all of them have both a  hierarchical and networked structure, though their actual subject  structure relating to concrete things is generally pretty weak [<a href="#onto2">2</a>].</p>
<p>The above diagram conveys a sense of how multiple ontologies can  relate to one another both in terms of narrower and broader topic matter  and at the same &#8220;levels&#8221; of generalization.  Such &#8220;meta-structure&#8221; (if  you will) can provide a reference structure for relating multiple  ontologies to one another.</p>
<div class="boxGrayDotted" style="float: right; margin-left: 10px; margin-right: 0px; width: 240px; font-size: 1.1em;">The relationships and mappings amongst ontologies  is a critical infrastructure component of the semantic Web.</div>
<p>It resides exactly in such bindings or relationships that we can  foresee the promise of querying and relating multiple endpoints on the  Web with accurate semantics in order to connect dots and combine  knowledge bases.  Thus, the understanding of the relationships and  mappings amongst ontologies becomes a critical infrastructural component  of the semantic Web.</p>
<h3>The SUMO Example</h3>
<p>We can better understand these mapping and inter-relationship  concepts by using a concrete example with a formal ontology.  We&#8217;ll  choose to use the <a href="http://www.ontologyportal.org/">Suggested Upper Merged Ontology</a> simply because it is one of the best known.  We could have also selected another upper-level system such as PROTON [<a href="#onto3">3</a>] or Cyc [<a href="#onto4">4</a>] or one of the many with narrower concept or subject coverage.</p>
<p>SUMO is one of the formal ontologies that has been mapped to the <a href="http://www.cogsci.princeton.edu/%7Ewn/">WordNet</a> lexicon, which adds to its semantic richness. SUMO is written in the <a href="http://sigmakee.cvs.sourceforge.net/*checkout*/sigmakee/sigma/suo-kif.pdf">SUO-KIF</a> language. SUMO is free and owned by the <a href="http://www.ieee.org/portal/site">IEEE</a>. The ontologies that extend SUMO are available under <a href="http://www.gnu.org/copyleft/gpl.html">GNU General Public License</a>.</p>
<p>The abstract, conceptual organization of SUMO is shown by this diagram, which also points to its related <a href="http://sigmakee.cvs.sourceforge.net/*checkout*/sigmakee/KBs/Mid-level-ontology.kif">MILO (MId-Level Ontology)</a>,  which is being developed as a bridge between the abstract content of  the SUMO and the richer detail of various domain ontologies:</p>
<div><img class="center_ok" src="http://www.ontologyportal.org/images/SUMOMILO.gif" alt="" width="348" height="368" /></div>
<p>At this level, the structure is quite abstract.  But one can easily browse the SUMO structure.  A nifty tool to do so is the <a href="http://virtual.cvut.cz/ksmsaWeb/browser/title">KSMSA (Knowledge Support for Modeling and Simulation) ontology browser</a>. Using a hierarchical tree representation, you can navigate through SUMO, MILO, <a href="http://www.cogsci.princeton.edu/%7Ewn/">WordNet</a>, and (with the locally installed version) Wikipedia.</p>
<p>The figure below shows the upper-level entity concept on the left; the right-hand panel shows a drill-down into the example atom entity:</p>
<div><a href="../wp-content/themes/ai3/images/2007Posts/070501a_SUMOExample.png"><img class="center_ok" title="Example SUMO Categories" src="../wp-content/themes/ai3/images/2007Posts/070501a_SUMOExample.png" alt="Example SUMO Categories" width="600" height="846" align="middle" /></a><br />
 [Click on image for full-size pop-up]</div>
<p>These views may be a bit misleading because the actual underlying  structure, while it has hierarchical aspects as shown here, really is in  the form of a directed acyclic graph (showing other relatedness  options, not just hierarchical ones).  So, alternate visualizations  include traditional network graphs.</p>
<p>The other thing to note is that the &#8220;things&#8221; covered in the ontology,  the entities, are also fairly abstract.  That is because the intention  of a standard &#8220;upper-level&#8221; ontology is to cover all relevant knowledge  aspects of each entity&#8217;s domain.  This approach results in a subject and  topic coverage that feels less &#8220;concrete&#8221; than the coverage in, say, an  encyclopedia, directory or card catalog.</p>
<h3>Ontology Binding and Integration Mechanisms</h3>
<p>According to Park and Durusau, upper ontologies are diverse, middle  ontologies are even more diverse, and lower ontologies are more diverse  still.  A key observation is that ontological diversity is a given and  increases as we approach real user interaction levels.  Moreover,  because of the &#8220;loose&#8221; nature of ontologies on the Web (now and into the  future), diversity of approach is a further key factor.</p>
<p>Recall the initial discussion on the role and objectives of  ontologies.  About half of those roles involve effectively accessing or  querying more than one ontology.  The objective of &#8220;upper-level&#8221;  ontologies, many with their own binding layers, is also expressly geared  to ontology integration or federation.  So, what are the possible  mechanisms for such binding or integration?</p>
<p>A fundamental distinction within mechanisms to combine ontologies is  whether it is a unified or centralized approach (often imposed or  required by some party) or whether it is a schema mapping or binding  approach.  We can term this distinction centralized v. federated.</p>
<h4>Centralized Approaches</h4>
<p>Centralized approaches can take a number of forms.  At the most  extreme, adherence to a centralized approach can be contractual.  At the  other end are reference models or standards.  For example, illustrative  reference models include:</p>
<ul>
<li>the <a href="http://en.wikipedia.org/wiki/Data_Reference_Model">Data Reference Model</a> (<a href="http://www.whitehouse.gov/omb/egov/documents/DRM_2_0_Final.pdf">DRM</a>), one of the five reference models of the Federal Enterprise Architecture (FEA)</li>
<li><a href="http://en.wikipedia.org/wiki/UDEF">UDEF</a> (Unified Data Element Framework), an approach toward a unified description framework, or</li>
<li>the eXtended MetaData Registry (<a href="http://xmdr.org/">XMDR</a>) project.</li>
</ul>
<p>Though I have argued that One Ring to Rule them All  is not appropriate to the general Web, there may be cases within  certain enterprises or where through funding clout (such as government  contracts), some form of centralized approach could be imposed [<a href="#onto5">5</a>].   And, frankly, even where compliance can not be assured, there are  advantages in economy, efficiency and interoperability to attempt  central ontologies.  Certain industries &#8212; notably pharmaceuticals and  petrochemicals &#8212; and certain disciplines &#8212; such as some areas of  biology among others &#8212; have through trade associations or community  consensus done admirable jobs in adopting centralized approaches.</p>
<h4>Federated Approaches</h4>
<p>However, combining ontologies in the context of the broader Internet  is more likely through federated approaches.  (Though federated  approaches can also be improved when there are consensual standards  within specific communities.)  The key aspect of a federated approach is  to acknowledge that multiple schema need to be brought together, and  that each contributing data set and its schema will not be altered  directly and will likely remain in place.</p>
<p>Thus, the key distinctions within this category are the mechanisms by  which those linkages may take place  An important goal in any federated  approach is to achieve interoperability at the data or instance level  without unacceptable loss of information or corruption of the semantics.   Numerous specific approaches are possible, but three example areas in  RDF-topic map interoperability, the use of &#8220;subject maps&#8221;, and binding  layers can illustrate some of the issues at hand.</p>
<p>In 2006 the W3C set up a working group to look at the issue of RDF  and topic maps interoperability.  Topic maps have been embraced by the  library and information architecture community for some time, and have  standards that have been adopted under ISO.  Somewhat later but also in  parallel was the development of the RDF standard by W3C.  The  interesting thing was that the conceptual underpinnings and objectives  between these two efforts were quite similar.  Also, because of the  substantive thrust of topic maps and the substantive needs of its  community, quite a few topic maps had been developed and implemented.</p>
<p>One of the first efforts of the W3C work group was to evaluate and  compare five or six extant proposals for how to relate RDF and topic  maps [<a href="#onto6">6</a>].   That report is very interesting reading for any one desirous of  learning more about specific issues in combining ontologies and their  interoperability.  The result of that evaluation then led to some  guidelines for best practices in how to complete this mapping [<a href="#onto7">7</a>].   Evaluations such as these provide confidence that interoperability can  be achieved between relatively formal schema definitions without  unacceptable loss in meaning.</p>
<p>A different, &#8220;looser&#8221; approach, but one which also grew out of the  topic map community, is the idea of &#8220;subject maps.&#8221;  This effort, backed  by Park and Durusau noted above, but also with the support of other  topic map experts such as Steve Newcomb and Robert Barta via their  proposed <a href="http://www.jtc1sc34.org/repository/0710.pdf">Topic Maps Reference Model</a> (ISO 13250-5), seems to be one of the best attempts I&#8217;ve seen that both  respects the reality of the actual Web while proposing a workable,  effective scheme for federation.</p>
<p>The basic idea of a subject map is built around a set of subject  &#8220;proxies.&#8221;  A subject proxy is a computer representation of a subject  that can be implemented as an object, must have an identity, and must be  addressable (this point provides the URI connector to RDF).  Each  contributing schema thus defines its own subjects, with the mappings  becoming meta-objects.  These, in turn, would benefit from having some  accepted subject reference schema (not specifically addressed by the  proponents) to reduce the breadth of the ultimate mapped proxy &#8220;space.&#8221;</p>
<p>I don&#8217;t have the expertise to judge further the specifics, but I find the presentation and papers by Park and Durusau, <a href="http://ontolog.cim3.net/file/resource/presentation/JackPark-PatrickDurusau_20060427/Avoiding_Hobson-s_Choice_In_Choosing_An_Ontology--JackPark-PatrickDurusau_20060427.ppt">Avoiding Hobson&#8217;s Choice In Choosing An Ontology</a> and <a href="http://protege.stanford.edu/conference/2006/submissions/abstracts/3.3_Park_SubjectCentricProtege.pdf">Towards Subject-centric Merging of Ontologies</a> to be worthwhile reading in any case.  I highly recommend these papers for further background and clarity.</p>
<p>As the third example, &#8220;binding layers&#8221; are a comparatively newer  concept.  Leading upper-level ontologies such as SUMO or PROTON propose  their own binding protocols to their &#8220;lower&#8221; domains, but that approach  takes place within the construct of the parent upper ontology and  language.  Such designs are not yet generalized solutions.  By far the  most promising generalized binding solution is the <a href="http://en.wikipedia.org/wiki/SKOS">SKOS</a> (Simple Knowledge Organization System).  Because of its importance, the next section is devoted to it.</p>
<p>Finally, with respect to federated approaches, there are quite a few  software tools that have been developed to aid or promote some of these  specific methods.  For, example, about twenty of the software  applications in my <a href="..//?page_id=325">Sweet Tools</a> listing of 500+ semantic Web and -related tools could be interpreted as  aiding ontology mapping or creation.  You may want to check out some of  these specific tools depending on your preferred approach [<a href="#onto8">8</a>].</p>
<h3>The Role of SKOS &#8211; the Simple Knowledge Organization System</h3>
<p>SKOS, or the Simple Knowledge Organization System, is a formal  language and schema designed to represent such structured information  domains as <a title="Thesaurus" href="http://en.wikipedia.org/wiki/Thesaurus">thesauri</a>, <a title="Classification scheme" href="http://en.wikipedia.org/wiki/Classification_scheme">classification schemes</a>, <a title="Taxonomy" href="http://en.wikipedia.org/wiki/Taxonomy">taxonomies</a>, <a title="Authority control" href="http://en.wikipedia.org/wiki/Authority_control">subject-heading systems</a>, <a title="Controlled vocabulary" href="http://en.wikipedia.org/wiki/Controlled_vocabulary">controlled vocabularies</a>, or others; in short, most all of the &#8220;loosely defined&#8221; ontology approaches discussed herein.  It is a <a title="World Wide Web Consortium" href="http://en.wikipedia.org/wiki/World_Wide_Web_Consortium">W3C</a> initiative more fully defined in its <a href="http://www.w3.org/TR/swbp-skos-core-guide">SKOS Core Guide</a> [<a href="#onto9">9</a>].</p>
<p>SKOS is built upon the <a title="Resource Description Framework" href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a> data model of the subject-predicate-object &#8220;triple.&#8221;  The subjects and objects are akin to nouns, the predicate a verb, in a simple Dick-sees-Jane sentence.  Subjects and predicates by convention are related to a <a href="http://en.wikipedia.org/wiki/Uniform_Resource_Identifier">URI</a> that provides the definitive reference to the item.  Objects  may be either a URI resource or a literal (in which case it might be  some indexed text, an actual image, number to be used in a calculation,  etc.).</p>
<p>Being an <a title="RDF Schema" href="http://en.wikipedia.org/wiki/RDF_Schema">RDF Schema</a> simply means that SKOS adds some language and defined relationships to  this RDF baseline.  This is a bit of recursive understanding, since RDFS  is itself defined in RDF by virtue of adding some controlled vocabulary  and relations.  The power, though, is that these schema additions are  also easily expressed and referenced.</p>
<p>This RDFS combination can thus be shown as a standard RDF triple  graph, but with the addition of the extended vocabulary and relations:</p>
<div><img class="center_ok" src="http://www.w3.org/TR/2005/WD-swbp-skos-core-guide-20051102/img/ex-triple.png" alt="Standard RDF Graph Model" width="600" height="201" /></div>
<p>The power of the approach arises from the ability of the triple to  express virtually any concept, further extended via the RDFS language  defined for SKOS.  SKOS includes concepts such as &#8220;broader&#8221; and  &#8220;narrower&#8221;, which enable hierarchical relations to be modeled, as well  as &#8220;related&#8221; and &#8220;member&#8221; to support networks and arrays, respectively [<a href="#onto9">9</a>].</p>
<p>We can visualize this transforming power by looking at how an  &#8220;ontology&#8221; in a totally foreign scheme can be related to the canonical  SKOS scheme.  In the figure below the left-hand portion shows the native  hierarchical taxonomy structure of the UK Archival Thesaurus (<a href="http://www.ukat.org.uk/thesaurus/hierarchy.php?m=625">UKAT</a>),  next as converted to SKOS on the right (with the overlap of categories  shown in dark purple).  Note the hierarchical relationships visualize  better via a taxonomy, but that the RDF graph model used by SKOS allows a  richer set of additional relationships including related and  alternative names:</p>
<div style="text-align: center;"><a href="../wp-content/themes/ai3/images/2007Posts/070501e_SKOS_UKAT.png"><img class="center_ok" title="Example Structural Comparison of Hierarchical Taxonomy with Network Graph" src="../wp-content/themes/ai3/images/2007Posts/070501e_SKOS_UKAT.png" alt="Example Structural Comparison of Hierarchical Taxonomy with Network Graph" width="600" height="507" align="middle" /></a><br />
 [Click on image for full-size pop-up]</div>
<p>SKOS also has a rich set of annotation and labeling properties to enhance human readability of schema developed in it [<a href="#onto9">9</a>].   There is also a useful draft schema that the W3C&#8217;s SWEO (Semantic Web  Education and Outreach) group is developing to organize semantic  Web-related information [<a href="#onto10">10</a>].</p>
<p>Combined, these constructs provide powerful mechanisms for giving  contributory ontologies a common conceptualization.  When added to other  sibling RDF schema such as FOAF or SIOC or DOAP, still additional  concepts can be collated.</p>
<h3>Conclusions</h3>
<p>While not addressed directly in this piece, it is obviously of first  importance to have content with structure before the questions of  connecting that information can even arise.  Then, that structure must  also be available in a form suitable for merging or connection.</p>
<p>At that point, the subjects of this posting come into play.</p>
<div class="boxGrayDotted" style="float: left; margin-right: 10px; margin-left: 0px; width: 240px; font-size: 1.1em;">We are stubbing our toes on the rocks while we gaze at the heavens.</div>
<p>We see that the daily Web has a diversity of schema or ontologies  &#8220;loosely defined&#8221; for representing the structure of the content.  These  representations can be transferred to more complex schema, but not in  the opposite direction.  Moreover, the semantic basis for how to make  these mappings also needs some common referents.</p>
<p>RDF provides the canonical data model for the data transfers and  representations.  RDFS, especially in the form of SKOS, appears to form  one basis for the syntax and language for these transformations.  And  SKOS, with other schema, also appears to offer much of the appropriate  &#8220;middle ground&#8221; for data relationships mapping.</p>
<p>However, lacking in this story is a referential structure for subject relationships [<a href="#onto11">11</a>].  (Also lacking are the ultimately critical domain specifics required for actual implementation.)</p>
<p>Abstract concepts of interest to philosophers and deep thinkers have  been given much attention.  Sadly, to date, concrete subject structures  in which tangible things and tangible actions can be shared, is still  very, very weak.  We are stubbing our toes on the rocks while we gaze at  the heavens.</p>
<p>Yet, despite this, simple and powerful infrastructures are well  in-hand to address all foreseeable syntactic and semantic issues.  There  appear to be no substantive limits to needed next steps.</p>
<p>Lastly, many valuable resources for further reading and learning may be found within the <a href="http://ontolog.cim3.net/cgi-bin/wiki.pl?WikiHomePage">Ontolog Community</a>, <a href="http://en.wikipedia.org/wiki/World_Wide_Web_Consortium">W3C</a>, <a href="http://tagcommons.org/">TagCommons</a> and <a href="http://www.topicmaps.org/">Topics Maps</a> groups.  Enjoy!  And be wary of ontology no longer.</p>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto1" name="onto1"></a>1] Deborah  L. McGuinness. &#8220;Ontologies Come of Age&#8221;. In Dieter Fensel, Jim Hendler,  Henry Lieberman, and Wolfgang Wahlster, editors. <span style="text-decoration: underline;">Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential</span>. MIT Press, 2003.  See <a href="http://www.ksl.stanford.edu/people/dlm/papers/ontologies-come-of-age-mit-press-%28with-citation%29.htm">http://www.ksl.stanford.edu/people/dlm/papers/ontologies-come-of-age-mit-press-(with-citation).htm</a></div>
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto2" name="onto2"></a>2] I think it would be much clearer to refer to &#8220;upper level&#8221; ontologies as abstract or conceptual, &#8220;mid levels&#8221; as mapping or binding, and &#8220;lower levels&#8221; as domain (without any hierarchical distinctions such as lower or lowest or sub-domain), but current practice is probably too entrenched to change now.</div>
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto3" name="onto3"></a>3]  There are many aspects that make <a href="http://proton.semanticweb.org/D1_8_1.pdf">PROTON</a> one of the more attractive reference ontologies. The PROTON ontology (PROTo ONtology), developed within the scope of the <a href="http://en.wikipedia.org/wiki/SEKT">SEKT project,</a> is attractive because of its understandability, relatively small size,  modular architecture and a simple subsumption hierarchy.  It is  available in an OWL Lite form and is easy to adopt and extend.  On the  face of it, the Topic class within PROTON, which is meant to serve as a  bridge between different ontologies, may also provide a binding layer to  specific subject topics as sub-classes or class instances.</div>
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto4" name="onto4"></a>4] See my <a href="..//?p=365">earlier post on Cyc</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto5" name="onto5"></a>5]  Even with such clout, it is questionable to get rather complete adherence, as <a href="http://en.wikipedia.org/wiki/Ada_%28programming_language%29">Ada</a> showed within the Federal government.  However, where circumstances  allow it, central schema and ontologies may be worth pursuing because of  improved interoperability and lower costs, even where some portions do  not adhere and are more chaotic like the standard Web.</div>
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto6" name="onto6"></a>6] See,  <a href="http://www.w3.org/TR/rdftm-survey">A Survey of RDF/Topic Maps Interoperability Proposals</a>, W3C Working Group Note 10 February 2006, Pepper, Vitali, Garshol, Gessa, Presutti (eds.)</div>
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto7" name="onto7"></a>7] See, <a href="http://www.w3.org/2001/sw/BestPractices/RDFTM/guidelines-20060630.html">Guidelines for RDF/Topic Maps Interoperability</a>, W3C Editor&#8217;s Draft 30 June 2006, Pepper, Presutti, Garshol, Vitali (eds.)</div>
<div style="font-size: 90%;">
<div style="margin: 10px 0pt;">[<a title="onto8" name="onto8"></a>8] Here are some <a href="..//?page_id=325">Sweet Tools</a> that may have a usefulness to ontology federation and creation:</div>
<ul style="margin: 10px 0pt 10px 20px;">
<li><a href="http://www.aktors.org/technologies/adaptiva/">Adaptiva</a> &#8212;  is a user-centered ontology building environment, based on using  multiple strategies to construct an ontology, minimising user input by  using adaptive information extraction</li>
<li><a href="http://www.altova.com/products_semanticworks.html">Altova SemanticWorks</a> &#8212; is a visual RDF and OWL editor that auto-generates RDF/XML or nTriples based on visual ontology design</li>
<li><a href="http://projects.semwebcentral.org/projects/ontologymapping/">CMS</a> &#8212; the CROSI Mapping System is a structure matching system that  capitalizes on the rich semantics of the OWL constructs found in source  ontologies and on its modular architecture that allows the system to  consult external linguistic resources</li>
<li><a href="http://www.aktors.org/technologies/conceptool/">ConcepTool</a> &#8212; is a system to model, analyze, verify, validate, share, combine, and  reuse domain knowledge bases and ontologies, reasoning about their  implication</li>
<li><a href="http://www.aktors.org/technologies/conref/">ConRef</a> &#8212;  is a service discovery system which uses ontology mapping techniques to support different user vocabularies</li>
<li><a href="http://www.aifb.uni-karlsruhe.de/WBS/meh/foam/">FOAM</a> &#8212; is the Framework for Ontology Alignment and Mapping. It is based on  heuristics (similarity) of the individual entities (concepts, relations,  and instances)</li>
<li><a href="http://sourceforge.net/projects/hmafra">hMAFRA</a> (Harmonize Mapping Framework) &#8212; is a set of tools supporting semantic  mapping definition and data reconciliation between ontologies. The  targeted formats are XSD, RDFS and KAON</li>
<li><a href="http://www.aktors.org/technologies/ifmap/">IF-Map</a> &#8212; is an Information Flow based ontology mapping method. It is based on  the theoretical grounds of logic of distributed systems and provides an  automated streamlined process for generating mappings between  ontologies of the same domain</li>
<li><a href="http://www.alphaworks.ibm.com/tech/semanticstk">IODT</a> &#8212; is IBM&#8217;s toolkit for ontology-driven development. The toolkit  includes EMF Ontology Definition Metamodel (EODM), EODM workbench, and  an OWL Ontology Repository (named Minerva)</li>
<li><a href="http://kaon.semanticweb.org/">KAON</a> &#8212; is an open-source ontology management infrastructure targeted for  business applications. It includes a comprehensive tool suite allowing  easy ontology creation and management and provides a framework for  building ontology-based applications. An important focus of KAON is  scalable and efficient reasoning with ontologies</li>
<li><a href="http://www.landcglobal.com/pages/linkfactory.php">LinKFactory</a> &#8212; is Language &amp; Computing&#8217;s ontology management tool.  It provides  an effective and user-friendly way to create, maintain and extend  extensive multilingual terminology systems and ontologies (English,  Spanish, French, etc.). It is designed to build, manage and maintain  large, complex, language independent ontologies</li>
<li><a href="http://www.m3t4.com/index.jsp">M3t4.Studio Semantic Toolkit</a> &#8212; is Metatomix&#8217;s free set of Eclipse plug-ins to allow developers to create and manage OWL ontologies and RDF documents</li>
<li><a href="http://mafra-toolkit.sourceforge.net/">MAFRA Toolkit</a> &#8212; the Ontology MApping FRAmework Toolkit allows to create semantic  relations between two (source and target) ontologies, and apply such  relations in translating source ontology instances into target ontology  instances</li>
<li><a href="http://projects.semwebcentral.org/projects/ontoengine/">OntoEngine</a> &#8212; is a step toward allowing agents to communicate even though they use  different formal languages (i.e., different ontologies). It translates  data from a &#8220;source&#8221; ontology to a &#8220;target.&#8221;</li>
<li><a href="http://www.ontoportal.org.uk/">OntoPortal</a> &#8212; enables the authoring and navigation of large semantically-powered portals</li>
<li><a href="http://www.dfki.de/%7Eklusch/owls-mx/">OWLS-MX</a> &#8212; the hybrid semantic Web service matchmaker OWLS-MX 1.0 utilizes both  description logic reasoning, and token based IR similarity measures. It  applies different filters to retrieve OWL-S services that are most  relevant to a given query</li>
<li><a href="http://powl.sourceforge.net/index.php">pOWL</a> &#8212; is a semantic Web development platform for ontologies in PHP. pOWL consists of a number of components, including RAP</li>
<li><a href="http://protege.stanford.edu/">Protege</a> &#8212; is an open source visual ontology editor written in Java with many plug-in tools</li>
<li><a href="https://sourceforge.net/projects/semantag">Semantic Net Generator</a> &#8212; is a utility for generating topic maps automatically from different  data sources by using rules definitions specified with Jelly XML syntax.  This Java library provides Jelly tags to access and modify data sources  (also RDF) to create a semantic network</li>
<li><a href="http://sofa.projects.semwebcentral.org/">SOFA</a> &#8212; is a Java API for modeling ontologies and Knowledge Bases in  ontology and Semantic Web applications. It provides a simple, abstract  and language neutral ontology object model, inferencing mechanism and  representation of the model with OWL, DAML+OIL and RDFS languages</li>
<li><a href="http://www.seco.tkk.fi/projects/semweb/dist.php">Terminator</a> &#8212; is a tool for creating term to ontology resource mappings (documentation in Finnish)</li>
<li><a href="http://kmi.open.ac.uk/projects/webonto/">WebOnto</a> &#8212; supports the browsing, creation and editing of ontologies through  coarse grained and fine grained visualizations and direct manipulation.</li>
</ul>
</div>
<div style="font-size: 90%;">
<div style="margin: 10px 0pt;">[<a title="onto9" name="onto9"></a>9] The SKOS language has the following classes:</div>
<ul style="margin: 10px 0pt 10px 20px;">
<li><a title="Collectable Property" href="http://www.w3.org/TR/swbp-skos-core-spec/#CollectableProperty">CollectableProperty</a> &#8212; A property which can be used with a skos:Collection</li>
<li><a title="Collection" href="http://www.w3.org/TR/swbp-skos-core-spec/#Collection">Collection</a> &#8212; A meaningful collection of concepts</li>
<li><a title="Concept" href="http://www.w3.org/TR/swbp-skos-core-spec/#Concept">Concept </a> &#8212; An abstract idea or notion; a unit of thought</li>
<li><a title="Concept Scheme" href="http://www.w3.org/TR/swbp-skos-core-spec/#ConceptScheme">ConceptScheme</a> &#8212; A set of concepts, optionally including statements about semantic  relationships between those concepts.  Thesauri, classification schemes,  subject heading lists, taxonomies, &#8216;folksonomies&#8217;, and other types of  controlled vocabulary are all examples of concept schemes. Concept  schemes are also embedded in glossaries and terminologies.</li>
<li><a title="Ordered Collection" href="http://www.w3.org/TR/swbp-skos-core-spec/#OrderedCollection">OrderedCollection</a> &#8212; An ordered collection of concepts, where both the grouping and the ordering are meaningful</li>
</ul>
<div style="margin: 10px 0pt;">. . . and the following properties:</div>
<ul style="margin: 10px 0pt 10px 20px;">
<li><a title="alternative label" href="http://www.w3.org/TR/swbp-skos-core-spec/#altLabel">altLabel</a> &#8212; An alternative lexical label for a resource.  Acronyms,  abbreviations, spelling variants, and irregular plural/singular forms  may be included among the alternative labels for a concept</li>
<li><a title="alternative symbolic label" href="http://www.w3.org/TR/swbp-skos-core-spec/#altSymbol">altSymbol</a> &#8212; An alternative symbolic label for a resource</li>
<li><a title="has broader" href="http://www.w3.org/TR/swbp-skos-core-spec/#broader">broader</a> &#8212; A concept that is more general in meaning. Broader concepts are typically rendered as parents in a concept hierarchy (tree)</li>
<li><a title="change note" href="http://www.w3.org/TR/swbp-skos-core-spec/#changeNote">changeNote</a> &#8212; A note about a modification to a concept</li>
<li><a title="definition" href="http://www.w3.org/TR/swbp-skos-core-spec/#definition">definition</a> &#8212; A statement or formal explanation of the meaning of a concept</li>
<li><a title="editorial note" href="http://www.w3.org/TR/swbp-skos-core-spec/#editorialNote">editorialNote</a> &#8212; A note for an editor, translator or maintainer of the vocabulary</li>
<li><a title="example" href="http://www.w3.org/TR/swbp-skos-core-spec/#example">example</a> &#8212; An example of the use of a concept</li>
<li><a title="has top concept" href="http://www.w3.org/TR/swbp-skos-core-spec/#hasTopConcept">hasTopConcept</a> &#8212; A top level concept in the concept scheme</li>
<li><a title="hidden label" href="http://www.w3.org/TR/swbp-skos-core-spec/#hiddenLabel">hiddenLabel</a> &#8212; A lexical label for a resource that should be hidden when generating  visual displays of the resource, but should still be accessible to free  text search operations</li>
<li><a title="history note" href="http://www.w3.org/TR/swbp-skos-core-spec/#historyNote">historyNote</a> &#8212; A note about the past state/use/meaning of a concept</li>
<li><a title="in scheme" href="http://www.w3.org/TR/swbp-skos-core-spec/#inScheme">inScheme</a> &#8212; A concept scheme in which the concept is included. A concept may be a member of more than one concept scheme</li>
<li><a title="is primary subject of" href="http://www.w3.org/TR/swbp-skos-core-spec/#isPrimarySubjectOf">isPrimarySubjectOf</a> &#8212; A resource for which the concept is the primary subject</li>
<li><a title="is subject of" href="http://www.w3.org/TR/swbp-skos-core-spec/#isSubjectOf">isSubjectOf</a> &#8211;A resource for which the concept is a subject</li>
<li><a title="member" href="http://www.w3.org/TR/swbp-skos-core-spec/#member">member</a> &#8212; A member of a collection</li>
<li><a title="member list" href="http://www.w3.org/TR/swbp-skos-core-spec/#memberList">memberList</a> &#8212; An RDF list containing the members of an ordered collection</li>
<li><a title="has narrower" href="http://www.w3.org/TR/swbp-skos-core-spec/#narrower">narrower</a> &#8212; A concept that is more specific in meaning.  Narrower concepts are  typically rendered as children in a concept hierarchy (tree)</li>
<li><a title="note" href="http://www.w3.org/TR/swbp-skos-core-spec/#note">note</a> &#8212; A general note, for any purpose. The other human-readable properties  of definition, scopeNote, example, historyNote, editorialNote and  changeNote are all sub-properties of note</li>
<li><a title="preferred label" href="http://www.w3.org/TR/swbp-skos-core-spec/#prefLabel">prefLabel</a> &#8212; The preferred lexical label for a resource, in a given language. No  two concepts in the same concept scheme may have the same value for  skos:prefLabel in a given language</li>
<li><a title="preferred symbolic label" href="http://www.w3.org/TR/swbp-skos-core-spec/#prefSymbol">prefSymbol</a> &#8212; The preferred symbolic label for a resource</li>
<li><a title="has primary subject" href="http://www.w3.org/TR/swbp-skos-core-spec/#primarySubject">primarySubject</a> &#8212; A concept that is the primary subject of the resource.  A resource may have only one primary subject per concept scheme</li>
<li><a title="related to" href="http://www.w3.org/TR/swbp-skos-core-spec/#related">related</a> &#8212; A concept with which there is an associative semantic relationship</li>
<li><a title="scope note" href="http://www.w3.org/TR/swbp-skos-core-spec/#scopeNote">scopeNote</a> &#8212; A note that helps to clarify the meaning of a concept</li>
<li><a title="semantic relation" href="http://www.w3.org/TR/swbp-skos-core-spec/#semanticRelation">semanticRelation</a> &#8212; A concept related by meaning. This property should not be used  directly, but as a super-property for all properties denoting a  relationship of meaning between concepts</li>
<li><a title="has subject" href="http://www.w3.org/TR/swbp-skos-core-spec/#subject">subject</a> &#8212; A concept that is a subject of the resource</li>
<li><a title="subject indicator" href="http://www.w3.org/TR/swbp-skos-core-spec/#subjectIndicator">subjectIndicator</a> &#8212; A subject indicator for a concept. [The notion of 'subject  indicator' is defined here with reference to the latest definition  endorsed by the OASIS Published Subjects Technical Committee]</li>
<li><a title="symbolic label" href="http://www.w3.org/TR/swbp-skos-core-spec/#symbol">symbol</a> &#8212; An image that is a symbolic label for the resource. This property is  roughly analagous to rdfs:label, but for labelling resources with  images that have retrievable representations, rather than RDF literals.  Symbolic labelling means labelling a concept with an image.</li>
</ul>
</div>
<div style="font-size: 90%;">
<div style="margin: 10px 0pt;">[<a title="onto10" name="onto10"></a>10]  The <a href="http://esw.w3.org/topic/SweoIG/TaskForces/InfoGathering/ClassificationOntology">SWEO classification ontology</a> is still under active development and has these draft classes.  Note,  however, the relative lack of actual subject or topic matter:</div>
<div style="margin: 10px 0pt;">Classes are currently defined as:</div>
<ul style="margin: 10px 0pt 10px 20px;">
<li>article &#8211; magazine article</li>
<li>blog &#8211; blog discussing SW topics</li>
<li>book &#8211; indicates a textbook, applies to the book&#8217;s home page, review or listing in Amazon or such</li>
<li>casestudy &#8211; Article on a business case</li>
<li>conference/event &#8211; conferences or events where you can learn about the Semantic Web</li>
<li>demo/demonstration &#8211; interactive SW demo</li>
<li>forum &#8211; a forum on semantic web or related topics</li>
<li>presentation &#8211; Powerpoint or similar slide show</li>
<li>person &#8211; If this is a person&#8217;s home page or blog, see below</li>
<li>publication &#8211; a scientific publication</li>
<li>ontology &#8211; a formalisation of a shared conceptualization using OWL, RDFS, SKOS or something else based on RDF</li>
<li>organization &#8211; If the page is the home page of an organization, research, vendor etc, see below</li>
<li>portal &#8211; a portal website Semantic Web or related topics, usually hosting information items, mailinglists, community tools</li>
<li>project &#8211; a research (for example EU-IST) or other project that addresses Semantic Web issues</li>
<li>mailinglist &#8211; a mailinglist on semantic Web or related topics</li>
<li>person &#8211; ideally a person that is well known regarding the Semantic  Web (people who can do keynote speakers), may also be any related person</li>
<li>press &#8211; a press release by a company or an article about Semantic Web</li>
<li>recommended &#8211; If the resource is seen to be in the top 10 of its kind</li>
<li>specification &#8211; a Semantic Web specification (RDF, RDF/S, OWL, etc)</li>
<li>categories &#8211; (perhaps using tags or other free form annotation</li>
<li>successstory &#8211; Article that can contain advertisment and clearly shows the benefit of semantic web</li>
<li>tutorial &#8211; a tutorial teaching some aspect of semantic web, an example</li>
<li>vocabulary &#8211; a RDF vocabulary</li>
<li>software project/tool &#8211; For product/project home pages</li>
</ul>
<div style="margin: 10px 0pt;">If the page describes an organization, it can be tagged as:</div>
<ul style="margin: 10px 0pt 10px 20px;">
<li>vendor</li>
<li>research</li>
<li>enduser</li>
</ul>
<div style="margin: 10px 0pt;">If the page is a person&#8217;s home page or blog or similar, it could be:</div>
<ul style="margin: 10px 0pt 10px 20px;">
<li>opinionleader</li>
<li>researcher</li>
<li>journalist</li>
<li>executive</li>
<li>geek</li>
</ul>
<div style="margin: 10px 0pt;">The type of audience can also be tagged, for example:</div>
<ul style="margin: 10px 0pt 10px 20px;">
<li>general public</li>
<li>beginners</li>
<li>technicians</li>
<li>researchers.</li>
</ul>
</div>
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto11" name="onto11"></a>11] The <a href="http://www.oasis-open.org/home/index.php">OASIS</a> Topic Maps Published Subjects Technical Committee was formed a number  of years back to promote Topic Maps interoperability through the use of  Published Subjects Indicators (PSIs).  Their <a href="http://www.oasis-open.org/committees/download.php/3050/pubsubj-pt1-1.02-cs.pdf">resulting report</a> was a very interesting effort that unfortunately did not lead to wide  adoption, perhaps because the effort was a bit ahead of its time or it  was in advance of the broader acceptance of RDF.  This general topic is  the subject of a<a href="..//?p=375"> later posting by me</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;">[12<a title="onto12" name="onto12"></a>] See further, Leo Obrst, &#8220;The Semantic Spectrum &amp; Semantic Models,&#8221; a Powerpoint presentation (<a href="http://ontolog.cim3.net/file/resource/presentation/LeoObrst_20060112/OntologySpectrumSemanticModels--LeoObrst_20060112.ppt">http://ontolog.cim3.net/file/resource/presentation/LeoObrst_20060112/OntologySpectrumSemanticModels&#8211;LeoObrst_20060112.ppt</a>)<br />
 made as part of an Ontolog Forum (<a href="http://ontolog.cim3.net/">http://ontolog.cim3.net/</a>) presentation in two parts, &#8220;What is an Ontology? &#8211; A Briefing on the Range of Semantic Models&#8221; (see <a href="http://ontolog.cim3.net/cgi-bin/wiki.pl?ConferenceCall_2006_01_12">http://ontolog.cim3.net/cgi-bin/wiki.pl?ConferenceCall_2006_01_12</a>), in January 2006.  Leo Obrst is a principal artificial intelligence scientist at MITRE&#8217;s (<a href="http://www.mitre.org/">http://www.mitre.org</a>)  Center for Innovative Computing and Informatics and a co-convener of  the Ontolog Forum.  His presentation is a rich source of practical  overview information on ontologies.</div>
<div style="margin: 10px 0pt; font-size: 90%;">[13<a title="onto13" name="onto13"></a>] The actual diagram is an unattributed modification by Dan McCreary (see <a href="http://www.danmccreary.com/presentations/sem_int/sem_int.ppt">http://www.danmccreary.com/presentations/sem_int/sem_int.ppt</a>) based on Obrst&#8217;s material in [12].</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/936/brown-bag-lunch-an-intrepid-guide-to-ontologies/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Seven Pillars of the Open Semantic Enterprise</title>
		<link>http://www.mkbergman.com/859/seven-pillars-of-the-open-semantic-enterprise/</link>
		<comments>http://www.mkbergman.com/859/seven-pillars-of-the-open-semantic-enterprise/#comments</comments>
		<pubDate>Tue, 12 Jan 2010 20:26:54 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Description Logics]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Ontology Best Practices]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Structured Dynamics]]></category>
		<category><![CDATA[Web-oriented Architecture]]></category>
		<category><![CDATA[adaptive ontologies]]></category>
		<category><![CDATA[ontology-driven apps]]></category>
		<category><![CDATA[open semantic enterprise]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[Semantic Enterprise]]></category>
		<category><![CDATA[web oriented architecture]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=859</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Seven Pillars of the <i>Open Semantic Enterprise</i>&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Description Logics&amp;rft.subject=Linked Data&amp;rft.subject=Ontologies&amp;rft.subject=Ontology Best Practices&amp;rft.subject=Semantic Web&amp;rft.subject=Structured Dynamics&amp;rft.subject=Web-oriented Architecture&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2010-01-12&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/859/seven-pillars-of-the-open-semantic-enterprise/&amp;rft.language=English"></span>
Guideposts for How to Make the Transition The beginning of a new year and a new decade is a perfect opportunity to take stock of how the world is changing and how we can change with it. Over the past year I have been writing on many foundational topics relevant to the use of semantic [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Seven Pillars of the <i>Open Semantic Enterprise</i>&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Description Logics&amp;rft.subject=Linked Data&amp;rft.subject=Ontologies&amp;rft.subject=Ontology Best Practices&amp;rft.subject=Semantic Web&amp;rft.subject=Structured Dynamics&amp;rft.subject=Web-oriented Architecture&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2010-01-12&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/859/seven-pillars-of-the-open-semantic-enterprise/&amp;rft.language=English"></span>
<p><img style="border: 0px solid; width: 250px; height: 211px; float: left; margin-right: 10px;" title="Seven Pillars of the Open Semantic Enterprise" src="../wp-content/themes/ai3/images/2010Posts/100110_7pillars.png" alt="Seven Pillars of the Open Semantic Enterprise" align="left" /></p>
<h2>Guideposts for How to Make the Transition</h2>
<p>The beginning of a new year and a new decade is a perfect opportunity         to take stock of how the world is changing and how we can change with         it. Over the past year I have been writing on many foundational topics         relevant to the use of semantic technologies in enterprises.</p>
<p>In this post I bring those threads together to present a unified view         of these foundations &#8212; some seven pillars &#8212; to the <span style="font-weight: bold; font-style: italic;">open semantic         enterprise</span>.</p>
<p>By <span style="font-weight: bold; font-style: italic;">open semantic         enterprise</span> we mean an organization that uses the languages and         standards of the <a href="http://en.wikipedia.org/wiki/Semantic_Web">semantic Web</a>, including         <a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a>,         <a href="http://en.wikipedia.org/wiki/RDF_Schema">RDFS</a>, <a href="http://en.wikipedia.org/wiki/Web_Ontology_Language">OWL</a>, <a href="http://en.wikipedia.org/wiki/SPARQL">SPARQL</a> and <a href="http://en.wikipedia.org/wiki/Semantic_Web#Components">others</a> to integrate existing information assets,         using the best practices of <a href="http://en.wikipedia.org/wiki/Linked_Data">linked data</a> and the <a href="http://en.wikipedia.org/wiki/Open_world_assumption">open         world assumption</a>, and targeting knowledge management applications. It         does so using some or all of the seven foundational pieces (&#8220;pillars&#8221;)         noted herein.</p>
<p>The foundational approaches to the open semantic enterprise do not necessarily mean open data nor open source (though they are suitable for these purposes with many open source tools available <a href="#ose3">[3]</a>). The techniques can equivalently be applied to internal, closed, proprietary data and structures. The techniques can themselves be used as a basis for bringing external information into the enterprise. &#8216;Open&#8217; is in reference to the critical use of the open world assumption.</p>
<p>These practices do not require replacing current systems and assets;         they can be applied equally to public or proprietary information; and         they can be tested and deployed incrementally at low risk and cost. The         very foundations of the practice encourage a learn-as-you-go approach         and active and agile adaptation. While embracing the open semantic         enterprise can lead to quite disruptive benefits and changes, it can be         accomplished as such with minimal disruption in itself. This is its         most compelling aspect.</p>
<p>Like any change in practice or learning, embracing the open semantic         enterprise is fundamentally a people process. This is the pivotal piece         to the puzzle, but also the one that does not lend itself to ready         formula about pillars or best practices. Leadership and vision is         necessary to begin the process. People are the fuel for impelling it.         So, we&#8217;ll take this fuel as a given below, and concentrate instead on         the mechanics and techniques by which this vision can be achieved. In         this sense, then, there are really <span style="font-style: italic; text-decoration: underline;">eight</span> pillars         to the open semantic enterprise, with people residing at the apex.</p>
<p>This article is synthetic, with links to (largely) my preparatory blog         postings and topics that preceded it. Assuming you are interested in         becoming one of those leaders who wants to bring the benefits of an         open semantic enterprise to your organization, I encourage you to         follow the reference links for more background and detail.</p>
<h3><img style="vertical-align: middle;" src="../wp-content/themes/ai3/images/2010Posts/100110_pillar0.png" alt="Benefits" /> A Review of the Benefits</h3>
<p>OK, so what&#8217;s the big deal about an open semantic enterprise and why         should my organization care?</p>
<p>We should first be clear that the natural scope of the open semantic         enterprise is in knowledge management and representation <a href="#ose1">[1]</a>. Suitable         applications include data federation, data warehousing, search,         enterprise information integration, business intelligence, competitive         intelligence, knowledge representation, and so forth <a href="#ose2">[2]</a>. In the         knowledge domain, the benefits for embracing the open semantic         enterprise can be summarized as <span class="double_u">greater insight</span> with <span class="double_u">lower         risk</span>, <span class="double_u">lower cost</span>, <span class="double_u">faster deployment</span>, and more <span class="double_u">agile responsiveness</span>.</p>
<p>The intersection of knowledge domain, semantic technologies and the         approaches herein means it is possible to start small in testing the         transition to a semantic enterprise. These efforts can be done         incrementally and with a focus on early, high-value applications and         domains.</p>
<p>There is absolutely no need to abandon past practices. There         is much that can be done to leverage existing assets. Indeed, those         prior investments are often the requisite starting basis to inform         semantic initiatives.</p>
<p>Embracing the pillars of the open semantic enterprise brings these knowledge management benefits:</p>
<ul>
<li>Domains can be analyzed and inspected incrementally</li>
<li>Schema can be incomplete and developed and refined incrementally</li>
<li>The data and the structures within these frameworks can be used and         expressed in a piecemeal or incomplete manner</li>
<li>Data with partial characterizations can be combined with other data         having complete characterizations</li>
<li>Systems built with these frameworks are flexible and robust; as new         information or structure is gained, it can be incorporated without         negating the information already resident, and</li>
<li>Both open and closed world subsystems can be bridged.</li>
</ul>
<p>Moreover, by building on successful Web architectures, we can also put         in place loosely coupled, distributed systems that can grow and         interoperate in a decentralized manner. These also happen to be perfect         architectures for flexible collaboration systems and networks.</p>
<p>These benefits arise both from individual pillars in the open semantic         enterprise foundation, as well as in the interactions between them.         Let&#8217;s now re-introduce these seven pillars.</p>
<h3><img style="vertical-align: middle;" src="../wp-content/themes/ai3/images/2010Posts/100110_pillar1.png" alt="Pillar #1" />Pillar         #1: The RDF Data Model</h3>
<p>As I stated on the occasion of the 10th birthday of the <a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">Resource         Description Framework</a> data model, I belief RDF is the single most         important foundation to the open semantic enterprise <a href="#ose4">[4]</a>. RDF can be         applied equally to all structured, semi-structured and unstructured         content. By defining new types and predicates, it is possible to create         more expressive vocabularies within RDF. This expressiveness enables         RDF to define controlled vocabularies with exact semantics. These         features make RDF a powerful data model and language for data         federation and interoperability across disparate datasets.</p>
<p>Via various processors or extractors, RDF can capture and convey the         metadata or information in unstructured (say, text), semi-structured         (say, HTML documents) or structured sources (say, standard databases).         This makes RDF almost a “universal solvent” for         representing data structure.</p>
<p>Because of this universality, there are now more than 150 off-the-shelf         ‘RDFizers’ for converting various non-RDF notations (data         formats and serializations) to RDF <a href="#ose5">[5]</a>. Because of its diversity of         serializations and simple data model, it is also easy to create new         converters. Once in a common RDF representation, it is easy to         incorporate new datasets or new attributes. It is also easy to         aggregate disparate data sources as if they came from a single source.         This enables meaningful compositions of data from different applications         regardless of format or serialization.</p>
<p>What this practically means is that the integration layer can be based         on RDF, but that all source data and schema can still reside in their         native forms <a href="#ose6">[6]</a>. If it is easier or more convenient to author,         transfer or represent data in non-RDF forms, great <a href="#ose7">[7]</a>. RDF is only         necessary at the point of federation, and not all knowledge workers         need be versed in the framework.</p>
<h3><img style="vertical-align: middle;" src="../wp-content/themes/ai3/images/2010Posts/100110_pillar2.png" alt="Pillar #2" /> Pillar #2: Linked Data Techniques</h3>
<p>Linked data is a set of best practices for publishing and deploying         instance and class data using the RDF data model. Two of the best         practices are to name the data objects using uniform resource         identifiers (URIs), and to expose the data for access via the HTTP         protocol. Both of these practices enable the Web to become a         distributed database, which also means that Web architectures can also         be readily employed (see Pillar #5 below).</p>
<p>Linked data is applicable to public or enterprise data, open or         proprietary. It is really straightforward to employ. Structured         Dynamics has published a <a href="http://structureddynamics.com/linked_data.html">useful FAQ</a> on         linked data.</p>
<p>Additional linked data best practices relate to how to characterize and         classify data, especially in the use of predicates with the proper         semantics for establishing the degree of relatedness for linked data         items from disparate sources.</p>
<p>Linked data has been a frequent topic of this blog, including how         adding linkages creates value for existing data, with a four-part         series about a year ago on linked data best practices <a href="#ose8">[8]</a>. As advocated         by Structured Dynamics, our linked data best practices are geared to         data interconnections, interrelationships and context that is equally         useful to both humans and machine agents.</p>
<h3><img style="vertical-align: middle;" src="../wp-content/themes/ai3/images/2010Posts/100110_pillar3.png" alt="Pillar #3" /> Pillar #3: Adaptive Ontologies</h3>
<p>Ontologies are the guiding structures for how information is         interrelated and made coherent using RDF and its related schema and         ontology vocabularies, <a href="http://en.wikipedia.org/wiki/RDF_Schema">RDFS</a> and <a href="http://en.wikipedia.org/wiki/Web_Ontology_Language">OWL</a> <a href="#ose10">[10]</a>.         Thousands of off-the-shelf ontologies exist &#8212; a minority of which are         suitable for re-use &#8212; and new ones appropriate to any domain or scope         at hand can be readily constructed.</p>
<p>In standard form, semantic Web ontologies may range from the small and         simple to the large and complex, and may perform the roles of defining         relationships among concepts, integrating instance data, orienting to         other knowledge and domains, or mapping to other schema <a href="#ose11">[11]</a>. These are         explicit uses in the way that we construct ontologies; we also believe         it is important to keep concept definitions and relationships expressed         separately from instance data and their attributes <a href="#ose9">[9]</a>.</p>
<p>But, in addition to these standard roles, we also look to ontologies to         stand on their own as guiding structures for ontology-driven         applications (see next pillar). With a relatively few minor and new         best practices, ontologies can take on the double role of informing         user interfaces in addition to standard information integration.</p>
<p>In this vein we term our structures <span style="font-style: italic;">adaptive ontologies</span> [<a href="#ose11">11</a>,<a href="#ose12">12</a>,<a href="#ose13">13</a>]. Some of         the user interface considerations that can be driven by adaptive         ontologies include: attribute labels and tooltips; navigation and         browsing structures and trees; menu structures; auto-completion of         entered data; contextual dropdown list choices; spell checkers; online         help systems; etc. Put another way, what makes an ontology adaptive is         to supplement the standard machine-readable purpose of ontologies to         add human-readable labels, synonyms, definitions and the like.</p>
<p>A neat trick occurs with this slight expansion of roles. The knowledge         management effort can now shift to the actual description, nature and         relationships of the information environment. In other words,         ontologies themselves become the focus of effort and development. The         KM problem no longer needs to be abstracted to the IT department or         third-party software. The actual concepts, terminology and relations         that comprise coherent ontologies now become the explicit focus of KM         activities.</p>
<p>Any existing structure (or multiples thereof) can become a starting         basis for these ontologies and their vocabularies, from spreadsheets to         naïve data structures and lists and taxonomies. So, while producing an         operating ontology that meets the best practice thresholds noted herein         has certain requirements, kicking off or contributing to this process         poses few technical or technology demands.</p>
<p>The skills needed to create these adaptive ontologies are logic,         coherent thinking and domain knowledge. That is, any subject matter         expert or knowledge worker likely has the necessary skills to         contribute to useful ontology development and refinement. With adaptive         ontologies powering ontology-driven apps (see next), we thus see a shift         in roles and responsibilities away from IT to the knowledge workers         themselves. This shift acts to democratize the knowledge management         function and flatten the organization.</p>
<h3><img style="vertical-align: middle;" src="../wp-content/themes/ai3/images/2010Posts/100110_pillar4.png" alt="Pillar #4" /> Pillar #4: Ontology-driven Applications</h3>
<p>The complement to adaptive ontologies are <span style="font-style: italic;">ontology-driven applications</span>. By         definition, ontology-driven apps are modular, generic software         applications designed to operate in accordance with the specifications         contained in an adaptive ontology. The relationships and structure of         the information driving these applications are based on the standard         functions and roles of ontologies, as supplemented by the human and         user interface roles noted above [<a href="#ose11">11</a>,<a href="#ose12">12</a>,<a href="#ose13">13</a>].</p>
<p>Ontology-driven apps fulfill specific generic tasks. Examples of         current ontology-driven apps include imports and exports in various         formats, dataset creation and management, data record creation and         management, reporting, browsing, searching, data visualization, user         access rights and permissions, and similar. These applications provide         their specific functionality in response to the specifications in the         ontologies fed to them.</p>
<p>The applications are designed more similarly to widgets or API-based         frameworks than to the dedicated software of the past, though the         dedicated functionality (<span style="font-style: italic;">e.g.</span>,         graphing, reporting, etc.) is obviously quite similar. The major change         in these ontology-driven apps is to accommodate a relatively common         abstraction layer that responds to the structure and conventions of the         guiding ontologies. The major advantage is that single generic         applications can supply shared functionality based on any properly         constructed adaptive ontology.</p>
<p>This design thus limits software brittleness and maximizes software         re-use. Moreover, as noted above, it shifts the locus of effort from         software development and maintenance to the creation and modification         of knowledge structures. The KM emphasis can shift from programming and         software to logic and terminology <a href="#ose12">[12]</a>.</p>
<h3><img style="vertical-align: middle;" src="../wp-content/themes/ai3/images/2010Posts/100110_pillar5.png" alt="Pillar #5" /> Pillar #5: A Web-oriented Architecture</h3>
<p>A Web-oriented architecture (WOA) is a subset of the <a href="http://en.wikipedia.org/wiki/Service-oriented_architecture">service-oriented         architectural</a> (SOA) style, wherein discrete functions are packaged         into modular and shareable elements (”services”) that are         made available in a distributed and loosely coupled manner. WOA uses         the representational state transfer (REST) style. REST provides         principles for how resources are defined and used and addressed with         simple interfaces without additional messaging layers such as <a href="http://en.wikipedia.org/wiki/SOAP">SOAP</a> or <a href="http://en.wikipedia.org/wiki/Remote_procedure_call">RPC</a>. The         principles are couched within the framework of a generalized         architectural style and are not limited to the Web, though they are a         foundation to it <a href="#ose14">[14]</a>.</p>
<p>REST and WOA stand in contrast to earlier Web service styles that are         often known by the WS-* acronym (such as <a href="http://en.wikipedia.org/wiki/Web_Services_Description_Language">WSDL</a>,         <a href="http://en.wikipedia.org/wiki/List_of_Web_service_specifications">etc</a>.).         WOA has proven itself to be highly scalable and robust for         decentralized users since all messages and interactions are         self-contained.</p>
<p>Enterprises have much to learn from the Web’s success. WOA has a         simple design with REST and idempotent operations, simple messaging,         distributed and modular services, and simple interfaces. It has a         natural synergy with linked data via the use of URI identifiers and the         HTTP transport protocol. As we see with the explosion of searchable         dynamic databases exposed via the Web, so too can we envision the same         architecture and design providing a distributed framework for data         federation. Our daily experience with browser access of the Web shows         how incredibly diverse and distributed systems can meaningfully         interoperate <a href="#ose15">[15]</a>.</p>
<p>This same architecture has worked beautifully in linking documents; it         is now pointing the way to linking data; and we are seeing but the         first phases of linking people and groups together via meaningful         collaboration. While generally based on only the most rudimentary basis         of connections, today&#8217;s social networking platforms are changing the         nature of contacts and interaction.</p>
<p>The foundations herein provide a basis for marrying data and documents         in a design geared from the ground up for collaboration. These         capabilities are proven and deployable today. The only unclear aspects         will be the scale and nature of the benefits <a href="#ose16">[16]</a>.</p>
<h3><img style="vertical-align: middle;" src="../wp-content/themes/ai3/images/2010Posts/100110_pillar6.png" alt="Pillar #6" /> Pillar #6: An Incremental, Layered Approach</h3>
<p>To this point, you&#8217;ll note that we have been speaking in what are         essentially &#8220;layers&#8221;. We began with existing assets, both internal and         external, in many diverse formats. These are then converted or         transformed into RDF-capable forms. These various sources are then         exposed via a WOA Web services layer for distributed and         loosely-coupled access. Then, we integrate and federate this         information via adaptive ontologies, which then can be searched,         inspected and managed via ontology-driven apps. We have presented this         layered architecture before <a href="#ose13">[13]</a>, and have also expressed this design         in relation to current Structured Dynamics&#8217; products <a href="#ose17">[17]</a>.</p>
<p>A slight update of this layered view is presented below, made even more         general for the purposes of this foundational discussion:</p>
<div style="margin: 10px; text-align: center;"><a href="http://mkbergman.com/wp-content/themes/ai3/images/2009Posts/091213_open_enterprise.png"> <img class="center_ok" style="border: 0px solid; width: 600px; height: 500px;" title="Click to expand" src="http://mkbergman.com/wp-content/themes/ai3/images/2009Posts/091213_open_enterprise.png" alt="Open Enterprise Architecture" width="982" height="818" /></a><br />
<span style="font-style: italic; font-size: 90%;">(click to         expand)</span></div>
<p>Semantic technology does not change or alter the fact that most         activities of the enterprise are transactional, communicative or         documentary in nature. Structured, relational data systems for         transactions or records are proven, performant and understood. On its         very face, it should be clear that the <span style="font-style: italic;">meaning</span> of these activities — their         <span style="font-style: italic;">semantics</span>, if you will —         is by nature an augmentation or added layer to how to conduct the         activities themselves.</p>
<p>This simple truth affirms that semantic technologies are not a starting         basis, then, for these activities, but a way of expressing and         interoperating their outcomes. Sure, some semantic understanding and         common vocabularies at the front end can help bring consistency and a         common language to an enterprise’s activities. This is good         practice, and the more that can be done within reason while not         stifling innovation, all the better. But we all know that the budget         department and function has its own way of doing things separate from         sales or R&amp;D. And that is perfectly OK and natural.</p>
<p>Clearly, then, an obvious benefit to the semantic enterprise is to         federate across existing data silos. This should be an objective of the         first semantic &#8220;layer&#8221;, and to do so in a way that leverages existing         information already in hand. This approach is inherently incremental;         if done right, it is also low cost and low risk.</p>
<h3><img style="vertical-align: middle;" src="../wp-content/themes/ai3/images/2010Posts/100110_pillar7.png" alt="Pillar #7" /> Pillar #7: The Open World Mindset</h3>
<p>As these pillars took shape in our thinking and arguments over the past         year, an illusive piece seemed always to be missing. It was like having         one of those meaningful dreams, and then waking up in the morning         wracking your memory trying to recall that essential, missing insight.</p>
<p>As I most recently wrote <a href="#ose1">[1]</a>, that missing piece for <span style="font-weight: bold; font-style: italic; text-decoration: underline;">this</span> story is the open world assumption (OWA). I argue that this somewhat         obscure concept holds within it the key as to why there have been         decades of too-frequent failures in the enterprise in <a href="http://en.wikipedia.org/wiki/Business_intelligence">business         intelligence</a>, <a href="http://en.wikipedia.org/wiki/Data_warehouse">data warehousing</a>,         <a href="http://en.wikipedia.org/wiki/Data_integration">data         integration</a> and <a href="http://en.wikipedia.org/wiki/Federated_database_system">federation</a>,         and <a href="http://en.wikipedia.org/wiki/Knowledge_management">knowledge         management</a>.</p>
<p>Enterprises have been captive to the mindset of traditional relational         data management and its (most often unstated) <a href="http://en.wikipedia.org/wiki/Closed_World_Assumption">closed world         assumption</a> (CWA). Given the success of relational systems for         transaction and operational systems &#8212; applications for which they are         still clearly superior &#8212; it is understandable and not surprising         that this same mindset has seemed logical for knowledge management         problems as well.  But knowledge and KM are by their nature         incomplete, changing and uncertain. A closed-world mindset carries with         it certainty and logic implications not supportable by real         circumstances.</p>
<p>This is not an esoteric point, but a fundamental one. How one thinks         about the world and evaluates it is pivotal to what can be learned and         how and with what information. Transactions require completeness and         performance; insight requires drawing connections in the face of         incompleteness or unknowns.</p>
<p>The absolute applicability of the semantic Web stack to an open-world         circumstance is the elephant in the room <a href="#ose1">[1]</a>. By itself, the open world mindset         provides no assurance of gaining insight or wisdom. But, absent it, we         place thresholds on information and understanding that may neither be         affordable nor achievable with traditional, closed-world approaches.</p>
<p>And, by either serendipity or some cosmic beauty, the open world         mindset also enables incremental development, testing and refinement.         Even if my basic argument of the open world advantage for knowledge         management purposes is wrong, we can test that premise at low cost and         risk. So, within available budget, pick a doable proof-of-concept, and         decide for yourself.</p>
<h3><img style="vertical-align: middle;" src="../wp-content/themes/ai3/images/2010Posts/100110_7pillars_small.png" alt="Seven Pillars" /> The Foundations for the <span style="font-style: italic;">Open Semantic         Enterprise</span></h3>
<p>The seven pillars above are not magic bullets and each is likely not         absolutely essential. But, based on today&#8217;s understandings and with         still-emerging use cases being developed, we can see our <span style="font-weight: bold; font-style: italic;">open semantic         enterprise</span> as resulting from the interplay of these seven         factors:</p>
<div style="margin: 10px;"><img class="center_ok" style="border: 0px solid; width: 414px; height: 404px;" title="Seven Pillars of the Open Semantic Enterprise" src="http://mkbergman.com/wp-content/themes/ai3/images/2010Posts/100110_ose.png" alt="Open Semantic Enterprise" width="414" height="404" /></div>
<p>Thirty years of disappointing knowledge management projects and much         wasted money and effort compel that better ways must be found. On         the other hand, until recently, too much of the semantic Web discussion         has been either revolutionary (<span style="font-style: italic;">&#8220;change everything!!&#8221;</span>) or argued from         pie-in-the-sky bases. Something needs to give.</p>
<p>Our work over the past few years &#8212; but especially as focused in the         last 12 months &#8212; tells us that meaningful semantic Web initiatives can         be mounted in the enterprise with potentially huge benefits, all at         manageable risks and costs. These seven pillars point to way to how         this might happen. What is now required is that eighth pillar &#8212; you.</p>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose1"></a> [1] See, M.K. Bergman, 2009. <a href="../852/the-open-world-assumption-elephant-in-the-room/"> &#8220;The Open World Assumption: Elephant in the Room</a>&#8220;, <a style="font-weight: bold;" href="http://mkbergman.com/"><span style="font-style: italic;">AI3:::Adaptive Information</span></a> blog,         December 21, 2009.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose2"></a> [2] In most instances, semantic technologies are poorly suited to         transactional or operational applications. Also, there are instances in         modeling specific closed-world domains where ontologies can be quite         useful, such as in aerospace, petrochemicals, engineering, etc., where         the scope of the domain can be precisely bounded and defined. Such         efforts tend to be high cost with lengthy lead times. There are vendors         who support efforts in these areas, though my company, <a href="http://structureddynamics.com/">Structured Dynamics</a>, does not. Our         focus and the more generally suitable case for semantic technologies we         believe is in knowledge representation and management.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose3"></a> [3] The standard <a style="font-weight: bold; font-style: italic; color: #990000;" href="../new-version-sweet-tools-sem-web/">Sweet         Tools</a> listing on my <a style="font-weight: bold;" href="http://mkbergman.com/"><span style="font-style: italic;">AI3:::Adaptive         Information</span></a> blog contains more than 800 semantic Web and         -related tools, most of which are open source, which can be inspected         via filtered and faceted search.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose4"></a> [4] See, M.K. Bergman, 2009. <a href="../483/advantages-and-myths-of-rdf/">&#8220;Advantages         and Myths of RDF&#8221;</a>, <a style="font-weight: bold;" href="http://mkbergman.com/"><span style="font-style: italic;">AI3:::Adaptive         Information</span></a> blog, April 8, 2009.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose5"></a> [5] For example, see this listing of more than 150 specific <a href="http://openstructs.org/resources/rdfizers">format options</a> available as open source. These converters can also work directly with         major application APIs.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose6"></a> [6] For an expansion on RDF as a canonical data model, see further M.K.         Bergman, 2009. <a href="../533/structure-the-world/">&#8220;Structure the         World&#8221;</a>, <a style="font-weight: bold;" href="http://mkbergman.com/"><span style="font-style: italic;">AI3:::Adaptive         Information</span></a> blog, August 3, 2009.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose7"></a> [7] For example, for dataset authoring, Structured Dynamics has         developed <a href="http://openstructs.org/iron"><span style="font-style: italic; font-weight: bold;">irON</span></a>, an instance         record and object notation that can be serialized as JSON (called         <span style="font-style: italic;">irJSON</span>), XML (called         <span style="font-style: italic;">irXML</span>) or comma-separated         values (or CSV comma-delimited files, called <span style="font-style: italic;">commON</span>). The purpose of these notations is         to provide easier authoring environments and scripting support to         RDF-ready datasets. The advantage is to shield users from the nuances         of RDF. The design of <span style="font-style: italic;">commON</span> is especially geared to using spreadsheets as authoring environments         for instance record tables or simple outline structures.  See         further the <a href="http://openstructs.org/iron/iron-specification"><span style="font-style: italic; font-weight: bold;">irON</span> specification</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose8"></a> [8] For a general listing of linked data articles, please see <a href="../category/linked-data/">that category</a> on         this <a style="font-weight: bold;" href="http://mkbergman.com/"><span style="font-style: italic;">AI3:::Adaptive         Information</span></a> blog. Specific articles of interest include the         four-part series on &#8220;Making Linked Data Reasonable Using Description         Logics&#8221; [9] (<a href="../474/making-linked-data-reasonable-using-description-logics-part-1/">February         11</a>, <a href="../476/making-linked-data-reasonable-using-description-logics-part-2/"> February 15</a>, <a href="../477/making-linked-data-reasonable-using-description-logics-part-3/"> February 18</a> and <a href="../478/making-linked-data-reasonable-using-description-logics-part-4/"> February 23</a>, 2009) and the <a href="../837/the-law-of-linked-data/">&#8220;The Law of         Linked Data&#8221;</a> (October 11, 2009).</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose9"></a> [9] Our best practices approach makes explicit splits between the         &#8220;<a href="http://en.wikipedia.org/wiki/Abox">ABox</a>&#8221; (for instance         data) and “<a href="http://en.wikipedia.org/wiki/Tbox">TBox</a>” (for ontology         schema) in accordance with our <a title="Permanent Link to Thinking ?Inside the Box? with Description Logics" href="../466/thinking-inside-the-box-with-description-logics/"> working definition</a> for <a href="http://en.wikipedia.org/wiki/Description_logics">description         logics</a>, a fundamental underpinning for how we use RDF:</p>
<div class="boxGraySolid">&#8220;Description logics and their semantics traditionally split           <span style="font-style: italic;">concepts</span> and their           relationships from the different treatment of <span style="font-style: italic;">instances</span> and their attributes and           roles, expressed as fact assertions. The concept split is known as           the TBox (for <em>terminological</em> knowledge, the basis for           <span style="font-style: italic;">T</span> in <span style="font-style: italic;">TBox</span>) and represents the schema or           taxonomy of the domain at hand. The TBox is the structural and           intensional component of conceptual relationships. The second split           of instances is known as the ABox (for <span style="font-style: italic;">assertions</span>, the basis for <span style="font-style: italic;">A</span> in <span style="font-style: italic;">ABox</span>) and describes the attributes of           instances (and individuals), the roles between instances, and other           assertions about instances regarding their class membership with the           TBox concepts.&#8221;</div>
</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose10"></a> [10] Those unfamiliar with the term <span style="font-style: italic;">ontology</span> might be interested in my first         introduction to the subject: M.K. Bergman, 2007. <a href="../374/an-intrepid-guide-to-ontologies/"><span style="font-style: italic;"> &#8220;</span>An Intrepid Guide to Ontologies<span style="font-style: italic;">&#8220;</span></a>, <a style="font-weight: bold;" href="http://mkbergman.com/"><span style="font-style: italic;">AI3:::Adaptive Information</span></a> blog, May         16, 2007.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose11"></a> [11] See M.K. Bergman, 2009. <a href="../492/ontology-best-practices-for-data-driven-applications-part-3/"> <span style="font-style: italic;">&#8220;</span>Ontologies as the         ‘Engine’ for Data-Driven Applications<span style="font-style: italic;">&#8220;</span></a>, <a style="font-weight: bold;" href="http://mkbergman.com/"><span style="font-style: italic;">AI3:::Adaptive Information</span></a> blog, June         10, 2009. This is the most detailed explanation, but the specific term         <span style="font-style: italic;">adaptive ontology</span> was not yet         used. The first dedicated focus on adaptive ontologies was in <a href="../553/confronting-misconceptions-with-adaptive-ontologies/"> &#8220;Confronting Misconceptions with Adaptive Ontologies&#8221;</a> (August 17,         2009). See also [12] and [13].</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose12"></a> [12] See, M.K. Bergman, 2009. <a href="../847/ontology-driven-applications-using-adaptive-ontologies/"> &#8220;Ontology-driven Applications Using Adaptive Ontologies&#8221;</a>, <a style="font-weight: bold;" href="http://mkbergman.com/"><span style="font-style: italic;">AI3:::Adaptive Information</span></a> blog,         November 23, 2009.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose13"></a> [13] See, M.K. Bergman, 2009. <a href="../825/fresh-perspectives-on-the-semantic-enterprise/"> &#8220;Fresh Perspectives on the Semantic Enterprise&#8221;</a>, <a style="font-weight: bold;" href="http://mkbergman.com/"><span style="font-style: italic;">AI3:::Adaptive Information</span></a> blog,         September 28, 2009.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose14"></a> [14] See, M.K. Bergman, 2009. <a href="../486/a-general-web-oriented-architecture-woa-for-structured-data/"> &#8220;A General Web-oriented Architecture (WOA) for Structured Data&#8221;</a>,         <a style="font-weight: bold;" href="http://mkbergman.com/"><span style="font-style: italic;">AI3:::Adaptive Information</span></a> blog, May         3, 2009. Also, see the related <a href="../category/web-oriented-architecture-woa/">WOA         category</a> for other articles in this area.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose15"></a> [15] See, M.K. Bergman, 2008. <a href="../459/woa-a-new-enterprise-partner-for-linked-data/"> &#8220;WOA: A New Enterprise Partner for Linked Data&#8221;</a>, <a style="font-weight: bold;" href="http://mkbergman.com/"><span style="font-style: italic;">AI3:::Adaptive Information</span></a> blog,         October 12, 2008.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose16"></a> [16] See, M.K. Bergman, 2009. <a href="../497/structwsf-a-framework-for-collaboration-networks/"> &#8220;structWSF: A Framework for Collaboration Networks&#8221;</a>, <a style="font-weight: bold;" href="http://mkbergman.com/"><span style="font-style: italic;">AI3:::Adaptive Information</span></a> blog, July         2, 2009.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="ose17"></a> [17] See <a href="http://structureddynamics.com/products.html">http://structureddynamics.com/products.html</a> for a general descriptive illustration of Structured Dynamics&#8217; product         stack. There is also a longer <a href="http://www.slideshare.net/mkbergman/structured-dynamicss-semantic-technologies-product-stack"> slideshow</a>, with particular reference to slide #37.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/859/seven-pillars-of-the-open-semantic-enterprise/feed/</wfw:commentRss>
		<slash:comments>15</slash:comments>
		</item>
		<item>
		<title>The Open World Assumption: Elephant in the Room</title>
		<link>http://www.mkbergman.com/852/the-open-world-assumption-elephant-in-the-room/</link>
		<comments>http://www.mkbergman.com/852/the-open-world-assumption-elephant-in-the-room/#comments</comments>
		<pubDate>Tue, 22 Dec 2009 04:20:14 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Description Logics]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[closed world assumption]]></category>
		<category><![CDATA[cwa]]></category>
		<category><![CDATA[knowledge management]]></category>
		<category><![CDATA[open world assumption]]></category>
		<category><![CDATA[owa]]></category>
		<category><![CDATA[owl]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[relational model]]></category>
		<category><![CDATA[Semantic Enterprise]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=852</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=The Open World Assumption: Elephant in the Room&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Description Logics&amp;rft.subject=Ontologies&amp;rft.subject=Semantic Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2009-12-21&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/852/the-open-world-assumption-elephant-in-the-room/&amp;rft.language=English"></span>
OWA Enables Incremental, Low-risk Wins for the Semantic Enterprise In speaking of the semantic Web, it is not infrequent that the open world assumption (OWA) gets mentioned. What this post argues is that this somewhat obscure concept may hold within it the key as to why there have been decades of too-frequent failures in the [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=The Open World Assumption: Elephant in the Room&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Description Logics&amp;rft.subject=Ontologies&amp;rft.subject=Semantic Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2009-12-21&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/852/the-open-world-assumption-elephant-in-the-room/&amp;rft.language=English"></span>
<p><img style="border: 0px solid; width: 250px; height: 276px; margin-right: 10px;" title="Open World" src="../wp-content/themes/ai3/images/2009Posts/091221_open_globe_elephant.png" alt="Open World" width="367" height="405" align="left" /></p>
<h2>OWA Enables Incremental, Low-risk Wins for the Semantic Enterprise</h2>
<p>In speaking of the <a href="http://en.wikipedia.org/wiki/Semantic_web">semantic Web</a>, it is not         infrequent that the <a href="http://en.wikipedia.org/wiki/Open_world_assumption">open world         assumption</a> (OWA) gets mentioned. What this post argues is that this         somewhat obscure concept may hold within it the key as to why there         have been decades of too-frequent failures in the enterprise in         <a href="http://en.wikipedia.org/wiki/Business_intelligence">business         intelligence</a>, <a href="http://en.wikipedia.org/wiki/Data_warehouse">data warehousing</a>,         <a href="http://en.wikipedia.org/wiki/Data_integration">data         integration</a> and <a href="http://en.wikipedia.org/wiki/Federated_database_system">federation</a>,         and <a href="http://en.wikipedia.org/wiki/Knowledge_management">knowledge         management</a>.</p>
<p>This is a fairly bold assertion. In order to support it, we first need         to look to the logic and mindset assumptions associated with         traditional relational data management and the semantic Web. We then         need to look to the nature of knowledge itself and its relation to data         federation. It is in this intersection that the key of decades of         faulty premises may reside.</p>
<p>The main argument is that the <a href="http://en.wikipedia.org/wiki/Closed_World_Assumption">closed world         assumption</a> (CWA) and its prevalent mindset in traditional database         systems have hindered the ability of enterprises and the vendors that         support them to adopt incremental, low-risk means to knowledge systems         and management. CWA, in turn, has led to over-engineered schema,         too-complicated architectures and massive specification efforts that         have led to high deployment costs, blown schedules and brittleness.</p>
<p>The good news is that abandoning these failed practices and embracing         the open world approach can be done immediately based on existing         assets. Simply shifting from the closed world to open world premise         can, I argue, improve the odds for enterprise IT success in these         areas.</p>
<p>It is time to meet the elephant in the room.</p>
<h3>Scope and Some Root Causes of Enterprise IT Failures</h3>
<p>It is, of course, a bit of editorial hyperbole to label most enterprise         initiatives in business intelligence and knowledge management as being         failures over the past few decades. And, insofar as failures have         occurred, I also do not believe they are the result of vendor greed or         cynicism, or IT management mistakes or incompetence. Rather, I believe         the fault resides in the attempt to pound a square peg (relational         model) into a round hole (knowledge representation).</p>
<p>The scope of these failures is not known. We have seen anecdotal claims         of trillions of dollars in annual loses due to IT project failures         worldwide; failure rates for major IT projects in the 65% to 80%         ranges; and analysis of waste and failures in individual firms that are         fairly eye-popping <a href="#owa1">[1]</a>. The real point of this post is not to try to         quantify these problems. However, in my many years within IT it has         been a common perception and concern that many &#8212; if not most &#8212;         large-scale information technology deployments have disappointed in one         way or another.</p>
<p>These disappointments range from cost overruns, to late delivery, to         unmet objectives, or to low user acceptance. Many initiatives are         simply cancelled before any such metrics can be documented. Whatever         the absolute quantification, I think most experienced IT managers and         executives would agree that these failures and disappointments have         been all too commonplace.</p>
<div class="boxGreenDotted" style="margin: 5px 0pt 5px 10px; float: right; text-align: center; width: 400px; font-style: italic; color: #666666; font-weight: bold; font-size: 110%;">“Business       Intelligence projects are famous for low success rates, high costs and       time overruns. The economics of BI are visibly broken, and have been for       years. Yet BI remains the #1 technology priority according to       Gartner.”<span style="font-size: x-small;"><a href="#owa2">[2]</a></span></div>
<p>Why might this be?</p>
<p>I truly believe the reasons for these disappointments do not reside in         bad faith or incompetence. The potential importance of IT knowledge         projects to improve competitive position, lower costs, or aid         innovation for new markets is understood by all. <a href="http://en.wikipedia.org/wiki/Dilbert">Dilbert</a> aside, I find it         simply incomprehensible that disappointments or failures are rooted in         these causes.</p>
<p>Rather, I suspect the root cause resides in the success of the         relational model in the enterprise.</p>
<p>As transaction systems and for modeling narrowly bound and structured         domains (such as products, inventory or customer lists), the relational         model and its proven and optimized RDBMs and SQL query language have         been resounding successes. It is natural to take a successful approach         and try to extend it to other areas.</p>
<p>However, beginning with data warehouses in the 1980s, business         intelligence (BI) systems in the 1990s, and the general issue of most         enterprise information being bound up in documents for decades, the         application of the relational model to these areas has been         disappointing.</p>
<p>The reasons for this do not reside in areas such as storage or         hardware; these areas have seen remarkable improvements over the         decades. Rather, the problem resides in the nature of the relational         model itself, and its lack of suitability to knowledge-based problems.</p>
<h3>Technical Aspects of OWA, Broadly Defined</h3>
<p>I have noted the importance of the open world assumption to the         semantic enterprise in many of my more recent posts [<a href="#owa3">3</a>,<a href="#owa4">4</a>]. But I, like         many others, often refer to the open world assumption with facile         summaries such as it means that a lack of information does not imply         the missing information to be false. Yet to fully understand the         implications of OWA and many of its associated assumptions, it is         necessary to delve deeper.</p>
<p>I am using here a shorthand that poses the closed world assumption         (CWA) <span style="font-style: italic;">vs.</span> the open world         assumption (OWA). Actually, the data models behind these approaches         (<a href="http://en.wikipedia.org/wiki/Datalog">Datalog</a> or <a href="http://en.wikipedia.org/wiki/Non-monotonic_logic">non-monotonic         logic</a> in the case of CWA; <a href="http://en.wikipedia.org/wiki/Monotonic#Monotonic_logic">monotonic</a> in the case of OWA <a href="#owa5">[5]</a>; OWA is also firmly grounded in description         logics <a href="#owa4">[4]</a>) tend be coupled with a few other assumptions. I use the         shorthand of relational approach <span style="font-style: italic;">vs</span>. (open) semantic Web approach to         contrast these two models.</p>
<p>There are instances where the relational model can embrace the open         world assumption (for example, the <a href="http://en.wikipedia.org/wiki/Null_%28SQL%29">null in SQL</a>) and         there are instances where semantic Web approaches can be closed world         (as with frame logic or Prolog or other special considerations; see         conclusion). But, as generally applied and as generally understood,         this contrast between typical relational practice and the semantic Web         (based on RDF and OWL) tends to hold.</p>
<p>From a theoretical standpoint, I have found the treatment of         Patel-­Schneider and Horrocks <a href="#owa6">[6]</a> to be most useful in comparing these         approaches. However, the <span style="font-style: italic;">Description         Logics Handbook</span> and some other varied sources are also helpful         [<a href="#owa7">7</a>,<a href="#owa5">5</a>]. Much of the technical aspects summarized in the table below are         from these sources; I refer you to these sources for more informed         technical discussions:</p>
<table class="center_ok" style="text-align: left; width: 620px;" border="1" cellspacing="0" cellpadding="5">
<tbody>
<tr>
<td style="vertical-align: top; font-weight: bold; font-size: 100%; width: 300px; text-align: center; background-color: #ffffcc;">Relational Approach</td>
<td style="vertical-align: top; font-weight: bold; font-size: 100%; width: 300px; text-align: center; background-color: #ffffcc;">(Open) Semantic Web Approach</td>
</tr>
<tr>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Closed World Assumption (CWA)</p>
<p style="font-size: 90%;">That which is not known to be true is presumed to be false; it                 needs to be explicitly stated as true. <span style="font-style: italic;">Negation as failure</span> (NAF) is a                 related assumption, since it assumes as false every predicate                 that cannot be proven to be true. Under CWA, any statement not                 known to be true is false.</p>
<p style="font-size: 90%;">Everything is prohibited until it is permitted.</p>
</td>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Open World Assumption (OWA)</p>
<p style="font-size: 90%;">The lack of a given assertion or fact being available does not                 imply whether that possible assertion is true or false: it                 simply is not known. In other words, lack of knowledge does not                 imply falsity.</p>
<p style="font-size: 90%;">Everything is permitted until it is prohibited.</p>
</td>
</tr>
<tr>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Unique Name Assumption (UNA)</p>
<p style="font-size: 90%;">The unique name assumption (UNA) is premised that different                 names always refer to different entities in the world.</p>
</td>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Duplicate Labels Allowed</p>
<p style="font-size: 90%;">OWL allows different synonym labels to be used for the same                 object; same names may refer to different objects. Identity                 assertions must be explicitly stated.</p>
</td>
</tr>
<tr>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Complete Information</p>
<p style="font-size: 90%;">The data system at hand is assumed to be complete. (Missing                 information is often handled via the <a href="http://en.wikipedia.org/wiki/Null_%28SQL%29">null statement in                 SQL</a>, but that has been controversial and contentious in its                 own right.) This is also known as the <span style="font-style: italic;">domain-closure assumption</span>.</p>
</td>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Incomplete Information</p>
<p style="font-size: 90%;">A central tenet of OWA is that information is incomplete. A                 corollary is that the attributes of specific objects or                 instances may also be incomplete or partially known.</p>
</td>
</tr>
<tr>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Single Schema (one world)</p>
<p style="font-size: 90%;">A single schema is necessary to define the scope and                 interpretation of the world (domain at hand).</p>
</td>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Many World Interpretations</p>
<p style="font-size: 90%;">Schema and data instance assertions are kept separate. Multiple                 interpretations (worlds) for the same data are possible.</p>
</td>
</tr>
<tr>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Integrity Constraints</p>
<p style="font-size: 90%;">Integrity constraints prevent “incorrect” values                 from being asserted in the relational model. It is useful for                 validation/parsing/data input and is related to the single                 model that contains only the facts asserted. Strict cardinality                 is used for checking validation.</p>
</td>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Logical Axioms (restrictions)</p>
<p style="font-size: 90%;">Logical axioms provide restrictions through property domains                 and ranges. Everything can be true unless proven otherwise, and                 multiple possible models can satisfy the axioms. This provides                 more powerful inferencing, though can also be unintuitive at                 times. Cardinality and range restrictions exhibit different                 behavior for objects (inferred) or datatypes.</p>
</td>
</tr>
<tr>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Non-monotonic Logic</p>
<p style="font-size: 90%;">The set of conclusions warranted on the basis of a given                 knowledge base does not increase (in fact, it likely shrinks)                 with the size of the knowledge base <a href="#owa5">[5]</a>.</p>
</td>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Monotonic Logic</p>
<p style="font-size: 90%;">The hypotheses of any derived fact may be freely extended with                 additional assumptions. Additional assertions tend to reduce                 the inferences or entailments that can be applied. A new piece                 of knowledge cannot reduce what is known <a href="#owa5">[5]</a>. New knowledge can                 arise through inference.</p>
</td>
</tr>
<tr>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Fixed and Brittle</p>
<p style="font-size: 90%;">Changing the schema requires re-architecting the database; not                 inherently extensible.</p>
</td>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Reusable and Extensible</p>
<p style="font-size: 90%;">Designed from the ground up to reuse existing ontologies                 (axioms) and to be extensible. Database design and management                 can be more agile, with schema evolving incrementally.</p>
</td>
</tr>
<tr>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Flat Structure; Strong Typing</p>
<p style="font-size: 90%;">Information organized into flat tables; linkages and                 connections between tables based on foreign keys or joins.                 Strong data typing orientation.</p>
</td>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Graph Structure; Open Typing</p>
<p style="font-size: 90%;">Inherent graph structure, supporting of linkage and                 connectivity analysis. Datatypes are inherently loose, though                 axioms can add strong types. Datatypes treated in the same way                 as classes, and datatype values are treated in the same way as                 individual identiers (<span style="font-style: italic;">i.e.</span>, a data value is treated as                 referring to an object).</p>
</td>
</tr>
<tr>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Querying and Tooling</p>
<p style="font-size: 90%;">SQL and query optimizers well developed. Tooling well                 developed. Disjunction not supported; negation must be                 accommodated through approaches such as NAF. Sums and counts                 are easier due to unique name premise. Answer closure (one                 answer passable to a next calculation) is easier than OWA. Most                 tools are not suitable for any arbitrary schema.</p>
</td>
<td style="vertical-align: top;">
<p style="font-weight: bold; text-align: center; font-size: 90%;">Querying and Tooling</p>
<p style="font-size: 90%;">SPARQL and emerging rule languages used for querying;                 performance at scale and with broad distribution a concern.                 Queries require contextual information for proper set                 selection. Negation and disjunction are allowed and are                 powerful constructs. Tools generally less developed. Exciting                 opportunities for <span style="font-style: italic;">ontology-driven applications</span> working against a small set of generic tools.</p>
</td>
</tr>
</tbody>
</table>
<p>In well-characterized or self-contained domains (seats on a plane,         books in a library, customers of a company, products sold via         distribution channels), the traditional relational model works well. A         closed-world assumption is performant for transaction operations with         easier data validation. The number of negative facts about a given         domain is typically much greater than the number of the positive ones.         So, in many bounded applications, the number of negative facts is so         large that their explicit representation can become practically         impossible <a href="#owa7">[7]</a>. In such cases, it is simpler and shorter to state known         &#8220;true&#8221; statements than to enumerate all &#8220;false&#8221; conditions.</p>
<p>However, the relational model is a paradigm where the information must         be complete and it must be described by a single schema. Traditional         databases require an agreement on a schema, which must be made before         data can be stored and queried. The relational model assumes that the         only objects and relationships that exist in the domain are those that         are explicitly represented in the database, and that names uniquely         identify objects in this domain. The result of these assumptions is         that there is a <span style="font-style: italic;">single</span> (canonical) model for relational systems where objects and         relationships are in a one-to-one correspondence with the data in the         database <a href="#owa6">[6]</a>.</p>
<p>This makes CWA and its related assumptions a very poor choice when         attempting to combine information from multiple sources, to deal with         uncertainty or incompleteness in the world, or to try to integrate         internal, proprietary information with external data.</p>
<p>The process of describing an open, semantic Web &#8220;world&#8221; can proceed         incrementally, sequentially asserting new statements or conditions. The         schema in the open semantic Web &#8212; the <span style="font-style: italic;">ontology</span> &#8212; consists of sets of statements         (called axioms) that describe characteristics that must be satisfied by         the ontology designer&#8217;s idea of “reasonable” states of the         world. Formally, such statements correspond to logical sentences, and         an ontology corresponds to a logical theory <a href="#owa6">[6]</a>.</p>
<p>Irregularity and incompleteness are toxic to relational model design.         In the open semantic Web, data that is structured differently can still         be stored together via RDF triple statements (<span style="font-style: italic;">subject</span> &#8211; <span style="font-style: italic;">predicate</span> &#8211; <span style="font-style: italic;">object</span>). For example, OWA allows suppliers         without cities and names to be stored along alongside suppliers with         that information. Information can be combined about similar objects or         individuals even though they have different or non-overlapping         attributes. Duplicate checking now occurs based on the logic of the         system and not unique name evaluations. Data validation in OWA systems         can both become more complicated (via testing against restriction         statements) or partially easier (via inference).</p>
<p>It is interesting to note that the theoretical underpinnings of CWA by         Reiter <a href="#owa8">[8]</a> began to be understood about the same time (1978) that data         federation and knowledge representation (KR) activities also began to         come to the fore. CWA and later work on (for example) default reasoning         <a href="#owa5">[5]</a> appeared to have informed early work in description logics and its         alternative OWA approach. This heavily influenced the development of         the semantic Web languages RDF and OWL. However, the early path toward         KM work based on the relational model also appears to have been set in         this timeframe.</p>
<p>We are still reaping the whirlwind from this unfortunate early choice         of the relational model for KR, KM and BI purposes. Moreover, though         there is quite a bit of theoretical and logical discussion of the         alternative OWA and CWA data models, there are surprisingly few         discussions of what the implications of these models are to the         enterprise. (That is, the elephant in the room.) The next two sections         tackle this gap.</p>
<h3>The Knowledge Management Argument for OWA</h3>
<p>The above should make clear that the relational model and CWA are         appropriate for defined and bounded systems. However, many of the new         <a href="http://en.wikipedia.org/wiki/Knowledge_economy">knowledge         economy</a> challenges are anything but defined and bounded. These         applications all reside in the broad category of <a href="http://en.wikipedia.org/wiki/Knowledge_management">knowledge         management</a> (KM), and include such applications as data federation,         data warehousing, enterprise information integration, business         intelligence, competitive intelligence, knowledge representation, and         so forth.</p>
<p>Let&#8217;s looks at the characteristics of such knowledge systems and why         they are more appropriately modeled through the open world assumption         (OWA) rather than the relational model and CWA:</p>
<ul>
<li style="padding-top: 9px;"> <span style="font-style: italic; font-weight: bold;">Knowledge is           never complete</span> &#8212; gaining and using knowledge is a process,           and is never complete. A completeness assumption around knowledge is           by definition inappropriate</li>
<li style="padding-top: 9px;"> <span style="font-style: italic; font-weight: bold;">Knowledge is           found in structured, semi-structured and unstructured forms</span> &#8212;           structured databases represent only a portion of structured           information in the enterprise (spreadsheets and other non-relational           datastores provide the remainder). Further, general estimates are           that 80% of information available to enterprises reside in documents,           with a growing importance to metadata, Web pages, markup documents           and other semi-structured sources. A proper data model for knowledge           representation should be equally applicable to these various           information forms; the open semantic language of RDF is specifically           designed for this purpose</li>
<li style="padding-top: 9px;"> <span style="font-weight: bold; font-style: italic;">Knowledge can be           found anywhere</span> &#8212; the open world assumption does not imply           open information only. However, it is also just as true that relevant           information about customers, products, competitors, the environment           or virtually any knowledge-based topic can also not be gained via           internal information alone. The emergence of the Internet and the           universal availability and access to mountains of public and shared           information demands its thoughtful incorporation into KM systems.           This requirement, in turn, demands OWA data models</li>
<li style="padding-top: 9px;"> <span style="font-weight: bold; font-style: italic;">Knowledge           structure evolves with the incorporation of more information</span> &#8212; our ability to describe and understand the world or our problems           at hand requires inspection, description and definition.           Birdwatchers, botanists and experts in all domains know well how           inspection and study of specific domains leads to more discerning           understanding and &#8220;seeing&#8221; of that domain. Before learning,           everything is just a shade of green or a herb, shrub or tree to the           incipient botanist; eventually, she learns how to discern entire           families and individual plant species, all accompanied by a rich           domain language. This truth of how increased knowledge leads to more           structure and more vocabulary needs to be explicitly reflected in our           KM systems</li>
<li style="padding-top: 9px;"> <span style="font-style: italic; font-weight: bold;">Knowledge is           contextual</span> &#8212; the importance or meaning of given information           changes by perspective and context. Further, exactly the same           information may be used differently or given different importance           depending on circumstance. Still further, what is important to           describe (the &#8220;attributes&#8221;) about certain information also varies by           context and perspective. Large knowledge management initiatives that           attempt to use the relational model and single perspectives or schema           to capture this information are doomed in one of two ways:            either they fail to capture the relevant perspectives of some users;           or they take forever and massive dollars and effort to embrace all           relevant stakeholders&#8217; contexts</li>
<li style="padding-top: 9px;"> <span style="font-weight: bold; font-style: italic;">Knowledge should           be coherent</span> &#8212; <a href="../450/when-is-content-coherent/">coherence</a> is the state of having internal logical consistency. A library of           books organized by the <a href="http://en.wikipedia.org/wiki/Dewey_Decimal_Classification">Dewey           Decimal Classification</a> <span style="font-style: italic;">v.</span> the <a href="http://en.wikipedia.org/wiki/Library_of_Congress_Classification">Library           of Congress Classification</a> <span style="font-style: italic;">v.</span> the <a href="http://en.wikipedia.org/wiki/Colon_classification">Colon           classification</a> system (or others) is not inherently correct or           wrong, but it is important that whatever system is used be applied           consistently. Because of the power of OWA logics in inferencing and           entailments, whatever &#8220;world&#8221; is chosen for a given knowledge           representation should be coherent.  Fantasies such as <a href="http://en.wikipedia.org/wiki/Avatar_%282009_film%29">Avatar</a> and           the <a href="http://en.wikipedia.org/wiki/The_Lord_of_the_Rings_film_trilogy">Lord           of the Rings</a> trilogy, even though not real, can be made           believable and compelling by virtue of their coherence</li>
<li style="padding-top: 9px;"> <span style="font-weight: bold; font-style: italic;">Knowledge is           about connections</span> &#8212; the epistemological nature of <a href="http://en.wikipedia.org/wiki/Knowledge">knowledge</a> can be argued           endlessly, but I submit much of what distinguishes knowledge from           information is that knowledge makes the connections between disparate           pieces of relevant information. As these relationships accrete, the           knowledge base grows. Again, RDF and the open world approach are           essentially connective in nature. New connections and relationships           tend to break brittle relational models, and</li>
<li style="padding-top: 9px;"> <span style="font-weight: bold; font-style: italic;">Knowledge is           about its users defining its structure and use</span> &#8212; since           knowledge is a state of understanding by practitioners and experts in           a given domain, it is also important that those very same users be           active in its gathering, organization (structure) and use. Data           models that allow more direct involvement and authoring and           modification by users &#8212; as is inherently the case with RDF and OWA           approaches &#8212; bring the knowledge process closer to hand. Besides           this ability to manipulate the model directly, there are also the           immediacy advantages of incremental changes, tests and tweaks of the           OWA model. The schema consensus and delays from single-world views           inherent to CWA remove this immediacy, and often result in delays of           months or years before knowledge structures can actually be used and           tested <a href="#owa9">[9]</a>.</li>
</ul>
<p>To be sure, there are many circumstances where large stores of instance         data and their analysis are necessary for knowledge purposes. In these         cases, hybrid CWA-OWA systems (see conclusion) may make sense.</p>
<p>But, as these points emphasize, the general assembly and organization         of knowledge is open world in nature. Trying to fit KM and related         applications into the straightjacket of the relational model is folly.         The relational model and CWA for KM is the elephant in the room. Three         decades of failures and disappointments affirm this fact.</p>
<h3>The Business Argument for OWA</h3>
<p>Besides the native match of knowledge systems with OWA, there are sound         business arguments for embracing the (open) semantic enterprise as         well. These arguments can be summarized as <span class="double_u">lower risk</span>, <span class="double_u">lower         cost</span>, <span class="double_u">faster deployment</span>, and         more <span class="double_u">agile responsiveness</span>. What is         there not to love?</p>
<p>It should now be clear that it is possible to start small in testing         the transition to a semantic enterprise. These efforts can be done         incrementally and with a focus on early, high-value applications and         domains.</p>
<p>Open world does not necessarily mean open data and it does not mean         open source. Open world is simply a way to think about the information         we have and how we act on it. OWA technologies are neutral to the         question of open or public sources. The techniques can equivalently be         applied to internal, closed, proprietary data and structures. Moreover,         the technologies can themselves be used as a basis for bringing         external information into the enterprise. An open world assumption         merely asserts that we never have all necessary information and lacking         that information does not itself lead to any conclusions.</p>
<p>Further, we need not abandon past practices. There is much that can be         done to leverage existing assets. Indeed, those prior investments are         often the requisite starting basis to inform semantic initiatives.         However, in leveraging those assets, it is important that the         enterprise begin to embrace and understand the open world assumption.</p>
<p>We also see that RDF and OWL, while important behind the scenes as a         canonical data model and languages for organizing this information,         need not be exposed as such to most users. Most instance data can be         expressed as is with the data languages of choice such as XML, JSON or         whatever. We are merely using the techniques of the (open) semantic Web         as the data model to organize our information assets at hand. These         assets need not themselves be represented in the native RDF or OWL         languages.</p>
<p>Thus, open world frameworks provide some incredibly important benefits         for knowledge management applications in the enterprise:</p>
<ul>
<li>Domains can be analyzed and inspected incrementally</li>
<li>Schema can be incomplete and developed and refined incrementally</li>
<li>The data and the structures within these open world frameworks can         be used and expressed in a piecemeal or incomplete manner</li>
<li>We can readily combine data with partial characterizations with         other data having complete characterizations</li>
<li>Systems built with open world frameworks are flexible and robust;         as new information or structure is gained, it can be incorporated         without negating the information already resident, and</li>
<li>Open world systems can readily bridge or embrace closed world         subsystems.</li>
</ul>
<p>One might argue, as we believe, that the biggest impediment to the         semantic enterprise is the mind shift necessary to start thinking about         and accepting the open world premise. Again, this perspective is not         applicable to all problems and domains. But, where it is, much can be         left in place and leveraged with semantic technologies, so long as the         enterprise begins to look at these existing assets through a different         open-world lens.</p>
<p>In most real world circumstances, there is much we don&#8217;t know and we         interact in complex and external environments. Knowledge management         inherently occupies this space. Ultimately, data interoperability         implies a global context. Open world is the proper logic premise for         these circumstances. Via the OWA framework, we can readily change and         grow our conceptual understanding and coverage of the world, including         incorporation of external ontologies and data. Since this can easily         co-exist with underlying closed-world data, the semantic enterprise can         readily bridge both worlds.</p>
<p>So, we can now define the <span style="font-weight: bold; font-style: italic;">open semantic         enterprise</span> as one that embraces OWA for its knowledge management         applications and engages in rapid and low-risk testing of incremental         learning. The open world assumption is the proper framework to reverse         decades of failure and disappointment for knowledge projects in the         enterprise.</p>
<h3>Some Open Questions about OWA</h3>
<p>In our own discussions about ABox &#8211; TBox splits <a href="#owa10">[10]</a>, we have, in         essence, supported a hybrid OWA-CWA argument for the enterprise. It is         beyond the scope of this current piece to describe these approaches in         detail, but some of the options include local CWA, the addition of rule         languages and constraints to basic OWA, use of the new OWL 2,         TopQuadrant&#8217;s SPIN notation, and others <a href="#owa11">[11]</a>. I will address some of         these in a later post.</p>
<p>There are also questions about performance and scalability with open         semantic technologies. Here, too, progress is rapid, with billion         triple thresholds rapidly falling with daily reports of better         performance <a href="#owa12">[12]</a>. Fortunately, the incremental approach that we         advocate herein dovetails well with these rapid developments. There         should be no arguing the benefits of a successful incremental project         in a smaller domain, perhaps repeated across multiple domains, in         comparison to large, costly initiatives that never produce (even though         their underlying technologies are performant).</p>
<p>There are also architecture issues inherent in these OWA designs. In         one of our next posts, we return to the topic of <a href="../category/web-oriented-architecture-woa/">Web-oriented         architecture</a> and its role in support of these OWA knowledge         management initiatives.</p>
<p>In the end, there is no substitute for doing and learning. KM based on         OWA for the open semantic enterprise can be started today, in a focused         manner with tangible benefits and outcomes, at low cost and risk. Let&#8217;s         push the elephant out of the room and let the learning and doing begin.</p>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a id="owa1" name="owa1"></a> [1] For example, see Roger Sessions,         2009. <a style="font-style: italic;" href="http://simplearchitectures.blogspot.com/2009/09/cost-of-it-failure.html"> Cost of IT Failure</a>, September 28, 2009. This analysis suggests         failure rates of 65% with a total estimated worldwide cost of $6.2         trillion in 2009. Commenters have raised questions as to what         constitutes failure and have questioned some of the analysis         assumptions. Nonetheless, even with over-estimates, the scale of the         numbers is alarming; see Jorge Dominguez, 2009. <a style="font-style: italic;" href="file:///F:/5-WebSites/All%20In%20Progress/The%20CHAOS%20Report%202009%20on%20IT%20Project%20Failure">The CHAOS         Report 2009 on IT Project Failure</a>, June 16, 2009, which indicates         combined failure and challenge rates for IT projects have ranged from         65% to 84% over the period 1994 to 2009; see Dan Galorath, 2008.         <a style="font-style: italic;" href="http://www.galorath.com/wp/software-project-failure-costs-billions-better-estimation-planning-can-help.php"> Software Project Failure Costs Billions; Better Estimation &amp;         Planning Can Help</a>, June 7, 2008. In this report, Galorath compares         and combines many of the available IT failure studies and summarizes         that 3 of 5 IT projects do not do what they were supposed to for the         expected costs, with 49% showing budget overruns, 47% showing higher         than expected maintenance costs, and 41% failing to deliver expected         business value; the anecdotal failure rate for years for IT projects         has been claimed as 80%, with business intelligence and data         warehousing particularly failure-prone areas; in 2001, a study by Mark         N. Frolick and Keith Lindsey, <a style="font-style: italic;" href="http://www.tdwi.org/research/display.aspx?ID=6592">Critical Factors         for Data Warehouse Failures</a>, for the Data Warehousing Institute         noted conventional wisdom says the failure rate of data warehousing         projects is 70 to 80 percent, with a then-recent study in the insurance         industry found a 90-percent failure rate. This report is useful for         combining many historical studies.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="owa2" name="owa2"></a> [2] According to this article, by Antone         Gonsalves, <a><span style="font-style: italic;">Poor Use Of Data         Integration Tools Can Waste $500,000 Annually: Gartner</span></a> (April 27, 2009), which reports on a recent Gartner Report, large         global 2000 companies, using several data integration tools with         overlapping features, can reduce costs by more than $500,000 annually         by eliminating redundant software and leveraging a shared services         model. In a further report by Roman Stanek, <a style="font-style: italic;" href="http://romanstanek.ulitzer.com/node/935202">Business Intelligence         Projects are Famous for Low Success Rates, High Costs and Time         Overruns</a> (April 25, 2009), Gartner is talking about a dirty little         secret in the world of data integration, the fact that the data         integration technology in place is based on generations of data         integration technology being layered in the enterprise over the years.         Thus, technology that was purchased to solve data integration problems,         and reduce costs, is actually making the data integration problem more         complex and no longer cost efficient.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="owa3" name="owa3"></a> [3] Here are some of my earlier postings         dealing in some degree with OWA: <a style="font-style: italic;" href="../847/ontology-driven-applications-using-adaptive-ontologies/"> Ontology-driven Applications Using Adaptive Ontologies</a>, November         23, 2009; <a style="font-style: italic;" href="../825/fresh-perspectives-on-the-semantic-enterprise/"> Fresh Perspectives on the Semantic Enterprise</a>, September 28, 2009;         <a style="font-style: italic;" href="../553/confronting-misconceptions-with-adaptive-ontologies/"> Confronting Misconceptions with Adaptive Ontologies</a>, August 17,         2009; <a style="font-style: italic;" href="../483/advantages-and-myths-of-rdf/">Advantages         and Myths of RDF</a>, April 8, 2009; <a style="font-style: italic;" href="../476/making-linked-data-reasonable-using-description-logics-part-2/"> Making Linked Data Reasonable using Description Logics, Part 2</a>,         February 15, 2009, which specifically relates OWA to the ABox and TBox         <a href="#owa4">[4]</a>; and, <a style="font-style: italic;" href="../441/the-role-of-umbel-stuck-in-the-middle-with-you/"> The Role of UMBEL: Stuck in the Middle with You . . .</a>, May 11,         2008.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="owa4" name="owa4"></a> [4] We use the reference to &#8220;<a href="http://en.wikipedia.org/wiki/Abox">ABox</a>&#8221; and “<a href="http://en.wikipedia.org/wiki/Tbox">TBox</a>” in accordance with         our <a title="Permanent Link to Thinking ?Inside the Box? with Description Logics" href="../466/thinking-inside-the-box-with-description-logics/"> working definition</a> for <a href="http://en.wikipedia.org/wiki/Description_logics">description         logics</a>:</p>
<div class="boxGraySolid">&#8220;Description logics and their semantics traditionally split           <span style="font-style: italic;">concepts</span> and their           relationships from the different treatment of <span style="font-style: italic;">instances</span> and their attributes and           roles, expressed as fact assertions. The concept split is known as           the TBox (for <em>terminological</em> knowledge, the basis for           <span style="font-style: italic;">T</span> in <span style="font-style: italic;">TBox</span>) and represents the schema or           taxonomy of the domain at hand. The TBox is the structural and           intensional component of conceptual relationships. The second split           of instances is known as the ABox (for <span style="font-style: italic;">assertions</span>, the basis for <span style="font-style: italic;">A</span> in <span style="font-style: italic;">ABox</span>) and describes the attributes of           instances (and individuals), the roles between instances, and other           assertions about instances regarding their class membership with the           TBox concepts.&#8221;</div>
</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="owa5" name="owa5"></a> [5] <strong style="font-weight: normal;">A <span style="font-style: italic;">model         theory</span></strong> is a formal semantic theory which relates         expressions to interpretations. A &#8220;model&#8221; refers to a given logical         &#8220;interpretation&#8221; or &#8220;world&#8221;. (See, for example, the discussion of         interpretation in Patrick Hayes, ed., 2004. <a style="font-style: italic;" href="ttp://www.w3.org/TR/rdf-mt/">RDF Semantics         &#8211; W3C Recommendation</a>, 10 February 2004.) The logic or inference         system of classical model theory is <strong style="font-style: italic;">monotonic</strong>. That is, it has the behavior         that if S entails E then (S + T) entails E. In other words, adding         information to some prior conditions or assertions cannot invalidate a         valid entailment. The basic intuition of         model-theoretic semantics is that asserting a statement makes a claim         about the world: it is another way of saying that the world is, in         fact, so arranged as to be an interpretation which makes the statement         true. An assertion amounts to stating a constraint on the possible ways         the world might be. In comparison, a <strong style="font-style: italic;">non-monotonic</strong> logic system may include         <em>default reasoning</em>, where one assumes a &#8216;normal&#8217; general truth         unless it is contradicted by more particular information (birds         normally fly, but penguins don&#8217;t fly); <em>negation-by-failure</em>,         commonly assumed in logic programming systems, where one concludes,         from a failure to prove a proposition, that the proposition is false;         and <em>implicit closed-world assumptions</em>, often assumed in         database applications, where one concludes from a lack of information         about an entity in some corpus that the information is false         (<span style="font-style: italic;">e.g</span>., that if someone is not         listed in an employee database, that he or she is not an employee.) See         further, <a style="font-style: italic;" href="http://plato.stanford.edu/entries/logic-nonmonotonic/">Non-monotonic         Logic</a> from the <a href="http://plato.stanford.edu/">Stanford         Encyclopedia of Philosophy</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="owa6" name="owa6"></a> [6] Peter F. Patel-­Schneider and Ian         Horrocks, 2006. Position Paper: A Comparison of Two Modelling Paradigms         in the Semantic Web,&#8221; in <em>WWW2006</em>, May 22–-26, 2006, Edinburgh,         UK. See <a href="http://www.comlab.ox.ac.uk/people/ian.horrocks/Publications/download/2006/PaHo06a.pdf"> http://www.comlab.ox.ac.uk/people/ian.horrocks/Publications/download/2006/PaHo06a.pdf</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="owa7" name="owa7"></a> [7] Other resources include: Franz         Baader, Diego Calvanese, Deborah McGuiness, Daniele Nardi, and Peter         Patel-Schneider, eds., 2003. <span style="font-style: italic;">The         Description Logic Handbook: Theory, Implementation and         Applications</span>, Cambridge University Press, 2003. Online access to         much of the book is available at <a href="http://www.inf.unibz.it/%7Efranconi/dl/course/">http://www.inf.unibz.it/~franconi/dl/course/</a>;         see esp. Chapters 1, 2, 4 and 16 relate to this topic; Jos de Bruijn,         Axel Polleres, Ruben Lara and Dieter Fensel, 2005. <a style="font-style: italic;" href="http://www2005.org/cdrom/docs/p623.pdf">OWL         DL vs. OWL Flight: Conceptual Modeling and Reasoning for the Semantic         Web</a>, in <span style="font-style: italic;">Proceedings</span> <span style="font-style: italic;">of the Ninth World Wide Web         Conference</span>, Japan, May 2005. This paper argues against the use         of description logics for the semantic Web; Andrew Newman, 2007.         <a style="font-style: italic;" href="http://www.xml.com/pub/a/2007/03/14/a-relational-view-of-the-semantic-web.html"> A Relational View of the Semantic Web</a>, March 14, 2007; Hai Wang,         2006. <a style="font-style: italic;" href="http://protege.stanford.edu/conference/2006/submissions/slides/7.2wang_protege2006.pdf"> Frames and OWL Side by Side</a>, presented at the 9th International         Protégé Conference, July 23-26, 2006, Stanford, CA; Nick Drummond and         Rob Shearer, 2006. <a style="font-style: italic;" href="http://www.cs.manchester.ac.uk/%7Edrummond/presentations/OWA.pdf">The         Open World Assumption</a>, Powerpoint presentation at <span style="font-style: italic;">The Chris Date Seminar: The Closed World of         Databases Meets the Open World of the Semantic Web</span>, e-Science         Institute, Edinburgh, Scotland, 12 Ocotober 2006; Yulia Levin, 2008.         <a style="font-style: italic;" href="http://www.cs.tau.ac.il/%7Eannaz/teaching/TAU_winter08/Seminar/yulia.pdf"> Closed World Reasoning</a>, presentation at <span style="font-style: italic;">Non-classical Logics and Applications Seminar &#8211;         Winter 2008</span>, Tel Aviv University; and Pat Hayes, 2001. &#8220;Why must         the web be monotonic?&#8221;, email thread at <a href="http://lists.w3.org/Archives/Public/www-rdf-logic/2001Jul/0067.html">http://lists.w3.org/Archives/Public/www-rdf-logic/2001Jul/0067.html</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="owa8" name="owa8"></a> [8] Raymond Reiter, 1978. “On         Closed World Data Bases”, in <span style="font-style: italic;">Logic and Data Bases</span>, H. Gallaire and J.         Minker, eds., New York: Plenum Press, 55-76; see also, Raymond Reiter,         1980. &#8220;A Logic for Default Reasoning,&#8221; <em>Artificial Intelligence</em>,         13:81-132.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="owa9" name="owa9"></a> [9] See this Google search on <a href="http://www.google.com/custom?domains=mkbergman.com&amp;q=driven+analysis&amp;sitesearch=mkbergman.com&amp;hl=en"> ontology-driven applications</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="owa10" name="owa10"></a> [10] See this Google search on <a href="http://www.google.com/custom?domains=mkbergman.com&amp;q=abox+tbox&amp;sitesearch=mkbergman.com&amp;hl=en"> ABox-TBox</a> articles.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="owa11" name="owa11"></a> [11] See, as examples: J. Heflin and H.         Munoz-Avila, 2002. LCW-Based Agent Planning for the Semantic Web, in         <span style="font-style: italic;">AAAI &#8217;02 Workshop on Ontologies and         the Semantic Web</span>, AAAI Press, pp. 63–70. See <a href="http://www.cse.lehigh.edu/%7Eheflin/pubs/lcw-aaai02.pdf">http://www.cse.lehigh.edu/~heflin/pubs/lcw-aaai02.pdf</a> (one of the first local CWA suggestions in specific regard to the         semantic Web); K. Golden, O. Etzioni and D. Weld, D. 1994. Omnipresence         Without Omniscience: Efficient Sensor Managment for Planning, in         <span style="font-style: italic;">Proceedings of AAAI-94</span> (one of         the first to propose LCWA in general); Evren Sirin, Michael Smith and         Evan Wallace, 2008. <a style="font-style: italic;" href="http://www.webont.org/owled/2008/papers/owled2008eu_submission_30.pdf"> Integrity constraints: Opening, Closing Worlds — On Integrity         Constraints</a>, presented at <span style="font-style: italic;">OWL:         Experiences and Directions (OWLED 2008), Fifth International         Workshop</span>, Karlsruhe, Germany, October 26-27, 2008; Timothy L.         Hinrichs, Jui-Yi Kao and Michael R. Genesereth, 2009. <a style="font-style: italic;" href="http://people.cs.uchicago.edu/%7Ethinrich/papers/hinrichs2009inconsistencytr.pdf"> Inconsistency-tolerant Reasoning with Classical Logic and Large         Databases</a>, in <span style="font-style: italic;">Proceedings of the         Eighth Symposium on Abstraction, Reformulation, and Approximation         (SARA2009)</span>, July 2009; S. Gómez, C.         Chesñevar and G. Simari 2008. <a style="font-style: italic;" href="http://www.cse.unsw.edu.au/%7Ekr2008/krow-papers/gomez-ea.pdf">An         Argumentative Approach to Reasoning with Inconsistent Ontologies</a>,         in <span style="font-style: italic;">Proceedings of the KR Workshop on         Knowledge Representation and Ontologies</span> (KROW 2008), Conferences         in Research and Practice in Information Technology, Vol. 90, pp. 11-20.         Eds. T.Meyer, M. Orgun. Australian Computer Society, Sidney, Australia,         July 2008. Holger Knoblauch, <a style="font-style: italic;" href="http://composing-the-semantic-web.blogspot.com/2009/01/object-oriented-semantic-web-with-spin.html"> The Object-Oriented Semantic Web with SPIN</a>, Sunday, January 18,         2009, that discusses the SPIN (SPARQL Inferencing Notation) Modeling         Vocabulary, which is a light-weight collection of RDF properties and         classes to support the use of SPARQL to specify rules and logical         constraints.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="owa12" name="owa12"></a> [12] For example, the BigOWLIM can         perform reasoning against 12 billion explicit statements and loads         about 12,000 statements per second on a standard server; see         <a href="http://www.ontotext.com/owlim/benchmarking/lubm.html">http://www.ontotext.com/owlim/benchmarking/lubm.html</a>;         also, see Orri Erling&#8217;s blog regarding performance of the Virtuoso RDF         triple store (<a href="http://www.openlinksw.com/weblog/oerling/">http://www.openlinksw.com/weblog/oerling/</a>).         In any case, these performance benchmarks continue to rise steadily and         indicate the performance of RDF as an ontology integration layer.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/852/the-open-world-assumption-elephant-in-the-room/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Ontology-driven Applications Using Adaptive Ontologies</title>
		<link>http://www.mkbergman.com/847/ontology-driven-applications-using-adaptive-ontologies/</link>
		<comments>http://www.mkbergman.com/847/ontology-driven-applications-using-adaptive-ontologies/#comments</comments>
		<pubDate>Mon, 23 Nov 2009 16:24:29 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Description Logics]]></category>
		<category><![CDATA[irON]]></category>
		<category><![CDATA[Ontology Best Practices]]></category>
		<category><![CDATA[Structured Dynamics]]></category>
		<category><![CDATA[adaptive ontologies]]></category>
		<category><![CDATA[conStruct]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[ontology-driven applications]]></category>
		<category><![CDATA[Semantic Enterprise]]></category>
		<category><![CDATA[structWSF]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=847</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Ontology-driven Applications Using Adaptive Ontologies&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Description Logics&amp;rft.subject=irON&amp;rft.subject=Ontology Best Practices&amp;rft.subject=Structured Dynamics&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2009-11-23&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/847/ontology-driven-applications-using-adaptive-ontologies/&amp;rft.language=English"></span>
A Low-risk Path to the Open World, Semantic Enterprise OK, you&#8217;ve been reading the literature and perhaps have attended a conference or two. You have heard a lot about semantic technologies, but have some real questions and concerns: How do we get started, especially with smaller proofs-of-concept? Do we need to abandon our past practices [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Ontology-driven Applications Using Adaptive Ontologies&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Description Logics&amp;rft.subject=irON&amp;rft.subject=Ontology Best Practices&amp;rft.subject=Structured Dynamics&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2009-11-23&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/847/ontology-driven-applications-using-adaptive-ontologies/&amp;rft.language=English"></span>
<p><img style="border: 0px solid; width: 250px; height: 130px; float: left; margin-right: 10px;" title="Open World" src="../wp-content/themes/ai3/images/2009Posts/091122_world_atlas.png" alt="Open World - from worldatlas.com" /></p>
<h2>A Low-risk Path to the Open World, Semantic Enterprise</h2>
<p>OK, you&#8217;ve been reading the literature and perhaps have attended a         conference or two. You have heard a lot about semantic technologies,         but have some real questions and concerns:</p>
<ul>
<li style="margin-left: 30px;">How do we get started, especially with         smaller proofs-of-concept?</li>
<li style="margin-left: 30px;">Do we need to abandon our past practices         and systems in order to gain semantic advantages?</li>
<li style="margin-left: 30px;">To gain the advantages of         interoperability, do we have to convert everything into RDF or OWL?</li>
<li style="margin-left: 30px;">Are semantic technologies limited to         open or public data; how do we accommodate our proprietary information?</li>
</ul>
<p>Such questions &#8212; and more &#8212; are not infrequent when organizations         first contemplate making the transition to become a semantic         enterprise.</p>
<h3>Overview</h3>
<p>The diagram below shows a general workflow for migrating existing         instance data into the semantic enterprise. The diagram is broken down         into three parts. The first part is to characterize and stage existing         data and information into the underlying structured data framework.         This is what SD (that is, my firm, <a href="http://structureddynamics.com/">Structured Dynamics</a>) does as data         architects using our particular approach to adaptive ontologies. I&#8217;ll         touch on this again in a moment.</p>
<p>Jumping to the right-hand side of the diagram is the access and display         part. It is here that developers or users can make selections from         dropdown lists and so forth to define the &#8220;slices&#8221; of diced results         sets they wish to display. The results of those interactions are         structured data results sets that are pre-staged to &#8220;drive&#8221; various         applications and displays [<a href="#adapto1">1</a>,<a href="#adapto2">2</a>]. These same capabilities can also be         embedded into standard Web end user applications, such as content         management systems.</p>
<p>The third and middle part of the diagram is the critical part, the         pivot point. It is the interface layer between the structured data on         the left and the display and presentation of that data on the right. As provided by SD,         this abstraction layer is the <span style="font-weight: bold;">structWSF</span> Web services framework that         &#8220;bridges&#8221; between the black box of what happens with RDF and semantic         Web structured data characterizations on the left in order to feed, or         &#8220;drive&#8221;, useful services and functions on the right.</p>
<p>We call this general design and architecture &#8220;ontology-driven         applications&#8221;. The bulk of this posting explains each of these three         parts in a bit more detail, organized from left-to-right by these Parts         1 to 3.</p>
<div style="margin: 10px; text-align: center;"><a href="http://mkbergman.com/wp-content/themes/ai3/images/2009Posts/091122_adaptive_ontology_workflow.png"> <img class="center_ok" style="border: 0px solid; width: 600px; height: 359px;" title="Click to expand" src="http://mkbergman.com/wp-content/themes/ai3/images/2009Posts/091122_adaptive_ontology_workflow.png" alt="Adaptive Ontology Workflow" width="1218" height="728" /></a><br />
<span style="font-style: italic; font-size: 90%;">(click to         expand)</span></div>
<h3>Part 1: Structured Data Instances and Ontologies</h3>
<p>Our approach relies on what we call &#8220;adaptive ontologies&#8221;. These         ontologies set the structural basis for all subsequent data display,         analysis, inferencing, entailments, and the like. We call them         &#8220;adaptive&#8221; because we embrace a set of unique best practices. These         practices enable the ontologies to do the double-duty of first         structuring data and then driving generic applications by properly         informing user interfaces, dropdown lists, menus and the like.</p>
<p>This structuring results in faceting key important dimensions and         attributes of available content. Structured data gets organized.         Unstructured data (text) gets tagged via this structure and integrated         with it.</p>
<p>As Structured Dynamics&#8217; general product schema makes clear (see         the diagram at <a href="#adapto3">[3]</a>), our approach leverages existing assets as         much as possible. Often, this means leaving most existing data         structures in place. These existing assets are staged and converted in         two complementary manners that largely correspond to the conceptual         <span style="font-style: italic;">ABox</span> (instance) and <span style="font-style: italic;">TBox </span>(concepts and schema) split central to         description logics and pivotal to SD&#8217;s methodology <a href="#adapto4">[4]</a>.</p>
<p>Whether transitioning small chunks or big chunks, this staging of         existing data in Part 1 results in an RDF-accessible characterization         of the starting content. Instances and their attributes are represented         via a common notation, generally based on <span style="font-weight: bold;">irON</span> (<span style="font-style: italic;">instance         record</span> and <span style="font-style: italic;">Object Notation</span>) <a href="#adapto5">[5]</a>, that is an         extensible notation and vocabulary for capturing the data characterizations,         attributes and metadata of the candidate instance data (&#8220;records&#8221; in         RDBMS parlance). These instances may either be internal or proprietary         records, or instance data on the Web or in the public domain. By         properly matching same or similar instances to one another, any source         of instance characterization can be meaningfully combined.</p>
<p>This instance notation is extremely lightweight, and really is merely         an RDF representation of data characterizations. In the         characterizations to this point there is not yet any &#8220;world view&#8221;         involved:  we are simply describing instances and their attributes         in a manner akin to key-value pairs. The process to this point is         entirely descriptive.</p>
<p>However, these instance characteristics do contain within them the         semantics as to how to describe these attributes (your &#8220;glad&#8221; is my         &#8220;happy&#8221;), as well as potentially a schematic or conceptual view of how         these instances relate to one another and to the broader world.         Instance characterizations provide the building blocks, that are then         related and made semantically whole via a second &#8220;terminological&#8221;         level.</p>
<p>These terminological, or conceptual, relationships (the <span style="font-style: italic;">TBox</span> <a href="#adapto4">[4]</a>),         reside at a different level from simply decribing things. Rather,         these schema &#8212; what in this context are best known         as <span style="font-style: italic;">ontologies</span> &#8212; provide         a precise language and means for describing conceptual relationships.         If these structural relationships are done well, they are <a href="../450/when-is-content-coherent/">coherent</a>:         the hip bone is connected to the thigh bone and not to the ear.         Coherence is a matter of a consistent world view that &#8220;hangs together&#8221;         when analyzed via powerful logical techniques available via description         logics and other broader mechanisms of the semantic enterprise.</p>
<p>Thus, as we transition from the existing, the operational workflow         splits the input data stream into two pathways:</p>
<ul>
<li>Instances, and their descriptive characteristics, and</li>
<li>Conceptual relationships, or ontologies.</li>
</ul>
<p>A sequential flow of these steps and splits is provided by this diagram         below that shows: 1) the conceptual structure of the concept ontology;         as 2) matched with the instances and their descriptive attributes that         populate that schema.</p>
<div style="margin: 10px; text-align: center;"><a href="http://mkbergman.com/wp-content/themes/ai3/images/2009Posts/091122_ontology_build_methodology.png"> <img class="center_ok" style="border: 0px solid; width: 600px; height: 355px;" title="Click to expand" src="http://mkbergman.com/wp-content/themes/ai3/images/2009Posts/091122_ontology_build_methodology.png" alt="Ontology and Instance Build Methodology" width="1232" height="728" /></a><br />
<span style="font-style: italic; font-size: 90%;">(click to         expand)</span></div>
<p>A key point is that &#8212; while a proper starting ontology is essential to         our process and proofs-of-concept &#8212; it can be grown and scaled         incrementally. We leverage as much existing starting structure as         possible and can readily bound the scope to meet budget and delivery         imperatives.</p>
<p>The concepts and entities that occur within these structures help         inform our fairly simple tagging system, <a style="font-weight: bold;" href="http://structureddynamics.com/scones.html">scones</a> <a href="#adapto3">[3]</a>. (There are         also benefits from &#8220;triangulating&#8221; between entity or instance         identification and concept identification that helps inform         disambiguation nearly for free; see further <a href="#adapto6">[6])</a>. It is also possible         to integrate these initial proof-of-concept approaches with third-party         tools (<span style="font-style: italic;">e.g.</span>, <a href="http://www.opencalais.com/">Calais</a>, Expert System (<a href="http://www.expertsystem.net/page.asp?id=1515&amp;idd=200">Cogito</a>),         etc.) to improve unstructured content characterization.</p>
<p>These approaches are pretty straightforward for any organization         wanting to test the idea of becoming a semantic enterprise. Real         benefits &#8212; such as concept retrievals overcoming the limitations of         standard keyword search &#8212; can be demonstrated from even small starting         ontologies and structures. Given the inherent connectedness of the         data, it is possible to expand the scope and usefulness of the         information incrementally within fixed and manageable budgets.</p>
<h3>Part 2: structWSF: A Web-oriented Services API and Framework</h3>
<p>A pivotal part of SD&#8217;s infrastructure software is <span style="font-weight: bold;">structWSF</span> <a href="#adapto7">[7]</a>, our platform-independent Web         services middleware. <span style="font-weight: bold;">structWSF</span> is an abstraction layer that provides the APIs, search endpoints, and         specific Web services for accessing, querying or getting results sets         from the underlying structured data and ontologies.</p>
<p><span style="font-weight: bold;">structWSF</span> has a standard set of access and retrieval services including browse, full-text search, CRUD, direct record retrievals, and the like. It is embedded within an access and permissions service that acts at the level of registered datasets. Then, based on the requested protocol, <span style="font-weight: bold;">structWSF</span> returns the         filtered results set. These results sets can be delivered as XML, JSON,         or any of the other formats already available <a href="#adapto7">[7]</a>. They can readily and         dynamically populate HTML pages and forms in any deployment framework.         For specific purposes, these results sets can also be returned as         pre-staged, properly formatted results streams for driving specific         applications.</p>
<p>As an API, the <span style="font-weight: bold;">structWSF</span> Web         services can be interacted with and driven via standard HTTP requests.         Alternatively, these requests can come from simple to complicated Web         apps that create the API queries based on user interface choices such         as selections from dropdown lists or clicking on various listed         options. An interactive demo of this approach is shown by SD&#8217;s         <span style="font-weight: bold;">conStruct</span> application <a href="#adapto8">[8]</a>, though even simpler Web pages or forms may         drive the query interface.</p>
<p>Queries and requests to <span style="font-weight: bold;">structWSF</span> may also include a parameter for         results sets to be returned in particular formats. SD&#8217;s <span style="font-weight: bold;">irON</span> protocol <a href="#adapto5">[5]</a> supports requests or results         in CSV, XML or JSON, in addition to other flavors including multiple         serializations of RDF.</p>
<p>In this manner, only a simple converter need be added to the         <span style="font-weight: bold;">structWSF</span> Web services stack in         order to &#8220;drive&#8221; a new application with a particularly formatted         results set stream.</p>
<p><span style="font-weight: bold;">structWSF</span> thus acts as a         single, uniform Web interface to all of the &#8220;black box&#8221; nuances of the         structured data system organized by the adaptive ontologies. Further,         virtually any data structure may be ingested and converted from         external sources via an import service and made part of the underlying canonical structure,         making the framework perfect for data federation <a href="#adapto9">[9]</a>. Lastly, the         dataset nature of the framework, and its neutrality to underlying data         stores or content management systems, also makes <span style="font-weight: bold;">structWSF</span> an excellent framework for one or         many nodes to share information and collaborate across the Web <a href="#adapto10">[10]</a>.</p>
<p>The following diagram shows how a diverse, Web-based network, involving         a diversity of Web portals and data gateways and hubs, can work via the         <span style="font-weight: bold;">structWSF</span> framework to         establish a complete collaboration network. Via datasets and         differential access rights and permissions, virtually any combination         of potential interactions can be supported:</p>
<div style="margin: 10px; text-align: center;"><a href="http://mkbergman.com/wp-content/themes/ai3/images/2009Posts/091122_collaboration_network.png"> <img class="center_ok" style="border: 0px solid; width: 600px; height: 483px;" title="Click to expand" src="http://mkbergman.com/wp-content/themes/ai3/images/2009Posts/091122_collaboration_network.png" alt="Example Collaboration Network" width="746" height="601" /></a><br />
<span style="font-style: italic; font-size: 90%;">(click to         expand)</span></div>
<p>These potentials are really fundamentally new, and we ourselves are         still trying to find the language and analogies to best explain them. <span style="font-weight: bold;">structWSF</span> was         initially designed as a platform-independent layer between the         structured data representation of existing assets and the         ontology-driven applications that interact with them. We are now finding that deployment in         a broader Web-based context provides additional exciting         prospects for integrating various regional offices or to         enable direct collaboration with customers, partners or         suppliers.</p>
<h3>Part 3: Ontology-driven Applications</h3>
<p>The basic design of <span style="font-weight: bold;">structWSF</span> is to provide a middleware layer that fulfills one or more of these         broad user interaction modes:</p>
<ul>
<li>To create, update, delete or otherwise manage data records</li>
<li>To browse or view existing records or record sets, based on simple         to possible complex selection or filtering criteria, or</li>
<li>To take one of these results sets and progress it through a         workflow of some nature, involving specialized analysis, applications,         or visualization.</li>
</ul>
<p>SD has developed generic applications in these areas (with many more         possible), the operations of which are guided by the instructions and         nature of the underlying data that feeds them. We have proven it is         possible to adopt data characterization practices within those         ontologies so as to stage or &#8220;drive&#8221; such generic applications.</p>
<p>In the case of a standard structured data display (say, a simple table         like a Wikipedia <a href="http://en.wikipedia.org/wiki/Category:Infobox_templates">infobox</a>,         for example), such generic design includes templates tailored to         various instance types (say, locational information presenting on a map         versus people information warranting a image and vital statistics).         Alternatively, in the generic design for some specialized application         (say, <a href="http://www.adobe.com/software/flash/about/">Adobe         Flash</a>), the information output of the results set may need to         contain certain formats and attributes.</p>
<p>SD&#8217;s &#8220;ontology-driven apps&#8221;, then, are really informed structured         results sets that are outputted in a form suitable to various intended         applications. This output form can include a variety of serializations,         formats or metadata. This flexibility of output that is tailored to and         responsive to particular generic applications is what makes our         ontologies &#8220;adaptive&#8221;.</p>
<p>Expressed in this manner, &#8220;ontology-driven apps&#8221; seem neither         remarkably profound nor clever. They are simply attentive to their         intended uses.</p>
<p>Using this structure, then, it is possible to either &#8220;drive&#8221; queries         and results sets selections via direct HTTP request or via simple         dropdown selections on HTML forms (that is, from <span style="font-style: italic;">right</span> to <span style="font-style: italic;">left</span> as shown on the first diagram). Similarly, it is possible with a         single parameter change to drive either a visualization app or a         structured table template from the equivalent query request (that is, from <span style="font-style: italic;">left</span> to <span style="font-style: italic;">right</span> on the first diagram).</p>
<p>&#8220;Ontology-driven apps&#8221; through SD&#8217;s architecture design thus provide two         profound benefits.  First, the entire system can be driven via         simple selections or interactions without the need for any programming         or technical expertise. And, second, simple additions of new and minor         output converters can work to power entirely new applications available         to the system. If, say, Adobe graphics applications need to change         tomorrow for Microsoft Silverlight, that switch is easy and can be         made transparent to the designer.</p>
<h3>The Complete Picture: Embrace the Open World</h3>
<p>The ability to develop these systems incrementally and the ability to         integrate with external, public data is fundamentally dependent on         the <a href="http://en.wikipedia.org/wiki/Open_world_assumption">open world         assumption</a>. The open world assumption is a different logic premise         than what many enterprises are used to; relational database systems,         for example, embrace the alternate <a href="http://en.wikipedia.org/wiki/Closed_world_assumption">closed world         premise</a>.</p>
<p>Open world does not necessarily mean open data and it does not mean open source.         Open world is merely a way to think about the information we have and         how we act on it. An open world assumption accepts that we never have         all necessary information and lacking that information does not itself lead         to any conclusions.</p>
<p>Some enterprise circumstances &#8211; say a complete enumeration of customers or products or even controlled engineering or design environments &#8212; may warrant a closed world approach. In those circumstances, the domain of inquiry is well bounded and we can get relatively complete information about it. Engineering an oil drilling platform or launching the Space Shuttle in fact demands that.</p>
<p>But, in most real world circumstances, there is much we don&#8217;t know and         we interact in complex and external environments. Open world is the         proper logic premise for these circumstances. These circumstances also         happen to be the very basis in which most most knowledge workers         and analysts reside.</p>
<p>Open world frameworks provide some         incredibly important benefits if the circumstances of their use apply:</p>
<ul>
<li>Domains can be analyzed and inspected incrementally</li>
<li>Schema can be incomplete and developed and refined incrementally</li>
<li>The data and the structures within these open world frameworks can         be used and expressed in a piecemeal or incomplete manner</li>
<li>We can readily combine data with partial characterizations with other data having complete characterizations</li>
<li>Systems built with open world frameworks are flexible and robust;         as new information or structure is gained, it can be incorporated         without negating the information already resident, and</li>
<li>Open world systems can readily bridge or embrace closed world         subsystems.</li>
</ul>
<p>One might argue, as we believe, that the biggest impediment to the semantic enterprise is the mind shift necessary to start thinking about and accepting the open world premise. Again, this perspective is not applicable to all problems and domains.  But, where it is, much can be left in place and leveraged with semantic technologies, so long as the enterprise begins to look at these existing assets through a different open-world lens.</p>
<h3>Summary</h3>
<p>So, let&#8217;s return to the rhetorical questions that began this posting.</p>
<p>It should now be clear that it is possible to start small in testing         the transition to a semantic enterprise. These efforts can be done incrementally and         with focus on early, high-value applications and domains.</p>
<p>Further, we need not abandon past practices. There is much that can be done to leverage existing assets. Indeed, those prior investments are often the requisite starting basis to inform semantic initiatives. However, in leveraging those assets, it is important that the enterprise begin to embrace and understand the open world assumption.</p>
<p>We also see that RDF and OWL, while important behind the scenes as a         canonical data model and languages for organizing this information,         need not be exposed as such to most users. Most instance data can be         expressed as is with the data languages of choice such as XML, JSON or         whatever.</p>
<p>We also see these technologies are neutral to the question of open         or public sources. The techniques can equivalently be applied to         internal, closed, proprietary data and structures. Moreover, the         technologies can themselves be used as a basis for bringing external         information into the enterprise.</p>
<p>Without a doubt, some of the early years in describing semantic         technologies were burdened with some unfortunate bad information and         lack of sophistication. Today&#8217;s semantic Web is nimble, agile, and         ready to be deployed immediately at low cost and risk. So, jump on in! We think you&#8217;ll find the water to be just fine.</p>
<div class="boxYellowDotted">This post is Part V of an occasional <span style="color: #993300; font-weight: bold;">AI3</span> series on         <a href="../category/ontologies/">ontology</a> <a href="../category/ontology-best-practices/">best         practices</a>.</div>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto1"></a> [1] These selections and requests need not occur <span style="font-style: italic;">only</span> via user interfaces or HTML forms,         but also programmatically via API or direct Web services calls.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto2"></a> [2] There are two main classes of visualizations possible with our         systems:  1) navigations or explorers of the concept space, which         is a particularly open challenge for large, graph-based knowledge bases         (see, for example, our Subject Concept Explorer using the <a href="http://umbel.org/">UMBEL</a> <a href="http://umbel.structureddynamics.com/explorer.php?concept=http://umbel.org/umbel/sc/FinancialAccount"> Financial Account concept</a>, and click on the bubbles); or 2)         conventional data visualizations or graphics or mappings of instance         data. Both are shown as workflow boxes on the first diagram above.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto3"></a> [3] See <a href="http://structureddynamics.com/products.html">http://structureddynamics.com/products.html</a> for a general descriptive illustration of Structured Dynamics&#8217; product         stack. There is also a longer <a href="http://www.slideshare.net/mkbergman/structured-dynamicss-semantic-technologies-product-stack"> slideshow</a>, from which this diagram is drawn as slide #37.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto4"></a> [4] We use the reference to <em><a href="http://en.wikipedia.org/wiki/Abox">ABox</a></em> and <em><a href="http://en.wikipedia.org/wiki/Tbox">TBox</a></em> in accordance with         our <a title="Permanent Link to Thinking ?Inside the Box? with Description Logics" href="../466/thinking-inside-the-box-with-description-logics/"> working definition</a> for <a href="http://en.wikipedia.org/wiki/Description_logics">description         logics</a>:</p>
<div class="boxGraySolid">&#8220;Description logics and their semantics traditionally split           <span style="font-style: italic;">concepts</span> and their           relationships from the different treatment of <span style="font-style: italic;">instances</span> and their attributes and           roles, expressed as fact assertions. The concept split is known as           the TBox (for <em>terminological</em> knowledge, the basis for           <span style="font-style: italic;">T</span> in <span style="font-style: italic;">TBox</span>) and represents the schema or           taxonomy of the domain at hand. The TBox is the structural and           intensional component of conceptual relationships. The second split           of instances is known as the ABox (for <span style="font-style: italic;">assertions</span>, the basis for <span style="font-style: italic;">A</span> in <span style="font-style: italic;">ABox</span>) and describes the attributes of           instances (and individuals), the roles between instances, and other           assertions about instances regarding their class membership with the           TBox concepts.&#8221;</div>
</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto5"></a> [5] For the specification and a use case of <span style="font-weight: bold;">irON</span> using the CSV (<span style="font-weight: bold;">commON</span>) serialization, see <a href="http://openstructs.org/iron">http://openstructs.org/iron.</a></div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto6"></a> [6] Via this approach we now can assess concept matches in addition to entity matches. This means we can triangulate between the two assessments to aid disambiguation. Because of these logical segmentations, we also have multiple “clusters” (that is, either the <span style="font-style: italic;">concept</span>,         <span style="font-style: italic;">type</span>, <span style="font-style: italic;">superType</span> or <span style="font-style: italic;">dimension</span>) upon which to do our disambiguation evaluations, either between concepts and entities or within the various concept clusters. We can do so via either multiple <a href="http://en.wikipedia.org/wiki/Vector_space_model">semantic         vectors</a> (for statistical-based methods) or multiple <a href="http://en.wikipedia.org/wiki/Features_%28pattern_recognition%29">features</a> (for <a href="http://en.wikipedia.org/wiki/Machine_learning">machine         learning</a> methods). In other words, because of logical segmentation, we have increased the informational power of our concept graph. See further <a href="../759/supertypes-and-logical-segmentation-of-instances/">http://www.mkbergman.com/759/supertypes-and-logical-segmentation-of-instances/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto7"></a> [7] See <a href="http://openstructs.org/structwsf/architecture">http://openstructs.org/structwsf/architecture</a>; also, the available export         formats are shown at <a href="http://constructscs.com/documentation/instructions/export">http://constructscs.com/documentation/instructions/export</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto8"></a> [8] There is an online demo of <span style="font-weight: bold;">conStruct</span> using the <a href="../new-version-sweet-tools-sem-web/">Sweet Tools</a> database of semantic Web and -related tools at <a href="http://constructscs.com/conStruct/browse/">http://constructscs.com/conStruct/browse/</a>; for background on this use case, see <a href="../845/a-most-un-common-way-to-author-datasets/">http://www.mkbergman.com/845/a-most-un-common-way-to-author-datasets/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto9"></a> [9] See, for example, <a href="../496/structwsf-a-framework-for-data-mixing/">http://www.mkbergman.com/496/structwsf-a-framework-for-data-mixing/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto10"></a> [10] See, for example, <a href="../497/structwsf-a-framework-for-collaboration-networks/"> http://www.mkbergman.com/497/structwsf-a-framework-for-collaboration-networks/</a>.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/847/ontology-driven-applications-using-adaptive-ontologies/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
	</channel>
</rss>

