<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>AI3:::Adaptive Information &#187; Brown Bag Lunch</title>
	<atom:link href="http://www.mkbergman.com/category/brown-bag-lunch/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mkbergman.com</link>
	<description>Mike Bergman on the semantic Web and structured Web</description>
	<lastBuildDate>Tue, 24 Jan 2012 15:52:16 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Brown Bag Lunch: ‘Structs’: Naïve Data Formats and the ABox</title>
		<link>http://www.mkbergman.com/951/brown-bag-lunch-%e2%80%98structs%e2%80%99-naive-data-formats-and-the-abox/</link>
		<comments>http://www.mkbergman.com/951/brown-bag-lunch-%e2%80%98structs%e2%80%99-naive-data-formats-and-the-abox/#comments</comments>
		<pubDate>Fri, 18 Mar 2011 08:08:16 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Adaptive Information]]></category>
		<category><![CDATA[Brown Bag Lunch]]></category>
		<category><![CDATA[irON]]></category>
		<category><![CDATA[Structured Web]]></category>
		<category><![CDATA[data structs]]></category>
		<category><![CDATA[Description logic]]></category>
		<category><![CDATA[Knowledge Representation]]></category>
		<category><![CDATA[Resource Description Framework]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web Ontology Language]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=951</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Brown Bag Lunch: ‘Structs’: Naïve Data Formats and the ABox&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Brown Bag Lunch&amp;rft.subject=irON&amp;rft.subject=Structured Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-03-18&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/951/brown-bag-lunch-%e2%80%98structs%e2%80%99-naive-data-formats-and-the-abox/&amp;rft.language=English"></span>
Writing and Sharing Data Can be Lightened Up Ever since I first started to learn in earnest about ontology, something has been gnawing at me. The term seemed to be (shall I say?) an obtuse one whose obscurity was not the result of subtle precision or technicality, but rather one of fuzziness. As I introduced [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Brown Bag Lunch: ‘Structs’: Naïve Data Formats and the ABox&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Brown Bag Lunch&amp;rft.subject=irON&amp;rft.subject=Structured Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-03-18&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/951/brown-bag-lunch-%e2%80%98structs%e2%80%99-naive-data-formats-and-the-abox/&amp;rft.language=English"></span>
<h2>Writing and Sharing Data Can be Lightened Up <a href="http://www.mkbergman.com/834/announcing-the-sporadic-friday-brown-bag-lunch"><img style="border: 0px solid; float: left; margin-right: 10px;" title="Friday Brown Bag Lunch" src="../wp-content/themes/ai3/images/lunchbag_225.jpg" alt="Friday     Brown Bag Lunch" width="158" height="179" /></a></h2>
<p>Ever since I first started to learn in earnest about <a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Ontology_(information_science)">ontology</a>, something has been gnawing at me. The term seemed to be (shall I say?) an obtuse one whose obscurity was not the result of subtle precision or technicality, but rather one of fuzziness. As I introduced my <a style="font-style: italic;" href="../?p=374">Intrepid Guide to Ontology</a> two years ago, I noted:</p>
<div class="boxGrayDotted" style="margin-left: 240px;">The root of the [ontology] term is the Greek <span style="font-style: italic;">ontos</span>, or <span style="font-style: italic;">being</span> or the <span style="font-style: italic;">nature of things</span>. Literally and in classical philosophy, ontology was used in relation to the study of the nature of being or the world, <a href="http://en.wikipedia.org/wiki/Ontology">the nature of existence</a>. <a href="http://tomgruber.org/">Tom Gruber</a>, among others, made the term popular in relation to computer science and artificial intelligence <a href="http://tomgruber.org/writing/ontolingua-kaj-1993.htm">about 15 years ago</a> when he defined ontology as a &#8220;formal specification of a conceptualization.&#8221;</div>
<p><img style="border: 0px solid; margin-left: 10px; width: 200px; height: 184px; float: right;" src="../wp-content/themes/ai3/images/2009Posts/090122_abc_struct.jpg" alt="Simple Data Structs" hspace="5" vspace="5" />Since then, I have continued to find ontology one of the hardest concepts to communicate to clients and quite a muddled mess even as used by practitioners. I have come to the conclusion that this problem is not because I have failed to grasp some ephemeral nuance, but because the term as used in practice is indeed fuzzy and imprecise.</p>
<h3>What Isn&#8217;t an Ontology?</h3>
<p>Even two years ago, I noted more than 40 different types of information structure that have at one time or another been labelled as an example of an &#8220;ontology&#8221;:</p>
<div class="center_ok smallIndent" style="margin: 15px; font-size: 10px;">
<table style="text-align: left; margin-left: 0px; width: 90%;" border="0" cellspacing="2" cellpadding="2">
<tbody>
<tr>
<td style="vertical-align: top; width: 25%;">
<ul>
<li><a title="Tag cloud" href="http://en.wikipedia.org/wiki/Tag_cloud">Tag cloud</a></li>
<li> <a title="Controlled vocabulary" href="http://en.wikipedia.org/wiki/Controlled_vocabulary">Controlled vocabulary</a></li>
<li> <a title="Thesauri" href="http://en.wikipedia.org/wiki/Thesauri">Thesauri</a></li>
<li> <a title="Collaborative tagging" href="http://en.wikipedia.org/wiki/Collaborative_tagging">Collaborative tagging</a></li>
<li> <a title="Folk taxonomy" href="http://en.wikipedia.org/wiki/Folk_taxonomy">Folk taxonomy</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Web_directory">Directory</a></li>
<li> <a href="http://www.mulberrytech.com/Extreme/Proceedings/html/2005/Newcomb01/EML2005Newcomb01.html">Subject Map</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Semantic_Web">Semantic Web</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Cladistics">Cladistics</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Markup_language">Markup languages</a></li>
</ul>
</td>
<td style="width: 25%; vertical-align: top;">
<ul>
<li> <a title="Social bookmarking" href="http://en.wikipedia.org/wiki/Social_bookmarking">Social bookmarking</a></li>
<li> <a title="Tag (metadata)" href="http://en.wikipedia.org/wiki/Tag_%28metadata%29">Tags</a></li>
<li> <a title="Tagging" href="http://en.wikipedia.org/wiki/Tagging">Tagging</a></li>
<li> <a title="Taxonomy" href="http://en.wikipedia.org/wiki/Taxonomy">Taxonomy</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Folksonomy">Folksonomy</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Library_classification">Classification</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Categorization">Categorization</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Metadata">Metadata</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Systematics">Systematics</a></li>
</ul>
</td>
<td style="width: 25%; vertical-align: top;">
<ul>
<li><a href="http://en.wikipedia.org/wiki/Ontology_%28computer_science%29">Ontology</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Microformats">Microformats</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Data_dictionary">Data dictionary</a></li>
<li> <a href="http://en.wikipedia.org/wiki/OPML">OPML</a></li>
<li> <a href="http://en.wikipedia.org/wiki/XOXO">XOXO</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Web_Ontology_Language">OWL</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Tree_structure">Subject Trees</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Information_architecture">Information Architecture</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Data_Reference_Model">Data Reference Model</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Phylogeny">Phylogeny</a></li>
</ul>
</td>
<td style="width: 25%; vertical-align: top;">
<ul>
<li><a href="http://en.wikipedia.org/wiki/Topic_map">Topic Maps</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Concept_map">Concept Maps</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Synset">Synsets</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Glossary">Glossary</a></li>
<li> <a href="http://en.wikipedia.org/wiki/WordNet">WordNet</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Metadata">Metadata</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Faceted_classification">Facets</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Structure_%28mathematical_logic%29">Structure</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Dublin_Core">Dublin Core</a></li>
<li> <a href="http://en.wikipedia.org/wiki/Typology">Typology</a></li>
</ul>
</td>
</tr>
</tbody>
</table>
</div>
<p>Since then, I could add even more terms to this list.</p>
<p>Lack of precision as to what ontology means has meant that it has been sloppily defined. As I have <a href="../?p=426">harped</a> <a href="../?p=437">upon</a> <a href="../?p=440">many</a> <a href="../?p=450">times</a> regarding semantic Web terminology, this is a sad state of affairs for the semWeb endeavor that has <span style="font-style: italic;">meaning</span> at the core of its purpose.</p>
<p>I&#8217;m pretty sure that the original intent in embracing the concept of ontology within the realm of <a href="http://en.wikipedia.org/wiki/Knowledge_representation">knowledge representation</a> was not to see this term so broadly misused or mis-applied. I suspect, as well, that if we could sharpen up our understanding and remove some of the fuzziness that we could improve communications with the lay public across many levels of the semWeb enterprise.</p>
<h3>The Useful Distinction of the TBox and ABox</h3>
<p>Recently, I have been looking to the semantic Web&#8217;s roots in <a href="http://en.wikipedia.org/wiki/Description_logic">description logics</a>. One of my writings, <a style="font-style: italic;" title="Thinking ?Inside the Box? with Description Logics" href="../?p=466">Thinking &#8216;Inside the Box&#8217; with Description Logics</a>, looked at the conceptual distinctions between the so-called &#8216;<a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Tbox">TBox</a>&#8216; and &#8216;<a style="font-style: italic;" href="http://en.wikipedia.org/wiki/ABox">ABox</a>&#8216;. That is, a knowledge base is a logical schema of roles and concepts and the relationships between them (the <span style="font-style: italic;">TBox</span>), which is populated by the actual data (instances) asserting memberships and attributes (&#8220;facts&#8221;) (the <span style="font-style: italic;">ABox</span>).</p>
<p>By analogy, in a conventional relational database system, the database or logical schema would correspond to the TBox; the actual data records or tables would correspond to the ABox. Often, the term <span style="font-style: italic;">ontology</span> is used to cover both ABox and TBox statements (which, I argue, only makes the understanding of the &#8216;ontology&#8217; concept more difficult).</p>
<p>My recent writing, <a style="font-style: italic;" title="Back to the Future with Description Logics" href="../?p=470">Back to the Future with Description Logics</a>, discussed at some length the advantages of keeping the TBox and ABox separate. This current article now expands on those thoughts, particularly with respect to the definition and understanding of ontology.</p>
<p>The starting point for this new mindset is to return to the ideas of data records or data tables <span style="font-style: italic;">v.</span> the logical schema that is prevalent in relational databases.</p>
<h3>So Many Structs, So Little Time</h3>
<p>The last time I took a census, about a year ago, there were more than 100 converters of various record and data structure types to RDF <a href="#light2">[2]</a>. These converters &#8212; also sometimes known as translators or &#8216;RDFizers&#8217; &#8212; generally take some input data records with varying formats or <a href="http://en.wikipedia.org/wiki/Serialization">serializations</a> and convert them to a form of RDF serialization (such as RDF/XML or <a href="http://www.w3.org/DesignIssues/Notation3.html">N3</a>), often with some ontology matching or characterizations. That last census listed these converters:</p>
<div class="mediumIndent" style="margin: 15px; font-size: 10px;">
<table style="text-align: left; margin-left: 0px; width: 90%;" border="0" cellspacing="2" cellpadding="2">
<tbody>
<tr>
<td style="vertical-align: top; width: 25%;">
<ul>
<li>RDF</li>
<li style="list-style-type: none; list-style-image: none; list-style-position: outside; display: inline;">
<ul>
<li>Serialization formats:</li>
<li style="list-style-type: none; list-style-image: none; list-style-position: outside; display: inline;">
<ul>
<li>RDF/XML</li>
<li>N3</li>
<li>Turtle</li>
</ul>
</li>
<li>Automatically recognized ontologies:</li>
<li style="list-style-type: none; list-style-image: none; list-style-position: outside; display: inline;">
<ul>
<li>SIOC</li>
<li>SKOS</li>
<li>FOAF</li>
<li> <span class="wikiword">AtomOWL</span></li>
<li>Annotea</li>
<li>Music Ontology</li>
<li>Bibliographic Ontology</li>
<li>EXIF</li>
<li>vCard</li>
<li>Others</li>
</ul>
</li>
</ul>
</li>
<li>(X)HTML pages</li>
<li>HTML header metadata</li>
<li style="list-style-type: none; list-style-image: none; list-style-position: outside; display: inline;">
<ul>
<li>Dublin Core</li>
</ul>
</li>
<li>Embedded microformats</li>
<li style="list-style-type: none; list-style-image: none; list-style-position: outside; display: inline;">
<ul>
<li>eRDF</li>
<li>RDFa</li>
<li>hCard</li>
<li>hCalendar</li>
<li>XFN</li>
<li>xFolk</li>
</ul>
</li>
<li>Syndication Formats:</li>
<li style="list-style-type: none; list-style-image: none; list-style-position: outside; display: inline;">
<ul>
<li>RSS 2.0</li>
<li>Atom</li>
<li>OPML</li>
<li>OCS</li>
<li>XBEL (for bookmarks)</li>
</ul>
</li>
<li>GRDDL <a href="#light1">[1]</a></li>
</ul>
</td>
<td style="width: 25%; vertical-align: top;">
<ul>
<li>REST-style Web service APIs:</li>
<li style="list-style-type: none; list-style-image: none; list-style-position: outside; display: inline;">
<ul>
<li>Google Base</li>
<li>Flickr</li>
<li>Del.icio.us</li>
<li>Ning</li>
<li>Amazon</li>
<li>eBay</li>
<li>Freebase</li>
<li>Facebook</li>
<li>raw HTTP</li>
<li>Etc.</li>
</ul>
</li>
<li>Files (multitude of file formats and MIME types, including):</li>
<li style="list-style-type: none; list-style-image: none; list-style-position: outside; display: inline;">
<ul>
<li>MS Office</li>
<li>OpenOffice</li>
<li>Open Document Format</li>
<li>images</li>
<li>audio</li>
<li>video</li>
<li>Etc.</li>
</ul>
</li>
<li>Web services:</li>
<li style="list-style-type: none; list-style-image: none; list-style-position: outside; display: inline;">
<ul>
<li>BPEL</li>
<li>WSDL</li>
<li>XBRL</li>
<li>XBEL</li>
</ul>
</li>
<li>Data exchange formats</li>
<li style="list-style-type: none; list-style-image: none; list-style-position: outside; display: inline;">
<ul>
<li>iCalendar</li>
<li>vCard</li>
</ul>
</li>
<li>Virtuoso VADs</li>
<li> OpenLink license files</li>
<li>Third party metadata extraction frameworks:</li>
<li style="list-style-type: none; list-style-image: none; list-style-position: outside; display: inline;">
<ul>
<li><a href="http://aperture.sourceforge.net/">Aperture</a></li>
<li>Spotlight</li>
<li> <a href="http://simile.mit.edu/wiki/RDFizers">SIMILE RDFizers</a></li>
</ul>
</li>
</ul>
</td>
<td style="width: 25%; vertical-align: top;">Note that MIT&#8217;s <a href="http://simile.mit.edu/wiki/RDFizers">SIMILE RDFizers</a> also recognizes these formats:</p>
<p><br class="spacer_" /></p>
<p><br class="spacer_" /></p>
<p><br class="spacer_" /></p>
<p><br class="spacer_" /></p>
<ul>
<li> <a title="JPEG RDFizer" href="http://simile.mit.edu/wiki/JPEG_RDFizer">JPEG </a>→ RDF</li>
<li> <a title="MARC/MODS RDFizer" href="http://simile.mit.edu/wiki/MARC/MODS_RDFizer">MARC/MODS </a>→ RDF</li>
<li> <a title="OAI-PMH RDFizer" href="http://simile.mit.edu/wiki/OAI-PMH_RDFizer">OAI-PMH </a>→ RDF</li>
<li> <a title="OCW RDFizer" href="http://simile.mit.edu/wiki/OCW_RDFizer">OCW </a>→ RDF</li>
<li> <a title="Email RDFizer" href="http://simile.mit.edu/wiki/Email_RDFizer">EMail </a>→ RDF</li>
<li> <a title="BibTeX RDFizer" href="http://simile.mit.edu/wiki/BibTeX_RDFizer">BibTEX </a>→ RDF</li>
<li> <a title="Maven POM RDFizer" href="http://simile.mit.edu/wiki/Maven_POM_RDFizer">POM </a>→ RDF</li>
<li> <a class="new" title="DEB RDFizer" href="http://simile.mit.edu/mediawiki/index.php?title=DEB_RDFizer&amp;action=edit">DEB </a>→<span class="new"> RDF</span></li>
<li> <a class="new" title="CRW RDFizer" href="http://simile.mit.edu/mediawiki/index.php?title=CRW_RDFizer&amp;action=edit">CRW </a>→<span class="new"> RDF</span></li>
<li> <a class="new" title="Flat RDFizer" href="http://simile.mit.edu/mediawiki/index.php?title=Flat_RDFizer&amp;action=edit">Flat </a>→<span class="new"> RDF</span></li>
<li> <a class="new" title="Weather RDFizer" href="http://simile.mit.edu/mediawiki/index.php?title=Weather_RDFizer&amp;action=edit">Weather </a>→<span class="new"> RDF</span></li>
<li> <a title="Java RDFizer" href="http://simile.mit.edu/wiki/Java_RDFizer">Java </a>→ RDF</li>
<li> <a title="Javadoc RDFizer" href="http://simile.mit.edu/wiki/Javadoc_RDFizer">Javadoc </a>→ RDF</li>
<li> <a title="Jira RDFizer" href="http://simile.mit.edu/wiki/Jira_RDFizer">Jira </a>→ RDF</li>
<li> <a title="Subversion RDFizer" href="http://simile.mit.edu/wiki/Subversion_RDFizer">Subversion </a>→ RDF</li>
<li> Random → RDF</li>
</ul>
</td>
<td style="width: 25%; vertical-align: top;">There is a growing list of third-party RDFizers as well:</p>
<p><br class="spacer_" /></p>
<p><br class="spacer_" /></p>
<p><br class="spacer_" /></p>
<p><br class="spacer_" /></p>
<ul>
<li><a title="http://www.w3.org/2000/10/swap/pim/ldif2n3.py" href="http://www.w3.org/2000/10/swap/pim/ldif2n3.py">LDIF </a>→ RDF</li>
<li><a class="external text" title="http://www.w3.org/2002/12/cal/fromIcal.py" rel="nofollow" href="http://www.w3.org/2002/12/cal/fromIcal.py">iCal </a>→<span class="external text"> RDF</span></li>
<li><a title="http://dev.w3.org/cvsweb/2001/palmagent" href="http://dev.w3.org/cvsweb/2001/palmagent">Palm </a>→ RDF</li>
<li><a class="external text" title="http://www.w3.org/2000/10/swap/pim/lookout.py" rel="nofollow" href="http://www.w3.org/2000/10/swap/pim/lookout.py">Outlook </a><span class="external text"> </span>→<span class="external text"> RDF</span></li>
<li><a class="external text" title="http://www.w3.org/2000/04/maillog2rdf/aboutMsg.py" rel="nofollow" href="http://www.w3.org/2000/04/maillog2rdf/aboutMsg.py">RFC822 </a>→<span class="external text"> RDF</span></li>
<li><a class="external text" title="http://www.w3.org/2000/10/swap/pim/fromGarmin.py" rel="nofollow" href="http://www.w3.org/2000/10/swap/pim/fromGarmin.py">Garmin </a><span class="external text"> </span>→<span class="external text"> RDF</span></li>
<li><a class="external text" title="http://www.w3.org/2000/10/swap/util/fink2n3.py" rel="nofollow" href="http://www.w3.org/2000/10/swap/util/fink2n3.py">Fink </a>→<span class="external text"> RDF</span></li>
<li><a class="external text" title="http://www.wiwiss.fu-berlin.de/suhl/bizer/d2rq/index.htm" rel="nofollow" href="http://www.wiwiss.fu-berlin.de/suhl/bizer/d2rq/index.htm">D2RQ</a></li>
<li><a class="external text" title="http://www.wiwiss.fu-berlin.de/suhl/bizer/d2rmap/D2Rmap.htm" rel="nofollow" href="http://www.wiwiss.fu-berlin.de/suhl/bizer/d2rmap/D2Rmap.htm">D2RMAP</a></li>
<li><a class="external text" title="http://www.mindswap.org/%7Erreck/excel2rdf.shtml" rel="nofollow" href="http://www.mindswap.org/%7Erreck/excel2rdf.shtml">XLS </a>→<span class="external text"> RDF</span></li>
<li><a class="external text" title="http://www.mindswap.org/%7Erreck/excel2rdf.shtml" rel="nofollow" href="http://www.mindswap.org/%7Erreck/excel2rdf.shtml">CSV </a>→<span class="external text"> RDF</span></li>
<li><a class="external text" title="http://rdf123.umbc.edu/" rel="nofollow" href="http://rdf123.umbc.edu/">RDF123</a></li>
<li><a class="external text" title="http://rhizomik.net/redefer/" rel="nofollow" href="http://rhizomik.net/redefer/">XSD </a>→<span class="external text"> OWL</span></li>
<li><a class="external text" title="http://rhizomik.net/content/" rel="nofollow" href="http://rhizomik.net/content/">XML </a>→<span class="external text"> RDF</span></li>
<li><a class="external text" title="http://rhizomik.net/redefer/" rel="nofollow" href="http://rhizomik.net/redefer/">MPEG-7/CS </a>→<span class="external text"> OWL</span></li>
<li><a class="external text" title="http://www.inf.unideb.hu/~jeszy/xmp/" rel="nofollow" href="http://www.inf.unideb.hu/%7Ejeszy/xmp/">XMP </a>→<span class="external text"> RDF</span></li>
<li><a class="external text" title="http://www.inf.unideb.hu/~jeszy/rdfizers/" rel="nofollow" href="http://www.inf.unideb.hu/%7Ejeszy/rdfizers/">BitTorrent </a><span class="external text"> </span>→<span class="external text"> RDF</span></li>
<li><a class="external text" title="http://www.inf.unideb.hu/~jeszy/rdfizers/" rel="nofollow" href="http://www.inf.unideb.hu/%7Ejeszy/rdfizers/">RPM </a>→<span class="external text"> RDF</span></li>
<li><a class="external text" title="http://www.l3s.de/~siberski/bibtex2rdf/" rel="nofollow" href="http://www.l3s.de/%7Esiberski/bibtex2rdf/">BibTeX </a>→<span class="external text"> RDF</span></li>
</ul>
</td>
</tr>
</tbody>
</table>
</div>
<p>This wealth of formats shows the robustness of the RDF data model to capture structure and data relationships from virtually any input form. This is what makes RDF so exciting as a canonical target for getting data to interoperate.</p>
<h3>Let&#8217;s Make this Elementary, Dr. Watson</h3>
<p>However &#8212; and this is crucial &#8212; standard users for decades have preferred simple, text-based and human readable formats for writing and transferring their structured data.</p>
<p>These various forms, sometimes well specified with APIs and sometimes almost ad hoc as in spreadsheet listings, are what we call &#8216;<span style="font-style: italic;">structs</span>&#8216;. <span style="font-style: italic;">Structs</span> can all be displayed as text and have, at minimum, explicit or inferrable key-value pairs to convey data relationships and attributes, with data types and values often noted by various white space, delimiter or other text conventions.</p>
<p>There is no doubt that the vast majority of extant data is found in such formats, including the results of data or information extraction from unstructured text. Indeed, even HTML and many markup languages with their angle bracket-delimited fields fall into this category.</p>
<p>There have literally been hundreds of <a href="http://en.wikipedia.org/wiki/Lightweight_markup_language">various</a> formats proposed over decades for conveying lightweight data structures. Most have been proprietary or limited to specific domains or users. Some, such as <a href="http://en.wikipedia.org/wiki/Fielded_text">fielded text</a>, <a href="http://www.zope.org/Documentation/Articles/STX">structured text</a>, <a href="http://en.wikipedia.org/wiki/Simple_Declarative_Language">simple declarative language</a> (SDL), or more recently <a href="http://en.wikipedia.org/wiki/YAML">YAML</a> or its simpler cousin <a href="http://en.wikipedia.org/wiki/JSON">JSON</a>, have become more widely adopted and supported by formal specifications, tools or APIs. JSON, especially, is a preferred form for Web 2.0 applications.</p>
<p>Some, like <a href="http://en.wikipedia.org/wiki/Microformats">microformats</a> or this example <a href="http://en.wikipedia.org/wiki/Bibtex">BibTeX</a> record below (with some non-standard extensions), rely less on syntax conventions and may use reserved keywords (such as AUTHOR or TITLE as shown) to signal the key type for the key-value pair:</p>
<div class="boxYellowSolid">
<pre>ID_LOCAL arXiv:0711.3808
AUTHOR &lt;a href="#Schramm_O"&gt;Oded Schramm&lt;/a&gt;
BIBTYPE ARTICLE
ID arXiv:0711.3808
JOURNAL Electron. Res. Announc. Math. Sci.
PAGES 17--23
SUBJECTS geom
TITLE Hyperfinite graph limits
URL http://www.aimsciences.org/journals/doIpChk.jsp?paperID=3117&amp;mode=full
URL http://www.aimsciences.org/journals/displayPapers0.jsp?comments=&amp;pubID=221&amp;journID=14&amp;pubString_num=Volume:
15, 2008 Journal Issue
VOLUME 15
YEAR 2008</pre>
</div>
<p>Some of these simple formats have been more successful than others, though none have achieved market dominance. There also appear to be few universal principles that have emerged as to syntax or format. Nonetheless, any of these various <span style="font-style: italic;">struct</span> forms are easy for casual readers to understand and easy for domain experts to write.</p>
<p>For modeling and interoperability purposes, many of these forms are patently inadequate. That is why many of these simpler forms might be called &#8220;naïve&#8221;: they achieve their immediate purpose of simple relationships and communication, but require understood or explicit context in order to be meaningfully (semantically) related to other forms or data.</p>
<p>Yet, if we have learned nothing else with the phenomenal success of the Web it is this: simplicity trumps elegance or expressivity.</p>
<h3>RDF and the Skinny ABox</h3>
<p>The RDF (<a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">Resource Description Framework</a>) data model is expressed as simple <span style="font-style: italic;">subject</span>-<span style="font-style: italic;">predicate</span>-<span style="font-style: italic;">object</span> &#8220;triple&#8221; statements. That sounds fancy, but just substitute verb for predicate and noun for subject and object. In other words: Dick sees Jane; or, the ball is round. It may sound like a kindergartner reader, but it is how data can be easily represented and built up into more complex structures and stories.</p>
<p>RDF triples can be applied equally to all structured, semi-structured and unstructured content. RDF is clearly a most capable data model that &#8212; through its ability to be extended with further concepts and relationships (vocabulary) &#8212; can create elegant and logical structures to represent comprehensive domains and knowledge bases. Finding such a model has been a quest in my professional life; I believe we finally have a winner to facilitate data interoperability using RDF.</p>
<p>But RDF has not achieved the market acceptance that its suitability as a data representation model might suggest. I think there are three reasons for this:</p>
<ul>
<li>First, RDF was first presented and &#8220;sold&#8221; as an XML serialization. This failing has been well understood for some time. This unfortunate early linkage of RDF caused confusion between data model and the XML syntax. The rather simple and incremental building blocks of triple RDF statements when presented in the nested XML syntax led to lengthy and hard-to-read specifications (for easier reading and use, see either the <a href="http://www.w3.org/DesignIssues/Notation3.html">N3</a> or <a href="http://www.dajobe.org/2004/01/turtle/">Turtle</a> syntaxes)</li>
<li>Second, triples by definition are 50% more complicated than a key-value pair. While the basic RDF statement might be simple like a Dick-and-Jane reader, as a data specification format it is still more complex than my personal attributes of <span style="font-family: monospace;">sex:Male</span> and <span style="font-family: monospace;">hair:Red</span> and <span style="font-family: monospace;">born:California</span>. Those three &#8220;facts&#8221; can not be said nearly so quickly in RDF. And, if we also adhere to <a href="http://en.wikipedia.org/wiki/Linked_Data">linked data</a>, each one of these items requires a URI unique identifier to boot! It is important not to ignore the desire for simple and human readable data-specification formats</li>
<li>Third, as this entry began and as we will conclude, RDF and its fuzzy relationship to ontology has led to over-specification of what needs to be included in the data record. What could simply be a record specification of an object and its attributes presented as simple key-value pairs has become burdened with &#8220;ontology&#8221; and &#8220;conceptual&#8221; relationships.</li>
</ul>
<p>Canonical forms embody all of the specification that the canon guiding them requires. What we may have failed to see in embracing RDF, however, is that getting useful data into the system need not carry all of this burden.</p>
<h3>Lightening Up and Shifting Work to the TBox</h3>
<div style="float: right;"><a href="../wp-content/themes/ai3/images/2009Posts/090119_tbox_abox.png"><img style="border: 0px solid; width: 200px; height: 279px;" title="Click to enlarge" src="../wp-content/themes/ai3/images/2009Posts/090119_tbox_abox.png" alt="ABox - TBox Split" /></a></p>
<p><br class="spacer_" /></p>
<p><br class="spacer_" /></p>
<p><br class="spacer_" /></p>
<p><br class="spacer_" /></p>
<div style="font-style: italic;"><a href="../wp-content/themes/ai3/images/2009Posts/090119_tbox_abox.png"><small> Click for full-size</small></a></div>
</div>
<p>So, what does all of this have to do with my starting diatribe about the term <span style="font-style: italic;">ontology</span>?</p>
<p>Whether a single database or the federation across all information known to human kind, we have data records (<span style="font-style: italic;">structs</span> of instances) and a logical schema (<span style="font-style: italic;">ontology</span> of concepts and relationships) by which we try to relate this information. This is a natural and meaningful split: structure and relationships <em>v.</em> the instances that populate that structure.</p>
<p>Stated this way, particularly for anyone with a relational database background, the split between schema and data is clear and obvious. Yet, the RDF, semantic Web and linked data communities have done an abysmal job of recognizing this fundamental separation of concerns.</p>
<p>We create &#8220;ontologies&#8221; that mix instances and schema. We insist on simple data record conversions that are burdened with relationship specifications as well. We tout a &#8220;linked data&#8221; infrastructure that is based solely on the same identity of instances without respect or attention to structure or conceptual relationships. We dismiss communities that work to express their data with useful local structures. We insist on standards and practices up and down the data staging and preparation chain that turns off the general market and makes us seem arrogant and dismissive. Frankly, in so many ways, we just don&#8217;t get it <a href="#light3">[3]</a>.</p>
<p>What has struck me personally over the past few months as these realizations have unfolded has been how much our own mindsets and language may be trapping us.</p>
<ul>
<li>Does existing structured data need to be expressed as RDF in order to be useful and integrated?</li>
<li>Exposing linked and instance data is great, but to what end; what are the conceptual or structural schema?</li>
<li>Why is our standards process so inward looking and parochial (often petty)? What purpose or who does this serve?</li>
</ul>
<p>At least for this diatribe, my essential conclusion is that we need to shift the burden of the schema and conceptual relations and (yes) world views to the TBox. We need to skinny down the ABox and make it a warm and welcoming environment by which any structured data (including the most naïve) can join.</p>
<p>So, ultimately, the bottom line is this: the burden of the semantic Web rests on us, not the providers of structured data.</p>
<p>It is time to streamline the ABox to smooth data contributions, assume as publishers the responsibility for the TBox, and keep those concerns separate. As for instance-related stuff, I now intend to refer to them as <span style="font-style: italic;">structs</span> governed by a controlled vocabulary (at most). I intend to reserve <span style="font-style: italic;">ontology</span> as a means to describe a given world view, a TBox, the schema and its relations of the domain at hand. And, frankly, this definition of ontology brings it back in balance with its roots in <span style="font-style: italic;">ontos</span> and the nature of the world.</p>
<p>It&#8217;s a good time to lighten up!</p>
<div class="boxBrownDotted center_ok" style="min-height: 80px; max-width: 460px;"><img style="width: 64px; height: 73px; float: left; margin-right: 10px;" title="Friday Brown Bag    Lunch" src="../wp-content/themes/ai3/images/lunchbag_64.png" alt="Friday      Brown Bag Lunch" /> This <a href="../834/announcing-the-sporadic-friday-brown-bag-lunch">Friday      brown bag leftover</a> was first placed into the <span style="font-weight: bold; color: #993300;">AI3</span> <a href="../chronological-listing/">refrigerator</a> on <a href="http://www.mkbergman.com/471/structs-naive-data-formats-and-the-abox/">January 22, 2009</a>, and is one of the more popular historical posts of this blog.   This reprise is unchanged since its original posting, though we have continued to make progress on constructs such as<a href="http://techwiki.openstructs.org/index.php/Category:IrON"> irON</a> to capture this idea. <a href="http://dev.w3.org/html5/md/">Microdata</a> in HTML5 is also an important contribution, to which we will devote some attention in the near future.</div>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a title="light1" name="light1"></a> [1] <a href="http://www.w3.org/TR/grddl/">GRDDL</a> (Gleaning Resource Descriptions from Dialects of Languages) is a W3C markup format for getting RDF data out of XML and XHTML documents using explicitly associated transformation algorithms, typically represented in XSLT GRDDL accomodates a wide variety of dialects (see <a href="http://esw.w3.org/topic/CustomRdfDialects">one listing</a>) and can be combined with arbitrary transformation mechanisms (though currently mostly based on XSLTs).</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="light2" name="light2"></a> [2] Also see the listing of &#8220;dynamic&#8221; RDFizers at <a href="http://esw.w3.org/topic/DynamicRDFizers">http://esw.w3.org/topic/DynamicRDFizers</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="light3" name="light3"></a> [3] I don&#8217;t mean to imply that there are not those in the community interested in lightweight data structures or their conversion, just that they have been more of a minority to date.  For example, the <span lang="EN-US"><a href="http://semanticscripting.org/SFSW2009/">5th Workshop on Scripting and Development for the Semantic Web</a> is coming up this summer in conjunction with the <a href="http://www.eswc2009.org/">6th European Semantic Web Conference</a> in Crete, Greece; this year&#8217;s organizers are </span>Gunnar Aastrand Grimnes (<a href="http://www.dfki.de/web/welcome?set_language=en&amp;cl=en">DFKI Knowledge Management Lab</a>), Chris Bizer (<a href="http://www.fu-berlin.de/">Freie U</a><a href="http://www.uni-leipzig.de/">niversität </a><a href="http://www.fu-berlin.de/"> Berlin</a>) and Sören Auer (<a href="http://www.uni-leipzig.de/">U</a><a href="http://www.uni-leipzig.de/">niversität </a><a href="http://www.uni-leipzig.de/"> Leipzig</a>).  As other examples focusing on JSON, there are a couple of efforts to define representation conventions from <a href="http://n2.talis.com/wiki/RDF_JSON_Specification">Talis</a> and <a href="http://www.gbv.de/wikis/cls/RDF_in_JSON">GBV</a> for serializing RDF; <a href="http://jibbering.com/rdf-parser/">Jim Ley</a>, <a href="http://www.kanzaki.com/works/2006/misc/0308turtle.html">Kanzaki Masahide</a> and <a href="http://librdf.org/rasqal/roqet.html">Dave Beckett</a> (likely among others) have written simple and straightforward RDF and <a href="http://www.dajobe.org/2004/01/turtle/">Turtle</a> parsers and converters; there was a floated idea for an RDF version of JSON called <a href="http://lists.w3.org/Archives/Public/semantic-web/2007Jul/0323.html">RDFON</a> that has now evolved into the <a href="http://www.urf.name/">TURF</a> approach; and <a href="http://jdil.org/">JDIL</a> (JSON data integration layer) instructs how to add namespaces to JSON to enable encoding RDF.  Still further examples are Beckett&#8217;s <a href="http://triplr.org/">Triplr</a> and Auer&#8217;s <a href="http://aksw.org/">ASKW</a> <a href="http://triplify.org/Overview">Triplify</a> lightweight conversion services involving many different formats. These are all laudable efforts with good relevance to a lighter ABox approach, I think.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/951/brown-bag-lunch-%e2%80%98structs%e2%80%99-naive-data-formats-and-the-abox/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Brown Bag Lunch: An Intrepid Guide to Ontologies</title>
		<link>http://www.mkbergman.com/936/brown-bag-lunch-an-intrepid-guide-to-ontologies/</link>
		<comments>http://www.mkbergman.com/936/brown-bag-lunch-an-intrepid-guide-to-ontologies/#comments</comments>
		<pubDate>Fri, 26 Nov 2010 08:43:48 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Adaptive Information]]></category>
		<category><![CDATA[Brown Bag Lunch]]></category>
		<category><![CDATA[Description Logics]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Structured Web]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[knowledge management]]></category>
		<category><![CDATA[Knowledge Representation]]></category>
		<category><![CDATA[Ontology]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=936</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Brown Bag Lunch: An Intrepid Guide to Ontologies&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Brown Bag Lunch&amp;rft.subject=Description Logics&amp;rft.subject=Ontologies&amp;rft.subject=Semantic Web&amp;rft.subject=Structured Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2010-11-26&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/936/brown-bag-lunch-an-intrepid-guide-to-ontologies/&amp;rft.language=English"></span>
There&#8217;s an Endless Variety of World Views, and Almost as Many Ways to Organize and Describe Them Ontology is one of the more daunting terms for those exposed for the first time to the semantic Web. Not only is the word long and without many common antecedents, but it is also a term that has [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Brown Bag Lunch: An Intrepid Guide to Ontologies&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Brown Bag Lunch&amp;rft.subject=Description Logics&amp;rft.subject=Ontologies&amp;rft.subject=Semantic Web&amp;rft.subject=Structured Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2010-11-26&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/936/brown-bag-lunch-an-intrepid-guide-to-ontologies/&amp;rft.language=English"></span>
<h2>There&#8217;s an Endless Variety of World Views, and Almost as Many Ways to Organize and Describe Them<a href="http://www.mkbergman.com/834/announcing-the-sporadic-friday-brown-bag-lunch"><img style="border: 0px solid; float: left; margin-right: 10px;" title="Friday Brown Bag Lunch" src="../wp-content/themes/ai3/images/lunchbag_225.jpg" alt="Friday     Brown Bag Lunch" width="158" height="179" /></a></h2>
<p>Ontology  is one of the more daunting terms for those exposed for the first time  to the semantic Web.  Not only is the word long and without many common  antecedents, but it is also a term that has widely divergent use and  understanding within the community.  It can be argued that this  not-so-little word is one of the barriers to mainstream understanding of  the semantic Web.</p>
<p>The root of the term is the Greek <em>ontos</em>, or being or the nature of things.  Literally &#8212; and in classical philosophy &#8212; ontology was used in relation to the study of the nature of being or the world, <a href="http://en.wikipedia.org/wiki/Ontology">the nature of existence</a>.  <a href="http://tomgruber.org/">Tom Gruber</a>, among others, made the term popular in relation to <a href="http://en.wikipedia.org/wiki/Ontology_%28computer_science%29">computer science</a> and artificial intelligence <a href="http://tomgruber.org/writing/ontolingua-kaj-1993.htm">about 15 years ago</a> when he defined ontology as a &#8220;formal specification of a conceptualization.&#8221;</p>
<p>While there have been attempts to strap on more or less formal  understandings or machinery around ontology, it still has very much the  sense of a world view, a means of viewing and organizing and  conceptualizing and defining a domain of interest.  As is made clear  below, I personally prefer a loose and embracing understanding of the  term (consistent with Deborah McGuinness&#8217;s 2003 paper, <a href="http://www.ksl.stanford.edu/people/dlm/papers/ontologies-come-of-age-mit-press-%28with-citation%29.htm">Ontologies Come of Age</a> [<a href="#onto1">1</a>]).</p>
<p>There has been a resurgence of interest in ontologies of late.  Two  reasons have been the emergence of Web 2.0 and tagging and folksonomies,  as well as the nascent emergence of the structured Web.  In fact, on April 23-24 one of the noted communities of practice around ontologies, <a title="http://ontolog.cim3.net/wiki/" rel="nofollow" href="http://ontolog.cim3.net/wiki/">Ontolog</a>, sponsored the <a href="http://ontolog.cim3.net/cgi-bin/wiki.pl?OntologySummit2007">Ontology Summit 2007</a> ,&#8221;Ontology, Taxonomy, Folksonomy: Understanding the Distinctions.&#8221;</p>
<p>These events have sparked my preparing this guide to ontologies.  I  have to admit this is a somewhat intrepid endeavor given the wealth of  material and diversity of opinions.</p>
<div class="boxBrownDotted center_ok" style="min-height: 80px; max-width: 460px;"><img style="width: 64px; height: 73px; float: left; margin-right: 10px;" title="Friday Brown Bag    Lunch" src="../wp-content/themes/ai3/images/lunchbag_64.png" alt="Friday      Brown Bag Lunch" /> This <a href="../834/announcing-the-sporadic-friday-brown-bag-lunch">Friday      brown bag leftover</a> was first placed into the <span style="font-weight: bold; color: #993300;">AI3</span> <a href="../chronological-listing/">refrigerator</a> more than three years ago on <a href="http://www.mkbergman.com/374/an-intrepid-guide-to-ontologies/">May 16, 2007</a>.  This reprise is unchanged since its original posting, though there is a more recent executive-level <a href="http://techwiki.openstructs.org/index.php/Intro_to_Ontologies">intro to ontologies</a> on the <a href="http://openstructs.org/">OpenStructs</a>&#8216; <a href="http://techwiki.openstructs.org/index.php/Main_Page">TechWiki</a>.</div>
<h3>Overview and Role of Ontologies</h3>
<p>Of course, a fancy name is not sufficient alone to warrant an  interest in ontologies.  There are reasons why understanding, using and  manipulating ontologies can bring practical benefit:</p>
<ul>
<li>Depending on their degree of formalism (an important dimension), ontologies help make explicit the scope, definition, and language and meaning (semantics) of a given domain or world view</li>
<li>Ontologies may provide the power to generalize about their domains</li>
<li>Ontologies, if hierarchically structured in part (and not all are), can provide the power of inheritance</li>
<li>Ontologies provide guidance for how to correctly &#8220;place&#8221; information in relation to other information in that domain</li>
<li>Ontologies may provide the basis to reason or infer over its domain (again as a function of its formalism)</li>
<li>Ontologies can provide a more effective basis for information extraction or content clustering</li>
<li>Ontologies, again depending on their formalism, may be a source of  structure and controlled vocabularies helpful for disambiguating  context; they can inform and provide structure to the &#8220;lexicons&#8221; in  particular domains</li>
<li>Ontologies can provide guiding structure for browsing or discovery within a domain, and</li>
<li>Ontologies can help relate and &#8220;place&#8221; other ontologies or world  views in relation to one another; in other words, ontologies can  organize ontologies from the most specific to the most abstract.</li>
</ul>
<p>Both structure and formalism are dimensions for classifying  ontologies, which combined are often referred to as an ontology&#8217;s  &#8220;expressiveness.&#8221;  How one describes this structure and formality  differs.  One recent attempt is this figure from the <a href="http://ontolog.cim3.net/cgi-bin/wiki.pl?OntologySummit2007">Ontology Summit 2007</a>&#8216;s wrap-up communique:</p>
<div><a href="http://ontolog.cim3.net/file/work/OntologySummit2007/workshop/ontology-dimensions-map_20070423b.png"><img class="center_ok" src="http://ontolog.cim3.net/file/work/OntologySummit2007/workshop/ontology-dimensions-map_20070423b.png" alt="Ontology Summit 2007 Communique Diagram" width="600" height="434" /></a></div>
<p>Note the bridging role that an ontology plays between a domain and  its content.  (By its nature, every ontology attempts to &#8220;define&#8221; and  bound a domain.)  Also note that the Summit&#8217;s 50 or so participants were  focused on the trade-off between semantics v. pragmatic considerations.   This was a result of the ongoing attempts within the community to  understand, embrace and (possibly) legitimize &#8220;less formal&#8221; Web 2.0  efforts such as tagging and the folksonomies that can result from them.</p>
<p>There is an <a href="http://en.wikipedia.org/wiki/M._C._Escher">M.C. Escher</a>-like  recursion of the lizard eating its tail when one observes ontologists  creating an ontology to describe the ontological domain.  The above  diagram, which itself would be different with a slight change in Summit  participation or editorship, is, of course, but one representative view  of the world.  Indeed, a tremendous variety of scientific and research  disciplines concern themselves with classifying and organizing the  &#8220;nature of things.&#8221;  Those disciplines go by such names as logicians,  taxonomists, philosophers, information architects, computer scientists,  librarians, operations researchers, systematicists, statisticians,  historians, and so forth.  (In short, given our ontos,  every area of human endeavor has the urge to classify, to organize.)   In each of these areas not only do their domains differ, but so do the  adopted structures and classification schemes often used.</p>
<p>There are at least 40 terms or concepts across these various  disciplines, most related to Web and general knowledge content, that  have organizational or classificatory aspects that &#8212; loosely defined &#8212;  could be called an &#8220;ontology&#8221; framework or approach:</p>
<div style="margin-top: 15px; margin-bottom: 15px;">
<table class="center_ok" border="0" cellspacing="2" cellpadding="2">
<tbody>
<tr>
<td>
<ul>
<li><a title="Tag cloud" href="http://en.wikipedia.org/wiki/Tag_cloud">Tag cloud</a></li>
<li><a title="Controlled vocabulary" href="http://en.wikipedia.org/wiki/Controlled_vocabulary">Controlled vocabulary</a></li>
<li><a title="Thesauri" href="http://en.wikipedia.org/wiki/Thesauri">Thesauri</a></li>
<li><a title="Collaborative tagging" href="http://en.wikipedia.org/wiki/Collaborative_tagging">Collaborative tagging</a></li>
<li><a title="Folk taxonomy" href="http://en.wikipedia.org/wiki/Folk_taxonomy">Folk taxonomy</a></li>
<li><a href="http://en.wikipedia.org/wiki/Web_directory">Directory</a></li>
<li><a href="http://www.mulberrytech.com/Extreme/Proceedings/html/2005/Newcomb01/EML2005Newcomb01.html">Subject Map</a></li>
<li><a href="http://en.wikipedia.org/wiki/Semantic_Web">Semantic Web</a></li>
<li><a href="http://en.wikipedia.org/wiki/Cladistics">Cladistics</a></li>
<li><a href="http://en.wikipedia.org/wiki/Markup_language">Markup languages</a></li>
</ul>
</td>
<td>
<ul>
<li><a title="Social bookmarking" href="http://en.wikipedia.org/wiki/Social_bookmarking">Social bookmarking</a></li>
<li><a title="Tag (metadata)" href="http://en.wikipedia.org/wiki/Tag_%28metadata%29">Tags</a></li>
<li><a title="Tagging" href="http://en.wikipedia.org/wiki/Tagging">Tagging</a></li>
<li><a title="Taxonomy" href="http://en.wikipedia.org/wiki/Taxonomy">Taxonomy</a></li>
<li><a href="http://en.wikipedia.org/wiki/Folksonomy">Folksonomy</a></li>
<li><a href="http://en.wikipedia.org/wiki/Library_classification">Classification</a></li>
<li><a href="http://en.wikipedia.org/wiki/Categorization">Categorization</a></li>
<li><a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a></li>
<li><a href="http://en.wikipedia.org/wiki/Metadata">Metadata</a></li>
<li><a href="http://en.wikipedia.org/wiki/Systematics">Systematics</a></li>
</ul>
</td>
<td>
<ul>
<li><a href="http://en.wikipedia.org/wiki/Ontology_%28computer_science%29">Ontology</a></li>
<li><a href="http://en.wikipedia.org/wiki/Microformats">Microformats</a></li>
<li><a href="http://en.wikipedia.org/wiki/Data_dictionary">Data dictionary</a></li>
<li><a href="http://en.wikipedia.org/wiki/OPML">OPML</a></li>
<li><a href="http://en.wikipedia.org/wiki/XOXO">XOXO</a></li>
<li><a href="http://en.wikipedia.org/wiki/Web_Ontology_Language">OWL</a></li>
<li><a href="http://en.wikipedia.org/wiki/Tree_structure">Subject Trees</a></li>
<li><a href="http://en.wikipedia.org/wiki/Information_architecture">Information Architecture</a></li>
<li><a href="http://en.wikipedia.org/wiki/Data_Reference_Model">Data Reference Model</a></li>
<li><a href="http://en.wikipedia.org/wiki/Phylogeny">Phylogeny</a></li>
</ul>
</td>
<td>
<div>
<ul>
<li><a href="http://en.wikipedia.org/wiki/Topic_map">Topic Maps</a></li>
<li><a href="http://en.wikipedia.org/wiki/Concept_map">Concept Maps</a></li>
<li><a href="http://en.wikipedia.org/wiki/Synset">Synsets</a></li>
<li><a href="http://en.wikipedia.org/wiki/Glossary">Glossary</a></li>
<li><a href="http://en.wikipedia.org/wiki/WordNet">WordNet</a></li>
<li><a href="http://en.wikipedia.org/wiki/Metadata">Metadata</a></li>
<li><a href="http://en.wikipedia.org/wiki/Faceted_classification">Facets</a></li>
<li><a href="http://en.wikipedia.org/wiki/Structure_%28mathematical_logic%29">Structure</a></li>
<li><a href="http://en.wikipedia.org/wiki/Dublin_Core">Dublin Core</a></li>
<li><a href="http://en.wikipedia.org/wiki/Typology">Typology</a></li>
</ul>
</div>
</td>
</tr>
</tbody>
</table>
</div>
<p>Actual domains or subject coverage are then mostly orthogonal to these approaches.</p>
<p>Loosely defined, the number of possible ontologies is therefore close to infinite:  domain X perspective X schema.  (Just kidding &#8212; sort of!  In fact, UMBC&#8217;s <a href="http://swoogle.umbc.edu/">Swoogle</a> ontology search service claims 10,000 ontologies presently on the Web; the actual data from August 2006 ranges from about <a href="http://ebiquity.umbc.edu/blogger/untangling-ontologies-on-the-semantic-web/">16,000 to 92,000 ontologies</a>, depending on how &#8220;formal&#8221; the definition.  These counts are also limited to OWL-based ontologies.)</p>
<p>Many have misunderstood the semantic Web because of this diversity  and the slipperiness of the concept of an ontology.  This  misunderstanding becomes flat wrong when people claim the semantic Web  implies one single grand ontology or organizational schema, <a href="http://en.wikipedia.org/wiki/One_Ring">One Ring to Rule Them All</a>.  Human and domain diversities makes this viewpoint patently false.</p>
<h3>Diversity, &#8216;Naturalness&#8217; and Change</h3>
<p>The choice of an ontological approach to organize Web and structured  content can be contentious. Publishers and authors perhaps have too many  choices: from straight Atom or RSS feeds and feeds with tags to  informal folksonomies and then Outline Processor Markup Language or  microformats. From there, the formalism increases further to include the  standard RDF ontologies such as SIOC (Semantically-Interlinked Online  Communities), SKOS (Simple Knowledge Organizing System), DOAP  (Description of a Project), and FOAF (Friend of a Friend) and the still  greater formalism of OWL&#8217;s various dialects.</p>
<p>Arguing which of these is the theoretical best method is doomed to  failure, except possibly in a bounded enterprise environment. We live in  the real world, where multiple options will always have their advocates  and their applications. All of us should welcome whatever structure we  can add to our information base, no matter where it comes from or how  it&#8217;s done. The sooner we can embrace content in any of these formats and  convert it to a canonical form, we can then move on to needed  developments in semantic mediation, the threshold condition for the  semantic Web.</p>
<div class="boxGrayDotted" style="float: right; margin-left: 10px; margin-right: 0px; width: 240px; font-size: 1.1em;">There are at least 40 concepts &#8212; loosely defined &#8212; that could be called an &#8220;ontology&#8221; framework or approach.</div>
<p>So, diversity is inevitable and should be accepted.  But that observation need not also embrace chaos.</p>
<p>In my early training in biological systematics, <a title="Ernst Haeckel" href="http://en.wikipedia.org/wiki/Ernst_Haeckel">Ernst Haeckel&#8217;s</a> <a title="Recapitulation theory" href="http://en.wikipedia.org/wiki/Recapitulation_theory">recapitulation theory</a> that &#8220;ontogeny recapitulates phylogeny&#8221; (note the same ontos  root, the difference from ontology being growth v. study) was losing  favor fast.  The theory was that the development of an organism through  its embryological phases mirrors its evolutionary history.  Today,  modern biologists recognize numerous connections between ontogeny and  phylogeny, explain them using <a title="Evolutionary developmental biology" href="http://en.wikipedia.org/wiki/Evolutionary_developmental_biology">evolutionary theory</a>, or view them as supporting evidence for that theory.</p>
<p>Yet, like the construction of phylogenetic trees, systematicists  strive for their classifications of the relatedness of organisms to be  &#8220;natural&#8221;, to reflect the true nature  of the relationship.  Thus, over time, that understanding of a  &#8220;natural&#8221; system has progressed from appearance → embryology →  embryology + detailed morphology → species and interbreeding → DNA.   While details continue to be worked out, the degree of genetic  relatedness is now widely accepted by biologists as a &#8220;natural&#8221; basis  for organizing the <a href="http://en.wikipedia.org/wiki/Tree_of_life_%28Science%29">Tree of Life</a>.</p>
<p>It is not unrealistic to also seek &#8220;naturalness&#8221; in the organization  of other knowledge domains, to seek &#8220;naturalness&#8221; in the organization of  their underlying ontologies.  Like natural systems in biology, this  naturalness should emerge from the shared understandings and perceptions  of the domain&#8217;s participants.  While subject matter expertise and  general and domain knowledge are essential to this development, they are  not the only factors.  As tagging systems on the Web are showing,  common usage and broad acceptance by the community at hand is important  as well.</p>
<p>While it may appear that a domain such as the biological relatedness  of organisms is more empirical than the concepts and ambiguous words in  most domains of human endeavor, these attempts at naturalness are still  not foolish.  The phylogeny example shows that understanding changes  over time as knowledge is gained.  We now accept DNA over the  recapitulation theory.</p>
<p>As the formal SKOS organizational schema for knowledge systems  recognizes (see below), the ideas of narrower and broader concepts can  be readily embraced, as well as concepts of relatedness and aliases  (synonyms).  These simple constructs, I would argue, plus the  application of knowledge being gained in related domains, will enable  tomorrow&#8217;s understandings to be more &#8220;natural&#8221; than today&#8217;s, no matter  the particular domain at hand.</p>
<p>So, in seeking a &#8220;naturalness&#8221; within our organizational schema, we  can also see that change is a constant.  We also see that the tools and  ideas underlying the seemingly abstract cause of merging and relating  existing ontologies to one another will further a greater &#8220;naturalness&#8221;  within our organizations of the world.</p>
<h3>A Spectrum of Formalisms</h3>
<p>According to the Summit, expressiveness is the extent and ease by which an ontology can describe domain semantics.  Structure they define as the degree of organization or hierarchical extent of the ontology.  They further define granularity  as the level of detail in the ontology.  And, as the diagram above  alludes, they define other dimensions of use, logical basis, purpose and  so forth of an ontology.</p>
<p>The over fifty respondents from 42 communities submitted some 70  different ontologies under about 40 terms to a survey that was used by  the Summit to construct their diagram.  These submissions included:</p>
<div class="boxYellowDotted">. . .  formal ontologies (e.g., <a href="http://www.ifomis.uni-saarland.de/bfo/">BFO</a>, <a href="http://www.loa-cnr.it/DOLCE.html">DOLCE</a>, <a href="http://www.ontologyportal.org/">SUMO</a>), biomedical ontologies (e.g., Gene Ontology, SNOMED CT, UMLS, ICD), thesauri (e.g., <a href="http://www.nlm.nih.gov/mesh/termscon.html">MeSH</a>, National Agricultural Library Thesaurus), folksonomies (e.g., Social bookmarking tags), general ontologies (<a href="http://www.cogsci.princeton.edu/%7Ewn">WordNet</a>, <a href="http://www.cyc.com/cyc/opencyc/overview">OpenCyc</a>) and specific ontologies (e.g., Process Specification Language). The list also includes markup languages (e.g., <a href="http://www.neuroml.org/">NeuroML</a>),  representation formalisms (e.g., Entity-Relation model, OWL, WSDL-S)  and various ISO standards (e.g., ISO 11179). This [Ontolog] sample  clearly illustrates the diversity of artifacts collected under  &#8220;ontology&#8221;.</div>
<p>I think the simplest spectrum for such distinctions is the formalism  of the ontology and its approach (and language or syntax, not further  discussed here).  More formal ontologies have greater expressiveness and  structure and inferential power, less formal ones the opposite.   Constructing more formal ontologies is more demanding, and takes more  effort and rigor, resulting in an approach that is more powerful but  also more rigid and less flexible.  Like anything else, there are always  trade-offs.</p>
<p>Based on work by Leo Obrst of Mitre as interpreted by Dan McCreary, we can view this as a trade-off as one of semantic clarity <em>v.</em> the time and money required to construct the formalism [<a href="#onto12">12,</a> <a href="#onto13">13</a>]:</p>
<div style="text-align: center;"><a href="../wp-content/themes/ai3/images/2007Posts/070501d_SemanticSpectrum.png"><img class="center_ok" title="Structure and Formalism Increases Semantic Expressiveness" src="../wp-content/themes/ai3/images/2007Posts/070501d_SemanticSpectrum.png" alt="Structure and Formalism Increases Semantic Expressiveness" width="600" height="428" align="middle" /></a><br />
 [Click on image for full-size pop-up]</div>
<p>Note this diagram reflects the more conventional, practitioner&#8217;s view  of the &#8220;formal&#8221; ontology, which does not include taxonomies or  controlled vocabularies (for example) in the definition.  This  represents the more &#8220;closely defined&#8221; end of the <a href="http://en.wikipedia.org/wiki/Semantic_spectrum">ontology (semantic) spectrum</a>.</p>
<p>However, since we are speaking here of ontologies and the structured Web  or the semantic Web, I believe we need to embrace a concept of ontology  aligned to actual practice.  Not all content providers can or want to  employ ontology engineers to enable formal inferencing of their content.   Yet, on the other hand, their content in its various forms does have  some meaningful structure, some organization.  The trick is to extract  this structure for more meaningful use such as data exchange or data  merging.</p>
<h3>Ontology Approaches on the Web</h3>
<p>Under such &#8220;loosely defined&#8221; bases we can thus see a spectrum of  ontology approaches on the Web, proceeding from less structure and  formalism to more so:</p>
<div style="margin-top: 15px; margin-bottom: 15px;">
<table class="center_ok" style="text-align: left; vertical-align: top; width: 80%;" border="0" cellspacing="0" cellpadding="4">
<tbody>
<tr>
<td style="background-color: #ffffcc; width: 3px;"></td>
<td style="font-weight: bold; text-align: center; background-color: #ffffcc; width: 25%;">Type or Schema</td>
<td style="font-weight: bold; text-align: center; background-color: #ffffcc; width: 25%;">Examples</td>
<td style="font-weight: bold; text-align: center; background-color: #ffffcc; width: 49%;">Comments on Structure and Formalism</td>
</tr>
<tr>
<td style="background-color: #ffd3d3;"></td>
<td style="vertical-align: top;">Standard Web Page</td>
<td style="vertical-align: top;">entire Web</td>
<td style="vertical-align: top;">General metadata fields in the  and internal HTML codes and tags provide minimal, but useful sources of structure; other HTTP and retrieval data can also contribute</td>
</tr>
<tr>
<td style="background-color: #ffbbbb;"></td>
<td style="vertical-align: top;">Blog / Wiki Page</td>
<td style="vertical-align: top;">examples from <a href="http://technorati.com/tag/">Technorati</a>, <a href="http://www.bloglines.com/">Bloglines</a>, <a href="http://en.wikipedia.org/wiki/Main_Page">Wikipedia</a></td>
<td style="vertical-align: top;">Provides still greater formalism for the organization and characterization of content (subjects, categories, posts, comments, date/time stamps, etc.).  Importantly, with the addition of plug-ins, some of the basic software may also provide other structured characterizations or output (SIOC, FOAF, etc.; highly varied and site-specific given the diversity of publishing systems and plug-ins)</td>
</tr>
<tr>
<td style="background-color: #ffa3a3;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Rss">RSS</a> / <a href="http://en.wikipedia.org/wiki/Atom_%28standard%29">Atom</a> feeds</td>
<td style="vertical-align: top;">most blogs and most news feeds</td>
<td style="vertical-align: top;">RSS extends basic XML schema for more robust syndication of content with a tightly controlled vocabulary for feed concepts and their relationships.  Because of its ubiquity, this is becoming a useful baseline of structure and formalism; also, the nature of adoption shows much about how ontological structure is an <span style="font-style: italic;">artifact</span>, not <span style="font-style: italic;">driver</span>, for use</td>
</tr>
<tr>
<td style="background-color: #ff8b8b;"></td>
<td style="vertical-align: top;">RSS / Atom feeds with <a href="http://en.wikipedia.org/wiki/Tag_%28metadata%29">tags</a> or <a href="http://en.wikipedia.org/wiki/OPML">OPML</a></td>
<td style="vertical-align: top;"><a title="http://www.grazr.com" rel="nofollow" href="http://www.grazr.com/">Grazr</a>, most newsfeed aggregators can import and export OPML lists of RSS feeds</td>
<td style="vertical-align: top;">The OPML specification defines an outline as a hierarchical, ordered list of arbitrary elements. The specification is fairly open which makes it suitable for many types of list data.  See also <a title="OML" href="http://en.wikipedia.org/wiki/OML">OML</a> and <a title="XOXO" href="http://en.wikipedia.org/wiki/XOXO">XOXO</a></td>
</tr>
<tr>
<td style="background-color: #ff7373;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Faceted_classification">Hierarchical Faceted Metadata</a></td>
<td style="vertical-align: top;"><a href="http://xfml.org/">XFML</a>, <a href="http://flamenco.berkeley.edu/index.html">Flamenco</a></td>
<td style="vertical-align: top;">These and related efforts from the information architecture (IA) community are geared more to library science.  However, they directly contribute to faceted browsing, which is one of the first practical instantiations of the semantic Web</td>
</tr>
<tr>
<td style="background-color: #ff5b5b;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Folksonomy">Folksonomies</a></td>
<td style="vertical-align: top;"><a title="Flickr" href="http://en.wikipedia.org/wiki/Flickr">Flickr</a>, <a title="Del.icio.us" href="http://en.wikipedia.org/wiki/Del.icio.us">del.icio.us</a></td>
<td style="vertical-align: top;">Based on user-generated tags and informal organizations of the same; not linked to any &#8220;standard&#8221; Web protocols.  Both tags and hierarchical structure are arbitrary, but some researchers now believe over large enough participant sets that structural consensus and value does emerge</td>
</tr>
<tr>
<td style="background-color: #ff4343;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Microformats">Microformats</a></td>
<td style="vertical-align: top;">Example formats include <a title="HAtom" href="http://en.wikipedia.org/wiki/HAtom">hAtom</a>, <a title="HCalendar" href="http://en.wikipedia.org/wiki/HCalendar">hCalendar</a>, <a title="HCard" href="http://en.wikipedia.org/wiki/HCard">hCard</a>, <a class="new" title="HReview" href="http://en.wikipedia.org/w/index.php?title=HReview&amp;action=edit">hReview</a>, <a title="HResume" href="http://en.wikipedia.org/wiki/HResume">hResume</a>, <a class="new" title="Rel-directory" href="http://en.wikipedia.org/w/index.php?title=Rel-directory&amp;action=edit">rel-directory</a>, <a class="new" title="XFolk" href="http://en.wikipedia.org/w/index.php?title=XFolk&amp;action=edit">xFolk</a>, <a title="XHTML Friends Network" href="http://en.wikipedia.org/wiki/XHTML_Friends_Network">XFN</a> and <a title="XOXO" href="http://en.wikipedia.org/wiki/XOXO">XOXO</a></td>
<td style="vertical-align: top;">A microformat is HTML mark up to express semantics with strictly controlled vocabularies.  This markup is embedded using specific HTML attributes such as class, rel, and rev.  This method is easy to implement and understand, but is not free-form</td>
</tr>
<tr>
<td style="background-color: #ff2b2a;"></td>
<td style="vertical-align: top;">Embedded RDF</td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/RDFa">RDFa</a>, eRDF</td>
<td style="vertical-align: top;">An embedded format, like microformats, but free-form, and not subject to the approval strictures associated with microformats</td>
</tr>
<tr>
<td style="background-color: #ff1312;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Topic_map">Topic Maps</a></td>
<td style="vertical-align: top;"><a href="http://www.topicmaps.net/">Infoloom</a>, <a href="http://search.topicmaps.de/">Topic Maps Search Engine</a></td>
<td style="vertical-align: top;">A topic map can represent information using topics (representing any concept, from people, countries, and organizations to software modules, individual files, and events), associations (which represent the relationships between them), and occurrences (which represent relationships between topics and information resources relevant to them)</td>
</tr>
<tr>
<td style="background-color: #f90000;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a></td>
<td style="vertical-align: top;">Many; <a href="http://dbpedia.org/">DBpedia</a>, etc.</td>
<td style="vertical-align: top;">RDF has become the canonical data model since it represents a &#8220;universal&#8221; conversion format</td>
</tr>
<tr>
<td style="background-color: #e10000;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/RDF_Schema">RDF Schema</a></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/SKOS">SKOS</a>, <a href="http://en.wikipedia.org/wiki/SIOC">SIOC</a>, <a href="http://en.wikipedia.org/wiki/DOAP">DOAP</a>, <a href="http://en.wikipedia.org/wiki/FOAF_%28software%29">FOAF</a></td>
<td style="vertical-align: top;">RDFS or RDF Schema is an extensible knowledge representation language, providing basic elements for the description of ontologies, otherwise called RDF vocabularies, intended to structure RDF resources.  This becomes the canonical ontology common meeting ground</td>
</tr>
<tr>
<td style="background-color: #c90000;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Web_Ontology_Language">OWL</a> Lite</td>
<td style="vertical-align: top;" rowspan="3">Here are some existing <a title="http://protege.cim3.net/cgi-bin/wiki.pl?ProtegeOntologiesLibrary#nid81S" rel="nofollow" href="http://protege.cim3.net/cgi-bin/wiki.pl?ProtegeOntologiesLibrary#nid81S">OWL ontologies</a>; also see <a href="http://swoogle.umbc.edu/">Swoogle</a> for OWL search facilities</td>
<td style="vertical-align: top;" rowspan="3">The Web Ontology Language (OWL) is a language for defining and instantiating Web ontologies. An OWL ontology may include descriptions of classes, along with their related properties and instances. OWL is designed for use by applications that need to process the content of information instead of just presenting information to humans. It facilitates greater machine interpretability of Web content than that supported by XML, RDF, and RDF Schema (RDF-S) by providing additional vocabulary along with a formal semantics.  The three language versions are in order of increasing expressiveness</td>
</tr>
<tr>
<td style="background-color: #b10000;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Web_Ontology_Language">OWL</a> DL</td>
</tr>
<tr>
<td style="background-color: #9b0000;"></td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Web_Ontology_Language">OWL</a> Full</td>
</tr>
<tr>
<td style="background-color: #820000;"></td>
<td style="vertical-align: top;">Higher-order &#8220;formal&#8221; and &#8220;upper-level&#8221; ontologies</td>
<td style="vertical-align: top;"><a href="http://en.wikipedia.org/wiki/Suggested_Upper_Merged_Ontology">SUMO</a>, <a title="http://wonderweb.semanticweb.org/deliverables/documents/D18.pdf" rel="nofollow" href="http://wonderweb.semanticweb.org/deliverables/documents/D18.pdf">DOLCE</a>, <a href="http://proton.semanticweb.org/D1_8_1.pdf">PROTON</a>, <a title="http://www.ifomis.org/bfo" rel="nofollow" href="http://www.ifomis.org/bfo">BFO</a>, <a href="http://en.wikipedia.org/wiki/Cyc">Cyc</a>, OpenCyc</td>
<td style="vertical-align: top;">These provide comprehensive ontologies and often related knowledge bases, with the goal of enabling AI applications to perform human-like reasoning.  Their reasoning languages often use higher-order logics</td>
</tr>
</tbody>
</table>
</div>
<p>As a rule of thumb, items that are less &#8220;formal&#8221; can be converted to a  more formal expression, but the most formal forms can generally not be  expressed in less formal forms.</p>
<p>As latter sections elaborate, I see RDF as the universal data model  for representing this structure into a common, canonical format, with  RDF Schema (specifically SKOS, but also supplemented by FOAF, DOAP and  SIOC) as the organizing ontology <a href="http://en.wikipedia.org/wiki/Knowledge_representation">knowledge representation language</a> (KRL).</p>
<p>This is not to say that the various dialects of OWL should be  neglected.  In bounded environments, they can provide superior reasoning  power and are warranted if they can be sufficiently mandated or  enforced.  But the RDF and RDF-S systems represent the most tractable  &#8220;meeting place&#8221; or &#8220;middle ground,&#8221;  IMHO.</p>
<h3>Still-Another &#8220;Level&#8221; of Ontologies</h3>
<p>As if the formalism dimension were not complicated enough, there is  also the practice within the ontology community to characterize  ontologies by &#8220;levels&#8221;, specifically upper, middle and lower levels.  For example, chances are that you have heard particularly of <a href="http://en.wikipedia.org/wiki/Upper_ontology_%28computer_science%29">&#8220;upper-level&#8221; ontologies</a>.</p>
<p>The following figure helps illustrate this &#8220;level&#8221; dimension. This diagram is also from Leo Obrst of Mitre [<a href="#onto12">12</a>], and was also used in another 2006 talk by <a href="http://ontolog.cim3.net/file/resource/presentation/JackPark-PatrickDurusau_20060427/Avoiding_Hobson-s_Choice_In_Choosing_An_Ontology--JackPark-PatrickDurusau_20060427.ppt">Jack Park and Patrick Durusau</a> (discussed further below for other reasons):</p>
<div><img class="center_ok" src="../wp-content/themes/ai3/images/2007Posts/070501b_OntologyLevels.png" alt="Ontology Levels" width="569" height="353" /></div>
<p>Examples of upper-level ontologies include the Suggested Upper Merged Ontology (<a href="http://en.wikipedia.org/wiki/Suggested_Upper_Merged_Ontology">SUMO</a>), the Descriptive Ontology for Linguistic and Cognitive Engineering (<a title="http://wonderweb.semanticweb.org/deliverables/documents/D18.pdf" rel="nofollow" href="http://wonderweb.semanticweb.org/deliverables/documents/D18.pdf">DOLCE</a>), <a href="http://proton.semanticweb.org/D1_8_1.pdf">PROTON</a>, <a href="http://en.wikipedia.org/wiki/Cyc">Cyc</a> and <a title="http://www.ifomis.org/bfo" rel="nofollow" href="http://www.ifomis.org/bfo">BFO</a> (Basic Formal Ontology).  Most of the content in their upper-levels is  akin to broad, abstract relations or concepts (similar to the primary  classes, for example, in a <a href="http://en.wikipedia.org/wiki/Roget%27s_Thesaurus">Roget&#8217;s Thesaurus</a> &#8212; that is, real ontos  stuff) than to &#8220;generic common knowledge.&#8221;  Most all of them have both a  hierarchical and networked structure, though their actual subject  structure relating to concrete things is generally pretty weak [<a href="#onto2">2</a>].</p>
<p>The above diagram conveys a sense of how multiple ontologies can  relate to one another both in terms of narrower and broader topic matter  and at the same &#8220;levels&#8221; of generalization.  Such &#8220;meta-structure&#8221; (if  you will) can provide a reference structure for relating multiple  ontologies to one another.</p>
<div class="boxGrayDotted" style="float: right; margin-left: 10px; margin-right: 0px; width: 240px; font-size: 1.1em;">The relationships and mappings amongst ontologies  is a critical infrastructure component of the semantic Web.</div>
<p>It resides exactly in such bindings or relationships that we can  foresee the promise of querying and relating multiple endpoints on the  Web with accurate semantics in order to connect dots and combine  knowledge bases.  Thus, the understanding of the relationships and  mappings amongst ontologies becomes a critical infrastructural component  of the semantic Web.</p>
<h3>The SUMO Example</h3>
<p>We can better understand these mapping and inter-relationship  concepts by using a concrete example with a formal ontology.  We&#8217;ll  choose to use the <a href="http://www.ontologyportal.org/">Suggested Upper Merged Ontology</a> simply because it is one of the best known.  We could have also selected another upper-level system such as PROTON [<a href="#onto3">3</a>] or Cyc [<a href="#onto4">4</a>] or one of the many with narrower concept or subject coverage.</p>
<p>SUMO is one of the formal ontologies that has been mapped to the <a href="http://www.cogsci.princeton.edu/%7Ewn/">WordNet</a> lexicon, which adds to its semantic richness. SUMO is written in the <a href="http://sigmakee.cvs.sourceforge.net/*checkout*/sigmakee/sigma/suo-kif.pdf">SUO-KIF</a> language. SUMO is free and owned by the <a href="http://www.ieee.org/portal/site">IEEE</a>. The ontologies that extend SUMO are available under <a href="http://www.gnu.org/copyleft/gpl.html">GNU General Public License</a>.</p>
<p>The abstract, conceptual organization of SUMO is shown by this diagram, which also points to its related <a href="http://sigmakee.cvs.sourceforge.net/*checkout*/sigmakee/KBs/Mid-level-ontology.kif">MILO (MId-Level Ontology)</a>,  which is being developed as a bridge between the abstract content of  the SUMO and the richer detail of various domain ontologies:</p>
<div><img class="center_ok" src="http://www.ontologyportal.org/images/SUMOMILO.gif" alt="" width="348" height="368" /></div>
<p>At this level, the structure is quite abstract.  But one can easily browse the SUMO structure.  A nifty tool to do so is the <a href="http://virtual.cvut.cz/ksmsaWeb/browser/title">KSMSA (Knowledge Support for Modeling and Simulation) ontology browser</a>. Using a hierarchical tree representation, you can navigate through SUMO, MILO, <a href="http://www.cogsci.princeton.edu/%7Ewn/">WordNet</a>, and (with the locally installed version) Wikipedia.</p>
<p>The figure below shows the upper-level entity concept on the left; the right-hand panel shows a drill-down into the example atom entity:</p>
<div><a href="../wp-content/themes/ai3/images/2007Posts/070501a_SUMOExample.png"><img class="center_ok" title="Example SUMO Categories" src="../wp-content/themes/ai3/images/2007Posts/070501a_SUMOExample.png" alt="Example SUMO Categories" width="600" height="846" align="middle" /></a><br />
 [Click on image for full-size pop-up]</div>
<p>These views may be a bit misleading because the actual underlying  structure, while it has hierarchical aspects as shown here, really is in  the form of a directed acyclic graph (showing other relatedness  options, not just hierarchical ones).  So, alternate visualizations  include traditional network graphs.</p>
<p>The other thing to note is that the &#8220;things&#8221; covered in the ontology,  the entities, are also fairly abstract.  That is because the intention  of a standard &#8220;upper-level&#8221; ontology is to cover all relevant knowledge  aspects of each entity&#8217;s domain.  This approach results in a subject and  topic coverage that feels less &#8220;concrete&#8221; than the coverage in, say, an  encyclopedia, directory or card catalog.</p>
<h3>Ontology Binding and Integration Mechanisms</h3>
<p>According to Park and Durusau, upper ontologies are diverse, middle  ontologies are even more diverse, and lower ontologies are more diverse  still.  A key observation is that ontological diversity is a given and  increases as we approach real user interaction levels.  Moreover,  because of the &#8220;loose&#8221; nature of ontologies on the Web (now and into the  future), diversity of approach is a further key factor.</p>
<p>Recall the initial discussion on the role and objectives of  ontologies.  About half of those roles involve effectively accessing or  querying more than one ontology.  The objective of &#8220;upper-level&#8221;  ontologies, many with their own binding layers, is also expressly geared  to ontology integration or federation.  So, what are the possible  mechanisms for such binding or integration?</p>
<p>A fundamental distinction within mechanisms to combine ontologies is  whether it is a unified or centralized approach (often imposed or  required by some party) or whether it is a schema mapping or binding  approach.  We can term this distinction centralized v. federated.</p>
<h4>Centralized Approaches</h4>
<p>Centralized approaches can take a number of forms.  At the most  extreme, adherence to a centralized approach can be contractual.  At the  other end are reference models or standards.  For example, illustrative  reference models include:</p>
<ul>
<li>the <a href="http://en.wikipedia.org/wiki/Data_Reference_Model">Data Reference Model</a> (<a href="http://www.whitehouse.gov/omb/egov/documents/DRM_2_0_Final.pdf">DRM</a>), one of the five reference models of the Federal Enterprise Architecture (FEA)</li>
<li><a href="http://en.wikipedia.org/wiki/UDEF">UDEF</a> (Unified Data Element Framework), an approach toward a unified description framework, or</li>
<li>the eXtended MetaData Registry (<a href="http://xmdr.org/">XMDR</a>) project.</li>
</ul>
<p>Though I have argued that One Ring to Rule them All  is not appropriate to the general Web, there may be cases within  certain enterprises or where through funding clout (such as government  contracts), some form of centralized approach could be imposed [<a href="#onto5">5</a>].   And, frankly, even where compliance can not be assured, there are  advantages in economy, efficiency and interoperability to attempt  central ontologies.  Certain industries &#8212; notably pharmaceuticals and  petrochemicals &#8212; and certain disciplines &#8212; such as some areas of  biology among others &#8212; have through trade associations or community  consensus done admirable jobs in adopting centralized approaches.</p>
<h4>Federated Approaches</h4>
<p>However, combining ontologies in the context of the broader Internet  is more likely through federated approaches.  (Though federated  approaches can also be improved when there are consensual standards  within specific communities.)  The key aspect of a federated approach is  to acknowledge that multiple schema need to be brought together, and  that each contributing data set and its schema will not be altered  directly and will likely remain in place.</p>
<p>Thus, the key distinctions within this category are the mechanisms by  which those linkages may take place  An important goal in any federated  approach is to achieve interoperability at the data or instance level  without unacceptable loss of information or corruption of the semantics.   Numerous specific approaches are possible, but three example areas in  RDF-topic map interoperability, the use of &#8220;subject maps&#8221;, and binding  layers can illustrate some of the issues at hand.</p>
<p>In 2006 the W3C set up a working group to look at the issue of RDF  and topic maps interoperability.  Topic maps have been embraced by the  library and information architecture community for some time, and have  standards that have been adopted under ISO.  Somewhat later but also in  parallel was the development of the RDF standard by W3C.  The  interesting thing was that the conceptual underpinnings and objectives  between these two efforts were quite similar.  Also, because of the  substantive thrust of topic maps and the substantive needs of its  community, quite a few topic maps had been developed and implemented.</p>
<p>One of the first efforts of the W3C work group was to evaluate and  compare five or six extant proposals for how to relate RDF and topic  maps [<a href="#onto6">6</a>].   That report is very interesting reading for any one desirous of  learning more about specific issues in combining ontologies and their  interoperability.  The result of that evaluation then led to some  guidelines for best practices in how to complete this mapping [<a href="#onto7">7</a>].   Evaluations such as these provide confidence that interoperability can  be achieved between relatively formal schema definitions without  unacceptable loss in meaning.</p>
<p>A different, &#8220;looser&#8221; approach, but one which also grew out of the  topic map community, is the idea of &#8220;subject maps.&#8221;  This effort, backed  by Park and Durusau noted above, but also with the support of other  topic map experts such as Steve Newcomb and Robert Barta via their  proposed <a href="http://www.jtc1sc34.org/repository/0710.pdf">Topic Maps Reference Model</a> (ISO 13250-5), seems to be one of the best attempts I&#8217;ve seen that both  respects the reality of the actual Web while proposing a workable,  effective scheme for federation.</p>
<p>The basic idea of a subject map is built around a set of subject  &#8220;proxies.&#8221;  A subject proxy is a computer representation of a subject  that can be implemented as an object, must have an identity, and must be  addressable (this point provides the URI connector to RDF).  Each  contributing schema thus defines its own subjects, with the mappings  becoming meta-objects.  These, in turn, would benefit from having some  accepted subject reference schema (not specifically addressed by the  proponents) to reduce the breadth of the ultimate mapped proxy &#8220;space.&#8221;</p>
<p>I don&#8217;t have the expertise to judge further the specifics, but I find the presentation and papers by Park and Durusau, <a href="http://ontolog.cim3.net/file/resource/presentation/JackPark-PatrickDurusau_20060427/Avoiding_Hobson-s_Choice_In_Choosing_An_Ontology--JackPark-PatrickDurusau_20060427.ppt">Avoiding Hobson&#8217;s Choice In Choosing An Ontology</a> and <a href="http://protege.stanford.edu/conference/2006/submissions/abstracts/3.3_Park_SubjectCentricProtege.pdf">Towards Subject-centric Merging of Ontologies</a> to be worthwhile reading in any case.  I highly recommend these papers for further background and clarity.</p>
<p>As the third example, &#8220;binding layers&#8221; are a comparatively newer  concept.  Leading upper-level ontologies such as SUMO or PROTON propose  their own binding protocols to their &#8220;lower&#8221; domains, but that approach  takes place within the construct of the parent upper ontology and  language.  Such designs are not yet generalized solutions.  By far the  most promising generalized binding solution is the <a href="http://en.wikipedia.org/wiki/SKOS">SKOS</a> (Simple Knowledge Organization System).  Because of its importance, the next section is devoted to it.</p>
<p>Finally, with respect to federated approaches, there are quite a few  software tools that have been developed to aid or promote some of these  specific methods.  For, example, about twenty of the software  applications in my <a href="..//?page_id=325">Sweet Tools</a> listing of 500+ semantic Web and -related tools could be interpreted as  aiding ontology mapping or creation.  You may want to check out some of  these specific tools depending on your preferred approach [<a href="#onto8">8</a>].</p>
<h3>The Role of SKOS &#8211; the Simple Knowledge Organization System</h3>
<p>SKOS, or the Simple Knowledge Organization System, is a formal  language and schema designed to represent such structured information  domains as <a title="Thesaurus" href="http://en.wikipedia.org/wiki/Thesaurus">thesauri</a>, <a title="Classification scheme" href="http://en.wikipedia.org/wiki/Classification_scheme">classification schemes</a>, <a title="Taxonomy" href="http://en.wikipedia.org/wiki/Taxonomy">taxonomies</a>, <a title="Authority control" href="http://en.wikipedia.org/wiki/Authority_control">subject-heading systems</a>, <a title="Controlled vocabulary" href="http://en.wikipedia.org/wiki/Controlled_vocabulary">controlled vocabularies</a>, or others; in short, most all of the &#8220;loosely defined&#8221; ontology approaches discussed herein.  It is a <a title="World Wide Web Consortium" href="http://en.wikipedia.org/wiki/World_Wide_Web_Consortium">W3C</a> initiative more fully defined in its <a href="http://www.w3.org/TR/swbp-skos-core-guide">SKOS Core Guide</a> [<a href="#onto9">9</a>].</p>
<p>SKOS is built upon the <a title="Resource Description Framework" href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a> data model of the subject-predicate-object &#8220;triple.&#8221;  The subjects and objects are akin to nouns, the predicate a verb, in a simple Dick-sees-Jane sentence.  Subjects and predicates by convention are related to a <a href="http://en.wikipedia.org/wiki/Uniform_Resource_Identifier">URI</a> that provides the definitive reference to the item.  Objects  may be either a URI resource or a literal (in which case it might be  some indexed text, an actual image, number to be used in a calculation,  etc.).</p>
<p>Being an <a title="RDF Schema" href="http://en.wikipedia.org/wiki/RDF_Schema">RDF Schema</a> simply means that SKOS adds some language and defined relationships to  this RDF baseline.  This is a bit of recursive understanding, since RDFS  is itself defined in RDF by virtue of adding some controlled vocabulary  and relations.  The power, though, is that these schema additions are  also easily expressed and referenced.</p>
<p>This RDFS combination can thus be shown as a standard RDF triple  graph, but with the addition of the extended vocabulary and relations:</p>
<div><img class="center_ok" src="http://www.w3.org/TR/2005/WD-swbp-skos-core-guide-20051102/img/ex-triple.png" alt="Standard RDF Graph Model" width="600" height="201" /></div>
<p>The power of the approach arises from the ability of the triple to  express virtually any concept, further extended via the RDFS language  defined for SKOS.  SKOS includes concepts such as &#8220;broader&#8221; and  &#8220;narrower&#8221;, which enable hierarchical relations to be modeled, as well  as &#8220;related&#8221; and &#8220;member&#8221; to support networks and arrays, respectively [<a href="#onto9">9</a>].</p>
<p>We can visualize this transforming power by looking at how an  &#8220;ontology&#8221; in a totally foreign scheme can be related to the canonical  SKOS scheme.  In the figure below the left-hand portion shows the native  hierarchical taxonomy structure of the UK Archival Thesaurus (<a href="http://www.ukat.org.uk/thesaurus/hierarchy.php?m=625">UKAT</a>),  next as converted to SKOS on the right (with the overlap of categories  shown in dark purple).  Note the hierarchical relationships visualize  better via a taxonomy, but that the RDF graph model used by SKOS allows a  richer set of additional relationships including related and  alternative names:</p>
<div style="text-align: center;"><a href="../wp-content/themes/ai3/images/2007Posts/070501e_SKOS_UKAT.png"><img class="center_ok" title="Example Structural Comparison of Hierarchical Taxonomy with Network Graph" src="../wp-content/themes/ai3/images/2007Posts/070501e_SKOS_UKAT.png" alt="Example Structural Comparison of Hierarchical Taxonomy with Network Graph" width="600" height="507" align="middle" /></a><br />
 [Click on image for full-size pop-up]</div>
<p>SKOS also has a rich set of annotation and labeling properties to enhance human readability of schema developed in it [<a href="#onto9">9</a>].   There is also a useful draft schema that the W3C&#8217;s SWEO (Semantic Web  Education and Outreach) group is developing to organize semantic  Web-related information [<a href="#onto10">10</a>].</p>
<p>Combined, these constructs provide powerful mechanisms for giving  contributory ontologies a common conceptualization.  When added to other  sibling RDF schema such as FOAF or SIOC or DOAP, still additional  concepts can be collated.</p>
<h3>Conclusions</h3>
<p>While not addressed directly in this piece, it is obviously of first  importance to have content with structure before the questions of  connecting that information can even arise.  Then, that structure must  also be available in a form suitable for merging or connection.</p>
<p>At that point, the subjects of this posting come into play.</p>
<div class="boxGrayDotted" style="float: left; margin-right: 10px; margin-left: 0px; width: 240px; font-size: 1.1em;">We are stubbing our toes on the rocks while we gaze at the heavens.</div>
<p>We see that the daily Web has a diversity of schema or ontologies  &#8220;loosely defined&#8221; for representing the structure of the content.  These  representations can be transferred to more complex schema, but not in  the opposite direction.  Moreover, the semantic basis for how to make  these mappings also needs some common referents.</p>
<p>RDF provides the canonical data model for the data transfers and  representations.  RDFS, especially in the form of SKOS, appears to form  one basis for the syntax and language for these transformations.  And  SKOS, with other schema, also appears to offer much of the appropriate  &#8220;middle ground&#8221; for data relationships mapping.</p>
<p>However, lacking in this story is a referential structure for subject relationships [<a href="#onto11">11</a>].  (Also lacking are the ultimately critical domain specifics required for actual implementation.)</p>
<p>Abstract concepts of interest to philosophers and deep thinkers have  been given much attention.  Sadly, to date, concrete subject structures  in which tangible things and tangible actions can be shared, is still  very, very weak.  We are stubbing our toes on the rocks while we gaze at  the heavens.</p>
<p>Yet, despite this, simple and powerful infrastructures are well  in-hand to address all foreseeable syntactic and semantic issues.  There  appear to be no substantive limits to needed next steps.</p>
<p>Lastly, many valuable resources for further reading and learning may be found within the <a href="http://ontolog.cim3.net/cgi-bin/wiki.pl?WikiHomePage">Ontolog Community</a>, <a href="http://en.wikipedia.org/wiki/World_Wide_Web_Consortium">W3C</a>, <a href="http://tagcommons.org/">TagCommons</a> and <a href="http://www.topicmaps.org/">Topics Maps</a> groups.  Enjoy!  And be wary of ontology no longer.</p>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto1" name="onto1"></a>1] Deborah  L. McGuinness. &#8220;Ontologies Come of Age&#8221;. In Dieter Fensel, Jim Hendler,  Henry Lieberman, and Wolfgang Wahlster, editors. <span style="text-decoration: underline;">Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential</span>. MIT Press, 2003.  See <a href="http://www.ksl.stanford.edu/people/dlm/papers/ontologies-come-of-age-mit-press-%28with-citation%29.htm">http://www.ksl.stanford.edu/people/dlm/papers/ontologies-come-of-age-mit-press-(with-citation).htm</a></div>
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto2" name="onto2"></a>2] I think it would be much clearer to refer to &#8220;upper level&#8221; ontologies as abstract or conceptual, &#8220;mid levels&#8221; as mapping or binding, and &#8220;lower levels&#8221; as domain (without any hierarchical distinctions such as lower or lowest or sub-domain), but current practice is probably too entrenched to change now.</div>
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto3" name="onto3"></a>3]  There are many aspects that make <a href="http://proton.semanticweb.org/D1_8_1.pdf">PROTON</a> one of the more attractive reference ontologies. The PROTON ontology (PROTo ONtology), developed within the scope of the <a href="http://en.wikipedia.org/wiki/SEKT">SEKT project,</a> is attractive because of its understandability, relatively small size,  modular architecture and a simple subsumption hierarchy.  It is  available in an OWL Lite form and is easy to adopt and extend.  On the  face of it, the Topic class within PROTON, which is meant to serve as a  bridge between different ontologies, may also provide a binding layer to  specific subject topics as sub-classes or class instances.</div>
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto4" name="onto4"></a>4] See my <a href="..//?p=365">earlier post on Cyc</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto5" name="onto5"></a>5]  Even with such clout, it is questionable to get rather complete adherence, as <a href="http://en.wikipedia.org/wiki/Ada_%28programming_language%29">Ada</a> showed within the Federal government.  However, where circumstances  allow it, central schema and ontologies may be worth pursuing because of  improved interoperability and lower costs, even where some portions do  not adhere and are more chaotic like the standard Web.</div>
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto6" name="onto6"></a>6] See,  <a href="http://www.w3.org/TR/rdftm-survey">A Survey of RDF/Topic Maps Interoperability Proposals</a>, W3C Working Group Note 10 February 2006, Pepper, Vitali, Garshol, Gessa, Presutti (eds.)</div>
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto7" name="onto7"></a>7] See, <a href="http://www.w3.org/2001/sw/BestPractices/RDFTM/guidelines-20060630.html">Guidelines for RDF/Topic Maps Interoperability</a>, W3C Editor&#8217;s Draft 30 June 2006, Pepper, Presutti, Garshol, Vitali (eds.)</div>
<div style="font-size: 90%;">
<div style="margin: 10px 0pt;">[<a title="onto8" name="onto8"></a>8] Here are some <a href="..//?page_id=325">Sweet Tools</a> that may have a usefulness to ontology federation and creation:</div>
<ul style="margin: 10px 0pt 10px 20px;">
<li><a href="http://www.aktors.org/technologies/adaptiva/">Adaptiva</a> &#8212;  is a user-centered ontology building environment, based on using  multiple strategies to construct an ontology, minimising user input by  using adaptive information extraction</li>
<li><a href="http://www.altova.com/products_semanticworks.html">Altova SemanticWorks</a> &#8212; is a visual RDF and OWL editor that auto-generates RDF/XML or nTriples based on visual ontology design</li>
<li><a href="http://projects.semwebcentral.org/projects/ontologymapping/">CMS</a> &#8212; the CROSI Mapping System is a structure matching system that  capitalizes on the rich semantics of the OWL constructs found in source  ontologies and on its modular architecture that allows the system to  consult external linguistic resources</li>
<li><a href="http://www.aktors.org/technologies/conceptool/">ConcepTool</a> &#8212; is a system to model, analyze, verify, validate, share, combine, and  reuse domain knowledge bases and ontologies, reasoning about their  implication</li>
<li><a href="http://www.aktors.org/technologies/conref/">ConRef</a> &#8212;  is a service discovery system which uses ontology mapping techniques to support different user vocabularies</li>
<li><a href="http://www.aifb.uni-karlsruhe.de/WBS/meh/foam/">FOAM</a> &#8212; is the Framework for Ontology Alignment and Mapping. It is based on  heuristics (similarity) of the individual entities (concepts, relations,  and instances)</li>
<li><a href="http://sourceforge.net/projects/hmafra">hMAFRA</a> (Harmonize Mapping Framework) &#8212; is a set of tools supporting semantic  mapping definition and data reconciliation between ontologies. The  targeted formats are XSD, RDFS and KAON</li>
<li><a href="http://www.aktors.org/technologies/ifmap/">IF-Map</a> &#8212; is an Information Flow based ontology mapping method. It is based on  the theoretical grounds of logic of distributed systems and provides an  automated streamlined process for generating mappings between  ontologies of the same domain</li>
<li><a href="http://www.alphaworks.ibm.com/tech/semanticstk">IODT</a> &#8212; is IBM&#8217;s toolkit for ontology-driven development. The toolkit  includes EMF Ontology Definition Metamodel (EODM), EODM workbench, and  an OWL Ontology Repository (named Minerva)</li>
<li><a href="http://kaon.semanticweb.org/">KAON</a> &#8212; is an open-source ontology management infrastructure targeted for  business applications. It includes a comprehensive tool suite allowing  easy ontology creation and management and provides a framework for  building ontology-based applications. An important focus of KAON is  scalable and efficient reasoning with ontologies</li>
<li><a href="http://www.landcglobal.com/pages/linkfactory.php">LinKFactory</a> &#8212; is Language &amp; Computing&#8217;s ontology management tool.  It provides  an effective and user-friendly way to create, maintain and extend  extensive multilingual terminology systems and ontologies (English,  Spanish, French, etc.). It is designed to build, manage and maintain  large, complex, language independent ontologies</li>
<li><a href="http://www.m3t4.com/index.jsp">M3t4.Studio Semantic Toolkit</a> &#8212; is Metatomix&#8217;s free set of Eclipse plug-ins to allow developers to create and manage OWL ontologies and RDF documents</li>
<li><a href="http://mafra-toolkit.sourceforge.net/">MAFRA Toolkit</a> &#8212; the Ontology MApping FRAmework Toolkit allows to create semantic  relations between two (source and target) ontologies, and apply such  relations in translating source ontology instances into target ontology  instances</li>
<li><a href="http://projects.semwebcentral.org/projects/ontoengine/">OntoEngine</a> &#8212; is a step toward allowing agents to communicate even though they use  different formal languages (i.e., different ontologies). It translates  data from a &#8220;source&#8221; ontology to a &#8220;target.&#8221;</li>
<li><a href="http://www.ontoportal.org.uk/">OntoPortal</a> &#8212; enables the authoring and navigation of large semantically-powered portals</li>
<li><a href="http://www.dfki.de/%7Eklusch/owls-mx/">OWLS-MX</a> &#8212; the hybrid semantic Web service matchmaker OWLS-MX 1.0 utilizes both  description logic reasoning, and token based IR similarity measures. It  applies different filters to retrieve OWL-S services that are most  relevant to a given query</li>
<li><a href="http://powl.sourceforge.net/index.php">pOWL</a> &#8212; is a semantic Web development platform for ontologies in PHP. pOWL consists of a number of components, including RAP</li>
<li><a href="http://protege.stanford.edu/">Protege</a> &#8212; is an open source visual ontology editor written in Java with many plug-in tools</li>
<li><a href="https://sourceforge.net/projects/semantag">Semantic Net Generator</a> &#8212; is a utility for generating topic maps automatically from different  data sources by using rules definitions specified with Jelly XML syntax.  This Java library provides Jelly tags to access and modify data sources  (also RDF) to create a semantic network</li>
<li><a href="http://sofa.projects.semwebcentral.org/">SOFA</a> &#8212; is a Java API for modeling ontologies and Knowledge Bases in  ontology and Semantic Web applications. It provides a simple, abstract  and language neutral ontology object model, inferencing mechanism and  representation of the model with OWL, DAML+OIL and RDFS languages</li>
<li><a href="http://www.seco.tkk.fi/projects/semweb/dist.php">Terminator</a> &#8212; is a tool for creating term to ontology resource mappings (documentation in Finnish)</li>
<li><a href="http://kmi.open.ac.uk/projects/webonto/">WebOnto</a> &#8212; supports the browsing, creation and editing of ontologies through  coarse grained and fine grained visualizations and direct manipulation.</li>
</ul>
</div>
<div style="font-size: 90%;">
<div style="margin: 10px 0pt;">[<a title="onto9" name="onto9"></a>9] The SKOS language has the following classes:</div>
<ul style="margin: 10px 0pt 10px 20px;">
<li><a title="Collectable Property" href="http://www.w3.org/TR/swbp-skos-core-spec/#CollectableProperty">CollectableProperty</a> &#8212; A property which can be used with a skos:Collection</li>
<li><a title="Collection" href="http://www.w3.org/TR/swbp-skos-core-spec/#Collection">Collection</a> &#8212; A meaningful collection of concepts</li>
<li><a title="Concept" href="http://www.w3.org/TR/swbp-skos-core-spec/#Concept">Concept </a> &#8212; An abstract idea or notion; a unit of thought</li>
<li><a title="Concept Scheme" href="http://www.w3.org/TR/swbp-skos-core-spec/#ConceptScheme">ConceptScheme</a> &#8212; A set of concepts, optionally including statements about semantic  relationships between those concepts.  Thesauri, classification schemes,  subject heading lists, taxonomies, &#8216;folksonomies&#8217;, and other types of  controlled vocabulary are all examples of concept schemes. Concept  schemes are also embedded in glossaries and terminologies.</li>
<li><a title="Ordered Collection" href="http://www.w3.org/TR/swbp-skos-core-spec/#OrderedCollection">OrderedCollection</a> &#8212; An ordered collection of concepts, where both the grouping and the ordering are meaningful</li>
</ul>
<div style="margin: 10px 0pt;">. . . and the following properties:</div>
<ul style="margin: 10px 0pt 10px 20px;">
<li><a title="alternative label" href="http://www.w3.org/TR/swbp-skos-core-spec/#altLabel">altLabel</a> &#8212; An alternative lexical label for a resource.  Acronyms,  abbreviations, spelling variants, and irregular plural/singular forms  may be included among the alternative labels for a concept</li>
<li><a title="alternative symbolic label" href="http://www.w3.org/TR/swbp-skos-core-spec/#altSymbol">altSymbol</a> &#8212; An alternative symbolic label for a resource</li>
<li><a title="has broader" href="http://www.w3.org/TR/swbp-skos-core-spec/#broader">broader</a> &#8212; A concept that is more general in meaning. Broader concepts are typically rendered as parents in a concept hierarchy (tree)</li>
<li><a title="change note" href="http://www.w3.org/TR/swbp-skos-core-spec/#changeNote">changeNote</a> &#8212; A note about a modification to a concept</li>
<li><a title="definition" href="http://www.w3.org/TR/swbp-skos-core-spec/#definition">definition</a> &#8212; A statement or formal explanation of the meaning of a concept</li>
<li><a title="editorial note" href="http://www.w3.org/TR/swbp-skos-core-spec/#editorialNote">editorialNote</a> &#8212; A note for an editor, translator or maintainer of the vocabulary</li>
<li><a title="example" href="http://www.w3.org/TR/swbp-skos-core-spec/#example">example</a> &#8212; An example of the use of a concept</li>
<li><a title="has top concept" href="http://www.w3.org/TR/swbp-skos-core-spec/#hasTopConcept">hasTopConcept</a> &#8212; A top level concept in the concept scheme</li>
<li><a title="hidden label" href="http://www.w3.org/TR/swbp-skos-core-spec/#hiddenLabel">hiddenLabel</a> &#8212; A lexical label for a resource that should be hidden when generating  visual displays of the resource, but should still be accessible to free  text search operations</li>
<li><a title="history note" href="http://www.w3.org/TR/swbp-skos-core-spec/#historyNote">historyNote</a> &#8212; A note about the past state/use/meaning of a concept</li>
<li><a title="in scheme" href="http://www.w3.org/TR/swbp-skos-core-spec/#inScheme">inScheme</a> &#8212; A concept scheme in which the concept is included. A concept may be a member of more than one concept scheme</li>
<li><a title="is primary subject of" href="http://www.w3.org/TR/swbp-skos-core-spec/#isPrimarySubjectOf">isPrimarySubjectOf</a> &#8212; A resource for which the concept is the primary subject</li>
<li><a title="is subject of" href="http://www.w3.org/TR/swbp-skos-core-spec/#isSubjectOf">isSubjectOf</a> &#8211;A resource for which the concept is a subject</li>
<li><a title="member" href="http://www.w3.org/TR/swbp-skos-core-spec/#member">member</a> &#8212; A member of a collection</li>
<li><a title="member list" href="http://www.w3.org/TR/swbp-skos-core-spec/#memberList">memberList</a> &#8212; An RDF list containing the members of an ordered collection</li>
<li><a title="has narrower" href="http://www.w3.org/TR/swbp-skos-core-spec/#narrower">narrower</a> &#8212; A concept that is more specific in meaning.  Narrower concepts are  typically rendered as children in a concept hierarchy (tree)</li>
<li><a title="note" href="http://www.w3.org/TR/swbp-skos-core-spec/#note">note</a> &#8212; A general note, for any purpose. The other human-readable properties  of definition, scopeNote, example, historyNote, editorialNote and  changeNote are all sub-properties of note</li>
<li><a title="preferred label" href="http://www.w3.org/TR/swbp-skos-core-spec/#prefLabel">prefLabel</a> &#8212; The preferred lexical label for a resource, in a given language. No  two concepts in the same concept scheme may have the same value for  skos:prefLabel in a given language</li>
<li><a title="preferred symbolic label" href="http://www.w3.org/TR/swbp-skos-core-spec/#prefSymbol">prefSymbol</a> &#8212; The preferred symbolic label for a resource</li>
<li><a title="has primary subject" href="http://www.w3.org/TR/swbp-skos-core-spec/#primarySubject">primarySubject</a> &#8212; A concept that is the primary subject of the resource.  A resource may have only one primary subject per concept scheme</li>
<li><a title="related to" href="http://www.w3.org/TR/swbp-skos-core-spec/#related">related</a> &#8212; A concept with which there is an associative semantic relationship</li>
<li><a title="scope note" href="http://www.w3.org/TR/swbp-skos-core-spec/#scopeNote">scopeNote</a> &#8212; A note that helps to clarify the meaning of a concept</li>
<li><a title="semantic relation" href="http://www.w3.org/TR/swbp-skos-core-spec/#semanticRelation">semanticRelation</a> &#8212; A concept related by meaning. This property should not be used  directly, but as a super-property for all properties denoting a  relationship of meaning between concepts</li>
<li><a title="has subject" href="http://www.w3.org/TR/swbp-skos-core-spec/#subject">subject</a> &#8212; A concept that is a subject of the resource</li>
<li><a title="subject indicator" href="http://www.w3.org/TR/swbp-skos-core-spec/#subjectIndicator">subjectIndicator</a> &#8212; A subject indicator for a concept. [The notion of 'subject  indicator' is defined here with reference to the latest definition  endorsed by the OASIS Published Subjects Technical Committee]</li>
<li><a title="symbolic label" href="http://www.w3.org/TR/swbp-skos-core-spec/#symbol">symbol</a> &#8212; An image that is a symbolic label for the resource. This property is  roughly analagous to rdfs:label, but for labelling resources with  images that have retrievable representations, rather than RDF literals.  Symbolic labelling means labelling a concept with an image.</li>
</ul>
</div>
<div style="font-size: 90%;">
<div style="margin: 10px 0pt;">[<a title="onto10" name="onto10"></a>10]  The <a href="http://esw.w3.org/topic/SweoIG/TaskForces/InfoGathering/ClassificationOntology">SWEO classification ontology</a> is still under active development and has these draft classes.  Note,  however, the relative lack of actual subject or topic matter:</div>
<div style="margin: 10px 0pt;">Classes are currently defined as:</div>
<ul style="margin: 10px 0pt 10px 20px;">
<li>article &#8211; magazine article</li>
<li>blog &#8211; blog discussing SW topics</li>
<li>book &#8211; indicates a textbook, applies to the book&#8217;s home page, review or listing in Amazon or such</li>
<li>casestudy &#8211; Article on a business case</li>
<li>conference/event &#8211; conferences or events where you can learn about the Semantic Web</li>
<li>demo/demonstration &#8211; interactive SW demo</li>
<li>forum &#8211; a forum on semantic web or related topics</li>
<li>presentation &#8211; Powerpoint or similar slide show</li>
<li>person &#8211; If this is a person&#8217;s home page or blog, see below</li>
<li>publication &#8211; a scientific publication</li>
<li>ontology &#8211; a formalisation of a shared conceptualization using OWL, RDFS, SKOS or something else based on RDF</li>
<li>organization &#8211; If the page is the home page of an organization, research, vendor etc, see below</li>
<li>portal &#8211; a portal website Semantic Web or related topics, usually hosting information items, mailinglists, community tools</li>
<li>project &#8211; a research (for example EU-IST) or other project that addresses Semantic Web issues</li>
<li>mailinglist &#8211; a mailinglist on semantic Web or related topics</li>
<li>person &#8211; ideally a person that is well known regarding the Semantic  Web (people who can do keynote speakers), may also be any related person</li>
<li>press &#8211; a press release by a company or an article about Semantic Web</li>
<li>recommended &#8211; If the resource is seen to be in the top 10 of its kind</li>
<li>specification &#8211; a Semantic Web specification (RDF, RDF/S, OWL, etc)</li>
<li>categories &#8211; (perhaps using tags or other free form annotation</li>
<li>successstory &#8211; Article that can contain advertisment and clearly shows the benefit of semantic web</li>
<li>tutorial &#8211; a tutorial teaching some aspect of semantic web, an example</li>
<li>vocabulary &#8211; a RDF vocabulary</li>
<li>software project/tool &#8211; For product/project home pages</li>
</ul>
<div style="margin: 10px 0pt;">If the page describes an organization, it can be tagged as:</div>
<ul style="margin: 10px 0pt 10px 20px;">
<li>vendor</li>
<li>research</li>
<li>enduser</li>
</ul>
<div style="margin: 10px 0pt;">If the page is a person&#8217;s home page or blog or similar, it could be:</div>
<ul style="margin: 10px 0pt 10px 20px;">
<li>opinionleader</li>
<li>researcher</li>
<li>journalist</li>
<li>executive</li>
<li>geek</li>
</ul>
<div style="margin: 10px 0pt;">The type of audience can also be tagged, for example:</div>
<ul style="margin: 10px 0pt 10px 20px;">
<li>general public</li>
<li>beginners</li>
<li>technicians</li>
<li>researchers.</li>
</ul>
</div>
<div style="margin: 10px 0pt; font-size: 90%;">[<a title="onto11" name="onto11"></a>11] The <a href="http://www.oasis-open.org/home/index.php">OASIS</a> Topic Maps Published Subjects Technical Committee was formed a number  of years back to promote Topic Maps interoperability through the use of  Published Subjects Indicators (PSIs).  Their <a href="http://www.oasis-open.org/committees/download.php/3050/pubsubj-pt1-1.02-cs.pdf">resulting report</a> was a very interesting effort that unfortunately did not lead to wide  adoption, perhaps because the effort was a bit ahead of its time or it  was in advance of the broader acceptance of RDF.  This general topic is  the subject of a<a href="..//?p=375"> later posting by me</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;">[12<a title="onto12" name="onto12"></a>] See further, Leo Obrst, &#8220;The Semantic Spectrum &amp; Semantic Models,&#8221; a Powerpoint presentation (<a href="http://ontolog.cim3.net/file/resource/presentation/LeoObrst_20060112/OntologySpectrumSemanticModels--LeoObrst_20060112.ppt">http://ontolog.cim3.net/file/resource/presentation/LeoObrst_20060112/OntologySpectrumSemanticModels&#8211;LeoObrst_20060112.ppt</a>)<br />
 made as part of an Ontolog Forum (<a href="http://ontolog.cim3.net/">http://ontolog.cim3.net/</a>) presentation in two parts, &#8220;What is an Ontology? &#8211; A Briefing on the Range of Semantic Models&#8221; (see <a href="http://ontolog.cim3.net/cgi-bin/wiki.pl?ConferenceCall_2006_01_12">http://ontolog.cim3.net/cgi-bin/wiki.pl?ConferenceCall_2006_01_12</a>), in January 2006.  Leo Obrst is a principal artificial intelligence scientist at MITRE&#8217;s (<a href="http://www.mitre.org/">http://www.mitre.org</a>)  Center for Innovative Computing and Informatics and a co-convener of  the Ontolog Forum.  His presentation is a rich source of practical  overview information on ontologies.</div>
<div style="margin: 10px 0pt; font-size: 90%;">[13<a title="onto13" name="onto13"></a>] The actual diagram is an unattributed modification by Dan McCreary (see <a href="http://www.danmccreary.com/presentations/sem_int/sem_int.ppt">http://www.danmccreary.com/presentations/sem_int/sem_int.ppt</a>) based on Obrst&#8217;s material in [12].</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/936/brown-bag-lunch-an-intrepid-guide-to-ontologies/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Brown Bag Lunch: Structure Paves the Way to the Semantic Web</title>
		<link>http://www.mkbergman.com/889/brown-bag-lunch-structure-paves-the-way-to-the-semantic-web/</link>
		<comments>http://www.mkbergman.com/889/brown-bag-lunch-structure-paves-the-way-to-the-semantic-web/#comments</comments>
		<pubDate>Fri, 11 Jun 2010 05:55:31 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Adaptive Information]]></category>
		<category><![CDATA[Brown Bag Lunch]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Structured Web]]></category>
		<category><![CDATA[Eisenhower]]></category>
		<category><![CDATA[IEEE]]></category>
		<category><![CDATA[Interstate highways]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=889</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Brown Bag Lunch: Structure Paves the Way to the Semantic Web&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Brown Bag Lunch&amp;rft.subject=Semantic Web&amp;rft.subject=Structured Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2010-06-11&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/889/brown-bag-lunch-structure-paves-the-way-to-the-semantic-web/&amp;rft.language=English"></span>
How Shall We Measure Progress Over the Past Three Years? For a dozen years, my career has been centered on Internet search, dynamic content and the deep Web. For the past few years, I have been somewhat obsessed by two topics. The first topic, a conviction really, is that implicit structure needs to be extracted [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Brown Bag Lunch: Structure Paves the Way to the Semantic Web&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Brown Bag Lunch&amp;rft.subject=Semantic Web&amp;rft.subject=Structured Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2010-06-11&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/889/brown-bag-lunch-structure-paves-the-way-to-the-semantic-web/&amp;rft.language=English"></span>
<h2>How Shall We Measure Progress Over the Past Three Years?</h2>
<p><img style="border: 0px solid; float: left; margin-right: 10px;" title="Friday Brown Bag Lunch" src="../wp-content/themes/ai3/images/lunchbag_225.jpg" alt="Friday     Brown Bag Lunch" width="158" height="179" /><br />
<a href="http://www.mkbergman.com/wp-content/themes/ai3/images/2007Posts/070405a_colorado-hwy.jpg"><img style="border: 0px solid; float: right; margin-left: 10px;" title="Colorado  Interstate construction - 1970; courtesy National Archives" src="../wp-content/themes/ai3/images/2007Posts/070405a_colorado-hwy.jpg" alt="Colorado  Interstate construction - 1970; courtesy National Archives" width="272" /></a>For a dozen years, my career has been centered on Internet search,  dynamic content and the <a href="http://en.wikipedia.org/wiki/Deep_web">deep Web</a>. For the  past few years, I have been somewhat obsessed by two topics.</p>
<p>The first  topic, a conviction really, is that implicit structure needs to be  extracted from Web content to enable it to be disambiguated, organized,  shared and re-purposed. The second topic, more an open question as a  former academic married to a professor, is what might replace editorial  selections and peer review to establish the authoritativeness of  content. These topics naturally steer one to the <a href="http://en.wikipedia.org/wiki/Semantic_web">semantic Web</a>.</p>
<h3><span style="font-weight: bold">A  Millennial Perspective</span></h3>
<p>The semantic Web, by whatever name it comes to be called, is an  inevitability.  History tells us that as information content grows, so  do the mechanisms for organizing and managing it. Over human history,  innovations such as writing systems, alphabetization, pagination, tables  of contents, indexes, concordances, reference look-ups, classification  systems, tables, figures, and statistics have emerged in parallel with  content growth [<a href="#SWref19">19</a>].</p>
<p>When the Lycos search engine, one of the first profitable Internet  ventures, was publicly released in 1994, it indexed a mere 54,000 pages [<a href="#SWref1">1</a>].  When  Google wowed us with its page-ranking algorithm in 1998, it soon  replaced my then favorite search engine, AltaVista.  Now, tens of  billions of indexed documents later, I often find Google&#8217;s results to be  overwhelming dross &#8212; unfortunately true again for all of the major  search engines.  Faceted browsing, vertical search, and Web 2.0&#8242;s  tagging and folksonomies demonstrate humanity&#8217;s natural penchant to  fight this entropy, efforts that will next continue with the semantic  Web and then mechanisms unforeseen to manage the chaos of burgeoning  content.</p>
<p>An awful lot of hot air has been expelled over the false dichotomy of  whether the semantic Web will fail or is on the verge of nirvana.  Arguments extend from the epistemological versus ontological  (classically defined) to Web 3.0 versus SemWeb or Web services (WS*)  versus REST (Representational State Transfer). My RSS feed reader points  to at least one such dust up every week.</p>
<p>Some set the difficulties of resolving semantic heterogeneities as  absolutes, leading to an illogical and false rejection of semantic Web  objectives. In contrast, some advocates set equally divisive arguments  for semantic Web purity by insisting on formal ontologies and  descriptive logics. Meanwhile, studied leaks about &#8220;stealth&#8221; semantic  Web ventures mean you should grab your wallet while simultaneously  shaking your head.</p>
<h3><span style="font-weight: bold">A  Decades-Long Perspective</span></h3>
<p>My mental image of the semantic Web is a road from here to some  achievable destination &#8212; say, Detroit. Parts of the road are well paved;  indeed, portions are already superhighways with controlled on-ramps and  off-ramps. Other portions are two lanes, some with way too many traffic  lights and some with dangerous intersections. A few small portions  remain unpaved gravel and rough going.</p>
<div style="float: right;  margin-left: 10px"><a href="http://www.mkbergman.com/wp-content/themes/ai3/images/2007Posts/070405b_1919wreck_400.jpg"><img style="border: 0px solid; width: 400px;" title="1919 Wreck in Nebraska" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2007Posts/070405b_1919wreck_400.jpg" alt="1919 Wreck in Nebraska" align="middle" /></a></p>
<p align="center"><small>Wreck in Nebraska during the 1919  Transcontinental Motor Convoy</small></p>
</div>
<p>A lack of perspective makes things appear either too close or too far  away. The automobile isn&#8217;t yet a century old as a mass-produced item.  It wasn&#8217;t until 1919 that the US Army Transcontinental Motor Convoy made  the first automobile trip across the United States.</p>
<p>The 3,200 mile  route roughly followed today&#8217;s Lincoln Highway, US 30, from Washington,  D.C. to San Francisco. The convoy took 62 days and 250 recorded  accidents to complete the trip (see figure), half on dirt roads at an  average speed of 6 miles per hour. A tank officer on that trip later  observed Germany&#8217;s autobahns during World War II. When he subsequently  became President Dwight D. Eisenhower, he proposed and then signed the  Interstate Highway Act.</p>
<p>That was 50 years ago. Today, the US is  crisscrossed with 50,000 miles of interstates, which have completely  remade the nation&#8217;s economy and culture [<a href="#SWref2">2</a>].</p>
<h3><span style="font-weight: bold">Today&#8217;s  Perspective</span></h3>
<p>Like the interstate system in its early years, today&#8217;s semantic Web  lets you link together a complete trip, but the going isn&#8217;t as smooth or  as fast as it could be. Nevertheless, making the trip is doable and  keeps improving day by day, month by month.</p>
<p>My view of what&#8217;s required to smooth the road begins with extracting  structure and meaningful information according to understandable schema  from mostly uncharacterized content. Then we store the now-structured  content as RDF triples that can be further managed and manipulated at  scale. By necessity, the journey embraces tools and requirements that,  individually, might not constitute semantic Web technology as some  strictly define it. These tools and requirements are nonetheless  integral to reaching the destination. We are well into that journey&#8217;s  first leg, what I and others are calling the <span style="font-style: italic">structured Web</span>.</p>
<p>For the past six months or so I have been researching and assembling  as many semantic Web and related tools as I can find [<a href="#SWref3">3</a>].  That  <a href="http://www.mkbergman.com/new-version-sweet-tools-sem-web/"><span style="font-style:  italic; font-weight: bold">Sweet Tools</span></a> listing now exceeds  500 tools [<a href="#SWref4">4</a>] (with  its presentation using the nifty lightweight Exhibit publication system  from MIT&#8217;s Simile program [<a href="#SWref5">5</a>]).   I&#8217;ve come to understand the importance of many ancillary tool sets to  the entire semantic Web highway, such as natural language processing and  information extraction. I&#8217;ve also found new categories of pragmatic  tools that embody semantic Web and data mediation processes but don&#8217;t  label themselves as such.</p>
<p>In its entirety, the <a href="http://www.mkbergman.com/new-version-sweet-tools-sem-web/"><span style="font-style:  italic; font-weight: bold">Sweet Tools</span></a> listing provides a  pretty good picture of the semantic Web&#8217;s state. It&#8217;s a surprisingly  robust picture &#8212; though with some notable potholes &#8212; and includes  impressive open source options in all categories. Content publishing,  indexing, and retrieval at massive scales are largely solved problems.  We also have the infrastructure, languages, and (yes!) standards for  tying this content together meaningfully at the data and object levels.</p>
<p>I also think a degree of consensus has emerged on RDF as the  canonical data model for semantic information. RDF triple stores are  rapidly improving toward industrial strength, and RESTful designs enable  massive scalability, as terabyte- and petabyte-scale full-text indexes  prove.</p>
<p>Powerful and flexible middleware options, such as those from OpenLink  [<a href="#SWref6">6</a>], can  transform and integrate diverse file formats with a variety of back  ends. The World Wide Web Consortium&#8217;s GRDDL standard [<a href="#SWref7">7]</a> and  related tools, plus various &#8220;RDF-izers&#8221; from Massachusetts Institute of  Technology and elsewhere [<a href="#SWref8">8</a>],  largely provide the conversion infrastructure for getting Web data into  that canonical RDF form. Sure, some of these converters are still  research-grade, but getting them to operational capabilities at scale  now appears trivial.</p>
<p>Things start getting shakier when trying to structure information  into a semantic formalism. Controlled vocabularies and ontologies range  broadly and remain a contentious area. Publishers and authors perhaps  have too many choices: from straight Atom or RSS feeds and feeds with  tags to informal folksonomies and then Outline Processor Markup Language  [<a href="#SWref9">9</a>] or  microformats [<a href="#SWref10">10</a>].  From there, the formalism increases further to include the standard RDF  ontologies such as SIOC (Semantically-Interlinked Online Communities),  SKOS (Simple Knowledge Organizing System), DOAP (Description of a  Project), and FOAF (Friend of a Friend) [<a href="#SWref11">11</a>] and  the still greater formalism of OWL&#8217;s various dialects [<a href="#SWref12">12</a>].</p>
<div style="border: 1px solid #820000; background-color: #ffffe5; width: 460px; float: left; margin: 10px 10px 10px 0px; font-style: italic; font-size: 120%;">
<table border="0" cellspacing="4" cellpadding="4">
<tbody>
<tr>
<td style="vertical-align: middle; text-align: center"><em>If we compare the  semantic Web to the US interstate highway system, we&#8217;re still in the  early stages of a journey that will remake our economy and culture.</em></td>
</tr>
<tr>
<td style="text-align: center"><em>Many  potholes on the road to the semantic Web exist.</em></td>
</tr>
<tr>
<td style="text-align: center"><em>One ready  task is to transform existing structure to RDF. Another priority is to  refine tools to extract structure and meaningful information from  uncharacterized content.</em></td>
</tr>
</tbody>
</table>
</div>
<p>Arguing which of these is the theoretical best method is doomed to  failure, except possibly in a bounded enterprise environment. We live in  the real world, where multiple options will always have their advocates  and their applications.</p>
<p>All of us should welcome whatever structure we  can add to our information base, no matter where it comes from or how  it&#8217;s done. The sooner we can embrace content in any of these formats and  convert it into canonical RDF form, we can then move on to needed  developments in semantic mediation, some of the roughest road on the  journey.</p>
<h3 style="font-weight: bold">Potholes on  the Semantic Highway</h3>
<p>Semantic mediation requires appropriate structured content. Many  potholes on the road to the semantic Web exist because the content lacks  structured markup; others arise because existing structure requires  transformation. We need improved ways to address both problems. We also  need more intuitive means for applying schema to structure. Some have  referred to these issues as &#8220;who pays the tax.&#8221;</p>
<p>Recent experience with social software and collaboration proves that a  portion of the Internet user community is willing to tag and  characterize content. Furthermore, we can readily leverage that  resulting structure, and free riders are welcomed. The real pothole is  the lack of easy &#8212; even fun &#8212; data extractors and &#8220;structurizers.&#8221; But  we&#8217;re tantalizingly close.</p>
<p>Tools such as Solvent and Sifter from MIT&#8217;s Simile program [<a href="#SWref13">13</a>] and  Marmite from Carnegie Mellon University [<a href="#SWref14">14</a>] are  showing the way to match DOM (document object model) inspectors with  automated structure extractors. DBpedia,  the alpha version of Freebase,  and System One now provide large-scale, open Web data sets in RDF [<a href="#SWref15">15</a>],  including all of Wikipedia. Browser extensions such as Zotero [<a href="#SWref16">16</a>] are  showing how to integrate structure management into acceptable user  interfaces, as are services such as Zoominfo [<a href="#SWref17">17</a>]. Yet  we still lack easy means to design the differing structures suitable  for a plenitude of destinations.</p>
<p>Amazingly, a compelling road map for how all these pieces could truly  fit together is also incomplete. How do we actually get from here to  Detroit? Within specific components, architectural understandings are  sometimes OK (although documentation is usually awful for open source  projects, as most of the current tools are). Until our community better  documents that vision, attracting new contributors will be needlessly  slower, thus delaying the benefits of network effects.</p>
<p>So, let&#8217;s create a road map and get on with paving the gaps and  filling the potholes. It&#8217;s not a matter of standards or technology &#8212; we  have those in abundance. Let&#8217;s stop the silly squabbles and commit to  the journey in earnest. The <span style="font-style: italic">structured Web</span>&#8216;s ability to reach <span style="font-style: italic">Hyperland</span> [<a href="#SWref18">18</a>],  Douglas Adam&#8217;s prescient 1990 forecast of the semantic Web, now looks to  be no further away than Detroit.</p>
<div class="boxBrownDotted center_ok" style="min-height: 80px; max-width: 460px;"><img style="width: 64px; height: 73px; float: left; margin-right: 10px;" title="Friday Brown Bag    Lunch" src="../wp-content/themes/ai3/images/lunchbag_64.png" alt="Friday      Brown Bag Lunch" /> This <a href="../834/announcing-the-sporadic-friday-brown-bag-lunch">Friday      brown bag leftover</a> was first placed into the <span style="font-weight: bold; color: #993300;">AI3</span> <a href="../chronological-listing/">refrigerator</a> about three years     ago on <a href="http://www.mkbergman.com/357/structure-paves-the-way-to-the-semantic-web/">May 3, 2007</a>.  The piece was my answer to a request by <a href="http://www.mindswap.org/blog/">Jim  Hendler</a> to pen   some thoughts on the semantic Web, based on I believe what he thought might be a pragmatic perspective combining  Internet business with Web science. The formal piece appeared as a guest  editorial in  the May/June 2007 issue of <a href="http://www.computer.org/intelligent/">IEEE Intelligent Systems</a>. What appears above is unaltered from my original posting (aside from some minor formatting clean-up and &#8212; sorry to say &#8212; some of the projects are now defunct).</div>
<hr style="height: 1px; width: 33%; margin-left: 0px; margin-right: auto;" />
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref1" name="SWref1">[1]</a> Chris  Sherman, &#8220;Happy Birthday, Lycos!,&#8221; <span style="font-style: italic">Search Engine Watch</span>, August 14,  2002.  See <a href="http://searchenginewatch.com/showPage.html?page=2160551">http://searchenginewatch.com/showPage.html?page=2160551</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref2" name="SWref2">[2]</a> David  A. Pfeiffer, &#8220;Ike&#8217;s Interstates at 50: Anniversary of the Highway System  Recalls Eisenhower&#8217;s Role as Catalyst,&#8221; <span style="font-style: italic">Prologue Magazine</span>,  National Archives, Summer 2006, Vol. 38, No. 2. See: <a href="http://www.archives.gov/publications/prologue/2006/summer/interstates.html">http://www.archives.gov/publications/prologue/2006/summer/interstates.html</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref3" name="SWref3">[3]</a> The  mention of specific tool names is meant to be illustrative and not  necessarily a recommendation.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref4" name="SWref4">[4]</a> <span style="font-weight: bold">Sweet Tools</span> (SemWeb) listing; see <a href="http://www.mkbergman.com/new-version-sweet-tools-sem-web/">http://www.mkbergman.com/new-version-sweet-tools-sem-web/</a> .</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref5" name="SWref5">[5]</a> See <a href="http://simile.mit.edu/exhibit/">http://simile.mit.edu/exhibit/.</a></div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref6" name="SWref6">[6]</a> OpenLink Software&#8217;s Virtuoso and Data Spaces products; see <a href="http://www.openlinksw.com/">http://www.openlinksw.com/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref7" name="SWref7">[7]</a> W3C&#8217;s  Gleaning Resource Descriptions from Dialects of Languages (GRDDL,  pronounced &#8220;griddle&#8221;).  See <a href="http://www.w3.org/2004/01/rdxh/spec">http://www.w3.org/2004/01/rdxh/spec</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref8" name="SWref8">[8]</a> See <a href="http://simile.mit.edu/wiki/RDFizers">http://simile.mit.edu/wiki/RDFizers</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref9" name="SWref9">[9]</a> Outline  Processor Markup Language (OPML); see <a href="http://www.opml.org/">http://www.opml.org/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref10" name="SWref10">[10]</a> Microformats; see <a href="http://microformats.org/">http://microformats.org</a>/.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref11" name="SWref11">[11]</a> <a href="http://en.wikipedia.org/wiki/DOAP">DOAP</a> (<a href="http://en.wikipedia.org/wiki/DOAP">Description of a Project</a>),  <a href="http://en.wikipedia.org/wiki/FOAF">FOAF</a> (<a href="http://en.wikipedia.org/wiki/FOAF">Friend of a Friend</a>), <a href="http://en.wikipedia.org/wiki/SIOC">SIOC</a> (<a href="http://en.wikipedia.org/wiki/SIOC">Semantically-Interlinked  Online Communities</a>) and <a href="http://en.wikipedia.org/wiki/SKOS">SKOS</a> (<a href="http://en.wikipedia.org/wiki/SKOS">Simple Knowledge Organizing  System</a>).</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref12" name="SWref12">[12]</a> W3C&#8217;s Web Ontology Language (OWL).  See <a href="http://www.w3.org/TR/owl-features/">http://www.w3.org/TR/owl-features/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref13" name="SWref13">[13]</a> Solvent (<a href="http://simile.mit.edu/wiki/Solvent">http://simile.mit.edu/wiki/Solvent</a>)  and Sifter (<a href="http://simile.mit.edu/wiki/Sifter">http://simile.mit.edu/wiki/Sifter</a>)  are from MIT&#8217;s Simile program.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref14" name="SWref14">[14]</a> Marmite (<a href="http://www.cs.cmu.edu/%7Ejasonh/projects/marmite/">http://www.cs.cmu.edu/~jasonh/projects/marmite/</a>)  is from Carnegie Mellon University.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref15" name="SWref15">[15]</a> DBpedia (<a href="http://dbpedia.org/docs/">http://dbpedia.org/docs/</a>) and  Freebase (in alpha, by invitation only at <a href="http://www.freebase.com/">http://www.freebase.com/</a>)  are two of the first large-scale open datasets on the Web; Wikipedia  has also been converted to RDF by System One (<a href="http://labs.systemone.at/wikipedia3">http://labs.systemone.at/wikipedia3</a>).</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref16" name="SWref16">[16]</a> Zotero is produced by George Mason University&#8217;s Center for History and  New Media; see <a href="http://www.zotero.org/">http://www.zotero.org</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref17" name="SWref17">[17]</a> ZoomInfo (<a href="http://www.zoominfo.com/">http://www.zoominfo.com/</a>)  provides online structured search of companies and people, plus broader  services to enterprises.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref18" name="SWref18">[18]</a> The  late <a title="Douglas Adams" href="http://www.douglasadams.com/">Douglas  Adams</a>, of <em>Doctor Who </em>and <em>A Hitchhiker&#8217;s Guide to the  Galaxy</em> fame, produced a TV program for BBC2 presaging the Internet  called <a style="font-style: italic" href="http://en.wikipedia.org/wiki/Hyperland">Hyperland</a>.  This 50-min  video can be seen in five parts via YouTube at Part <a href="http://www.youtube.com/watch?v=rOsPKjbMvxY">1 of 5</a>, <a href="http://www.youtube.com/watch?v=ELSZ7pAmvKE">2 of 5</a>, <a href="http://www.youtube.com/watch?v=VF8dm9sK8as">3 of 5</a>, <a href="http://www.youtube.com/watch?v=6dB3_GcFV_0">4 of 5</a> and <a href="http://www.youtube.com/watch?v=b8pvOdMnflI">5 of 5</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a title="SWref19" name="SWref19">[19]</a> Since I first wrote this piece, I have systematized these developments in my <a href="http://www.mkbergman.com/temp-exhibit/">Timeline of Information History</a>.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/889/brown-bag-lunch-structure-paves-the-way-to-the-semantic-web/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Brown Bag Lunch: Historical Origins of the Knowledge Economy</title>
		<link>http://www.mkbergman.com/879/brown-bag-lunch-historical-origins-of-the-knowledge-economy/</link>
		<comments>http://www.mkbergman.com/879/brown-bag-lunch-historical-origins-of-the-knowledge-economy/#comments</comments>
		<pubDate>Fri, 23 Apr 2010 06:48:02 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Adaptive Information]]></category>
		<category><![CDATA[Adaptive Innovation]]></category>
		<category><![CDATA[Brown Bag Lunch]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=879</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Brown Bag Lunch: Historical Origins of the Knowledge Economy&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Adaptive Innovation&amp;rft.subject=Brown Bag Lunch&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2010-04-23&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/879/brown-bag-lunch-historical-origins-of-the-knowledge-economy/&amp;rft.language=English"></span>
A Reprise AI3 Post from Four Years Ago In 2002 Joel Mokyr, an economic historian from Northwestern University, wrote a book that should be read by anyone interested in knowledge and its role in economic growth. The Gifts of Athena : Historical Origins of the Knowledge Economy is a sweeping and comprehensive account of the [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Brown Bag Lunch: Historical Origins of the Knowledge Economy&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Adaptive Innovation&amp;rft.subject=Brown Bag Lunch&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2010-04-23&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/879/brown-bag-lunch-historical-origins-of-the-knowledge-economy/&amp;rft.language=English"></span>
<h2>A Reprise AI3 Post from Four Years Ago</h2>
<p><img style="border: 0px solid; float: left; margin-right: 10px;" title="Friday Brown Bag Lunch" src="../wp-content/themes/ai3/images/lunchbag_225.jpg" alt="Friday    Brown Bag Lunch" width="158" height="179" /><a href="http://www.amazon.com/exec/obidos/tg/detail/-/0691120137/ref=olp_product_details/104-6346161-2267928?%5Fencoding=UTF8&amp;v=glance"><img style="border: 0px solid; float: right; margin-left: 10px;" src="http://images.amazon.com/images/P/0691120137.01._SCTZZZZZZZ_.jpg" alt="" width="72" height="110" /></a><br />
In 2002 Joel Mokyr, an economic historian from Northwestern  University, wrote a book that should be read by anyone interested in  knowledge and its role in economic growth.  <a href="http://www.amazon.com/gp/product/0691120137/sr=8-1/qid=1152193714/ref=sr_1_1/104-6346161-2267928?ie=UTF8"><span>The Gifts of Athena : Historical Origins of the  Knowledge Economy</span></a> is a sweeping and comprehensive account of  the period from 1760 (in what Mokyr calls the &#8220;Industrial  Enlightenment&#8221;) through the Industrial Revolution beginning roughly in  1820 and then continuing through the end of the 19th century.</p>
<p>The book  (and related expansions by Mokyr available as <a href="http://faculty.wcas.northwestern.edu/~jmokyr/papers.html">separate PDFs</a> on the  Internet) should be considered as the definitive reference on this topic  to date.  The book contains 40 pages of references to all of the  leading papers and writers on diverse technologies from mining to  manufacturing to health and the household.  The scope of subject  coverage, granted mostly focused on western Europe and America, is truly  impressive.</p>
<p>Mokyr deals with &#8216;useful knowledge,&#8217; as he acknowledges <a title="Simon Kuznets" href="http://en.wikipedia.org/wiki/Simon_Kuznets">Simon Kuznets</a>&#8216;  phrase.  Mokyr argues that the growth of recent centuries was driven by  the accumulation of knowledge and the declining costs of access to it.   Mokyr helps to break past logjams that have attempted to link single  factors such as the growth in science or the growth in certain  technologies (such as the steam engine or electricity) as the key  drivers of the massive increases in economic growth that coincided with  the era now known as the Industrial Revolution.</p>
<p>Mokyr cracks some of these prior impasses by picking up on ideas  first articulated through <a title="Michael Polanyi" href="http://en.wikipedia.org/wiki/Michael_Polanyi">Michael Polanyi</a>&#8216;s  &#8220;<a href="http://en.wikipedia.org/wiki/Tacit_knowledge">tacit knowing</a>&#8221;  (among other recent philosophers interested in the nature and  definition of knowledge).  Mokyr&#8217;s own schema posits <span style="font-style: italic">propositional  knowledge</span>, which he defines as the science, beliefs or the <span style="font-style: italic">epistemic  base</span> of knowledge, which he labels omega (<span style="font-weight: bold;">Ω</span>), in  combination with <span style="font-style: italic">prescriptive knowledge</span>, which are  the techniques (&#8220;recipes&#8221;), and which he also labels lambda (<span style="font-weight: bold;">λ</span>).   Mokyr notes that an addition to omega (<span style="font-weight: bold;">Ω</span>) is a <span style="font-style: italic">discovery</span>, an addition to  lambda (<span style="font-weight: bold;">λ</span>)  is an <span style="font-style: italic">invention</span>.</p>
<p>One of Mokyr&#8217;s key points is that both knowledge types reinforce one  another and, of course, the Industrial Revolution was a period of  unprecedented growth in such knowledge.  Another key point, easily  overlooked when &#8220;discoveries&#8221; are seemingly more noteworthy, is that <span style="font-style: italic">techniques</span> and <span style="font-style: italic">practical  applications </span>of knowledge can provide a multiplier effect and  are equivalently important.  For example, in addition to his main case  studies of the factory, health and the household, he says:</p>
<blockquote><p>The inventions of writing, paper, and printing not only  greatly reduced access costs but also materially<br />
affected human cognition, including the way people thought about their  environment.</p></blockquote>
<p>Mokyr also correctly notes how the accumulation of knowledge in  science and the <span style="font-style:  italic">epistemic base</span> promotes productivity and more still-more  efficient discovery mechanisms:</p>
<blockquote><p>The range of experimentation possibilities that needs to  be searched over is far larger if the searcher knows nothing about the  natural principles at work.  To paraphrase Pasteur&#8217;s famous aphorism  once more, fortune may sometimes favor unprepared minds, but only for a  short while.  It is in this respect that the width of the epistemic base  makes the big difference.</p></blockquote>
<p>In my own opinion, I think Mokyr starts to get closer to the mark  when he discusses knowledge &#8220;storage&#8221;, access costs and multiplier  effects from basic knowledge-based technologies or techniques.  Like  some other recent writers, he also tries to find analogies with  evolutionary biology.  For example:</p>
<blockquote><p>Much like DNA, useful knowledge does not exist by itself;  it has to be &#8220;carried&#8221; by people or in storage<br />
devices.  Unlike DNA, however, carriers can acquire and shed knowledge  so that the selection process is quite different.  This difference  raises the question of how it is transmitted over time, and whether it  can actually shrink as well as expand.</p></blockquote>
<p>One of the real advantages of this book is to move forward a re-think  of the &#8220;great man&#8221; or &#8220;great event&#8221; approach to history.  There are  indeed complicated forces at work.  I think Mokyr summarizes well this  transition when he states:</p>
<blockquote><p>A century ago, historians of technology felt that  individual inventors were the main actors that brought about<br />
the Industrial Revolution.  Such heroic interpretations were discarded  in favor of views that emphasized deeper economic and social factors  such as institutions, incentives, demand, and factor prices.  It seems,  however, that the crucial elements were neither brilliant individuals  nor the impersonal forces governing the masses, but a small group of at  most a few thousand peopled who formed a creative community based on the  exchange of knowledge.  Engineers, mechanics, chemists, physicians, and  natural philosophers formed circles in which access to knowledge was  the primary objective.  Paired with the appreciation that such knowledge  could be the base of ever-expanding prosperity, these elite networks  were indispensible, even if individual members were not.  Theories that  link education and human capital of technological progress need to  stress the importance of these small creative communities jointly with  wider phenomena such as literacy rates and universal schooling.</p></blockquote>
<p>There is so much to like and to be impressed with this book and even  later Mokyr writings.  My two criticisms are that, first, I found the  pseudo-science of his knowledge labels confusing (I kept having to  mentally translate the omega symbol) and I disliked the naming  distinctions between <em>propositional</em> and <em>prescriptive</em>, even  though I think the concepts are spot on.</p>
<p>My second criticism, a more major one, is that Mokyr notes, but does  not adequately pursue, &#8220;In the decades after 1815, a veritable explosion  of technical literature took place.  Comprehensive technical compendia  appeared in every industrial field.&#8221;  Statements such as these, and  there are many in the book, hint at perhaps some fundamental drivers.</p>
<p>Mokyr has provided the raw grist for answering his starting question of <span style="font-style: italic; text-decoration: underline;">why</span> such massive economic growth occurred in conjunction with the era of the  Industrial Revolution.  He has made many insights and posited new factors to explain this salutary discontinuity from all prior human  history.  But, in this reviewer&#8217;s opinion, he still leaves the <span style="font-style: italic; text-decoration: underline;">why</span> tantalizingly close but still unanswered. The fixity of information and  growing storehouses because of declining production and access costs  remain too poorly explored.</p>
<div class="boxBrownDotted center_ok" style="min-height: 80px; max-width: 460px;"><img style="width: 64px; height: 73px; float: left; margin-right: 10px;" title="Friday Brown Bag    Lunch" src="../wp-content/themes/ai3/images/lunchbag_64.png" alt="Friday    Brown Bag Lunch" /> This <a href="../834/announcing-the-sporadic-friday-brown-bag-lunch">Friday    brown bag leftover</a> was first placed into the <span style="font-weight: bold; color: #993300;">AI3</span> <a href="../chronological-listing/">refrigerator</a> about four years   ago on <a href="http://www.mkbergman.com/249/historical-origins-of-the-knowledge-economy/">July 6, 2006</a>. It was part of a series of book reviews I was doing at that time getting at the importance of bulk paper production as a key enabler of economic growth.  No changes have been made to the original  posting.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/879/brown-bag-lunch-historical-origins-of-the-knowledge-economy/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

