<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>AI3:::Adaptive Information &#187; irON</title>
	<atom:link href="http://www.mkbergman.com/category/iron/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mkbergman.com</link>
	<description>Mike Bergman on the semantic Web and structured Web</description>
	<lastBuildDate>Mon, 26 Jul 2010 05:31:20 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Consolidating a Coherent Message with OSF</title>
		<link>http://www.mkbergman.com/894/consolidating-a-coherent-message-with-osf/</link>
		<comments>http://www.mkbergman.com/894/consolidating-a-coherent-message-with-osf/#comments</comments>
		<pubDate>Tue, 06 Jul 2010 06:52:55 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Adaptive Information]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Semantic Web Tools]]></category>
		<category><![CDATA[Structured Dynamics]]></category>
		<category><![CDATA[Web-oriented Architecture]]></category>
		<category><![CDATA[irON]]></category>
		<category><![CDATA[open semantic framework]]></category>
		<category><![CDATA[semantic components]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=894</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Consolidating a Coherent Message with OSF&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Ontologies&amp;rft.subject=Open Source&amp;rft.subject=Semantic Web Tools&amp;rft.subject=Structured Dynamics&amp;rft.subject=Web-oriented Architecture&amp;rft.subject=irON&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2010-07-06&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/894/consolidating-a-coherent-message-with-osf/&amp;rft.language=English"></span>

Release of Semantic Components Adds Final Layer, Leads to Streamlined Sites
Yesterday Fred Giasson announced the release of code associated with Structured Dynamics&#8216; open source semantics components (also called sComponents).  A semantic component is an ontology-driven component, or widget, based on Flex. Such a component takes record descriptions, ontologies and target attributes/types as inputs and then [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Consolidating a Coherent Message with OSF&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Ontologies&amp;rft.subject=Open Source&amp;rft.subject=Semantic Web Tools&amp;rft.subject=Structured Dynamics&amp;rft.subject=Web-oriented Architecture&amp;rft.subject=irON&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2010-07-06&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/894/consolidating-a-coherent-message-with-osf/&amp;rft.language=English"></span>
<p><a href="http://openstructs.org/open-semantic-framework"><img style="border: 0px solid; width: 216px; height: 216px; float: left; margin-right: 10px;" title="Consolidating Under the Open Semantic Framework" src="../wp-content/themes/ai3/images/2010Posts/100706_osf_consolidation.png" alt="Consolidating Under the Open Semantic Framework" /></a></p>
<h2>Release of Semantic Components Adds Final Layer, Leads to Streamlined Sites</h2>
<p>Yesterday <a href="http://fgiasson.com/blog/index.php/2010/07/05/semantic-components/">Fred Giasson announced</a> the release of code associated with <a href="http://structureddynamics.com/">Structured Dynamics</a>&#8216; open source <a style="font-weight: bold; font-style: italic;" href="http://openstructs.org/semantic-components">semantics components</a> (also called <span style="font-weight: bold; font-style: italic;">sComponents</span>).  A <span style="font-weight: bold; font-style: italic;">semantic component</span> is an ontology-driven component, or widget, based on <a href="http://en.wikipedia.org/wiki/Adobe_Flex">Flex</a>. Such a component takes record descriptions, ontologies and target attributes/types as inputs and then outputs some (possibly interactive) visualizations of the records.</p>
<p>Though not all layers are by any means complete, from an architectural standpoint the release of these <span style="font-weight: bold; font-style: italic;">semantic components</span> provides the last and missing layer to complete our <a style="font-weight: bold; font-style: italic;" href="http://openstructs.org/open-semantic-framework">open semantic framework</a>. Completing this layer now also enables Structured Dynamics to rationalize its open source Web sites and various groups and mailing lists associated with them.</p>
<h3>The OSF &#8220;Semantic Muffin&#8221;</h3>
<p>We <a href="../891/domain-specific-instantiations-based-on-the-open-semantic-framework/">first announced</a> the <a href="http://openstructs.org/open-semantic-framework"><span style="font-weight: bold; font-style: italic;">open semantic framework</span></a> &#8212; or <span style="font-weight: bold; font-style: italic;">OSF</span> &#8212; a couple of weeks back. Refer to <a href="../891/domain-specific-instantiations-based-on-the-open-semantic-framework/">that original post</a> for more description of the general design <a href="#consol1">[1]</a>. However, we can show this framework with the <span style="font-weight: bold; font-style: italic;">semantic components</span> layer as illustrated by what some have called the &#8220;semantic muffin&#8221;:</p>
<div style="clear: both;"><a href="../wp-content/themes/ai3/images/2010Posts/100706_osf_sc_layer.png"> <img class="center_ok" style="border: 0px solid; width: 600px; height: 382px;" title="Semantic Componetn Layer of the Open Semantic Framework" src="../wp-content/themes/ai3/images/2010Posts/100706_osf_sc_layer.png" alt="Incremental Layers of the Open Semantic Framework" width="758" height="482" /></a></p>
<p style="font-style: italic;" align="center"><small>(click for <a href="../wp-content/themes/ai3/images/2010Posts/100706_osf_sc_layer.png"> full size</a>)</small></p>
</div>
<p>The <span style="font-weight: bold; font-style: italic;">OSF</span> stack consists of these layers, moving from existing assets upward through increasing semantics and usability:</p>
<ul>
<li>Existing assets &#8212; any and all existing information and data assets, ranging from unstructured to structured. Preserving and leveraging those assets is a key premise</li>
<li>scones / irON &#8212; this layer is for general conversion of non-RDF data and data schema to <a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a> (via <a href="http://openstructs.org/iron">irON</a> or <a href="http://openstructs.org/resources/rdfizers">RDFizers</a>) or for information extraction of subject concepts or named entities (<a href="http://structureddynamics.com/scones.html">scones</a>)</li>
<li><a href="http://openstructs.org/structwsf">structWSF</a> &#8212; is the pivotal Web services framework layer, and provides the standard, common interface by which existing information assets get represented and presented to the outside world and to other layers in the <span style="font-weight: bold; font-style: italic;">OSF</span> stack</li>
<li><a href="http://openstructs.org/semantic-components">Semantic components</a> &#8212; the highlighted layer in the &#8220;semantic muffin&#8221;; in essence, this is the visualization and data interaction layer in the <span style="font-weight: bold; font-style: italic;">OSF</span> stack; see more below</li>
<li>Ontologies &#8212; are the layer containing the structured assets &#8220;driving&#8221; the system; this includes the concepts and relationships of the domain at hand, and administrative ontologies that guide how the user interfaces or widgets in the system should behave, and</li>
<li><a href="http://openstructs.org/conStruct">conStruct</a> &#8212; is the content management system (CMS) layer based on Drupal and the thinnest layer with respect to <span style="font-weight: bold; font-style: italic;">OSF</span>; this optional layer provides the theming, user rights and permissions, or other functionality drawn from Drupal&#8217;s 6500 third-party modules.</li>
</ul>
<p>Not all of these layers are required in a given deployment and their adoption need not be sequential or absolutely depend on prior layers. Nonetheless, they do layer and interact with one another in the general manner shown.</p>
<h3>The Semantics Components Layer</h3>
<p>Current <span style="font-weight: bold; font-style: italic;">semantic components</span>, or widgets, include: filter; tabular templates          (similar to <a href="http://en.wikipedia.org/wiki/Help:Infobox">infoboxes</a>);  maps; bar,         pie or linear charts; relationship (concept)  browser; story and text         annotator and viewer; workbench for  creating structured views; and         dashboard for presenting  pre-defined views and component arrangements.         These are generic  tools that respond to the structures and data fed to them,          adaptable to any domain without modification.</p>
<p>Though <a href="http://fgiasson.com/blog/index.php/2010/07/05/semantic-components/">Fred&#8217;s post</a> goes into more detail &#8212; with subsequent posts to get into the technical nuances of the <span style="font-weight: bold; font-style: italic;">semantic components</span> &#8212; the main idea of these components is shown by the diagram below.</p>
<p>These various <span style="font-weight: bold; font-style: italic;">semantic components</span> get embedded in a layout canvas for the Web page. By interacting with the various components, new queries are generated (most often as <a href="http://en.wikipedia.org/wiki/Sparql">SPARQL</a> queries) to the various <a href="http://openstructs.org/structwsf">structWSF</a> Web services endpoints. The result of these requests is to generate a structured results set, which includes various types and attributes.</p>
<p>An internal ontology that embodies the desired behavior and display options (SCO, the <a href="http://openstructs.org/semantic-components/manual/semantic-component-ontology">Semantic Component Ontology</a>) is matched with these types and attributes to generate the formal instructions to the <span style="font-weight: bold; font-style: italic;">semantic components</span>. These instructions are presented via the sControl component, that determines which widgets (individual components, with multiples possible depending on the inputs) need to be invoked and displayed on the layout canvas. Here is a picture of the general workflow:</p>
<div><a href="../wp-content/themes/ai3/images/2010Posts/100706_semantic_component.png"> <img class="center_ok" style="border: 0px solid; width: 600px; height: 597px;" title="Semantic Components Workflow" src="../wp-content/themes/ai3/images/2010Posts/100706_semantic_component.png" alt="Semantic Components Workflow" width="686" height="682" /></a></p>
<p style="font-style: italic;" align="center"><small>(click for <a href="../wp-content/themes/ai3/images/2010Posts/100706_semantic_component.png"> full size</a>)</small></p>
</div>
<p>New interactions with the resulting displays and components cause the iteration path to be generated anew, again starting a new cycle of queries and results sets. As these pathways and associated display components get created, they can be named and made persistent for later re-use or within dashboard invocations.</p>
<h3>Consolidating and Rationalizing Web Sites and Mailing Lists</h3>
<p><a href="http://openstructs.org/"><img style="border: 0px solid; width: 90px; height: 90px; float: right; margin-left: 10px;" title="OpenStructs and Open Semantic Framework Logo" src="../wp-content/themes/ai3/images/2010Posts/triple_90.png" alt="OpenStructs and Open Semantic Framework Logo" /></a>As the release of the <span style="font-weight: bold; font-style: italic;">semantic components</span> drew near, it was apparent that releases of previous layers had led to some fragmentation of Web sites and mailing lists. The umbrella nature of the <span style="font-weight: bold; font-style: italic;">open semantic framework</span> enabled us to consolidate and rationalize these resources.</p>
<p>Our first change was to consolidate all <span style="font-weight: bold; font-style: italic;">OSF</span>-related material under the existing <a href="http://openstructs.org/">OpenStructs.org </a>Web site. It already contained the links and background material to structWSF and irON. To that, we added the conStruct and <span style="font-weight: bold; font-style: italic;">OSF</span> material as well. This consolidation also allowed us to retire the previous conStruct Web site as well, which now re-directs to <a href="http://openstructs.org/">OpenStructs</a>.</p>
<p>We also had fragmentation in user groups and mailing lists. Besides shared materials, these had many shared members. The Google groups for irON, structWSF and conStruct were thus archived and re-directed to the new <a href="http://groups.google.com/group/open-semantic-framework?hl=en"><span style="font-weight: bold; font-style: italic;">Open Semantic Framework</span> Google group and mailing list</a>. Personal notices of the change and invites have been issued to all members of the earlier groups. For those interested in development work and interchange with other developers on any of these OSF layers, please now direct your membership and attention to the <a href="http://groups.google.com/group/open-semantic-framework?hl=en"><span style="font-weight: bold; font-style: italic;">OSF</span> group</a>.</p>
<p>There has also been a revigoration of the developers&#8217; community Web site at <a href="http://community.openstructs.org/">http://community.openstructs.org/</a>. It remains the location for all central developer resources, including bug and issue tracking and links to SVNs.</p>
<p>Actual code SVN repositories are unchanged. These code repositories may be found at:</p>
<ul>
<li><a href="http://code.google.com/p/structwsf/">structWSF</a></li>
<li><a href="http://drupal.org/project/construct">conStruct</a></li>
<li><a href="http://code.google.com/p/semanticcomponents/">Semantic Components</a></li>
<li><a href="http://code.google.com/p/iron-notation/">irON Parsers</a>.</li>
</ul>
<p>We hope you find these consolidations helpful. And, of course, we welcome new participants and contributors!</p>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a name="consol1"></a> [1] An alternative view of this layer diagram is shown by the general Structured Dynamics <a href="http://structureddynamics.com/products.html">product stack and architecture</a>.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/894/consolidating-a-coherent-message-with-osf/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Bipolar Disorder of Linked Data</title>
		<link>http://www.mkbergman.com/880/the-bipolar-disorder-of-linked-data/</link>
		<comments>http://www.mkbergman.com/880/the-bipolar-disorder-of-linked-data/#comments</comments>
		<pubDate>Wed, 28 Apr 2010 23:12:38 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Structured Web]]></category>
		<category><![CDATA[irON]]></category>
		<category><![CDATA[ABox]]></category>
		<category><![CDATA[structured data]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=880</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=The Bipolar Disorder of Linked Data&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Linked Data&amp;rft.subject=Semantic Web&amp;rft.subject=Structured Web&amp;rft.subject=irON&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2010-04-28&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/880/the-bipolar-disorder-of-linked-data/&amp;rft.language=English"></span>

An Acceptance of Its Natural Role is the Prozac Substitute
There has been a bit of a manic-depressive character on the Web waves         of late with respect to linked data. On the one         hand, we have seen huzzahs and celebrations [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=The Bipolar Disorder of Linked Data&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Linked Data&amp;rft.subject=Semantic Web&amp;rft.subject=Structured Web&amp;rft.subject=irON&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2010-04-28&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/880/the-bipolar-disorder-of-linked-data/&amp;rft.language=English"></span>
<p><a href="http://commons.wikimedia.org/wiki/File:VanGogh-starry_night_ballance1.jpg"><img style="border: 0px solid; width: 250px; height: 198px; float: left; margin-right: 10px;" title="The Starry Night, from Vincent Van Gogh" src="../wp-content/themes/ai3/images/2010Posts/250-VanGogh-starry_night_ballance1.jpg" alt="The Starry Night, from Vincent Van Gogh" hspace="5" vspace="5" align="left" /></a></p>
<h2>An Acceptance of Its Natural Role is the Prozac Substitute</h2>
<p>There has been a bit of a manic-depressive character on the Web waves         of late with respect to <a href="http://en.wikipedia.org/wiki/Linked_Data">linked data</a>. On the one         hand, we have seen huzzahs and celebrations from the likes of <a href="http://www.readwriteweb.com/archives/the_state_of_linked_data_in_2010.php"> ReadWriteWeb</a> and <a href="http://www.semanticweb.com/">Semantic         Web.com</a> and, just concluded, the Linked Data on the Web (<a href="http://events.linkeddata.org/ldow2010/">LDOW</a>) workshop at <a href="http://www2010.org/www/">WWW2010</a>. This treatment has tended to         tout the coming of the linked data era and to seek ideas about         possible, cool <a href="http://www.readwriteweb.com/archives/10_ideas_for_web_of_data_apps.php"> linked data apps</a> <a href="#BPD1">[1]</a>. This rise in visibility has been accomplished         by much manic and excited discussion on <a href="http://lists.w3.org/Archives/Public/public-lod/">various</a> <a href="http://lists.w3.org/Archives/Public/semantic-web/">mailing</a> <a href="http://sourceforge.net/mailarchive/forum.php?forum_name=dbpedia-discussion"> lists</a>.</p>
<p>On the other hand, we have seen much wringing of hands and gnashing of         teeth for why linked data is not being used more and why the broader         issue of the semantic Web is not seeing more uptake. This depressive         &#8220;<a href="http://lists.w3.org/Archives/Public/semantic-web/2010Mar/0160.html">call         to arms</a>&#8221; has sometimes felt like ravings with blame being given to         the poor state of apps and user interfaces to badly linked data to the         difficulty of publishing same. Actually using linked data for anything         productive (other than single sources like <a href="http://dbpedia.org/About">DBpedia</a>) still appears to be an issue.</p>
<p>Meanwhile, among others, <a href="http://www.openlinksw.com/blog/%7Ekidehen/">Kingsley Idehen</a>,         ubiquitous voice on the Twitter <a href="http://twitter.com/search?q=%23linkeddata">#linkeddata</a> channel,         has been promoting the separation of identity of linked data from the         notion of the semantic Web. He is also trying to <a href="http://www.openlinksw.com/blog/%7Ekidehen/?1624&amp;title=Data%203.0%20%28a%20Manifesto%20for%20Platform%20Agnostic%20Structured%20Data%29%20Update%203"> change the narrative</a> away from the association of linked data with         RDF, instead advocating &#8220;Data 3.0&#8243; and the <a href="http://en.wikipedia.org/wiki/Entity-attribute-value_model">entity-attribute-value</a> (EAV) model understanding of structured data.</p>
<p>As someone less engaged in these topics since my own statements about         linked data over the past couple of years <a href="#BPD2">[2]</a>, I have my own         distanced-yet-still-biased view of what all of this crisis of         confidence is about. I think I have a diagnosis for what may be causing         this <a href="http://en.wikipedia.org/wiki/Bipolar_disorder">bipolar         disorder</a> of linked data <a href="#BPD3">[3]</a>.</p>
<h3>The Semantic Web Boogie Man</h3>
<p>A fairly universal response from enterprise prospects when raising the         topic of the semantic Web is, &#8220;That was a big deal of about a decade         ago, wasn&#8217;t it? It didn&#8217;t seem to go anywhere.&#8221; And, actually, I think         both proponents and keen observers agree with this general sentiment.         We have seen the original advocate, Tim Berners-Lee, float the <a href="http://en.wikipedia.org/wiki/Giant_Global_Graph">Giant Global         Graph</a> balloon, and now <a href="http://blog.ted.com/2010/03/the_year_open_d.php">Linked Data</a>.         Others have touted <a href="../462/how-shall-we-call-web-30-instead-mike-please-indulge-us/"> Web 3.0</a> or <a href="http://webofdata.wordpress.com/">Web of         Data</a> or, frankly, <a href="http://bnode.org/blog/2008/03/04/semantic-web-aliases">dozens of         alternatives</a>. Linked data, which began as a set of techniques for         publishing RDF, has emerged as a potential marketing hook and saviour         for the tainted original semantic Web term.</p>
<p>And therein, I think, lies the rub and the answer to the bipolar         disorder.</p>
<p>If one looks at the <a href="http://www.w3.org/DesignIssues/LinkedData.html">original         principles</a> for putting linked data on the Web or <a href="../846/when-linked-data-rules-fail/">subsequent         interpretations</a>, it is clear that linked data (lower case) is merely a set of techniques. Useful techniques, for sure; but really a simple approach to exposing data using the Web with URLs as the naming convention for objects and their relationships. These techniques provide (1) methods to access data on the Web and (2) specifying the relationships to link the data (resources). The first part is mechanistic and not really of further concern here. And, while any predicate can be used to specify a data (resource) relationship, that relationship should also be discoverable with a URL (dereferencable) to qualify as linked data. Then, to actually be semantically useful, that relationship (predicate) should also have a precise definition and be part of a coherent schema. (Note, this last sentence is actually not part of the &#8220;standard&#8221; principles for linked data, which itself is a <a href="../846/when-linked-data-rules-fail/">problem</a>.)</p>
<p>When used right, these techniques can be powerful and useful. But, poor         choices or execution in how relationships are specified often leads to         saying little or nothing about semantics. Most linked data uses a         woefully small vocabulary of data relationships, with even a smaller         set ever used for setting linkages <span style="font-weight: bold; font-style: italic;">across</span> existing linked         data sets <a href="#BPD4">[4]</a>. Linked data techniques are a part of the foundation to         overall best practices, but not the total foundation. As I have argued         for some time, linked data alone does not speak to issues of <a href="../431/umbel-making-linked-data-classy/">context</a> nor <a href="../450/when-is-content-coherent/">coherence</a>.</p>
<p>To speak semantically, linked data is not a synonym for the semantic         Web nor is it the <a style="font-family: monospace;" href="http://events.linkeddata.org/ldow2010/papers/ldow2010_paper09.pdf">sameAs</a> the semantic Web. But, many proponents have tried to characterize it as         such. The general tenor is to blow the horns hard anytime some large         data set is &#8220;exposed&#8221; as linked data. (No matter whether the data is         incoherent, lacks a schema, or is even <a href="../846/when-linked-data-rules-fail/">poorly         described and defined</a>.) Heralding such events, followed by no         apparent usefulness to the data, causes confusion to reign supreme and         disappointment to naturally occur.</p>
<p>The semantic Web (or semantic enterprise or semantic government or         similar expressions) is a vision and an ideal. It is also a fairly         complete one that potentially embraces machines and agents working in         the background to serve us and make us more productive. There is an         entire stack of languages and techniques and methods that enable schema         to be described and non-conforming data to be interoperated. Now, of         course this ideal is still a work in progress. Does that make it a         failure?</p>
<p>Well, maybe so, if one sees the semantic Web as marketing or branding.         But, who said we had to present it or understand it as such?</p>
<p>The issue is not one of marketing and branding, but the lack of         benefits. Now, maybe I have it all wrong, but it seems to me that the         argument needs to start with what &#8220;linked data&#8221; and the &#8220;semantic Web&#8221;         can do for me. What I actually call it is secondary. Rejecting the         branding of the semantic Web for linked data or Web 3.0 or any other         somesuch is still dressing the emperor in new clothes.</p>
<h3>A Nicely Progressing Continuum, Thank You!</h3>
<p>For a couple of years now I have tried in various posts to present         linked data in a broader framework of structured and semantic Web data.         I first tried to capture this continuum in a diagram from <a href="../?p=391">July 2007</a>:</p>
<div style="margin: 18px 0px;">
<table class="center_ok" style="text-align: left; width: 622px;" border="0">
<tbody>
<tr>
<td colspan="4"><img style="width: 599px; height: 205px; margin-left: 15px; vertical-align: top;" src="../wp-content/themes/ai3/images/2007Posts/070720_web_transition.jpg" alt="Transition in Web Structure" /></td>
</tr>
<tr>
<td style="border-bottom: 1px solid; font-weight: bold; text-align: center;">Document Web</td>
<td style="border-bottom: 1px solid; font-weight: bold; text-align: center;" colspan="2">Structured Web</td>
<td style="border-bottom: 1px solid; font-weight: bold; text-align: center;">Semantic Web</td>
</tr>
<tr>
<td style="width: 150px;"></td>
<td style="width: 150px;"></td>
<td style="border-bottom: 1px solid; width: 150px; font-weight: bold; text-align: center;">Linked Data</td>
<td style="width: 150px;"></td>
</tr>
<tr>
<td>
<ul>
<li> <small>Document-centric</small></li>
<li> <small>Document resources</small></li>
<li> <small>Unstructured data and semi-structured data</small></li>
<li> <small>HTML<br />
</small></li>
<li> <small>URL-centric</small></li>
<li> <small><span style="font-style: italic;">circa</span> 1993</small></li>
</ul>
</td>
<td>
<ul>
<li> <small>Data-centric</small></li>
<li> <small>Structured data<br />
</small></li>
<li> <small>Semi-structured data and structured data</small></li>
<li> <small>XML, JSON, RDF, etc<br />
</small></li>
<li> <small>URI-centric</small></li>
<li> <small><span style="font-style: italic;">circa</span> 2003</small></li>
</ul>
</td>
<td>
<ul>
<li> <small>Data-centric</small></li>
<li> <small>Linked data<br />
</small></li>
<li> <small>Semi-structured data and structured data</small></li>
<li> <small>RDF, RDF-S<br />
</small></li>
<li> <small>URI-centric</small></li>
<li> <small><span style="font-style: italic;">circa</span> 2006<br />
</small></li>
</ul>
</td>
<td>
<ul>
<li> <small>Data-centric</small></li>
<li> <small>Linked data<br />
</small></li>
<li> <small>Semi-structured data and structured data</small></li>
<li> <small>RDF, RDF-S, OWL<br />
</small></li>
<li> <small>URI-centric</small></li>
<li> <small><span style="font-style: italic;">circa</span> ???<br />
</small></li>
</ul>
</td>
</tr>
</tbody>
</table>
</div>
<p>Now, three years later, I think the transitional phase of linked data         is reaching an end. OK, we have figured out one useful way to publish         large datasets staged for possible interoperability. Sure, we have         billions of triples and assertions floating out there. But what are we         to do with them? And, is any of it any good?</p>
<h3>The Reality of a Heterogeneous World</h3>
<p>I think Kingsley is right in one sense to point to EAV and structured         data. We, too, have not met a structured data format we did not         like. There are hundreds of attribute-value pair models of even         more generic nature that also belong to the conversation.</p>
<p>One of my most popular posts on this blog has been, <a style="font-style: italic;" href="../471/structs-naive-data-formats-and-the-abox/"> ‘Structs’: Naïve Data Formats and the ABox</a>, from         January 2009. Today, we have a multitude of popular structured data         formats from XML to JSON and even spreadsheets (CSV). Each form has its         advocates, place and reasons for existence and popularity (or not).         This inherent diversity is a fact and fixture of any discussion of         data. It is a major reason why we developed the <a href="http://openstructs.org/iron">irON</a> (<span style="font-style: italic;">instance record</span> and <span style="font-style: italic;">object notation</span>) non-RDF vocabulary to         provide a bridge from such forms to RDF, which is accessible on the Web         via URIs. irON clearly shows that entities can be usefully described         and consumed in either RDF or non-RDF serialized forms.</p>
<p>Though RDF and linked data is a great form for expressing this         structured information, other forms can convey the same meaning as         well. Of the billions of linked data triples exposed to date, surely         more than 99% are of this instance-level, &#8220;ABox&#8221; type of data <a href="#BPD5">[5]</a>. And,         more telling, of all of the structured data that is publicly obtainable         on the Web, my wild guess is that less than 0.0000000001% of that is         even linked RDF data <a href="#BPD6">[6]</a>.</p>
<p>Neither linked data nor RDF alone will &#8212; today or in the near future         &#8212; play a pivotal or essential role for instance data. The real         contribution from RDF and the semantic Web will come from connecting         things together, from interoperation and federation and conjoining.         This is the provenance of the TBox and is a role barely touched by         linked data. Publishing data as linked data helps tremendously in         simplifying ingest and guiding the eventual connections, but the making         of those connections, testing for their quality and reliability, are         steps beyond the linked data ken or purpose.</p>
<h3>Promoting Linked Data to its Level of Incompetence</h3>
<p>It seems, then, that we see two different forces and perspectives at         work, each contributing in its own way to today&#8217;s bipolar nature of         linked data.</p>
<p>On the manic side, we see the celebration for the release of each         large, linked data set. This perspective seems to care most about         volumes and numbers, with less interest in how and whether the data is         of quality or useful. This perspective seems to believe &#8220;post the data,         and the public will come.&#8221; This same perspective is also quite         parochial with respect to the unsuitability of non-linked data, be it         microdata, microformats or any of the older junk.</p>
<p>On the depressed side, linked data has been seen as a more palatable         packaging for the disappointments and perceived failures or slow         adoption of the earlier semantic Web phrasing. When this perspective         sees the lack of structure, defensible connections and other quality         problems with linked data as it presently exists, despair and         frustration ensue.</p>
<p>But both of these perspectives very much miss the mark. Linked data         will never become the universal technique for publishing structured         data, and should not be expected to be such. Numbers are never a         substitute for quality. And linked data lacks the standards, scope and         investment made in the semantic Web to date. Be patient; don&#8217;t despair;         structured data and the growth of semantics and useful metadata is         proceeding just fine.</p>
<p>Unrealistic expectations or wrong roles and metrics simply confuse the         public. We are fortunate that most potential buyers do not frequent the         community&#8217;s various mailing lists. Reduced expectations and an         understanding of linked data&#8217;s natural role is perhaps the best way to         bring back balance.</p>
<h3>Linked Data&#8217;s Natural Role</h3>
<p>We have consciously moved our communications focus from speaking         internally to the community to reaching out to the broader enterprise         public. There is much of education, clarification and dialog that is         now needed with the buying public. The time has moved past software         demos and toys to workable, pragmatic platforms, and the methodologies         and documentation necessary to support them. This particular missive         speaking to the founding community is (perhaps many will Hurray!)         likely to become even more rare as we continue to focus outward.</p>
<p>As Structured Dynamics has stated many times, we are committed to         linked data, presenting our information as such, and providing better         tools for producing and consuming it. We have made it one of the         <a href="../859/seven-pillars-of-the-open-semantic-enterprise/"> seven foundations</a> to our <a href="http://structureddynamics.com/products.html">technology stack</a> and         <a href="http://mike2.openmethodology.org/wiki/Open_SEAS_Framework">methodology</a>.</p>
<p>But, linked data on its own is inadequate as an interoperability         standard. Many practitioners don&#8217;t publish it right, characterize it         right, or link to it right. That does not negate its benefits, but it         does make it a poor candidate to install on the semantic Web throne.</p>
<p>Linked data based on RDF is perhaps the first citizen amongst all         structured data citizens. It is an expressive and readily consumed         means for publishing and relating structured instance data and one that         can be easily interoperated. It is a natural citizen of the Web.</p>
<p>If we can accept and communicate linked data for these strengths, for         what it naturally is &#8212; a useful set of techniques and best practices         for enabling data that can be easily consumed &#8212; we can rest easy at         night and not go crazy. Otherwise, bring on the Prozac.</p>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a name="BPD1"></a> [1] Actually, in my opinion, the suggested listing of apps from these         discussions is distinctly unimpressive and not compelling. As argued in         the main body of the post, I think this is because linked data is         really just a technique or best practice, and not a basis alone for         enabling compelling apps. As initial developers of such apps as the         <a href="http://umbel.structureddynamics.com/explorer.php?concept=http%3A%2F%2Fumbel.org%2Fumbel%2Fsc%2FMolecule"> UMBEL concept explorer</a> or <a href="http://dataviewer.zitgist.com/?uri=http%3A//fgiasson.com">Dataviewer</a>,         <a href="http://structureddynamics.com/">Structured Dynamics</a> understands the use of linked data and has a defensible basis to         comment on applications. Our own applications intimately integrate         linked data, but only as one of <a href="../859/seven-pillars-of-the-open-semantic-enterprise/"> seven foundations</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="BPD2"></a> [2] Here are some of my relevant posts over the past year discussing         the role of linked data: <a style="font-style: italic;" href="../802/moving-beyond-linked-data/">Moving Beyond         Linked Data</a> (Sept. 20, 2009); <a style="font-style: italic;" href="../825/fresh-perspectives-on-the-semantic-enterprise/"> Fresh Perspectives on the Semantic Enterprise</a> (Sept. 28, 2009);         <a style="font-style: italic;" href="../837/the-law-of-linked-data/">The Law of         Linked Data</a> (Oct. 11, 2009); <a style="font-style: italic;" href="../846/when-linked-data-rules-fail/">When Linked         Data Rules Fail</a> (Nov. 16, 2009).</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="BPD3"></a> [3] The current bipolar discussion reminds me of the &#8220;<a href="http://en.wikipedia.org/wiki/Six_phases_of_a_big_project">Six Phases         of a Project</a>,&#8221; a copy of which has been a permanent fixture on my         office wall:</p>
<ol>
<li>Enthusiasm</li>
<li>Disillusionment</li>
<li>Panic</li>
<li>Search for the guilty</li>
<li>Punishment of the innocent</li>
<li>Honors &amp; praise for the non-participants.</li>
</ol>
</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="BPD4"></a> [4] See, for example: Harry Halpin, 2009. &#8220;A Query-Driven         Characterization of Linked Data,&#8221; paper presented at the Linked Data on         the Web (LDOW) 2009 Workshop, April 20, 2009, Madrid, Spain, see         <a href="http://events.linkeddata.org/ldow2009/papers/ldow2009_paper16.pdf">http://events.linkeddata.org/ldow2009/papers/ldow2009_paper16.pdf</a>;         Prateek Jain, Pascal Hitzler, Peter Z. Yehy, Kunal Vermay and Amit P.         Shet, 2010. &#8220;Linked Data is Merely More Data,&#8221; in Dan Brickley, Vinay         K. Chaudhri, Harry Halpin, and Deborah McGuinness, <span style="font-style: italic;">Linked Data Meets Artificial Intelligence,         Technical Report SS-10-07</span>, AAAI Press, Menlo Park, California,         2010, pp. 82-86., see <a href="http://knoesis.wright.edu/library/publications/linkedai2010_submission_13.pdf"> http://knoesis.wright.edu/library/publications/linkedai2010_submission_13.pdf</a>;         among others.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="BPD5"></a> [5] Structured Dynamics&#8217; best practices approach makes explicit splits         between the “<a href="http://en.wikipedia.org/wiki/Abox">ABox</a>” (for instance data)         and “<a href="http://en.wikipedia.org/wiki/Tbox">TBox</a>”         (for ontology schema) in accordance with our <a title="Permanent Link to Thinking ?Inside the Box? with Description Logics" href="../466/thinking-inside-the-box-with-description-logics/"> working definition</a> for <a href="http://en.wikipedia.org/wiki/Description_logics">description         logics</a>, a fundamental underpinning for how we use RDF:</p>
<div class="boxGraySolid">“Description logics and their semantics traditionally split           <span style="font-style: italic;">concepts</span> and their           relationships from the different treatment of <span style="font-style: italic;">instances</span> and their attributes and           roles, expressed as fact assertions. The concept split is known as           the TBox (for <em>terminological</em> knowledge, the basis for           <span style="font-style: italic;">T</span> in <span style="font-style: italic;">TBox</span>) and represents the schema or           taxonomy of the domain at hand. The TBox is the structural and           intensional component of conceptual relationships. The second split           of instances is known as the ABox (for <span style="font-style: italic;">assertions</span>, the basis for <span style="font-style: italic;">A</span> in <span style="font-style: italic;">ABox</span>) and describes the attributes of           instances (and individuals), the roles between instances, and other           assertions about instances regarding their class membership with the           TBox concepts.”</div>
</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="BPD6"></a> [6] This topic is deserving of some analysis in its own right, and my         guess is really just that. For example, RSS feeds to mobile devices         alone perhaps account for 2,000 petabytes today; see <a href="http://www.tgdaily.com/hardware-features/49167-8000-petabytes-of-mobile-data-traffic-expected-by-2014"> http://www.tgdaily.com/hardware-features/49167-8000-petabytes-of-mobile-data-traffic-expected-by-2014</a>.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/880/the-bipolar-disorder-of-linked-data/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Ontology-driven Applications Using Adaptive Ontologies</title>
		<link>http://www.mkbergman.com/847/ontology-driven-applications-using-adaptive-ontologies/</link>
		<comments>http://www.mkbergman.com/847/ontology-driven-applications-using-adaptive-ontologies/#comments</comments>
		<pubDate>Mon, 23 Nov 2009 16:24:29 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Description Logics]]></category>
		<category><![CDATA[Ontology Best Practices]]></category>
		<category><![CDATA[Structured Dynamics]]></category>
		<category><![CDATA[irON]]></category>
		<category><![CDATA[adaptive ontologies]]></category>
		<category><![CDATA[conStruct]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[ontology-driven applications]]></category>
		<category><![CDATA[Semantic Enterprise]]></category>
		<category><![CDATA[structWSF]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=847</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Ontology-driven Applications Using Adaptive Ontologies&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Description Logics&amp;rft.subject=Ontology Best Practices&amp;rft.subject=Structured Dynamics&amp;rft.subject=irON&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2009-11-23&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/847/ontology-driven-applications-using-adaptive-ontologies/&amp;rft.language=English"></span>

A Low-risk Path to the Open World, Semantic Enterprise
OK, you&#8217;ve been reading the literature and perhaps have attended a         conference or two. You have heard a lot about semantic technologies,         but have some real questions and concerns:

How do we [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Ontology-driven Applications Using Adaptive Ontologies&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Description Logics&amp;rft.subject=Ontology Best Practices&amp;rft.subject=Structured Dynamics&amp;rft.subject=irON&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2009-11-23&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/847/ontology-driven-applications-using-adaptive-ontologies/&amp;rft.language=English"></span>
<p><img style="border: 0px solid; width: 250px; height: 130px; float: left; margin-right: 10px;" title="Open World" src="../wp-content/themes/ai3/images/2009Posts/091122_world_atlas.png" alt="Open World - from worldatlas.com" /></p>
<h2>A Low-risk Path to the Open World, Semantic Enterprise</h2>
<p>OK, you&#8217;ve been reading the literature and perhaps have attended a         conference or two. You have heard a lot about semantic technologies,         but have some real questions and concerns:</p>
<ul>
<li style="margin-left: 30px;">How do we get started, especially with         smaller proofs-of-concept?</li>
<li style="margin-left: 30px;">Do we need to abandon our past practices         and systems in order to gain semantic advantages?</li>
<li style="margin-left: 30px;">To gain the advantages of         interoperability, do we have to convert everything into RDF or OWL?</li>
<li style="margin-left: 30px;">Are semantic technologies limited to         open or public data; how do we accommodate our proprietary information?</li>
</ul>
<p>Such questions &#8212; and more &#8212; are not infrequent when organizations         first contemplate making the transition to become a semantic         enterprise.</p>
<h3>Overview</h3>
<p>The diagram below shows a general workflow for migrating existing         instance data into the semantic enterprise. The diagram is broken down         into three parts. The first part is to characterize and stage existing         data and information into the underlying structured data framework.         This is what SD (that is, my firm, <a href="http://structureddynamics.com/">Structured Dynamics</a>) does as data         architects using our particular approach to adaptive ontologies. I&#8217;ll         touch on this again in a moment.</p>
<p>Jumping to the right-hand side of the diagram is the access and display         part. It is here that developers or users can make selections from         dropdown lists and so forth to define the &#8220;slices&#8221; of diced results         sets they wish to display. The results of those interactions are         structured data results sets that are pre-staged to &#8220;drive&#8221; various         applications and displays [<a href="#adapto1">1</a>,<a href="#adapto2">2</a>]. These same capabilities can also be         embedded into standard Web end user applications, such as content         management systems.</p>
<p>The third and middle part of the diagram is the critical part, the         pivot point. It is the interface layer between the structured data on         the left and the display and presentation of that data on the right. As provided by SD,         this abstraction layer is the <span style="font-weight: bold;">structWSF</span> Web services framework that         &#8220;bridges&#8221; between the black box of what happens with RDF and semantic         Web structured data characterizations on the left in order to feed, or         &#8220;drive&#8221;, useful services and functions on the right.</p>
<p>We call this general design and architecture &#8220;ontology-driven         applications&#8221;. The bulk of this posting explains each of these three         parts in a bit more detail, organized from left-to-right by these Parts         1 to 3.</p>
<div style="margin: 10px; text-align: center;"><a href="http://mkbergman.com/wp-content/themes/ai3/images/2009Posts/091122_adaptive_ontology_workflow.png"> <img class="center_ok" style="border: 0px solid; width: 600px; height: 359px;" title="Click to expand" src="http://mkbergman.com/wp-content/themes/ai3/images/2009Posts/091122_adaptive_ontology_workflow.png" alt="Adaptive Ontology Workflow" width="1218" height="728" /></a><br />
<span style="font-style: italic; font-size: 90%;">(click to         expand)</span></div>
<h3>Part 1: Structured Data Instances and Ontologies</h3>
<p>Our approach relies on what we call &#8220;adaptive ontologies&#8221;. These         ontologies set the structural basis for all subsequent data display,         analysis, inferencing, entailments, and the like. We call them         &#8220;adaptive&#8221; because we embrace a set of unique best practices. These         practices enable the ontologies to do the double-duty of first         structuring data and then driving generic applications by properly         informing user interfaces, dropdown lists, menus and the like.</p>
<p>This structuring results in faceting key important dimensions and         attributes of available content. Structured data gets organized.         Unstructured data (text) gets tagged via this structure and integrated         with it.</p>
<p>As Structured Dynamics&#8217; general product schema makes clear (see         the diagram at <a href="#adapto3">[3]</a>), our approach leverages existing assets as         much as possible. Often, this means leaving most existing data         structures in place. These existing assets are staged and converted in         two complementary manners that largely correspond to the conceptual         <span style="font-style: italic;">ABox</span> (instance) and <span style="font-style: italic;">TBox </span>(concepts and schema) split central to         description logics and pivotal to SD&#8217;s methodology <a href="#adapto4">[4]</a>.</p>
<p>Whether transitioning small chunks or big chunks, this staging of         existing data in Part 1 results in an RDF-accessible characterization         of the starting content. Instances and their attributes are represented         via a common notation, generally based on <span style="font-weight: bold;">irON</span> (<span style="font-style: italic;">instance         record</span> and <span style="font-style: italic;">Object Notation</span>) <a href="#adapto5">[5]</a>, that is an         extensible notation and vocabulary for capturing the data characterizations,         attributes and metadata of the candidate instance data (&#8221;records&#8221; in         RDBMS parlance). These instances may either be internal or proprietary         records, or instance data on the Web or in the public domain. By         properly matching same or similar instances to one another, any source         of instance characterization can be meaningfully combined.</p>
<p>This instance notation is extremely lightweight, and really is merely         an RDF representation of data characterizations. In the         characterizations to this point there is not yet any &#8220;world view&#8221;         involved:  we are simply describing instances and their attributes         in a manner akin to key-value pairs. The process to this point is         entirely descriptive.</p>
<p>However, these instance characteristics do contain within them the         semantics as to how to describe these attributes (your &#8220;glad&#8221; is my         &#8220;happy&#8221;), as well as potentially a schematic or conceptual view of how         these instances relate to one another and to the broader world.         Instance characterizations provide the building blocks, that are then         related and made semantically whole via a second &#8220;terminological&#8221;         level.</p>
<p>These terminological, or conceptual, relationships (the <span style="font-style: italic;">TBox</span> <a href="#adapto4">[4]</a>),         reside at a different level from simply decribing things. Rather,         these schema &#8212; what in this context are best known         as <span style="font-style: italic;">ontologies</span> &#8212; provide         a precise language and means for describing conceptual relationships.         If these structural relationships are done well, they are <a href="../450/when-is-content-coherent/">coherent</a>:         the hip bone is connected to the thigh bone and not to the ear.         Coherence is a matter of a consistent world view that &#8220;hangs together&#8221;         when analyzed via powerful logical techniques available via description         logics and other broader mechanisms of the semantic enterprise.</p>
<p>Thus, as we transition from the existing, the operational workflow         splits the input data stream into two pathways:</p>
<ul>
<li>Instances, and their descriptive characteristics, and</li>
<li>Conceptual relationships, or ontologies.</li>
</ul>
<p>A sequential flow of these steps and splits is provided by this diagram         below that shows: 1) the conceptual structure of the concept ontology;         as 2) matched with the instances and their descriptive attributes that         populate that schema.</p>
<div style="margin: 10px; text-align: center;"><a href="http://mkbergman.com/wp-content/themes/ai3/images/2009Posts/091122_ontology_build_methodology.png"> <img class="center_ok" style="border: 0px solid; width: 600px; height: 355px;" title="Click to expand" src="http://mkbergman.com/wp-content/themes/ai3/images/2009Posts/091122_ontology_build_methodology.png" alt="Ontology and Instance Build Methodology" width="1232" height="728" /></a><br />
<span style="font-style: italic; font-size: 90%;">(click to         expand)</span></div>
<p>A key point is that &#8212; while a proper starting ontology is essential to         our process and proofs-of-concept &#8212; it can be grown and scaled         incrementally. We leverage as much existing starting structure as         possible and can readily bound the scope to meet budget and delivery         imperatives.</p>
<p>The concepts and entities that occur within these structures help         inform our fairly simple tagging system, <a style="font-weight: bold;" href="http://structureddynamics.com/scones.html">scones</a> <a href="#adapto3">[3]</a>. (There are         also benefits from &#8220;triangulating&#8221; between entity or instance         identification and concept identification that helps inform         disambiguation nearly for free; see further <a href="#adapto6">[6])</a>. It is also possible         to integrate these initial proof-of-concept approaches with third-party         tools (<span style="font-style: italic;">e.g.</span>, <a href="http://www.opencalais.com/">Calais</a>, Expert System (<a href="http://www.expertsystem.net/page.asp?id=1515&amp;idd=200">Cogito</a>),         etc.) to improve unstructured content characterization.</p>
<p>These approaches are pretty straightforward for any organization         wanting to test the idea of becoming a semantic enterprise. Real         benefits &#8212; such as concept retrievals overcoming the limitations of         standard keyword search &#8212; can be demonstrated from even small starting         ontologies and structures. Given the inherent connectedness of the         data, it is possible to expand the scope and usefulness of the         information incrementally within fixed and manageable budgets.</p>
<h3>Part 2: structWSF: A Web-oriented Services API and Framework</h3>
<p>A pivotal part of SD&#8217;s infrastructure software is <span style="font-weight: bold;">structWSF</span> <a href="#adapto7">[7]</a>, our platform-independent Web         services middleware. <span style="font-weight: bold;">structWSF</span> is an abstraction layer that provides the APIs, search endpoints, and         specific Web services for accessing, querying or getting results sets         from the underlying structured data and ontologies.</p>
<p><span style="font-weight: bold;">structWSF</span> has a standard set of access and retrieval services including browse, full-text search, CRUD, direct record retrievals, and the like. It is embedded within an access and permissions service that acts at the level of registered datasets. Then, based on the requested protocol, <span style="font-weight: bold;">structWSF</span> returns the         filtered results set. These results sets can be delivered as XML, JSON,         or any of the other formats already available <a href="#adapto7">[7]</a>. They can readily and         dynamically populate HTML pages and forms in any deployment framework.         For specific purposes, these results sets can also be returned as         pre-staged, properly formatted results streams for driving specific         applications.</p>
<p>As an API, the <span style="font-weight: bold;">structWSF</span> Web         services can be interacted with and driven via standard HTTP requests.         Alternatively, these requests can come from simple to complicated Web         apps that create the API queries based on user interface choices such         as selections from dropdown lists or clicking on various listed         options. An interactive demo of this approach is shown by SD&#8217;s         <span style="font-weight: bold;">conStruct</span> application <a href="#adapto8">[8]</a>, though even simpler Web pages or forms may         drive the query interface.</p>
<p>Queries and requests to <span style="font-weight: bold;">structWSF</span> may also include a parameter for         results sets to be returned in particular formats. SD&#8217;s <span style="font-weight: bold;">irON</span> protocol <a href="#adapto5">[5]</a> supports requests or results         in CSV, XML or JSON, in addition to other flavors including multiple         serializations of RDF.</p>
<p>In this manner, only a simple converter need be added to the         <span style="font-weight: bold;">structWSF</span> Web services stack in         order to &#8220;drive&#8221; a new application with a particularly formatted         results set stream.</p>
<p><span style="font-weight: bold;">structWSF</span> thus acts as a         single, uniform Web interface to all of the &#8220;black box&#8221; nuances of the         structured data system organized by the adaptive ontologies. Further,         virtually any data structure may be ingested and converted from         external sources via an import service and made part of the underlying canonical structure,         making the framework perfect for data federation <a href="#adapto9">[9]</a>. Lastly, the         dataset nature of the framework, and its neutrality to underlying data         stores or content management systems, also makes <span style="font-weight: bold;">structWSF</span> an excellent framework for one or         many nodes to share information and collaborate across the Web <a href="#adapto10">[10]</a>.</p>
<p>The following diagram shows how a diverse, Web-based network, involving         a diversity of Web portals and data gateways and hubs, can work via the         <span style="font-weight: bold;">structWSF</span> framework to         establish a complete collaboration network. Via datasets and         differential access rights and permissions, virtually any combination         of potential interactions can be supported:</p>
<div style="margin: 10px; text-align: center;"><a href="http://mkbergman.com/wp-content/themes/ai3/images/2009Posts/091122_collaboration_network.png"> <img class="center_ok" style="border: 0px solid; width: 600px; height: 483px;" title="Click to expand" src="http://mkbergman.com/wp-content/themes/ai3/images/2009Posts/091122_collaboration_network.png" alt="Example Collaboration Network" width="746" height="601" /></a><br />
<span style="font-style: italic; font-size: 90%;">(click to         expand)</span></div>
<p>These potentials are really fundamentally new, and we ourselves are         still trying to find the language and analogies to best explain them. <span style="font-weight: bold;">structWSF</span> was         initially designed as a platform-independent layer between the         structured data representation of existing assets and the         ontology-driven applications that interact with them. We are now finding that deployment in         a broader Web-based context provides additional exciting         prospects for integrating various regional offices or to         enable direct collaboration with customers, partners or         suppliers.</p>
<h3>Part 3: Ontology-driven Applications</h3>
<p>The basic design of <span style="font-weight: bold;">structWSF</span> is to provide a middleware layer that fulfills one or more of these         broad user interaction modes:</p>
<ul>
<li>To create, update, delete or otherwise manage data records</li>
<li>To browse or view existing records or record sets, based on simple         to possible complex selection or filtering criteria, or</li>
<li>To take one of these results sets and progress it through a         workflow of some nature, involving specialized analysis, applications,         or visualization.</li>
</ul>
<p>SD has developed generic applications in these areas (with many more         possible), the operations of which are guided by the instructions and         nature of the underlying data that feeds them. We have proven it is         possible to adopt data characterization practices within those         ontologies so as to stage or &#8220;drive&#8221; such generic applications.</p>
<p>In the case of a standard structured data display (say, a simple table         like a Wikipedia <a href="http://en.wikipedia.org/wiki/Category:Infobox_templates">infobox</a>,         for example), such generic design includes templates tailored to         various instance types (say, locational information presenting on a map         versus people information warranting a image and vital statistics).         Alternatively, in the generic design for some specialized application         (say, <a href="http://www.adobe.com/software/flash/about/">Adobe         Flash</a>), the information output of the results set may need to         contain certain formats and attributes.</p>
<p>SD&#8217;s &#8220;ontology-driven apps&#8221;, then, are really informed structured         results sets that are outputted in a form suitable to various intended         applications. This output form can include a variety of serializations,         formats or metadata. This flexibility of output that is tailored to and         responsive to particular generic applications is what makes our         ontologies &#8220;adaptive&#8221;.</p>
<p>Expressed in this manner, &#8220;ontology-driven apps&#8221; seem neither         remarkably profound nor clever. They are simply attentive to their         intended uses.</p>
<p>Using this structure, then, it is possible to either &#8220;drive&#8221; queries         and results sets selections via direct HTTP request or via simple         dropdown selections on HTML forms (that is, from <span style="font-style: italic;">right</span> to <span style="font-style: italic;">left</span> as shown on the first diagram). Similarly, it is possible with a         single parameter change to drive either a visualization app or a         structured table template from the equivalent query request (that is, from <span style="font-style: italic;">left</span> to <span style="font-style: italic;">right</span> on the first diagram).</p>
<p>&#8220;Ontology-driven apps&#8221; through SD&#8217;s architecture design thus provide two         profound benefits.  First, the entire system can be driven via         simple selections or interactions without the need for any programming         or technical expertise. And, second, simple additions of new and minor         output converters can work to power entirely new applications available         to the system. If, say, Adobe graphics applications need to change         tomorrow for Microsoft Silverlight, that switch is easy and can be         made transparent to the designer.</p>
<h3>The Complete Picture: Embrace the Open World</h3>
<p>The ability to develop these systems incrementally and the ability to         integrate with external, public data is fundamentally dependent on         the <a href="http://en.wikipedia.org/wiki/Open_world_assumption">open world         assumption</a>. The open world assumption is a different logic premise         than what many enterprises are used to; relational database systems,         for example, embrace the alternate <a href="http://en.wikipedia.org/wiki/Closed_world_assumption">closed world         premise</a>.</p>
<p>Open world does not necessarily mean open data and it does not mean open source.         Open world is merely a way to think about the information we have and         how we act on it. An open world assumption accepts that we never have         all necessary information and lacking that information does not itself lead         to any conclusions.</p>
<p>Some enterprise circumstances &#8211; say a complete enumeration of customers or products or even controlled engineering or design environments &#8212; may warrant a closed world approach. In those circumstances, the domain of inquiry is well bounded and we can get relatively complete information about it. Engineering an oil drilling platform or launching the Space Shuttle in fact demands that.</p>
<p>But, in most real world circumstances, there is much we don&#8217;t know and         we interact in complex and external environments. Open world is the         proper logic premise for these circumstances. These circumstances also         happen to be the very basis in which most most knowledge workers         and analysts reside.</p>
<p>Open world frameworks provide some         incredibly important benefits if the circumstances of their use apply:</p>
<ul>
<li>Domains can be analyzed and inspected incrementally</li>
<li>Schema can be incomplete and developed and refined incrementally</li>
<li>The data and the structures within these open world frameworks can         be used and expressed in a piecemeal or incomplete manner</li>
<li>We can readily combine data with partial characterizations with other data having complete characterizations</li>
<li>Systems built with open world frameworks are flexible and robust;         as new information or structure is gained, it can be incorporated         without negating the information already resident, and</li>
<li>Open world systems can readily bridge or embrace closed world         subsystems.</li>
</ul>
<p>One might argue, as we believe, that the biggest impediment to the semantic enterprise is the mind shift necessary to start thinking about and accepting the open world premise. Again, this perspective is not applicable to all problems and domains.  But, where it is, much can be left in place and leveraged with semantic technologies, so long as the enterprise begins to look at these existing assets through a different open-world lens.</p>
<h3>Summary</h3>
<p>So, let&#8217;s return to the rhetorical questions that began this posting.</p>
<p>It should now be clear that it is possible to start small in testing         the transition to a semantic enterprise. These efforts can be done incrementally and         with focus on early, high-value applications and domains.</p>
<p>Further, we need not abandon past practices. There is much that can be done to leverage existing assets. Indeed, those prior investments are often the requisite starting basis to inform semantic initiatives. However, in leveraging those assets, it is important that the enterprise begin to embrace and understand the open world assumption.</p>
<p>We also see that RDF and OWL, while important behind the scenes as a         canonical data model and languages for organizing this information,         need not be exposed as such to most users. Most instance data can be         expressed as is with the data languages of choice such as XML, JSON or         whatever.</p>
<p>We also see these technologies are neutral to the question of open         or public sources. The techniques can equivalently be applied to         internal, closed, proprietary data and structures. Moreover, the         technologies can themselves be used as a basis for bringing external         information into the enterprise.</p>
<p>Without a doubt, some of the early years in describing semantic         technologies were burdened with some unfortunate bad information and         lack of sophistication. Today&#8217;s semantic Web is nimble, agile, and         ready to be deployed immediately at low cost and risk. So, jump on in! We think you&#8217;ll find the water to be just fine.</p>
<div class="boxYellowDotted">This post is Part V of an occasional <span style="color: #993300; font-weight: bold;">AI3</span> series on         <a href="../category/ontologies/">ontology</a> <a href="../category/ontology-best-practices/">best         practices</a>.</div>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto1"></a> [1] These selections and requests need not occur <span style="font-style: italic;">only</span> via user interfaces or HTML forms,         but also programmatically via API or direct Web services calls.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto2"></a> [2] There are two main classes of visualizations possible with our         systems:  1) navigations or explorers of the concept space, which         is a particularly open challenge for large, graph-based knowledge bases         (see, for example, our Subject Concept Explorer using the <a href="http://umbel.org/">UMBEL</a> <a href="http://umbel.structureddynamics.com/explorer.php?concept=http://umbel.org/umbel/sc/FinancialAccount"> Financial Account concept</a>, and click on the bubbles); or 2)         conventional data visualizations or graphics or mappings of instance         data. Both are shown as workflow boxes on the first diagram above.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto3"></a> [3] See <a href="http://structureddynamics.com/products.html">http://structureddynamics.com/products.html</a> for a general descriptive illustration of Structured Dynamics&#8217; product         stack. There is also a longer <a href="http://www.slideshare.net/mkbergman/structured-dynamicss-semantic-technologies-product-stack"> slideshow</a>, from which this diagram is drawn as slide #37.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto4"></a> [4] We use the reference to <em><a href="http://en.wikipedia.org/wiki/Abox">ABox</a></em> and <em><a href="http://en.wikipedia.org/wiki/Tbox">TBox</a></em> in accordance with         our <a title="Permanent Link to Thinking ?Inside the Box? with Description Logics" href="../466/thinking-inside-the-box-with-description-logics/"> working definition</a> for <a href="http://en.wikipedia.org/wiki/Description_logics">description         logics</a>:</p>
<div class="boxGraySolid">&#8220;Description logics and their semantics traditionally split           <span style="font-style: italic;">concepts</span> and their           relationships from the different treatment of <span style="font-style: italic;">instances</span> and their attributes and           roles, expressed as fact assertions. The concept split is known as           the TBox (for <em>terminological</em> knowledge, the basis for           <span style="font-style: italic;">T</span> in <span style="font-style: italic;">TBox</span>) and represents the schema or           taxonomy of the domain at hand. The TBox is the structural and           intensional component of conceptual relationships. The second split           of instances is known as the ABox (for <span style="font-style: italic;">assertions</span>, the basis for <span style="font-style: italic;">A</span> in <span style="font-style: italic;">ABox</span>) and describes the attributes of           instances (and individuals), the roles between instances, and other           assertions about instances regarding their class membership with the           TBox concepts.&#8221;</div>
</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto5"></a> [5] For the specification and a use case of <span style="font-weight: bold;">irON</span> using the CSV (<span style="font-weight: bold;">commON</span>) serialization, see <a href="http://openstructs.org/iron">http://openstructs.org/iron.</a></div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto6"></a> [6] Via this approach we now can assess concept matches in addition to entity matches. This means we can triangulate between the two assessments to aid disambiguation. Because of these logical segmentations, we also have multiple “clusters” (that is, either the <span style="font-style: italic;">concept</span>,         <span style="font-style: italic;">type</span>, <span style="font-style: italic;">superType</span> or <span style="font-style: italic;">dimension</span>) upon which to do our disambiguation evaluations, either between concepts and entities or within the various concept clusters. We can do so via either multiple <a href="http://en.wikipedia.org/wiki/Vector_space_model">semantic         vectors</a> (for statistical-based methods) or multiple <a href="http://en.wikipedia.org/wiki/Features_%28pattern_recognition%29">features</a> (for <a href="http://en.wikipedia.org/wiki/Machine_learning">machine         learning</a> methods). In other words, because of logical segmentation, we have increased the informational power of our concept graph. See further <a href="../759/supertypes-and-logical-segmentation-of-instances/">http://www.mkbergman.com/759/supertypes-and-logical-segmentation-of-instances/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto7"></a> [7] See <a href="http://openstructs.org/structwsf/architecture">http://openstructs.org/structwsf/architecture</a>; also, the available export         formats are shown at <a href="http://constructscs.com/documentation/instructions/export">http://constructscs.com/documentation/instructions/export</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto8"></a> [8] There is an online demo of <span style="font-weight: bold;">conStruct</span> using the <a href="../new-version-sweet-tools-sem-web/">Sweet Tools</a> database of semantic Web and -related tools at <a href="http://constructscs.com/conStruct/browse/">http://constructscs.com/conStruct/browse/</a>; for background on this use case, see <a href="../845/a-most-un-common-way-to-author-datasets/">http://www.mkbergman.com/845/a-most-un-common-way-to-author-datasets/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto9"></a> [9] See, for example, <a href="../496/structwsf-a-framework-for-data-mixing/">http://www.mkbergman.com/496/structwsf-a-framework-for-data-mixing/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="adapto10"></a> [10] See, for example, <a href="../497/structwsf-a-framework-for-collaboration-networks/"> http://www.mkbergman.com/497/structwsf-a-framework-for-collaboration-networks/</a>.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/847/ontology-driven-applications-using-adaptive-ontologies/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>A Most un-commON Way to Author Datasets</title>
		<link>http://www.mkbergman.com/845/a-most-un-common-way-to-author-datasets/</link>
		<comments>http://www.mkbergman.com/845/a-most-un-common-way-to-author-datasets/#comments</comments>
		<pubDate>Thu, 12 Nov 2009 02:19:54 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Adaptive Information]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Semantic Web Tools]]></category>
		<category><![CDATA[Structured Dynamics]]></category>
		<category><![CDATA[Structured Web]]></category>
		<category><![CDATA[irON]]></category>
		<category><![CDATA[case study]]></category>
		<category><![CDATA[commON]]></category>
		<category><![CDATA[conStruct]]></category>
		<category><![CDATA[CSV]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[spreadsheet]]></category>
		<category><![CDATA[structured data]]></category>
		<category><![CDATA[structWSF]]></category>
		<category><![CDATA[Sweet Tools]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=845</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=A Most un-commON Way to Author Datasets&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Ontologies&amp;rft.subject=Semantic Web&amp;rft.subject=Semantic Web Tools&amp;rft.subject=Structured Dynamics&amp;rft.subject=Structured Web&amp;rft.subject=irON&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2009-11-11&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/845/a-most-un-common-way-to-author-datasets/&amp;rft.language=English"></span>

A Case Study of Turning Spreadsheets into Structured Data Powerhouses
In a former life, I had the nickname of &#8216;Spreadsheet King&#8217; (perhaps         among others that I did not care to hear). I had gotten the nick         because of my aggressive [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=A Most un-commON Way to Author Datasets&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Ontologies&amp;rft.subject=Semantic Web&amp;rft.subject=Semantic Web Tools&amp;rft.subject=Structured Dynamics&amp;rft.subject=Structured Web&amp;rft.subject=irON&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2009-11-11&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/845/a-most-un-common-way-to-author-datasets/&amp;rft.language=English"></span>
<p><a href="http://openstructs.org/iron"><img style="border: 0px solid; width: 235px; height: 125px; float: left; margin-right: 10px;" title="irON - instance record and Object Notation" src="../wp-content/themes/ai3/images/iron_logo_235.png" alt="irON - instance record and Object Notation" hspace="5" vspace="5" align="left" /></a></p>
<h2>A Case Study of Turning Spreadsheets into Structured Data Powerhouses</h2>
<p>In a former life, I had the nickname of &#8216;Spreadsheet King&#8217; (perhaps         among others that I did not care to hear). I had gotten the nick         because of my aggressive use of spreadsheets for financial models,         competitors tracking, time series analyses, and the like. However, in         all honesty, I have encountered many others in my career much more         knowledgeable and capable with spreadsheets than I&#8217;ll ever be. So,         maybe I was really more like a minor duke or a court jester than true         nobility.</p>
<p>Yet, pro or amateur, there are perhaps 1 billion spreadsheet users         worldwide <a href="#commON1">[1]</a>, making spreadsheets undoubtedly the most prevalent data         authoring environment in existence. And, despite moans and wails about         how spreadsheets can lead to chaos, spaghetti code, or violations of         internal standards, they are here to stay.</p>
<p>Spreadsheets often begin as simple notetaking environments. With the         addition of new findings and more analysis, some of these worksheets         may evolve to become full-blown datasets. Alternatively, some         spreadsheets start from Day One as intended datasets or modeling         environments. Whatever the case, clearly there is much accumulated         information and data value &#8220;locked up&#8221; in existing spreadsheets.</p>
<p>How to &#8220;unlock&#8221; this value for sharing and collaboration was a major         stimulus to development of the <span style="font-weight: bold;">commON</span> serialization of <span style="font-weight: bold;">irON</span> (<span style="font-style: italic;">instance record</span> and <span style="font-style: italic;">Object Notation</span>) <a href="#commON2">[2]</a>. I recently published         a <a href="http://openstructs.org/iron/common-swt-annex">case study</a> <a href="#commON3">[3]</a> that describes the reasons and benefits of dataset authoring in a         spreadsheet, and provides working examples and code based on         <span style="font-style: italic;">Sweet Tools</span> <a href="#commON4">[4]</a> to aid users         in understanding and using the <span style="font-weight: bold;">commON</span> notation. I summarize portions of         that study herein.</p>
<div class="boxGreenDotted" style="margin: 5px 0pt 5px 10px; width: 240px; float: right; text-align: center;">This is the second article of a two-part series related to the recent       <span style="font-style: italic;">Sweet Tools</span> <a href="../844/sweet-tools-shatters-the-sound-barrier/">update</a>.</div>
<h3>Background on <span style="font-style: italic;">Sweet Tools</span> and         irON</h3>
<p>The dataset that is the focus of this <a href="http://openstructs.org/iron/common-swt-annex">use case</a>,         <a href="../844/sweet-tools-shatters-the-sound-barrier/"><span style="font-style: italic;">Sweet Tools</span></a>, began as an         informal tracking spreadsheet about four years ago. I began it as a way         to learn about available tools in the semantic Web and -related spaces.         I began publishing it and others found it of value so I continued to         develop it.</p>
<p>As it grew over time, however, it gained in structure and size.         Eventually, it became a reference dataset, with which many other people         desired to use and interact. The current version has well over 800         tools listed, characterized by many structured data attributes such as         type, programming language, description and so forth. As it has grown,         a formal controlled vocabulary has also evolved to bring consistency to         the characterization of many of these attributes.</p>
<p>It was natural for me to maintain this listing as a spreadsheet, which         was also reinforced when I was one of the first to adopt an <a href="../326/converting-sweet-tools-to-an-exhibit/">Exhibit         presentation</a> of the data based on a Google spreadsheet about three         years back. Here is a partial view of this spreadsheet as I maintain it         locally:</p>
<div style="margin: 10px; text-align: center;"><a href="http://openstructs.org/sites/openstructs.org/files/images/swt_main_screen.png"> <img class="center_ok" style="border: 0px solid; width: 740px; height: 356px;" title="Click to expand" src="http://openstructs.org/sites/openstructs.org/files/images/swt_main_screen.png" alt="Sweet Tools Main Spreadsheet Screen" width="1279" height="615" /></a><br />
<span style="font-style: italic; font-size: 90%;">(click to         expand)</span></div>
<p>When we began to develop <span style="font-weight: bold;">irON</span> in earnest as a simple (&#8221;naïve&#8221;) dataset authoring framework, it was         clear that a comma-separated value, or <a href="http://en.wikipedia.org/wiki/Comma-separated_values">CSV</a> <a href="#commON5">[5]</a>,         option should join the other two serializations under consideration,         XML and JSON. CSV, though less expressive and capable as a data format         than the other serializations, still has an <a href="http://en.wikipedia.org/wiki/Attribute-value_pair">attribute-value         pair</a> (also known as key-value pairs and many other variants <a href="#commON6">[6]</a>)         orientation. And, via spreadsheets, datasets can be easily authored and         inspected, while also providing a rich functional environment including         sorting, formatting, data validation, calculations, macros, etc.</p>
<p>As a dataset very familiar to us as <span style="font-weight: bold;">irON</span>&#8217;s editors, and directly relevant to         the semantic Web, <span style="font-style: italic;">Sweet Tools</span> provided a perfect prototype case study for helping to guide the         development of <span style="font-weight: bold;">irON</span>, and         specifically what came to be known as the <span style="font-weight: bold;">commON</span> serialization for <span style="font-weight: bold;">irON</span>. The <span style="font-style: italic;">Sweet Tools</span> dataset is relatively large         for a speciality source, has many different types and attributes, and         is characterized by text, images, URLs and similar.</p>
<p>The premise was that if <span style="font-style: italic;">Sweet         Tools</span> could be specified and represented in <span style="font-weight: bold;">commON</span> sufficiently to be parsed and         converted to interoperable RDF, then many similar instance-oriented         datasets could likely be so as well. Thus, as we tried and refined         notation and vocabulary, we tested applicability against the CSV         representation of <span style="font-style: italic;">Sweet Tools</span> in addition to other CSV, JSON and XML datasets.</p>
<h3>Dataset Authoring in a Spreadsheet</h3>
<p>A large portion of the <a href="http://openstructs.org/iron/common-swt-annex">case study</a> describes         the many advantages of authoring small datasets within spreadsheets.         The useful thing about the CSV format is that these full functional         capabilities of the spreadsheet are available during authoring or later         updates and modifications, but, when exported, the CSV provides a         relatively clean format for processing and parsing.</p>
<p>So, some of the reasons for small dataset authoring in a spreadsheet         include:</p>
<ul>
<li> <span style="font-style: italic;">Formatting and on-sheet           management</span> -  the first usefulness of a spreadsheet comes           from being able to format and organize the records. Records can be           given background colors to highlight distinctions (new entries, for           example); live URL links can be embedded; contents can be wrapped and           styled within cells; and the column and row heads can be &#8220;frozen&#8221;,           useful when scrolling large workspaces</li>
<li> <span style="font-style: italic;">Named blocks and sorting</span> &#8211;           named blocks are a powerful feature of modern spreadsheets, useful           for data manipulation, printing and internal referencing by formulas           and the like.  Sorting with named blocks is especially important           as an aid to check consistency of terminology, records completeness,           duplicates checks, missing value checks, and the like. Named blocks           can also be used as references in calculations. All of these features           are real time savers, especially when datasets grow large and           consistency of treatment and terminology is important</li>
<li> <span style="font-style: italic;">Multiple sheets and consolidated           access</span> &#8211; <span style="font-weight: bold;">commON</span> modules can be specified on a single worksheet or multiple worksheets           and saved as individual CSV files; because of its size and relative           complexity, the <span style="font-style: italic;">Sweet Tools</span> dataset is maintained on multiple sheets. Multi-worksheet           environments help keep related data and notes consolidated and more           easily managed on local hard drives</li>
<li> <span style="font-style: italic;">Completeness and counts</span> - the spreadsheet <span style="font-style: italic;">counta</span> function is useful to sum counts           for cell entries by both column and row, a useful aid to indicate if           an attribute or type value is missing or if a record is           incomplete.  Of course, similar helps and uses can be found for           many of the hundreds of embedded functions within a spreadsheet</li>
<li> <span style="font-style: italic;">Controlled vocabularies and data           entry validation</span> &#8211; quality datasets often hinge on consistency           and uniform values and terminology; the data validation utilities           within spreadsheets can be applied to Boolean, ranges and mins and           maxes, and to controlled vocabulary lists. Here is an example for           <span style="font-style: italic;">Sweet Tools</span>, enforcing           proper tool category assignments from a 50-item pick list:</li>
</ul>
<div style="margin: 10px;"><img class="center_ok" style="border: 0px solid; width: 609px; height: 373px;" title="Controlled Vocabularies and Data Entry Validation" src="http://openstructs.org/sites/openstructs.org/files/images/swt_validation.png" alt="Controlled Vocabularies and Data Entry Validation" width="609" height="373" /></div>
<ul>
<li> <span style="font-style: italic;">Specialized functions and           macros</span> &#8211; <span>all</span> functionality of           spreadsheets may be employed in the development of <span style="font-weight: bold;">commON</span> datasets. Then, once employed,           only the values embedded within the sheets are then exported as CSV.</li>
</ul>
<h3>Staging <span style="font-style: italic;">Sweet Tools</span> for commON</h3>
<p>The next major section of the <a href="http://openstructs.org/iron/common-swt-annex">case study</a> deals         with the minor conventions that must be followed in order to stage         spreadsheets for <span style="font-weight: bold;">commON</span>. Not         much of the specific <span style="font-weight: bold;">commON</span> vocabulary or notation is discussed below; for details, see <a href="#commON7">[7]</a>.</p>
<p>Because you can create multiple worksheets within a spreadsheet, it is         not necessary to modifiy existing worksheets or tabs. Rather, if you         are reluctant or can not change existing information, merely create         parallel duplicate sheets of the source information. These duplicate         sheets have as their sole purpose export to <span style="font-weight: bold;">commON</span> CSV. You can maintain your         spreadsheet as is while staging for <span style="font-weight: bold;">commON</span>.</p>
<p>To do so, use the simple <span style="font-style: italic;">=</span> formula to create cross-references between the existing source         spreadsheet tab and the target <span style="font-weight: bold;">commON</span> CSV export tab. (You can also do         this for complete, highlighted blocks from source to target sheet.)         Then, by adding the few minor conventions of <span style="font-weight: bold;">commON</span>, you have now created a staged         export tab without modifying your source information in the slightest.</p>
<p>In standard form and for Excel and Open Office, single quotes, double         quotes and commas when entered into a spreadsheet cell are         automatically &#8216;<a href="http://en.wikipedia.org/wiki/Escape_character">escaped</a>&#8216; when         issued as CSV. <span style="font-weight: bold;">commON</span> allows         you to specify your own delimiter for lists (the standard is the pipe         &#8216;|&#8217; character) and what the parser recognizes as the &#8216;escape&#8217; character         (&#8217;\&#8217; is the standard). However, you probably should not change for most         conditions.</p>
<p>The standard <span style="font-weight: bold;">commON</span> parsers and         converters are UTF-8 compatible. If your source content has unusual         encodings, try to target UTF-8 as your canonical spreadsheet output.</p>
<p>In the <a href="http://openstructs.org/iron/iron-specification"><span style="font-weight: bold;">irON</span> specification</a> there are a         small number of defined modules or processing sections. In <span style="font-weight: bold;">commON</span>, these         modules are denoted by the double-ampersand character sequence         (&#8217;<span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;&amp;</span>&#8216;),         and apply to lists of instance records (<span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;&amp;recordList</span>),         dataset specifications and associated metadata describing the dataset         (<span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;&amp;dataset</span>),         and mappings of attributes and types to existing schema (<span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;&amp;linkage</span>).         Similarly, attributes and types are denoted by a single ampersand         prefix (<span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;attributeName</span>).</p>
<p>In <span style="font-weight: bold;">commON</span>, any or all of the         modules can occur within a single CSV file or in multiple files. In any         case, the start of one of these processing modules is signaled by the         module keyword and <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;&amp;keyword</span> convention.</p>
<h4>The RecordList Module</h4>
<p>The first spreadsheet figure above shows a <span style="font-style: italic;">Sweet Tools</span> example for the <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;&amp;recordList</span> module. The module begins with that keyword, indicating one of more         instance records will follow. Note that the first line after the         <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;&amp;recordList</span> keyword is devoted to the listing of attributes and types for the         instance records (designated by the <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;attributeName</span> convention in the columns for the first row after the <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;&amp;recordList</span> keyword is encountered).</p>
<p>The <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;&amp;recordList</span> format can also include the <span style="font-style: italic;">stacked</span> style (see similar Dataset example         below) in addition to the single <span style="font-style: italic;">row</span> style shown above.</p>
<p>At any rate, once a worksheet is ready with its instance records         following the straightforward <span style="font-weight: bold;">irON</span> and <span style="font-weight: bold;">commON</span> conventions, it can then be saved as         a CSV file and appropriately named. Here is an example of what this         &#8220;vanilla&#8221; CSV file now looks like when shown again in a spreadsheet:</p>
<div style="margin: 10px; text-align: center;"><a href="http://openstructs.org/sites/openstructs.org/files/images/swt_csv_spreadsheet_view.png"> <img class="center_ok" style="border: 0px solid; width: 740px; height: 342px;" title="Click to expand" src="http://openstructs.org/sites/openstructs.org/files/images/swt_csv_spreadsheet_view.png" alt="Spreadsheet View of the CSV File" width="1271" height="587" /></a><span><br />
</span> <span style="font-style: italic; font-size: 90%;">(click to         expand)</span></div>
<p>Alternatively, you could open this same file in a text editor. Here is         how this exact same instance record view looks in an editor:</p>
<div style="margin: 10px; text-align: center;"><a href="http://openstructs.org/sites/openstructs.org/files/images/swt_csv_editor_view.png"> <img class="center_ok" style="border: 0px solid; width: 740px; height: 389px;" title="Click to expand" src="http://openstructs.org/sites/openstructs.org/files/images/swt_csv_editor_view.png" alt="Editor View of the CSV Record File" width="1251" height="657" /></a><br />
<span style="font-style: italic; font-size: 90%;">(click to         expand)</span></div>
<p>Note that the CSV format separates each column by the comma separator,         with escapes shown for the <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;description</span> attribute when it includes a comma-separated clause. Without word wrap,         each record in this format occupies a single row (though, again, for         the <span style="font-style: italic;">stacked</span> style, multiple         entries are allowed on individual rows so long as a new instance record         <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;id</span> is not encountered in the first column).</p>
<h4>The Dataset Module</h4>
<p>The <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;&amp;dataset</span> module defines the dataset parameters and provides very flexible         metadata attributes to describe the dataset <a href="#commON8">[8]</a>. Note the dataset         specification is exactly equivalent in form to the instance record         (<span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;&amp;recordList</span>)         format, and also allows the single <span style="font-style: italic;">row</span> or <span style="font-style: italic;">stacked</span> styles (see these <a href="http://openstructs.org/iron/iron-specification#mozTocId223991">instance         record examples</a>), with this one being the <span style="font-style: italic;">stacked</span> style:</p>
<div style="margin: 10px; text-align: center;"><a href="http://openstructs.org/sites/openstructs.org/files/images/swt_dataset.png"> <img class="center_ok" style="border: 0px solid; width: 740px; height: 105px;" title="Click to expand" src="http://openstructs.org/sites/openstructs.org/files/images/swt_dataset.png" alt="The Dataset Module" width="1579" height="223" /></a><br />
<span style="font-style: italic; font-size: 90%;">(click to         expand)</span></div>
<h4>The Linkage Module</h4>
<p>The <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;&amp;linkage</span> module is used to map the structure of the instance records to some         structural schema, which can also include external ontologies. The         module has a simple, but specific structure.</p>
<p>Either attributes (presented as the <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;attributeList</span>)         or types (presented as the <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;typeList</span>)         are listed sequentially by row until the listing is exhausted <a href="#commON8">[8]</a>. By         convention, the second column in the listing is the targeted         <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;mapTo</span> value. Absent a prior <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;prefixList</span> value, the <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">&amp;mapTo</span> value needs to be a full URL to the corresponding attribute or type in         some external schema:</p>
<div style="margin: 10px;"><img class="center_ok" style="border: 0px solid; width: 537px; height: 595px;" title="The Linkage Module" src="http://openstructs.org/sites/openstructs.org/files/images/swt_linkage.png" alt="The Linkage Module" width="537" height="595" /></div>
<p>Notice in the case of <span style="font-style: italic;">Sweet         Tools</span> that most values are from the actual COSMO mini-ontology         underlying the listing. These need to be listed as well, since absent         the specifications in <span style="font-weight: bold;">commON</span> the system has NO knowledge of linkages and mappings.</p>
<h4>The Schema (structure) Module</h4>
<p>In its current state of development, <span style="font-weight: bold;">commON</span> does not support a spreadsheet-based         means for specifying the schema structure (lightweight ontology)         governing the datasets <a href="#commON2">[2]</a>. Another <span style="font-weight: bold;">irON</span> serialization, <span style="font-weight: bold;">irJSON</span>, does. Either via this <span style="font-weight: bold;">irJSON</span> specification or via an offline         ontology, a link reference is presently used by <span style="font-weight: bold;">commON</span> (and, therefore, <span style="font-style: italic;">Sweet Tools</span> for this case study) to         establish the governing structure of the input instance record         datasets.</p>
<p>A spreadsheet-based schema structure for <span style="font-weight: bold;">commON</span> has been designed and tested in         prototype form. <span style="font-weight: bold;">commON</span> should         be enhanced with this capability in the near future <a href="#commON8">[8]</a>.</p>
<h4>Saving and Importing</h4>
<p>If the modules are spread across more than one worksheet, then each         worksheet must be saved as its own CSV file. In the case of         <span style="font-style: italic;">Sweet Tools</span>, as exhibited by         its reference current spreadsheet, <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">sweet_tools_20091110.xls</span>,         three individual CSV files get saved. These files can be named whatever         you would like. However, it is essential that the names be remembered         for later referencing.</p>
<p>My own naming convention is to use a format of <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">appname_date_modulename.csv</span> because it sorts well in a file manager accommodating multiple versions         (dates) and keeps related files clustered. The <span style="font-style: italic;">appname</span> in the case of <span style="font-style: italic;">Sweet Tools</span> is generally <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">swt</span>.         The <span style="font-style: italic;">modulename</span> is generally         the <span style="font-style: italic;">dataset</span>, <span style="font-style: italic;">records</span>, or <span style="font-style: italic;">linkage</span> convention. I tend to use the         <span style="font-style: italic;">date</span> specification in the         YYYYMMDD format. Thus, in the case of the <span style="font-style: italic;">records</span> listings for <span style="font-style: italic;">Sweet Tools</span>, its filename could be         something like:  <span style="font-family: Courier New,Courier,monospace; font-weight: bold;">swt_20091110_records.csv</span>.</p>
<p>Once saved, these files are now ready to be imported into a         <span style="font-weight: bold;">structWSF</span> <a href="#commON9">[9]</a> instance, which         is where the CSV parsing and conversion to interoperable RDF occurs<a href="#commON8"> [8]</a>. In this case study, we used the Drupal-based <span style="font-weight: bold;">conStruct SCS</span> system <a href="#commON10">[10]</a>. <span style="font-weight: bold;">conStruct</span> exposes the <span style="font-weight: bold;">structWSF</span> Web services via a user interface         and a user permission and access system. The actual case study write-up         offers more details about the import process.</p>
<h3>Using the Dataset</h3>
<p>We are now ready to interact with the <span style="font-style: italic;">Sweet Tools</span> structured dataset using         <span style="font-weight: bold;">conStruct</span> (assuming you have a         Drupal installation with the <span style="font-weight: bold;">conStruct</span> modules) <a href="#commON10">[10]</a>.</p>
<h4>Introduction to the App</h4>
<p>The screen capture below shows a couple of aspects of the system:</p>
<ul>
<li>First, the left hand panel (according to how this specific Drupal         install was themed) shows the various tools available to <span style="font-weight: bold;">conStruct</span>.  These include (with links         to their documentation) <a href="http://constructscs.com/documentation/instructions/search">Search</a>,         <a href="http://constructscs.com/documentation/instructions/browse">Browse</a>,         <a href="http://constructscs.com/documentation/instructions/view-record">View         Record</a>, <a href="http://constructscs.com/documentation/instructions/import">Import</a>,         <a href="http://constructscs.com/documentation/instructions/export">Export</a>,         <a href="http://constructscs.com/documentation/instructions/datasets"> Datasets</a>, <a href="http://constructscs.com/documentation/instructions/create-record">Create           Record</a>, <a href="http://constructscs.com/documentation/instructions/update-record">Update           Record</a>, <a href="http://constructscs.com/documentation/instructions/delete-record">Delete           Record</a> and <a href="http://constructscs.com/documentation/instructions/settings">Settings</a><a href="#commON11"> [11]</a>;</li>
<li>The Browse tree in the main part of the screen shows the full         mini-ontology that classifies <span style="font-style: italic;">Sweet         Tools</span>. Via simple inferencing, clicking on any parent link         displays all children projects for that category as well <span style="font-style: italic;">(click to expand)</span>:</li>
</ul>
<div style="margin: 10px; text-align: center;"><a href="http://openstructs.org/sites/openstructs.org/files/images/swt_drupal_browse.png"> <img class="center_ok" style="border: 0px solid; width: 740px; height: 1907px;" title="Click to expand" src="http://openstructs.org/sites/openstructs.org/files/images/swt_drupal_browse.png" alt="conStruct (Drupal) Browse Screen for Sweet Tools" width="1176" height="3031" /></a><span style="font-style: italic; font-size: 90%;">(click to         expand)</span></div>
<p>One of the absolutely cool things about this framework is that all         tools, inferencing, user interfaces and data structure are a direct         result of the ontology(ies) underlying the system (plus the         <span style="font-weight: bold;">irON</span> instance ontology, as         well). This means that switching datasets or adding datasets causes the         entire system structure to now reflect those changes — without         lifting a finger!!</p>
<h4>Some Sample Uses</h4>
<p>Here are a few sample things you can do with these generic tools driven         by the <em>Sweet Tools</em> dataset:</p>
<ul>
<li> <a href="http://constructscs.com/conStruct/browse/">Browsing the           ontology tree</a> (then, Browse by Kind)</li>
<li>Viewing an <a href="http://constructscs.com/conStruct/view/?uri=http%3A%2F%2Fpurl.org%2Fontology%2Fswt%2Firon&amp;dataset=http%3A%2F%2Fconstructscs.com%2Fwsf%2Fdatasets%2F181%2F"> instance record</a></li>
<li>Viewing a <a href="http://constructscs.com/conStruct/ontology/view/?uri=http%3A%2F%2Fpurl.org%2Fontology%2Fcosmo%23KRBrowser"> Class Type Report</a></li>
<li>Viewing an <a href="http://constructscs.com/conStruct/ontology/view/?uri=http%3A%2F%2Fpurl.org%2Fontology%2Firon%23description"> Attribute Report</a></li>
<li> <a href="http://constructscs.com/conStruct/search/?filter_types_3=http%3A%2F%2Fpurl.org%2Fontology%2Fcosmo%23KRBrowser&amp;filter_attributes_4=http%3A%2F%2Fpurl.org%2Fontology%2Fcosmo%23status&amp;query=new&amp;filter=on"> Searching by facet</a> (check the tabs)</li>
<li>Doing a <a href="http://constructscs.com/conStruct/search/">multi-value filtering</a> (make selections from the various tabs),</li>
<li> <a href="http://constructscs.com/conStruct/export/">Exporting           stuff</a> in a variety of formats.</li>
</ul>
<p>Note, if you access this <span style="font-weight: bold;">conStruct</span> instance you will do so as a         <span style="font-style: italic;">demo</span> user. Unfortunately, as such, you may not be able to see all of the write and update tools, which in this case are reserved for curators or admins. Recall that <span style="font-weight: bold;">structWSF</span> has a comprehensive <a href="../497/structwsf-a-framework-for-collaboration-networks/"> user access and permissions layer</a>.</p>
<h4>Exporting in Alternative Formats</h4>
<p>Of course, one of the real advantages of the <span style="font-weight: bold;">irON</span> and <span style="font-weight: bold;">structWSF</span> designs is to enable different         formats to be interchanged and to interoperate. Upon submission, the         <span style="font-weight: bold;">commON</span> format and its datasets         can then be exported in these alternate formats and serializations <a href="#commON8">[8]</a>:</p>
<ul>
<li>commON</li>
<li>irJSON</li>
<li>irXML</li>
<li>N-Triples/CSV</li>
<li>N-Triples/TSV</li>
<li>RDF+N3</li>
<li>RDF+XML</li>
</ul>
<p>As should be obvious, one of the real benefits of the <span style="font-weight: bold;">irON</span> notation &#8212; in addition to easy         dataset authoring &#8212; is the ability to more-or-less treat RDF, CSV, XML         and JSON as interoperable data formats.</p>
<h3>The Formal Case Study</h3>
<p>The formal <span style="font-style: italic;">Sweet Tools</span> case       study based on <span style="font-weight: bold;">commON</span>, with       sample download files and PDF, is available from <a style="font-style: italic;" href="http://openstructs.org/iron/common-swt-annex">Annex: A commON Case Study       using Sweet Tools, Supplementary Documentation</a> <a href="#commON3">[3]</a>.</p>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a id="commON1" name="commON1"></a> [1] In 2003, <a href="http://www.microsoft.com/presspass/press/2003/oct03/10-13vstoofficelaunchpr.mspx"> Microsoft estimated</a> its worldwide users of the Excel spreadsheet,         which then had about a 90% market share globally, at 400 million.         Others at that time estimated unauthorized use to perhaps double that         amount. There has been significant growth since then, and online         spreadsheets such as Google Docs and Zoho have also grown wildly. This         surely puts spreadsheet users globally into the 1 billion range.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="commON2" name="commON2"></a> [2] See Frédérick Giasson and         Michael Bergman, eds., <span style="font-style: italic;">Instance         Record and Object Notation (irON) Specification, Specification         Document</span>, version 0.82, 20 October 2009.  See <a href="http://openstructs.org/iron/iron-specification">http://openstructs.org/iron/iron-specification</a>.         Also see the <a href="http://openstructs.org/iron"><span style="font-weight: bold;">irON</span> Web site</a>, Google <a href="http://groups.google.com/group/iron-notation">discussion group</a>,         and <a href="http://code.google.com/p/iron-notation/">code distribution         site</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="commON3" name="commON3"></a> [3] Michael Bergman, 2009.         <span style="font-style: italic;">Annex: A commON Case Study using         Sweet Tools, Supplementary Documentation</span>, prepared by Structured         Dynamics LLC, November 10, 2009. See <a href="http://openstructs.org/iron/common-swt-annex">http://openstructs.org/iron/common-swt-annex</a>.         It may also be downloaded in PDF <a href="http://openstructs.org/sites/openstructs.org/files/downloads/common-case-study.pdf"> <img style="border: 0px solid; width: 13px; height: 16px;" src="http://openstructs.org/sites/openstructs.org/files/icons/pdfdoc.gif" alt="" /></a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="commON4" name="commON4"></a> [4] See Michael K. Bergman&#8217;s         <a href="http://mkbergman.com/">AI3:::Adaptive Information</a> blog,         <a href="../new-version-sweet-tools-sem-web/"><span style="font-style: italic;"> Sweet Tools (Sem Web)</span></a>. In addition, the <span style="font-weight: bold;">commON</span> version of <span style="font-style: italic;">Sweet Tools</span> is available at the <a href="http://constructscs.com/conStruct/browse/?browse=true&amp;attribute=all&amp;type=all&amp;dataset=http%3A%2F%2Fconstructscs.com%2Fwsf%2Fdatasets%2F122%2F&amp;page=0"> <span style="font-weight: bold;">conStruct</span> site</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="commON5" name="commON5"></a> [5] The CSV mime type is defined in         <span style="font-style: italic;">Common Format and MIME Type for         Comma-Separated Values (CSV) Files</span> [<a href="http://www.rfc-editor.org/rfc/rfc4180.txt">RFC 4180</a>]. A useful         overview of the CSV format is provided by <a title="http://www.creativyst.com/Doc/Articles/CSV/CSV01.htm" rel="nofollow" href="http://www.creativyst.com/Doc/Articles/CSV/CSV01.htm">The Comma Separated Value (CSV) File Format</a>. Also, see         that author&#8217;s related CTX reference for a discussion of how schema and         structure can be added to the basic CSV framework; see <a href="http://www.creativyst.com/Doc/Std/ctx/ctx.htm">http://www.creativyst.com/Doc/Std/ctx/ctx.htm</a>,         especially the section on the comma-delimited version (<a href="http://www.creativyst.com/Doc/Std/ctx/ctx.htm#CTC">http://www.creativyst.com/Doc/Std/ctx/ctx.htm#CTC</a>).</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="commON6" name="commON6"></a> [6] An <a href="http://en.wikipedia.org/wiki/Attribute-value_system">attribute-value         system</a> is a basic knowledge representation framework comprising a         table with columns designating &#8220;attributes&#8221; (also known as <span style="font-style: italic;">properties</span>, <span style="font-style: italic;">predicates</span>, <span style="font-style: italic;">features</span>, <span style="font-style: italic;">parameters</span>, <span style="font-style: italic;">dimensions</span>, <span style="font-style: italic;">characteristics</span> or <span style="font-style: italic;">independent variables</span>) and rows         designating &#8220;objects&#8221; (also known as <span style="font-style: italic;">entities</span>, <span style="font-style: italic;">instances</span>, <span style="font-style: italic;">exemplars</span>, <span style="font-style: italic;">elements</span> or <span style="font-style: italic;">dependent variables</span>). Each table cell         therefore designates the value (also known as <span style="font-style: italic;">state</span>) of a particular attribute of a         particular object. This is the basic table presentation of a         spreadsheet or relational data table.</p>
<p>Attribute-values can also be presented as pairs in a form of an         <a href="http://en.wikipedia.org/wiki/Associative_array">associative         array</a>, where the first item listed is the attribute, often followed         by a separator such as the colon, and then the value. JSON and many         simple data struct notations follow this format. This format may also         be called <span style="font-style: italic;">attribute-value         pairs</span>, <span style="font-style: italic;">key-value pairs</span>,         <span style="font-style: italic;">name-value pairs</span>, <span style="font-style: italic;">alists</span> or others. In these cases the         &#8220;object&#8221; is implied, or is introduced as the name of the array..</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="commON7" name="commON7"></a> [7] See especially <a style="font-style: italic;" href="http://openstructs.org/iron/iron-specification#mozTocId603499">SUB-PART         3: commON PROFILE</a> in, Frédérick Giasson and Michael Bergman, eds.,         <span style="font-style: italic;">Instance Record and Object Notation         (irON) Specification, Specification Document</span>, version 0.82, 20         October 2009.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="commON8" name="commON8"></a> [8] As of the date of this case         study, some of the processing steps in the <span style="font-weight: bold;">commON</span> pipeline are manual. For example,         the parser creates an intermediate N3 file that is actually submitted         to the <span style="font-weight: bold;">structWSF</span>. Within a week         or two of publication, these capabilities should be available as a         direct import to a <span style="font-weight: bold;">structWSF</span> instance. However, there is one exception to this:  the         specification for the schema structure. That module has been         prototyped, but will not be released with the first <span style="font-weight: bold;">commON</span> upgrade. That enhancement is likely         a few weeks off from the date of this posting. Please check the         <a href="http://groups.google.com/group/iron-notation"><span style="font-weight: bold;">irON</span></a> or <a style="font-weight: bold;" href="http://groups.google.com/group/structwsf">structWSF</a> discussion groups for announcements.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="commON9" name="commON9"></a> [9] <a style="font-weight: bold;" href="http://openstructs.org/">structWSF</a> is a platform-independent         Web services framework for accessing and exposing structured RDF data,         with generic tools driven by underlying data structures. Its central         perspective is that of the dataset. Access and user rights are granted         around these datasets, making the framework enterprise-ready and         designed for collaboration. Since a <span style="font-weight: bold;">structWSF</span> layer may be placed over         virtually any existing datastore with Web access &#8212; including large         instance record stores in existing relational databases &#8212; it is also a         framework for Web-wide deployments and interoperability.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="commON10"></a>[10] <a style="font-weight: bold;" href="http://constructscs.com/">conStruct SCS</a> is a structured content         system built on the Drupal content management framework. <span style="font-weight: bold;">conStruct</span> enables structured data and its         controlling vocabularies (ontologies) to drive applications and user         interfaces. It is based on RDF and SD&#8217;s <span style="font-weight: bold;">structWSF</span> platform-independent Web services         framework [6]. In addition to user access control and management and a         general user interface, <span style="font-weight: bold;">conStruct</span> provides Drupal-level CRUD, data         display templating, faceted browsing, full-text search, and import and         export over structured data stores based on RDF.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="commON11"></a> [11] More Web services are being         added to <span style="font-weight: bold;">structWSF</span> on a fairly         constant basis, and the existng ones have been through a number of         upgrades.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/845/a-most-un-common-way-to-author-datasets/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Structured Dynamics&#8217; Product Stack</title>
		<link>http://www.mkbergman.com/842/structured-dynamics-product-stack/</link>
		<comments>http://www.mkbergman.com/842/structured-dynamics-product-stack/#comments</comments>
		<pubDate>Mon, 02 Nov 2009 22:54:24 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Information Automation]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Semantic Web Tools]]></category>
		<category><![CDATA[Structured Dynamics]]></category>
		<category><![CDATA[UMBEL]]></category>
		<category><![CDATA[Web-oriented Architecture]]></category>
		<category><![CDATA[irON]]></category>
		<category><![CDATA[conStruct]]></category>
		<category><![CDATA[scones]]></category>
		<category><![CDATA[Semantic Enterprise]]></category>
		<category><![CDATA[slideshow]]></category>
		<category><![CDATA[structWSF]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=842</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Structured Dynamics&#8217; Product Stack&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Information Automation&amp;rft.subject=Linked Data&amp;rft.subject=Ontologies&amp;rft.subject=Open Source&amp;rft.subject=Semantic Web&amp;rft.subject=Semantic Web Tools&amp;rft.subject=Structured Dynamics&amp;rft.subject=UMBEL&amp;rft.subject=Web-oriented Architecture&amp;rft.subject=irON&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2009-11-02&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/842/structured-dynamics-product-stack/&amp;rft.language=English"></span>

A New Slide Show Consolidates, Explains Recent Developments
Much has been happening on the Structured Dynamics front of late. Besides welcoming Steve Ardire as a senior advisor to the company, we also have been issuing a steady stream of new products from our semantic Web pipeline.
This new slide show attempts to capture these products and relate [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Structured Dynamics&#8217; Product Stack&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Information Automation&amp;rft.subject=Linked Data&amp;rft.subject=Ontologies&amp;rft.subject=Open Source&amp;rft.subject=Semantic Web&amp;rft.subject=Semantic Web Tools&amp;rft.subject=Structured Dynamics&amp;rft.subject=UMBEL&amp;rft.subject=Web-oriented Architecture&amp;rft.subject=irON&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2009-11-02&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/842/structured-dynamics-product-stack/&amp;rft.language=English"></span>
<p><a href="http://structureddynamics.com/"><img style="border: 0px solid; width: 260px; height: 60px; float: left; margin-right: 10px;" title="Structured Dynamics LLC" src="../wp-content/themes/ai3/images/sd_logo_260.png" alt="Structured Dynamics LLC" hspace="5" vspace="5" align="left" /></a></p>
<h2>A New Slide Show Consolidates, Explains Recent Developments</h2>
<p>Much has been happening on the <a href="http://structureddynamics.com">Structured Dynamics</a> front of late. Besides welcoming <a href="http://www.linkedin.com/in/sardire">Steve Ardire</a> as a senior advisor to the company, we also have been issuing a steady stream of new <a href="http://structureddynamics.com/products.html">products</a> from our semantic Web pipeline.</p>
<p>This new slide show attempts to capture these products and relate them to the various layers in Structured Dynamics&#8217; enterprise product stack:</p>
<div class="center_ok center">
<div id="__ss_2406783" class="center_ok" style="width: 425px; text-align: left;"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" title="Structured Dynamics's Semantic Technologies Product Stack" href="http://www.slideshare.net/mkbergman/structured-dynamicss-semantic-technologies-product-stack">Structured Dynamics&#8217;s Semantic Technologies Product Stack</a><object style="margin:0px" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="425" height="355" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=sdproductstack20091102-091102163620-phpapp01&amp;stripped_title=structured-dynamicss-semantic-technologies-product-stack" /><param name="allowfullscreen" value="true" /><embed style="margin:0px" type="application/x-shockwave-flash" width="425" height="355" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=sdproductstack20091102-091102163620-phpapp01&amp;stripped_title=structured-dynamicss-semantic-technologies-product-stack" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<div style="font-size: 11px; font-family: tahoma,arial; height: 26px; padding-top: 2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">presentations</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/mkbergman">mkbergman</a>.</div>
</div>
</div>
<p>The show indicates the role of <a href="http://structureddynamics.com/scones.html">scones</a>, <a href="http://openstructs.org/iron">irON</a>, <a href="http://openstructs.org/structwsf">structWSF</a>, <a href="http://umbel.org/">UMBEL</a>, <a href="http://constructscs.com/">conStruct</a> and others and how they leverage existing information assets to enable the semantic enterprise. And, oh, by the way, all of this is done via Web-accessible <a href="http://structureddynamics.com/linked_data.html">linked data</a> and our practical <a href="http://structureddynamics.com/technology.html">technologies</a>.</p>
<p>Enjoy!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/842/structured-dynamics-product-stack/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>irON: Semantic Web for Mere Mortals</title>
		<link>http://www.mkbergman.com/838/iron-semantic-web-for-mere-mortals/</link>
		<comments>http://www.mkbergman.com/838/iron-semantic-web-for-mere-mortals/#comments</comments>
		<pubDate>Mon, 19 Oct 2009 00:12:26 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Adaptive Information]]></category>
		<category><![CDATA[Bibliographic Knowledge Network]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Semantic Web Tools]]></category>
		<category><![CDATA[Structured Dynamics]]></category>
		<category><![CDATA[irON]]></category>
		<category><![CDATA[CSV]]></category>
		<category><![CDATA[JSON]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[spreadsheets]]></category>
		<category><![CDATA[XML]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=838</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=irON: Semantic Web for Mere Mortals&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Bibliographic Knowledge Network&amp;rft.subject=Open Source&amp;rft.subject=Semantic Web Tools&amp;rft.subject=Structured Dynamics&amp;rft.subject=irON&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2009-10-18&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/838/iron-semantic-web-for-mere-mortals/&amp;rft.language=English"></span>

New Cross-Scripting Frameworks for XML, JSON and Spreadsheets
On behalf of Structured         Dynamics, I am pleased to announce our release into the open source         community of irON — the         instance record and [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=irON: Semantic Web for Mere Mortals&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Bibliographic Knowledge Network&amp;rft.subject=Open Source&amp;rft.subject=Semantic Web Tools&amp;rft.subject=Structured Dynamics&amp;rft.subject=irON&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2009-10-18&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/838/iron-semantic-web-for-mere-mortals/&amp;rft.language=English"></span>
<p><a href="http://openstructs.org/iron/iron-specification"><img style="border: 0px solid; width: 235px; height: 125px; float: left; margin-right: 10px;" title="instance record and Object Notation" src="../wp-content/themes/ai3/images/iron_logo_235.png" alt="instance record and Object Notation" hspace="5" vspace="5" align="left" /></a></p>
<h2>New Cross-Scripting Frameworks for XML, JSON and Spreadsheets</h2>
<p>On behalf of <a href="http://structureddynamics.com/">Structured         Dynamics</a>, I am pleased to announce our release into the open source         community of <span style="font-style: italic; font-weight: bold;">irON</span> — the         <span style="font-style: italic;">instance record</span> and         <span style="font-style: italic;">Object Notation</span> — and         its family of frameworks and tools <a href="#ia1">[1]</a>. With <span style="font-weight: bold; font-style: italic;">irON</span>, you can now         author and conduct business solely in the formats and tools most         familiar and comfortable to you, all the while enabling your data to         interact with the semantic Web.</p>
<p><span style="font-weight: bold; font-style: italic;">irON</span> is an         abstract notation and associated vocabulary for specifying RDF triples         and schema in non-RDF forms. Its purpose is to allow users and tools in         non-RDF formats to stage interoperable datasets using RDF. The notation         supports writing RDF and schema in <a href="http://en.wikipedia.org/wiki/JSON">JSON</a> (<span style="font-weight: bold; font-style: italic;">irJSON</span>), <a href="http://en.wikipedia.org/wiki/Xml">XML</a> (<span style="font-weight: bold; font-style: italic;">irXML</span>) and         comma-delimited (<a href="http://en.wikipedia.org/wiki/Comma-separated_values">CSV</a>) formats         (<span style="font-weight: bold; font-style: italic;">commON</span>).</p>
<p>The surprising thing about <span style="font-weight: bold; font-style: italic;">irON</span> is that — by         following its simple conventions and vocabulary — you will be         authoring and creating interoperable RDF datasets without doing much         different than your normal practice.</p>
<p>This first specification for the <a href="http://openstructs.org/iron/iron-specification"><span style="font-weight: bold; font-style: italic;">irON</span> notation</a> includes guidance for creating instance records         (including in bulk), linkages to existing ontologies and schema, and         schema definitions. In this newly published <a href="http://openstructs.org/iron/iron-specification"><span style="font-weight: bold; font-style: italic;">irON</span> specificatiion</a>, profiles and examples are also provided for each of         the <span style="font-weight: bold; font-style: italic;">irXML</span>,         <span style="font-weight: bold; font-style: italic;">irJSON</span> and         <span style="font-weight: bold; font-style: italic;">commON</span> serializations. The <span style="font-weight: bold; font-style: italic;">irON</span> release also         includes a number of parsers and converters of the specification into         RDF <a href="#ia2">[2]</a>. Data ingested in the <span style="font-weight: bold; font-style: italic;">irON</span> frameworks can         also be exported as RDF and staged as <a href="http://en.wikipedia.org/wiki/Linked_data">linked data</a>.</p>
<div class="boxRedDotted"><strong>UPDATE</strong>: Fred Giasson <a href="http://fgiasson.com/blog/index.php/2009/10/20/common-and-irjson-php-parsers-released/">announced</a> on his blog today (10/20) the release of the <em><strong>irJSON </strong></em>and <em><strong>commON</strong></em> parsers.</div>
<h3>Background and Rationale</h3>
<p>The objective of <span style="font-weight: bold; font-style: italic;">irON</span> is to make it easy         for data owners to author, read and publish data. This means the         starting format should be a human readable, easily writable means for         authoring and conveying instance records (that is, instances and their         attributes and assigned values) and the datasets that contain them.         Among other things, this means that <span style="font-style: italic; font-weight: bold;">irON</span>&#8217;s notation does         not use RDF &#8220;<a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">triples</a>&#8220;,         but rather the native notations of the host serializations.</p>
<p><span style="font-weight: bold; font-style: italic;">irON</span> is         premised on these considerations and observations:</p>
<ul>
<li> <a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a> (Resource Description Framework) is a powerful canonical data model           for data interoperability <a href="#ia3">[3]</a></li>
<li>However, most existing data is not written in RDF and many authors         and publishers prefer other formats for various reasons</li>
<li>Many formats that are easier to author and read than RDF are         variants of the attribute-value pair construct <a href="#ia4">[4]</a>, which can readily         be expressed as RDF, and</li>
<li>A common abstract notation for converting to RDF would also enable         non-RDF formats to become somewhat interchangeable, thus allowing the         strengths of each to be combined.</li>
</ul>
<p>The <span style="font-weight: bold; font-style: italic;">irON</span> notation and vocabulary is designed to allow the conceptual structure         (&#8221;schema&#8221;) of datasets to be described, to facilitate easy description         of the instance records that populate those datasets, and to link         different structures for different schema to one another. In these         manners, more-or-less complete RDF data structures and instances can be         described in alternate formats and be made interoperable. <span style="font-weight: bold; font-style: italic;">irON</span> provides a simple         and naïve information exchange notation expressive enough to describe         most any data entity.</p>
<p>The notation also provides a framework for extending existing schema.         This means that <span style="font-weight: bold; font-style: italic;">irON</span> and its three         serializations can represent many existing, common data formats and         standards, while also providing a vehicle for extending them. Another         intent of the specification is to be sparse in terms of requirements.         For instance, this reserved vocabulary is fairly minimal and optional         in most all cases. The <span style="font-weight: bold; font-style: italic;">irON</span> specification         supports skeletal submissions.</p>
<h3>irON Concepts and Vocabulary</h3>
<p>The aim of <span style="font-weight: bold; font-style: italic;">irON</span> is to describe         instance <span style="font-style: italic;">records</span>. An instance         record is simply a means to represent and convey the information         (”attributes”) describing a given instance. An instance is         the thing at hand, and need not represent an individual; it could, for         example, represent the entire holdings or collection of books in a         given library. Such instance records are also known as the         <em>ABox</em> <a href="#ia5">[5]</a>. The simple design of <span style="font-weight: bold; font-style: italic;">irON</span> is in keeping with         the limited roles and work associated with this <span style="font-style: italic;">ABox</span> role.</p>
<p><span style="font-style: italic;">Attributes</span> provide descriptive         characteristics for each instance. Every attribute is matched with a         value, which can range from descriptive text strings to lists or         numeric values. This design is in keeping with simple attribute-value         pairs where, in using the terminology of RDF triples, the         <em>subject</em> is the instance itself, the <em>predicate</em> is the         attribute, and the <em>object</em> is the value. <span style="font-weight: bold; font-style: italic;">irON</span> has a vocabulary         of about 40 reserved attribute terms, though only two are ever         required, with a few others strongly recommended for interoperability         and interface rendering purposes.</p>
<p>A <span style="font-style: italic;">dataset</span> is an aggregation of         instance records used to keep a reference between the instance records         and their source (provenance). It is also the container for         transmitting those records and providing any metadata descriptions         desired. A dataset can be split into multiple dataset slices. Each         slice is written to a file serialized in some way. Each slice of a         dataset shares the same <span style="font-family: Courier New,Courier,monospace;">&lt;id&gt;</span> of the         dataset.</p>
<p>Instances can also be assigned to <span style="font-style: italic;">types</span>, which provide the set or         classificatory structure for how to relate certain kinds of things         (instances) to other kinds of things. The organizational relationships         of these types and attributes is described in a <span style="font-style: italic;">schema</span>. <span style="font-weight: bold; font-style: italic;">irON</span> also has         conventions and notations for describing the <span style="font-style: italic;">linkage</span> of attributes and types in a given         dataset to existing schema. These linkages are often mapped to         established ontologies.</p>
<p>Each of these <span style="font-weight: bold; font-style: italic;">irON</span> concepts of         <span style="font-style: italic;">records</span>, <span style="font-style: italic;">attributes</span>, <span style="font-style: italic;">types</span>, <span style="font-style: italic;">datasets</span>, <span style="font-style: italic;">schema</span> and <span style="font-style: italic;">linkages</span> share similar notations with         keywords signaling to the <span style="font-weight: bold; font-style: italic;">irON</span> parsers and         converters how to interpret incoming files and data. There are also         provisions for metadata, name spaces, and local and global references.</p>
<p>In these manners, <span style="font-weight: bold; font-style: italic;">irON</span> and its three         serializations can capture virtually the entire scope and power of RDF         as a data model, but with simpler and familiar terminology and         constructs expected for each serialization.</p>
<h3>The Three Serializations</h3>
<p>For different reasons and for different audiences, the formats of XML,         JSON and CSV (spreadsheets) were chosen as the representative formats         across which to formulate the abstract <span style="font-weight: bold; font-style: italic;">irON</span> notation.</p>
<p><a href="http://en.wikipedia.org/wiki/Xml">XML</a>, or eXtensible         Markup Language, has become the leading data exchange format and syntax         for modern applications. It is frequently adopted by industry groups         for standards and standard exchange formats. There is a rich diversity         of tools that support the language, importantly including capable         parsers and query languages. There is also a serialization of RDF in         XML. As implemented in the <span style="font-weight: bold; font-style: italic;">irON</span> notation, we call         this serialization <span style="font-weight: bold; font-style: italic;">irXML</span>.</p>
<p><a href="http://en.wikipedia.org/wiki/JSON">JSON</a>, the JavaScript         Object Notation, has become very popular as a Web 2.0 data exchange         format and is often the format of choice to drive JavaScript         applications. There is a growing richness of tools that support JSON,         including support from leading Web and general scripting languages such         as JavaScript, Python, Perl, Ruby and PHP. JSON is relatively easy to         read, and is also now growing in popularity with lightweight databases,         such as CouchDB. As implemented in the <span style="font-weight: bold; font-style: italic;">irON</span> notation, we call         this serialization <span style="font-weight: bold; font-style: italic;">irJSON</span>.</p>
<p><a href="http://en.wikipedia.org/wiki/Comma-separated_values">CSV</a>,         or comma-separated values, is a format that has been in existence for         decades. It was made famous by Microsoft as a spreadsheet exchange         format, which makes CSV very useful since spreadsheets are the most         prevalent data authoring environment in existence. CSV is less         expressive and capable as a data format than the other <span style="font-weight: bold; font-style: italic;">irON</span> serializations,         yet still has a attribute-value pair orientation. And, via         spreadsheets, datasets can be easily authored and inspected, while also         providing a rich functional environment including sorting, formatting,         data validation, calculations, macros, etc. As implemented in the         <span style="font-weight: bold; font-style: italic;">irON</span> notation, we call this serialization <span style="font-weight: bold; font-style: italic;">commON</span>.</p>
<p>The following diagram shows how these three formats relate to         <span style="font-weight: bold; font-style: italic;">irON</span> and         then the canonical RDF target data model:</p>
<div><img class="center_ok" style="width: 547px; height: 619px;" title="Data transformations path" src="../wp-content/themes/ai3/images/2009Posts/data_transform_path.png" alt="Data transformations path" width="547" height="619" /></div>
<p>We have used the unique differences amongst XML, JSON and CSV to guide         the embracing abstract notations within <span style="font-weight: bold; font-style: italic;">irON</span>. Note the         round-tripping implications of the framework.</p>
<p>One exciting prospect for the design is how, merely by following the         simple conventions within <span style="font-weight: bold; font-style: italic;">irON</span>, each of these         three data formats — and RDF !! — can be used more-or-less         interchangeably, and can be used to extend existing schema within their         domains.</p>
<h3>Links, References and More</h3>
<p>This first release of <span style="font-weight: bold; font-style: italic;">irON</span> is in version 0.8.         Updates and revisions are likely with use. Here are some key links for         <span style="font-weight: bold; font-style: italic;">irON</span>:</p>
<ul>
<li>The <a href="http://openstructs.org/iron/iron-specification"><span style="font-weight: bold; font-style: italic;">irON</span> specification</a>, also available in <a href="http://openstructs.org/sites/openstructs.org/files/downloads/irON_specification_v8.pdf">download</a> as a PDF <a href="http://openstructs.org/sites/openstructs.org/files/downloads/irON_specification_v8.pdf"><img src="http://www.mkbergman.com/wp-content/themes/ai3/images/pdfdoc.gif" alt="" width="13" height="16" /></a></li>
<li>The <span style="font-weight: bold; font-style: italic;">irON</span> <a href="http://code.google.com/p/iron-notation/">code and vocabulary release         site</a>, and</li>
<li>The <a href="http://groups.google.com/group/iron-notation">Google         discussion group</a> for the <span style="font-weight: bold; font-style: italic;">irON</span> notation.</li>
</ul>
<p>Mid-week, the parsers and converters for <span style="font-weight: bold;">structWSF</span> <a href="#ia6">[6]</a> will be released and         announced on Fred Giasson&#8217;s <a href="http://fgiasson.com/blog">blog</a>.</p>
<p>In addition, within the next week we will be publishing a case study of         converting the <a style="color: #820000; font-weight: bold;" href="../new-version-sweet-tools-sem-web/">Sweet         Tools</a> semantic Web and -related tools dataset to <span style="font-weight: bold; font-style: italic;">commON</span>.</p>
<p><span>The <span style="font-weight: bold; font-style: italic;">irON</span> specification and         notation</span> by <a rel="cc:attributionURL" href="http://openstructs.org/iron/iron-specification">Structured Dynamics         LLC</a> is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by-sa/3.0/us/">Creative Commons         Attribution-Share Alike 3.0</a>. <span style="font-weight: bold; font-style: italic;">irON</span>&#8217;s parsers or         converters are available under the <a href="http://www.apache.org/licenses/LICENSE-2.0.html">Apache License,         Version 2.0</a>.</p>
<h3>Editors&#8217; Notes</h3>
<p><span style="font-weight: bold; font-style: italic;">irON</span> is an         important piece in the semantic enterprise puzzle that we are building         at <a href="http://structureddynamics.com/">Structured Dynamics</a>. It         reflects our belief that knowledge workers should be able to author and         create interoperable datasets without having to learn the arcana of         RDF. At the same time we also believe that RDF is the appropriate data         model for interoperability. <span style="font-weight: bold; font-style: italic;">irOn</span> is an expression         of our belief that many data formats have appropriate places and uses;         there is no need to insist on a single format.</p>
<p>We would like to thank <a href="http://www.stat.berkeley.edu/%7Epitman/">Dr. Jim Pitman</a> for his         advocacy of the importance of human-readable and easily authored         datasets and formats. Via his leadership of the Bibliographic Knowledge         Network (BKN) project and our contractual relationship with it <a href="#ia7">[7]</a>, we         have learned much regarding the BKN&#8217;s own format, BibJSON. Experience         with this format has been a catalytic influence in our own work on         <span style="font-weight: bold; font-style: italic;">irON</span>.</p>
<p style="margin-left: 40px;">— <span style="font-style: italic;">Mike Bergman</span> and         <span style="font-style: italic;">Fred Giasson</span>, editors</p>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a id="ia1" name="ia1"></a> [1] Please <a href="http://structureddynamics.com/products.html">see here</a> for how         <span style="font-weight: bold; font-style: italic;">irON</span> fits         within Structured Dynamics&#8217; vision and family of products.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="ia2" name="ia2"></a> [2] Presently parsers and converters are         available for the <span style="font-weight: bold; font-style: italic;">irJSON</span> and <span style="font-weight: bold; font-style: italic;">commON</span> serializations,         and will be released this week. We have tentatively spec&#8217;ed the         <span style="font-weight: bold; font-style: italic;">irXML</span> converter, and would welcome working with another party to finalize a         converter. Absent an immediate contribution from a third party,         contractual work will likely result in our completing the <span style="font-weight: bold; font-style: italic;">irXML</span> converter within         the reasonable future.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="ia3" name="ia3"></a> [3] A pivotal premise of <span style="font-weight: bold; font-style: italic;">irON</span> is the         desirability of using the RDF data model as the canonical basis for         interoperable data. RDF provides a data model capable of representing         any extant data structure and any extant data format. This flexibility         makes RDF a perfect data model for federating across disparate data         sources. For a detailed discussion of RDF, see Michael K. Bergman,         2009. &#8220;Advantages and Myths of RDF,&#8221; in <span style="font-style: italic;">AI3 blog</span>, April 8, 2009. See <a href="../483/advantages-and-myths-of-rdf/">http://www.mkbergman.com/483/advantages-and-myths-of-rdf/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="ia4" name="ia4"></a> [4] An <a href="http://en.wikipedia.org/wiki/Attribute-value_system">attribute-value         system</a> is a basic knowledge representation framework comprising a         table with columns designating &#8220;attributes&#8221; (also known as <span style="font-style: italic;">properties</span>, <span style="font-style: italic;">predicates</span>, <span style="font-style: italic;">features</span>, <span style="font-style: italic;">parameters</span>, <span style="font-style: italic;">dimensions</span>, <span style="font-style: italic;">characteristics</span> or <span style="font-style: italic;">independent variables</span>) and rows         designating &#8220;objects&#8221; (also known as <span style="font-style: italic;">entities</span>, <span style="font-style: italic;">instances</span>, <span style="font-style: italic;">exemplars</span>, <span style="font-style: italic;">elements</span> or <span style="font-style: italic;">dependent variables</span>). Each table cell         therefore designates the value (also known as <span style="font-style: italic;">state</span>) of a particular attribute of a         particular object. This is the basic table presentation of a         spreadsheet or relational data table.</p>
<p>Attribute-values can also be presented as pairs in the form of an         <a href="http://en.wikipedia.org/wiki/Associative_array">associative         array</a>, where the first item listed is the attribute, often followed         by a separator such as the colon, and then the value. JSON and many         simple data struct notations follow this format. This format may also         be called <span style="font-style: italic;">attribute-value         pairs</span>, <span style="font-style: italic;">key-value pairs</span>,         <span style="font-style: italic;">name-value pairs</span>, <span style="font-style: italic;">alists</span> or others. In these cases the         &#8220;object&#8221; is implied, or is introduced as the name of the array.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="ia5" name="ia5"></a> [5]We use the reference to the &#8220;<a href="http://en.wikipedia.org/wiki/Abox">ABox</a>&#8221; and “<a href="http://en.wikipedia.org/wiki/Tbox">TBox</a>” in accordance with         this <a title="Permanent Link to Thinking ?Inside the Box? with Description Logics" href="../466/thinking-inside-the-box-with-description-logics/"> working definition</a> for <a href="http://en.wikipedia.org/wiki/Description_logics">description         logics</a>:</p>
<div class="boxGraySolid">&#8220;Description logics and their semantics traditionally split           <span style="font-style: italic;">concepts</span> and their           relationships from the different treatment of <span style="font-style: italic;">instances</span> and their attributes and           roles, expressed as fact assertions. The concept split is known as           the TBox (for <em>terminological</em> knowledge, the basis for           <span style="font-style: italic;">T</span> in <span style="font-style: italic;">TBox</span>) and represents the schema or           taxonomy of the domain at hand. The TBox is the structural and           intensional component of conceptual relationships. The second split           of instances is known as the ABox (for <span style="font-style: italic;">assertions</span>, the basis for <span style="font-style: italic;">A</span> in <span style="font-style: italic;">ABox</span>) and describes the attributes of           instances (and individuals), the roles between instances, and other           assertions about instances regarding their class membership with the           TBox concepts.&#8221;</div>
</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="ia6" name="ia6"></a> [6] <a href="http://openstructs.org/">structWSF</a> is a platform-independent Web         services framework for accessing and exposing structured RDF data, with         generic tools driven by underlying data structures. Its central         perspective is that of the dataset. Access and user rights are granted         around these datasets, making the framework enterprise-ready and         designed for collaboration. Since a structWSF layer may be placed over         virtually any existing datastore with Web access &#8212; including large         instance record stores in existing relational databases &#8212; it is also a         framework for Web-wide deployments and interoperability.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a id="ia7" name="ia7"></a> [7] BKN is a project to develop a suite of         tools and services to encourage formation of virtual organizations in         scientific communities of various types. BKN is a project started in         September 2008 with funding by the <a href="http://www.nsf.gov/crssprgm/cdi/">NSF Cyber-enabled Discovery and         Innovation (CDI) Program</a> (Award # 0835851). The major participating         organizations are the <a href="http://www.bibkn.org/drupal/conStruct/datasets/99/resource/bkncentral_AIM"> American Institute of Mathematics (AIM)</a>, <a href="http://www.bibkn.org/drupal/conStruct/datasets/99/resource/bkncentral_Harvard"> Harvard University</a>, <a href="http://www.bibkn.org/drupal/conStruct/datasets/99/resource/bkncentral_Stanford"> Stanford University</a> and the <a href="http://www.bibkn.org/drupal/conStruct/datasets/99/resource/bkncentral_Berkeley"> University of California, Berkeley</a>.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/838/iron-semantic-web-for-mere-mortals/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
