<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>AI3:::Adaptive Information &#187; Adaptive Innovation</title>
	<atom:link href="http://www.mkbergman.com/category/adaptive-innovation/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mkbergman.com</link>
	<description>Mike Bergman on the semantic Web and structured Web</description>
	<lastBuildDate>Tue, 24 Jan 2012 15:52:16 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>In the Midst of an Evolutionary Explosion</title>
		<link>http://www.mkbergman.com/965/in-the-midst-of-an-evolutionary-explosion/</link>
		<comments>http://www.mkbergman.com/965/in-the-midst-of-an-evolutionary-explosion/#comments</comments>
		<pubDate>Tue, 19 Jul 2011 04:00:07 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Adaptive Innovation]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Structured Web]]></category>
		<category><![CDATA[Charles Sanders Peirce]]></category>
		<category><![CDATA[grand challenges]]></category>
		<category><![CDATA[information extraction]]></category>
		<category><![CDATA[language translation]]></category>
		<category><![CDATA[natural language understanding]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=965</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=In the Midst of an Evolutionary Explosion&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Innovation&amp;rft.subject=Semantic Web&amp;rft.subject=Structured Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-07-18&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/965/in-the-midst-of-an-evolutionary-explosion/&amp;rft.language=English"></span>
A Decade of Remarkable Advances in Ten Grand IT Challenges I&#8217;ve been in the information theory and technology game for quite some time, but believe nothing has matched the pace of advances of the past ten years. As one example, it was a mere eight years ago that I was sitting in a room with [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=In the Midst of an Evolutionary Explosion&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Innovation&amp;rft.subject=Semantic Web&amp;rft.subject=Structured Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-07-18&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/965/in-the-midst-of-an-evolutionary-explosion/&amp;rft.language=English"></span>
<h2><a><img style="float: left; margin-right: 10px;" title="Photo courtesy of levelofhealth.com" src="../wp-content/themes/ai3/images/2011Posts/110717_technological-advances.jpg" alt="Photo courtesy of levelofhealth.com" width="220" height="219" /></a>A Decade of Remarkable Advances in Ten Grand IT Challenges</h2>
<p>I&#8217;ve been in the information theory and technology game for quite some         time, but believe nothing has matched the pace of advances of the past         ten years. As one example, it was a mere eight years ago that I was         sitting in a room with language translation vendors contemplating         automated translation techniques for US intelligence agencies. The         prospects finally looked doable, but the success of large-scale         translation was not assured.</p>
<p>At about that same time, and the years until just recently, a whole slew         of <a href="http://en.wikipedia.org/wiki/Grand_Challenge">Grand         Challenges</a> <a href="#explode1">[1]</a> in computing hung out there: tantalizing yet not proven. These areas ranged from information extraction and <a href="http://en.wikipedia.org/wiki/Natural_language_processing">natural         language understanding</a> to speech recognition and automated         reasoning.</p>
<p>But things have been changing fast, and with a subtle steadiness that         has caused it to go largely unremarked. Sure, all of us have been aware         of the huge changes on the Web and search engine ubiquity and social         networking. But some of the fundamentally hard problems in computing         have also gone through some remarkable (but largely unremarked)         advances.</p>
<p>We now have smart phones that speak instructions to us while we         instruct them by voice in turn. Virtually all information conceivable         is now indexed and made available through the Web; structure is now         rapidly characterizing that information, making it even more useful to         discover and organize. We can translate documents online with         acceptable accuracy into more than 60 languages <a href="#explode2">[2]</a>. We can get         directions to or see satellite views of virtually any place on earth.         We have in fact become accustomed to new technology magic on a nearly         daily basis, so much so that the pace of these advances seems to be a         constant, blunting our perspective of just how rapid these advances         have been progressing.</p>
<p>These advances are perhaps not the realization of <a href="http://en.wikipedia.org/wiki/Artificial_intelligence">artificial         intelligence</a> as articulated in the 1950s to 1980s, but are         contributing to a machine-based ability to do tasks useful to humans         heretofore impossible and at scales unimaginable. As Google and IBM&#8217;s         <a href="http://www-03.ibm.com/innovation/us/watson/what-is-watson/the-next-grand-challenge.html"> Watson</a> are showing, statistics (among other techniques) applied to         massive knowledge bases or text corpora are breaking down all of the         Grand Challenges of symbolic computing. The image that is emerging is         less one of intelligent machines working autonomously than it is of         computers working interactively or semi-automatically with humans to         address previously unsolvable problems.</p>
<p>By using a perspective of the decade past, we also demark the seminal         paper on the <a href="http://en.wikipedia.org/wiki/Semantic_Web">semantic Web</a> by         Berners-Lee, Hendler and Lassila from May 2001 <a href="#explode3">[3]</a>. Yet, while this         semantic Web vision has been a contributor to the success of the Grand         Challenge advances of the past ten years, I think we can also say that         it has not been the key or even a primary driver. That day may still         yet come. Rather, I think we have to look to natural language and         statistics surrounding large-scale corpora as the more telling drivers.</p>
<h3>Ten Grand Challenge Advances</h3>
<p>Over the past ten years there have been significant advances on at         least ten Grand Challenges in symbolic computation. As the concluding         section notes, these advances can be traced in most part to broader         advances in natural language processing, the logical and <a href="http://en.wikipedia.org/wiki/Semiotics">semiotic</a> bases for         interoperability, and standards (nominally in the semantic Web) for         embracing them. Here are these ten areas of advance, all achieved over         the past ten years:</p>
<h4>#1 Information Extraction</h4>
<p><a href="http://en.wikipedia.org/wiki/Information_extraction">Information         extraction</a> (IE) uses various forms of natural language processing         (NLP) to identify structured information within unstructured or         semi-structured documents. These documents are presented in         machine-readable form (including straight text, various document         formats or HTML) with the various types of information &#8220;tagged&#8221; or         prompted for inclusion. Information types that can be extracted with         one of the various techniques include entities, relations, topics,         categories, and so forth. Once tagged or extracted, the information in         the documents can now be included and linked to standard structured         information (as might come from conventional databases) or to structure         in other documents.</p>
<p>Most recently, a large number of online services and open source         systems have also become available with strengths in one or more of         these extraction types <a href="#explode4">[4]</a>. Some current examples include <a href="http://developer.yahoo.com/search/content/V1/termExtraction.html">Yahoo!         Term Extraction</a>, <a href="http://www.opencalais.com/">OpenCalais</a>, <a href="http://www.beliefnetworks.net/">BeliefNetworks</a>, <a href="http://www.openamplify.com/">OpenAmplify</a>, <a href="http://www.alchemyapi.com/">Alchemy API</a>, <a href="http://www.evri.com/">Evri</a>, <a href="http://extractiv.com/">Extractiv</a>, <a href="http://cogcomp.cs.illinois.edu/page/software_view/4">Illinois         Tagger</a>, and about 80 others <a href="#explode4">[4]</a>.</p>
<h4>#2 Machine Translation</h4>
<p><a href="http://en.wikipedia.org/wiki/Machine_translation">Machine         translation</a> is the automatic translation of machine-readable text         from one human language to another. Accurate and acceptable machine         translation requires applying different types of knowledge including         grammar, semantics, facts about the real world, etc. Various approaches         have been developed and refined over time.</p>
<p>Especially helpful has been the availability of huge corpora in         multiple languages to which large-scale statistical analysis may be         applied (as is the case of Google&#8217;s <a href="http://translate.google.com/">machine translation</a>) or human editing         and refinement (as is the case with the more than <a href="http://s23.org/wikistats/wikipedias_html">280 language versions of         Wikipedia</a>).</p>
<p>While it is true none of these systems have 100% accuracy (even human         translators show much variation), the more advanced ones are truly         impressive with remaining ambiguities flagged for resolution by         semi-automatic means.</p>
<h4>#3 Sentiment Analysis</h4>
<p>Though <a href="http://en.wikipedia.org/wiki/Sentiment_analysis">sentiment         analysis</a> is strictly speaking a subset of information extraction,         it has the more demanding and useful task of extracting subjective         information, often across a group of documents or texts. Sentiment         analysis can be applied to online reviews to determine the &#8220;polarity&#8221;         about specific objects, and it is especially useful for identifying         public opinion trends or evaluating social media for ranking, polling         or marketing purposes.</p>
<p>Because of its greater difficulty and potential high value, many of the         leading sentiment analysis capabilities remain proprietary. Some         capable <a href="http://www.google.com/search?q=sentiment+analysis+open+source">open         source versions</a> are available nonetheleless. There is also an         <a href="http://smm.streamcrab.com/">interesting online application</a> using Twitter feeds.</p>
<h4>#4 Disambiguation</h4>
<p>Many words have more than one meaning. <a href="http://en.wikipedia.org/wiki/Word_sense_disambiguation">Word sense         disambiguation</a> uses either machine learning, dictionaries         (gazetteers) of known entities and concepts, ontologies or linguistic         databases such as <a href="http://en.wikipedia.org/wiki/WordNet">WordNet</a>, or combinations thereof to evaluate         ambiguous terms or phrases and resolve them based on context. Some         systems need to be &#8220;trained&#8221; or some work automatically or others are         based on evaulation and prompting (semi-automatic) to complete the         disambiguation process.</p>
<p>State-of-the-art systems have greater than 90% precision <a href="#explode5">[5]</a>. Most of         the leading <a href="http://en.wikipedia.org/wiki/List_of_natural_language_processing_toolkits"> open source NLP toolkits</a> have quite capable disambiguation modules,         and even better proprietary systems exist.</p>
<h4>#5 Speech Synthesis and Recognition</h4>
<p><a href="http://en.wikipedia.org/wiki/Speech_synthesis">Speech         synthesis</a> is the conversion of text to spoken speech and has been         around for quite some time. <a href="http://en.wikipedia.org/wiki/Speech_recognition">Speech         recognition</a> is a far more difficult task in that a given sound clip         or real-time spoken speech of a person must be converted to a textual         representation, which itself can then be acted upon such as navigating         or making selections. Speech recognition is made difficult because of         individual voice differences, the variations of human languages and speech patterns, and the         need to <a href="http://en.wikipedia.org/wiki/Speech_segmentation">segment speech</a> into a sequence of words. (In most spoken languages, the sounds         representing successive letters blend into each other, so the         conversion of the modulated wave form to discrete characters or tokens         can be a very difficult process.)</p>
<p>Crude systems of a decade ago required much training with a specific         speaker&#8217;s voice to show much effectiveness. Today, the range and         ability to use these systems without training has markedly improved.</p>
<p>Until recently, improvements largely were driven by military and         intelligence requirements. Today, however, with the ubiquity of smart         phones and speech interfaces, the consumer market is greatly         accelerating progress.</p>
<h4>#6 Image Recognition</h4>
<p><a href="http://en.wikipedia.org/wiki/Computer_vision#Recognition">Image         recognition</a> is the ability to determine whether or not an         electronic image contains some specific object, feature, or activity,         and then to extract the image data associated with it. Today, under         specific circumstances and for specific tasks, this can be done by         computer. However, for the general case of arbitrary objects in         arbitrary situations this challenge has not yet been fully met. The         systems of today work best for simple geometric objects (<em>e.g</em>., polyhedra), human faces, printed or         hand-written characters, or vehicles, and in specific situations,         typically described in terms of well-defined illumination, background,         and orientation of the object relative to the camera.</p>
<p>Auto license recognition at intersections, face recognition by security         cameras, and greatly expanded and improved character recognition         systems (machine vision) represent some of the current         state-of-the-art. Again, smart phone apps are helping to drive         advances.</p>
<h4>#7 Interoperability Standards and Methods</h4>
<div style="float: right; margin-left: 10px; text-align: center;"><a href="../wp-content/themes/ai3/images/2006Posts/060524a_DataFederationPyramid.gif"> <img src="../wp-content/themes/ai3/images/2006Posts/060524a_DataFederationPyramid.gif" alt="" width="400" height="283" /></a><br />
<strong><em>Rapid Progress in Climbing the Data           Federation Pyramid</em></strong></div>
<p>Most of the previous advances are related to extracting structured         information or mapping or deriving additional structured information.         Once obtained, of course, the next challenge is in how to relate that         information together; that is, how to make it interoperate.</p>
<p>We have been steadily climbing a data federation pyramid <a href="#explode6">[6]</a> &#8212; and at         an impressively accelerating rate since the adoption of the Internet         and Web. These network innovations gave us a common basis and protocols         for connecting distributed devices. That, in turn, has freed us to         concentrate on the standards for data representation and         interoperability.</p>
<p><a href="http://en.wikipedia.org/wiki/XML">XML</a> first provided a         means for a common data serialization that encouraged various         communities and industries to devise exchange vocabularies. <a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a> provided a means for a common data model, one that was both simple and         extensible at the same time <a href="#explode7">[7]</a>. <a href="http://en.wikipedia.org/wiki/Web_Ontology_Language">OWL</a> built upon         that basis to enable us to build common domain models (see next).</p>
<p>There are alternatives to the semantic Web standards of RDF and OWL such as <a href="http://en.wikipedia.org/wiki/Common_logic">common logic</a> and there are many competing data exchange formats to XML. None of these standards is essential on its own and all have their communities and advocates. However, because they are standards and they share common network bases, it has also been relatively easy to convert amongst the various available protocols. We are nearly at a global level where everything is connected, machine-readable, and in structured form.</p>
<h4>#7 Common Domain Models</h4>
<p>Semantics in machine-readable form means that we can more confidently         link and combine available information. We are seeing a veritable         explosion of domain models to represent various domains and viewpoints         in consensual, interoperable form. What this means is that we are now         gaining the computing vocabularies and grammars &#8212; along with shared         community models (world views) &#8212; to get this stuff to work together.</p>
<p>Five years ago we called this phenomena mashups, but no one uses that         term any longer because these information brewpots are everywhere,         including in our very hands when we interact with the apps on our smart         phones. This glue of domain models is generally as invisible to us as         is the glue in laminates or the resin in plastics. But they are the         strength and foundations nonetheless that enable much of the computing         magic unfolding around us.</p>
<h4>#9 Virtual Apps (Cloud Computing)</h4>
<p>Once the tyranny of physical separation was shattered between data       and machine by the network, the rationale for keeping the data with the       app or even the user with the app disappeared. Cloud computing may seem       mysterious or sound to have some high-octave hum, but it really is       nothing more than saying that the Web enables us to treat all of our       computing resources as virtual. Data can be anywhere; machines and hard       drives can be anywhere; and applications can be anywhere.</p>
<p>And, virtualness brings benefits in and of itself. Whole computing         environments can be installed or removed nearly instantaneously. Peak         computing demands can be met with virtual headrooms. Backup and         rollover and redundancy practices and strategies can change. Web         services mean tailored capabilities can be invoked from anywhere and         integrated for local needs. Massive computing resources and server         farms can be as accessible to the individual as they are to prior         computing behemoths. Combined with continued advances in underlying         computing hardware and chips, the computing power available to any user         is rising exponentially. There is now even more power in the power         curve.</p>
<h4>#10 Big Data</h4>
<p>One hears stories of Google or the National Security Agency having         access and managing servers measured in the hundreds of thousands.         Entirely new operating systems and computing environments &#8212; many with         roots in open source &#8212; such as virtual operating systems and <a href="http://en.wikipedia.org/wiki/Mapreduce">MapReduce</a> approaches like         <a href="http://en.wikipedia.org/wiki/Hadoop">Hadoop</a> have been         innovated to deal with the current era of <a href="http://en.wikipedia.org/wiki/Big_data">&#8220;big data&#8221;</a>.</p>
<p>MapReduce is a framework for processing huge datasets using a large         number of servers. The &#8220;map&#8221; step partitions the problem into tractable         sub-problems, organized in a tree structure. The &#8220;reduce&#8221; step then         takes the answers to all the sub-problems and combines them to produce         the final output.</p>
<p>Such techniques enable analysis of datasets of a size impossible         before. This has enabled the development of statistics and analytical         techniques that have been able to make correlations and find patterns         for some of the Grand Challenge tasks noted before that simply could         not be addressed within previous limits. The &#8220;big data&#8221; approach is         providing a brute force alternative to previously intractable problems.</p>
<h3>Why Such Progress?</h3>
<p>Declining hardware costs and increasing performance (such as from         <a href="http://en.wikipedia.org/wiki/Moore%27s_law">Moore&#8217;s Law</a>),         combined with the adoption of the Internet + Web network, set the         fertile conditions for these unprecedented advances in computing&#8217;s         Grand Challenges. But the <a href="http://en.wikipedia.org/wiki/Adaptive_radiation">adaptive         radiation</a> in innovations now occurring has its own dynamics. In         computing terms, we are seeing the equivalent of the <a href="http://en.wikipedia.org/wiki/Cambrian_explosion">Cambrian         explosion</a> in evolutionary history.</p>
<p>The dynamics driving this computing explosion are based largely, I         believe, on the statistics of information retrieval and extraction         needed to cope with the scale of documents on the Web. That, in turn,         has impelled innovations in big data and distributed architectures and         designs that have pried open previously closed computing lockboxes. As         data from everywhere and from every provenance pours into the system,         means for handling and interoperating with it have become imperatives.         These forces, in turn, have been channeled and are being met through         the open and standards-based approaches that helped lead to the         development of the Internet and its infrastructure in the first place.</p>
<p>These powerful evolutionary forces in computing are clearly evident in         the ten Grand Challenge advances above. But the challenges above are         also silent on another factor, underpinning the interoperability         initiatives, that is only now just becoming evident and exerting its         own powerful force. That is the workable, intellectual foundations for         interoperability itself.</p>
<p>Clearly, as the advances in the Grand Challenges show, we are seeing         immense exposures of new structured information and impressive means         for accessing and managing it on a global, distributed scale.  Yet         all of this data and this structure begs the question of how to get the         information to work together. Further, the sources and         viewpoints and methods by which all of this data has been created also         puts a huge premium on means to deal with the         diversity. Though not evident, and perhaps not even known to many of         the innovators and practitioners, there has been a growing intellectual         force shaping our foundational views about the nature of things and         their representations. This force has been, I believe, one of those<a href="http://en.wikipedia.org/wiki/Root_cause"> root cause</a> drivers helping to show the way to interoperability.</p>
<p>John Sowa, despite his unending criticism of the semantic Web in favor         of <a href="http://en.wikipedia.org/wiki/Common_logic">common         logic</a>, has nonetheless been a very positive evangelist for the 19th         century American logician and philosopher, <a href="http://en.wikipedia.org/wiki/Charles_Sanders_Peirce">Charles Sanders         Peirce</a>. Sowa points out that the entire 20th century largely         neglected Peirce&#8217;s significant contributions in many areas and some         philosophers appropriated Peircean insights without proper attribution         <a href="#explode8">[8]</a>. Indeed, Peirce has only come to wider attention within the         past decade or so. Much of his voluminous lifetime writings have still         not yet been committed to publication.</p>
<p>Among many notable contributions, Peirce was passionate about signs and         their triadic representations, in a field known as <a href="http://en.wikipedia.org/wiki/Semiotics">semiotics</a>. The         philosophical and logical basis of his triangle of signs deserves your         attention, which can not be adequately treated here <a href="#explode9">[9]</a>. However, as         summarized by Sowa <a href="#explode8">[8]</a>, &#8220;A semiotic view of language and logic gets to         the heart of the philosophical controversies and their practical         implications for linguistics, artificial intelligence, and related         subjects.&#8221;</p>
<p>In essence, Peirce&#8217;s triadic logic of semiotics helps clarify         philosophical questions about things, how they are perceived and how         they are named that has vexed philosophers at least since the time of         Aristotle. What Peirce was able to put forward was a testable logic for         how things and the names of things can be understood and related to one         another, via logical statements or structures. These, in turn, can be         symbolized and formalized into logical constructs that can capture the         structure of natural language as well as more structured data.</p>
<p>The clarity of Peirce&#8217;s logic of signs is an underlying factor, I         believe, for why we are finally seeing our way clear to how         to capture, represent and relate information from a diversity of         sources and viewpoints that is defensible and interoperable <a href="#explode10">[10]</a>. As we         plumb Peircean logics further, I believe we will continue to gain         additional insights and methods for combining and relating information.         The next phase of our advances on these Grand Challenges is likely to         be fueled more by connections and interoperability than in basic         extraction or representation.</p>
<h3>The Widening Explosion</h3>
<p>We are not seeing the vision of artificial intelligence unfold as posed         three decades ago. Nor are we seeing the <a href="http://en.wikipedia.org/wiki/AI-complete">AI-complete</a> type of problems being solved in their entirety <a href="#explode11">[11]</a>. Rather, we are seeing impressive but incomplete approaches. Full automation and autonomy are not yet at hand, and may be so far in the future as to never be. But we are nevertheless seeing advances across the board in all Grand Challenge areas.</p>
<p>What is emerging is a practical achievement of the Grand Challenges, the scale and scope of which is unprecedented in symbolic computing. As we see Peircean logic continue to take hold and interoperability grow in usefulness and stature, I think it fair to say we can look back in ten years to describe where we stand today as having been in the midst of an evolutionary explosion.</p>
<hr size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a name="explode1"></a>[1] Grand Challenges were United States policy objectives for         high-performance computing and communications research set in the late         1980s. According to <a href="http://nsf.ptfsinc.com/cgi-bin/koha/opac-detail.pl?biblionumber=23787"> &#8220;A Research and Development Strategy for High Performance         Computing&#8221;</a>, <em>Executive Office of         the President, Office of Science and Technology Policy</em>, 29 pp.,         November 20, 1987, &#8220;A grand challenge is a fundamental problem in         science or engineering, with broad applications, whose solution would         be enabled by the application of high performance computing resources         that could become available in the near future.&#8221;</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="explode2"></a>[2] For example, as of July 17, 2011, Google offered <a href="http://translate.google.com/">63 different</a> source or target         languages for translation.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="explode3"></a>[3] Tim Berners-Lee, James Hendler and Ora Lassila, 2001. <a rel="nofollow" href="http://www.sciam.com/article.cfm?id=the-semantic-web&amp;print=true">&#8220;The Semantic Web&#8221;</a>. <em>Scientific American         Magazine</em>; see <a href="http://www.scientificamerican.com/article.cfm?id=the-semantic-web">http://www.scientificamerican.com/article.cfm?id=the-semantic-web</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="explode4"></a>[4] Go to <a href="../sweet-tools/">Sweet         Tools</a>, and enter the search &#8216;information extraction&#8217; to see a list         of about 85 tools.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="explode5"></a>[5] See, for example, Roberto Navigli, 2009. &#8220;Word Sense         Disambiguation: A Survey,&#8221; <em>ACM         Computing Surveys</em>, 41(2), 2009, pp. 1–69. See <a href="http://www.dsi.uniroma1.it/%7Enavigli/pubs/ACM_Survey_2009_Navigli.pdf"> http://www.dsi.uniroma1.it/~navigli/pubs/ACM_Survey_2009_Navigli.pdf</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="explode6"></a>[6] M.K. Bergman, 2006. “Climbing the Data Federation Pyramid,”         <em>AI3:::Adaptive Information</em> blog, May 25, 2006; see <a href="../229/climbing-the-data-federation-pyramid/">http://www.mkbergman.com/229/climbing-the-data-federation-pyramid/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="explode7"></a>[7] M. K. Bergman, 2009. “Advantages and Myths of RDF,” <em>AI3:::Adaptive Information</em> blog, April 8,         2009. See <a href="../483/advantages-and-myths-of-rdf/">http://www.mkbergman.com/483/advantages-and-myths-of-rdf/</a></div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="explode8"></a>[8] John Sowa, 2006. &#8220;Peirce&#8217;s Contributions to the 21st Century&#8221;, in         H. Schärfe, P. Hitzler, &amp; P. Øhrstrøm, eds., <em>Conceptual         Structures: Inspiration and Application</em>, LNAI 4068, Springer,         Berlin, 2006, pp. 54-69. See <a href="http://www.jfsowa.com/pubs/csp21st.pdf">http://www.jfsowa.com/pubs/csp21st.pdf</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="explode9"></a>[9] See, as a start, the Wikipedia article on <a href="http://en.wikipedia.org/wiki/Charles_Sanders_Peirce">Charles Sanders         Peirce</a> (pronounced &#8220;purse&#8221;), as well as the <a href="http://www.cspeirce.com/">Arisbe</a> collection of his assembled         papers (to date). Also see John Sowa, 2010. &#8220;The Role of Logic and         Ontology in Language and Reasoning,&#8221; from Chapter 11 of <em>Theory and         Applications of Ontology: Philosophical Perspectives</em>, edited by R.         Poli &amp; J. Seibt, Berlin: Springer, 2010, pp. 231-263. See <a href="http://www.jfsowa.com/pubs/rolelog.pdf">http://www.jfsowa.com/pubs/rolelog.pdf</a>.         Sowa also says, &#8220;Although formal logic can be studied independently of         natural language semantics, no formal ontology that has any practical         application can ever be developed and used without acknowledging its         intimate connection with NL semantics.&#8221;</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="explode10"></a>[10] While Peirce&#8217;s logic and clarity of conceptual relationships is         compelling, I find reading his writings quite demanding.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="explode11"></a>[11] In the field of artificial intelligence, the most difficult         problems are informally known as <em>AI-complete</em> or <em>AI-hard</em>, meaning that the difficulty of         these computational problems is equivalent to solving the central         artificial intelligence problem of making computers as intelligent as         people. Computer vision, autonomous robots and understanding natural         language are amongst challenges recognized by consensus as being         AI-complete. However, practical advances on the Grand Challenges were         never defined as needing to meet the AI-complete criterion. Indeed, it         is even questionable whether such a hurdle is even worthwhile or         meaningful on its own.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/965/in-the-midst-of-an-evolutionary-explosion/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Leveraging Intangible Assets Using Semantic Technologies</title>
		<link>http://www.mkbergman.com/958/leveraging-intangible-assets-using-semantic-technologies/</link>
		<comments>http://www.mkbergman.com/958/leveraging-intangible-assets-using-semantic-technologies/#comments</comments>
		<pubDate>Tue, 10 May 2011 21:13:44 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Adaptive Information]]></category>
		<category><![CDATA[Adaptive Innovation]]></category>
		<category><![CDATA[Semantic Enterprise]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[company valuations]]></category>
		<category><![CDATA[GDP]]></category>
		<category><![CDATA[information economy]]></category>
		<category><![CDATA[information management]]></category>
		<category><![CDATA[intangible assets]]></category>
		<category><![CDATA[Knowledge Capital]]></category>
		<category><![CDATA[Market value]]></category>
		<category><![CDATA[semantic technologies]]></category>
		<category><![CDATA[tangible assets]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=958</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Leveraging Intangible Assets Using Semantic Technologies&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Adaptive Innovation&amp;rft.subject=Semantic Enterprise&amp;rft.subject=Semantic Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-05-10&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/958/leveraging-intangible-assets-using-semantic-technologies/&amp;rft.language=English"></span>
Exposing $4.7 Trillion Annually in Undervalued Information Something strange began to happen with company valuations beginning twenty to thirty years ago. Book values increasingly began to diverge &#8212; go lower &#8212; from stock prices or acquisition prices. Between 1982 and 1992 the ratio of book value to market value decreased from 62% to 38% for [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Leveraging Intangible Assets Using Semantic Technologies&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Information&amp;rft.subject=Adaptive Innovation&amp;rft.subject=Semantic Enterprise&amp;rft.subject=Semantic Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-05-10&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/958/leveraging-intangible-assets-using-semantic-technologies/&amp;rft.language=English"></span>
<h2><a><img style="border: 0px solid; width: 225px; height: 264px; float: left; margin-right: 10px;" title="Deciphering Information Assets" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110509_hieroglyphs.jpg" alt="Deciphering Information Assets" hspace="5" vspace="5" align="left" /></a> Exposing $4.7 Trillion Annually in Undervalued Information</h2>
<p>Something strange began to happen with company valuations beginning twenty to thirty years ago. Book values increasingly began to diverge &#8212; go lower &#8212; from stock prices or acquisition prices. Between 1982 and 1992 the ratio of book value to market value decreased from 62% to 38% for public US companies <a href="#asset1">[1]</a>. The why of this mystery has largely been solved, but what to do about it has not. Significantly, semantic technologies and approaches offer both a rationale and an imperative for how to get the enterprises&#8217; books back in order. In the process, semantics may also provide a basis for more productive management and increased valuations for enterprises as well.</p>
<p>The mystery of diverging value resides in the importance of information in an information economy. Unlike the historical and traditional ways of measuring a company&#8217;s assets &#8212; based on the tangible factors of labor, capital, land and equipment &#8212; information is an <span style="font-style: italic;">intangible</span> asset. As such, it is harder to see, understand and evaluate than other assets. Conventionally, and still the more common accounting practice, <a href="http://en.wikipedia.org/wiki/Intangible_asset">intangible assets </a>are divided into goodwill, legal (intellectual property and trade secrets) and competitive (know-how) intangibles. But &#8212; given that intangibles now equal or exceed the value of tangible assets in advanced economies &#8212; we will focus instead on the information component of these assets.</p>
<p>As used herein, <span style="font-style: italic;">information</span> is taken to be any data that is presented in a form useful to recipients (as contrasted to the more technical definition of Shannon and Weaver <a href="#asset2">[2]</a>). While it is true that the there is always a question of whether the collection or development of information is a cost or represents an investment, that &#8220;information&#8221; is of growing importance and value to the enterprise is certain.</p>
<p>The importance of this information focus can be demonstrated by two telling facts, which I elaborate below. First, only five to seven percent of existing information is adequately used by most enterprises. And, second, the global value of this information amounts to perhaps a range of <span class="double_u">$2.0 trillion to $7.4 trillion</span> annually (yes, trillions with a T)! It is frankly unbelievable that assets of such enormous magnitude are so poorly understood, exploited or managed.</p>
<p>Amongst all corporate resources and assets, information is surely the least understood and certainly the least managed. We value what we measure, and measure what we value. To say that we little measure information &#8212; its generation, its use (or lack thereof) or its value &#8212; means we are attempting to manage our enterprises with one eye closed and one arm tied behind our backs. Semantic approaches offer us one way, perhaps the best way, to bring understanding to this asset and then to leverage its value.</p>
<h3>The Seven “Laws” of Information</h3>
<p>More than a decade ago Moody and Walsh put forward a seminal paper on the seven &#8220;laws&#8221; of information <a href="#asset3">[3]</a>. Unlike other assets, information has some unique characteristics that make understanding its importance and valuing it much more difficult than other assets. Since I think it a shame that this excellent paper has received little attention and few citations, let me devote some space to covering these &#8220;laws&#8221;.</p>
<p>Like traditional factors of production &#8212; land, labor, capital &#8212; it is critical to understand the nature of this asset of &#8220;information&#8221;. As the laws below make clear, the nature of &#8220;information&#8221; is totally unique with respect to other factors of production. Note I have taken some liberty and done some updating on the wording and emphasis of the Moody and Walsh &#8220;laws&#8221; to accommodate recent learnings and understandings.</p>
<h4>Law #1: Information Is (Infinitely) Shareable</h4>
<p>Information is not friable and can not be depleted. Using or consuming it has no direct affect on others using or consuming it and only using portions of it does not undermine the whole of it. Using it does not cause a degradation or loss of function from its original state. Indeed, information is actually not &#8220;consumed&#8221; at all (in the conventional sense of the term); rather, it is &#8220;shared&#8221;. And, absent other barriers, information can be shared infinitely. The access and<br />
use to information on the Web demonstrates this truth daily.</p>
<p>Thus, perhaps the most salient characteristic of information as an asset is that it can be shared between any number of people, business areas and organizations without loss of value to any party (absent the importance of confidentiality or secrecy, which is another information factor altogether). The sharability or maintenance of value irrespective of use makes information quite different to how other assets behave. There is no dilution from use. As Moody and Walsh put it, &#8220;from the firm’s perspective, value is therefore cumulative rather than apportioned across different users.&#8221;</p>
<p>In practice, however, this very uniqueness is also a cause of other organizational challenges. Both personal and institutional barriers get erected to limit sharing since &#8220;knowledge is power.&#8221; One perverse effect of information hoarding or lack of institutional support for sharing is to force the development anew of similar information. When not shared, existing information becomes a cost, and one that can get duplicated many times over.</p>
<h4>Law #2: The Value of Information Increases With Use</h4>
<p>Most resources degrade with use, such as equipment wearing out. In contrast, the per unit value of information increases with use. The major cost of information is in its capture, storage and maintenance. The actual variable costs of using the information (particularly digital information) is, in essence, zero. Thus, with greater use, the per unit cost of information drops.</p>
<p>There is a corollary to this that also goes to the heart of the question of information as an asset. From an accounting point of view, something can only be an asset if it provides future economic value. If information is not used, it cannot possibly result in such benefits and is therefore not an asset. Unused information is really a liability, because no value is extracted from it. In such cases the costs of capture, storage and maintenance are incurred, but with no realized<br />
value. Without use, information is solely a cost to the enterprise.</p>
<p>The additional corollary is that awareness of the information&#8217;s existence is an essential requirement in order to obtain this value. As Moody and Walsh state, &#8220;information is at its highest &#8216;potential&#8217; when everyone in the organization knows where it is, has access to it and knows how to use it. Information is at its lowest &#8216;potential&#8217; when people don’t even know it is there.&#8221;</p>
<p>A still further corollary is the importance of information literacy. Awareness of information without an understanding of where it fits or how to take advantage of it also means its value is hidden to potential users. Thus, in addition to awareness, training and documentation are important factors to help ensure adequate use. Both of these factors<br />
may seem like additional costs to the enterprise beyond capture, storage and maintenance, but &#8212; without them &#8212; no or little value will be leveraged and the information will remain a sunk cost.</p>
<h4>Law #3: Information is Perishable</h4>
<p>Like most other assets, the value of information tends to depreciate over time<a href="#asset4"> [4]</a>. Some information has a short shelf life (such as Web visitations); other has a long shelf life (patents, contracts and many trade secrets). Proper valuation of information should take into account such differences in operational life, analysis or decision life, and statutory life. Operational shelf life tends to be the shortest.</p>
<p>In these regards, information is not too dissimilar from other asset types. The most important point is to be cognizant of use and shelf differences amongst different kinds of information. This consideration is also traded off against the declining costs of digital information storage.</p>
<h4>Law #4: The Value of Information Increases With Accuracy</h4>
<p>A standard dictum is that the value of information increases with accuracy. The caveat, however, is that some information, because it is not operationally dependent or critical to the strategic interests of the firm, actually can become a cost when capture costs exceed value. Understanding such <a href="http://en.wikipedia.org/wiki/Pareto_principle">Pareto principles</a> is an important criterion in evaluating information approaches. Generally, information closest to the transactional or business purpose of the organization will demand higher accuracy.</p>
<p>Such statements may sound like platitudes &#8212; and are &#8212; in the absence of an understanding of information dependencies within the firm. But, when certain kinds of information are critical to the enterprise, it is just as important to know the accuracy of the information feeding that &#8220;engine&#8221; as it is for oil changes or maintenance schedules for physical engines. Thus an understanding of accuracy requirements in information should be a deliberate management focus for critical business functions. It is the rare firm that attends to such imperatives today.</p>
<h4>Law #5: The Value of Information Increases in Combination</h4>
<p>A unique contribution from semantic approaches &#8212; and perhaps the one resulting in the highest valuation benefit &#8212; arises from the increased value of connecting the information. We have come to understand this intimately as the &#8220;network effect&#8221; from interconnected nodes on a network. It also arises when existing information is connected as well.</p>
<p>Today&#8217;s enterprise information environment is often described by many as unconnected &#8220;silos&#8221;. Scattered databases and spreadsheets and other information repositories litter the information landscape. Not only are these sources unconnected and isolated, but they also may describe similar information in different and inconsistent ways.</p>
<p>As I have described previously in <a href="http://www.mkbergman.com/837/the-law-of-linked-data/"><span style="font-style: italic;">The Law of Linked Data</span></a> <a href="#asset5">[5]</a>, existing information can act as nodes that &#8212; once connected to one another &#8212; tend to produce a similar network effect to what physical networks exhibit with increasing numbers of users. Of course, the nature of the information that is being connected and its centrality to the mission of the enterprise will greatly affect the value of new connections. But, based on evidence to date, the value of information appears to go up somewhere between a quadratic and exponential function for the number of new connections. This value is especially evident in know-how and competitive areas.</p>
<h4>Law #6: More Is Not Necessarily Better</h4>
<p>Information overload is a well-known problem. On the other hand, lack of appropriate information is also a compelling problem. The question of information is thus one of relevancy. Too much irrelevant information is a bad thing, as is too little relevant information.</p>
<p>These observations lead to two use considerations. First, means to understand and focus information capture on relevant information is critical. And, second, information management systems should be purposefully designed with user interfaces for easy filtering of irrelevant information.</p>
<p>The latter point sounds straightforward, but, in actuality, requires a semantic underpinning to the enterprise&#8217;s information assets. This requirement is because relevancy is in the eye of the beholder, and different users have different terms, perspectives, and world views by which information evaluation occurs. In order for useful filtering, information must be presented in similar terms and perspectives relevant to those users. Since multiple studies affirm that information decision-makers seek more information beyond their overload points<a href="#asset3"> [3]</a>, it is thus more useful to provide relevant access and filtering methods that can be tailored by user rather than &#8220;top down&#8221; information restrictions.</p>
<h4>Law #7: Information is Self-propagating</h4>
<p>With access and connections, information tends to beget more information. This propagation results from summations, analysis, unique combinations and other ways that basic datum get recombined into new datum. Thus, while the first law noted that information can not be consumed (or depleted) by virtue of its use, we can also say that information tends to reproduce and expand itself via use and inspection.</p>
<p>Indeed, knowledge itself is the result of how information in its native state can be combined and re-organized to derive new insights. From a valuation standpoint, it is this very understanding that leads to such things as competitive intelligence or new know-how. In combination with insights from connections, this propagating factor of information is the other leading source of intangible asset valuations.</p>
<p>This law also points to the fact that information <span style="font-style: italic;">per se</span> is not a scarce resource. (Though its availability may be scarce.) Once available, techniques like data mining, analysis, visualization and so forth can be rich sources for generating new information from existing holdings of data.</p>
<h3>Information as an Asset and How to Value</h3>
<p>These &#8220;laws&#8221; &#8212; or perspectives &#8212; help to frame the imperatives for how to judge information as an asset and its resulting value. The methodological and conceptual issues of how to explicitly account for information on a company&#8217;s books are, of course, matters best left to economists and professional accountants. With the growing share of information in relation to intangible assets, this would appear to be a matter of great importance to national policy. Accounting for R&amp;D efforts as an asset versus a cost, for example, has been estimated to add on the order of 11 percent to US national GDP estimates<a href="#asset9"> [9]</a>.</p>
<p>The mere generation of information is not necessarily an asset, as the &#8220;laws&#8221; above indicate. Some of the information has no value and some indeed represents a net sunk cost. What we can say, however, is that valuable information that is created by the enterprise but remains unused or is duplicated means that what was potentially an asset has now been turned into a cost &#8212; sometimes a cost repeated many-fold.</p>
<p>Information that <span style="font-weight: bold; font-style: italic;">is</span> used is an asset, intangible or not. Here, depending on the nature of the information and its use, it can be valued on the basis of cost (historical cost or what it cost to develop it), market value (what others will pay for it), or utility (what is its present value as benefits accrue into the future). Traditionally the historical cost method has been applied to information. Yet, since information can both be sold and still retained by the organization, it may have both market value and utility value, with its total value being the sum.</p>
<p>In looking at these factors, Moody and Walsh propose a number of new guidelines in keeping with the &#8220;laws&#8221; noted above<a href="#asset3"> [3]</a>:</p>
<ul>
<li>Operational information should be measured as the cost of collection using data entry costs</li>
<li>Management information should be valued based on what it cost to extract the data from operational systems</li>
<li>Redundant data should be considered to have zero value (Law #1)</li>
<li>Unused data should be considered to have zero value (Law #2)</li>
<li>The number of users and number of accesses to the data should be used to multiply the value of the information (Law #2). When information is used for the first time, it should be valued at its cost of collection; subsequent uses should add to this value (perhaps on a depreciated basis; see below)</li>
<li>The value of information should be depreciated based on its “shelf life” (Law #3)</li>
<li>The value of information should be discounted by its accuracy relative to what is considered to be acceptable (Law #4)</li>
<li>And, as an added factor, information that is effectively linked or combined should have its value multiplied (Law #5), though the actual multiplier may be uncertain <a href="#asset5">[5]</a>.</li>
</ul>
<p>The net result of thinking about information in this more purposeful way is to encourage more accurate valuation methods, and to provide incentives for more use and re-use, particularly in combined ways. Such methods can also help distinguish what information is of more value to the organization, and therefore worthy of more attention and investment.</p>
<h3>The Growing Importance of Intangible Information</h3>
<p>The emerging discrepancy between market capitalizations and book values began to get concerted academic attention in the 1990s. To be sure, perceptions by the market and of future earnings potential can always  color these differences. The simple occurrence of a discrepancy is not itself proof of erroneous or inaccurate valuations. (And, the corollary is that the degree of the discrepancy is not sufficient alone to estimate the intangible asset &#8220;gap&#8221;, a logical error made by many proponents.) But, the fact that these discrepancies had been growing and appeared to be based (in part) on structural changes linked to intangibles was creating attention.</p>
<p>Leonard Nakamura, an economist with the Federal Reserve Board in Philadelphia, published a working paper in 2001 entitled, <a title="L.I. Nakamura, 'What is the US Gross Investment in Intangibles'" href="http://www.phil.frb.org/files/wps/2001/wp01-15.pdf"><em>&#8220;What is the U.S. Gross investment in Intangibles?  (At Least) One Trillion Dollars a Year!&#8221;</em></a> <a href="#asset6">[6]</a>. This was one of the first attempts to measure intangible investments, which he defined as private expenditures on assets that are intangible and necessary to the creation and sale of new or improved products and processes, including designs, software, blueprints, ideas, artistic expressions, recipes, and the like. Nakamura acknowledged his work as being preliminary. But he did find direct and indirect empirical evidence to show that US private firms were investing at least $1 trillion annually (as of 2000, the basis year for the data) in intangible assets.  Private expenditures, labor and corporate operating margins were the three measurement methods.  The study also suggested that the capital stock of intangibles in the US has an equilibrium market value of at least $5 trillion.</p>
<p>Another key group &#8212; Carol Corrado, Charles Hulten, and Daniel Sichel, known as &#8220;CHS&#8221; across their many studies &#8212; also began to systematically evaluate the extent and basis for intangible assets and its discrepancy <a href="#asset7">[7]</a>.  They estimated that spending on long-lasting knowledge capital &#8212; not just intangibles broadly &#8212; grew relative to other major components of aggregate demand during the 1990s. CHS was the first to show that by the turn of the millenium that fixed US investment in intangibles was at least as large as business investment in traditional, tangible capital.</p>
<p>By later in the decade, Nakamura was able to gather and analyze time series data that showed the steady increase in the contributions of intangibles<a href="#asset8"> [8]</a>:</p>
<div><a title="OSF workflow.png" href="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110509_intangibles_trends.png"><br />
<img class="center_ok" style="border: 0px solid; width: 600px; height: 374px;" title="Trends in Intangibles Values" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110509_intangibles_trends.png" alt="Trends in Intangibles Values" /></a></div>
<p>One can see the cross-over point late in the decade. Investment in intangibles he now estimates to be on the order of 8% to 10% of GDP annually in the US.</p>
<p>Roughly at the same time the National Academies in the US was commissioned to investigate the policy questions of intangible assets. The resulting major study<a href="#asset9"> [9]</a> contains much relevant information. But it, too, contained an update by CHS on their slightly different approach to analyzing the growing role of intangible assets:</p>
<div><a title="OSF workflow.png" href="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110509_intangibles_trends_nas.png"><br />
<img class="center_ok" style="border: 0px solid; width: 600px; height: 386px;" title="Trends in Intangibles Values - Version 2" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110509_intangibles_trends_nas.png" alt="Trends in Intangibles Values - Version 2" /></a></div>
<p>This CHS analysis shows similar trends to what Nakamura found, though the degree of intangible contributions is estimated as higher (~14% of annual GDP today), with investments in intangibles exceeding tangible assets somewhat earlier.</p>
<p>Surveys of more than 5,000 companies in 25 companies confirmed these trends from a different perspective, and also showed that most of these assets did not get reflected in financial statements. A large portion of this value was due to &#8220;brands&#8221; and other market intangibles <a href="#asset10">[10]</a>. The total &#8220;undisclosed&#8221; portion appeared to equal or exceed total<br />
reported assets. Figures for the US indicated there might be a cumulative basis of intangible assets of $9.2 trillion <a href="#asset11">[11]</a>.</p>
<p>In parallel, these groups and others began to decompose the intangible asset growth by country, sector, or asset type. The specific component of &#8220;information&#8221; received a great deal of attention. Uday Apte, Uday Karmarkar and Hiranya Nath, in particular, conducted a couple of important studies during this decade <a href="#asset12">[12</a>,<a href="#asset13">13]</a>. For example, they found nearly two-thirds of recent US GDP was due to information or knowledge industry contributions, a percentage that had been growing over time. They also found that a secondary sector of information internal to firms itself constituted well over 40% of the information economy, or some 28% of the entire economy. So the information activities internal to organizations and institutions represent a very large part of the economy.</p>
<p>The specific components that can constitute the informational portion of intangible assets has also been looked at by many investigators, importantly including key accounting groups. FASB, for example, has specific guidance on treatment of intangible assets in SFAS 141 <a href="#asset14">[14]</a>. Two-thirds of the 90 specific intangible items listed by the American Institute of Certified Public Accountants are directly related to information (as opposed to contracts, brands or goodwill), as shown in <a href="#asset15">[15]</a>. There has also been some good analysis by CHS on breakdowns by intangible assets categories<a href="#asset16"> [16]</a>. There are also considerable differences by country on various aspects of these measures (for example,<a href="#asset10"> [10]</a>). For example, according to OECD figures from 2002, expenditures for knowledge (R&amp;D, education and software) ranged from nearly 7 percent (Sweden) to below 2 percent (Greece) in OECD countries, with the average of about 4 percent and the US at over 6 percent<a href="#asset17"> [17]</a>.</p>
<h3>. . . Plus Too Much Information Goes Unused</h3>
<p>The common view is that a typical organization only uses 5 to 7 percent of the information it already has on hand<a href="#asset18"> [18]</a>, and 20% to 25% of a knowledge worker&#8217;s time is spent simply trying to find information <a href="#asset19">[19]</a>. To probe these issues more deeply, I began a series of analyses in 2004 looking at how much money was being spent on preparing documents within US companies, and how much of that investment was being wasted or not re-used<a href="#asset20"> [20]</a>. One key finding from that study was that the information within documents in the US represent about a third of total gross domestic product, or an amount equal at the time of the study to about $3.3 trillion annually (in 2010 figures, that would be closer to $4.7 trillion). This level of investment is consistent with the results of Apte <span style="font-style: italic;">et al.</span> and others as noted above.</p>
<p>However, for various reasons &#8212; mostly due to lack of awareness and re-use &#8212; some 25% of those trillions of dollar spent annually on document creation costs are wasted. If we could just find the information and re-use it, massive benefits could accrue, as these breakdowns in key areas show:</p>
<table style="text-align: left; margin-left: auto; margin-right: auto;" border="0" cellspacing="0" cellpadding="4">
<tbody>
<tr>
<td style="background-color: #cccccc; width: 363px; text-align: center;"><strong>U.S.</strong> <strong>FIRMS</strong></td>
<td style="background-color: #cccccc; width: 86px; text-align: center;"><strong>$ Million</strong></td>
<td style="background-color: #cccccc; width: 82px; text-align: center;"><strong>%</strong></td>
</tr>
<tr>
<td style="vertical-align: top;">Cost to Create Documents</td>
<td style="vertical-align: top; text-align: right;">$3,261,091</td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
<tr>
<td style="vertical-align: top;"></td>
<td style="vertical-align: top; text-align: right;"></td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
<tr>
<td style="vertical-align: top;">Benefits to Finding Missed or Overlooked Documents</td>
<td style="vertical-align: top; text-align: right;">$489,164</td>
<td style="vertical-align: top; text-align: right;">63%</td>
</tr>
<tr>
<td style="vertical-align: top;">Benefits to Improved Document Access</td>
<td style="vertical-align: top; text-align: right;">$81,360</td>
<td style="vertical-align: top; text-align: right;">10%</td>
</tr>
<tr>
<td style="vertical-align: top;">Benefits of Re-finding Web Documents</td>
<td style="vertical-align: top; text-align: right;">$32,967</td>
<td style="vertical-align: top; text-align: right;">4%</td>
</tr>
<tr>
<td style="vertical-align: top;"></td>
<td style="vertical-align: top; text-align: right;"></td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
<tr>
<td style="vertical-align: top;">Benefits of Proposal Preparation and Wins</td>
<td style="vertical-align: top; text-align: right;">$6,798</td>
<td style="vertical-align: top; text-align: right;">1%</td>
</tr>
<tr>
<td style="vertical-align: top;">Benefits of Paperwork Requirements and Compliance</td>
<td style="vertical-align: top; text-align: right;">$119,868</td>
<td style="vertical-align: top; text-align: right;">15%</td>
</tr>
<tr>
<td style="vertical-align: top;">Benefits of Reducing Unauthorized Disclosures</td>
<td style="vertical-align: top; text-align: right;">$51,187</td>
<td style="vertical-align: top; text-align: right;">7%</td>
</tr>
<tr>
<td style="vertical-align: top;"></td>
<td style="vertical-align: top; text-align: right;"></td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
<tr>
<td style="vertical-align: top;"><strong>Total Annual Benefits</strong></td>
<td style="vertical-align: top; text-align: right;">$781,314</td>
<td style="vertical-align: top; text-align: right;">100%</td>
</tr>
<tr>
<td style="vertical-align: top;"></td>
<td style="vertical-align: top; text-align: right;"></td>
<td style="vertical-align: top;"></td>
</tr>
<tr>
<td style="vertical-align: top; text-align: center;"><strong>PER LARGE FIRM</strong></td>
<td style="vertical-align: top; text-align: center;"><strong>$ Million</strong></td>
<td style="vertical-align: top;"></td>
</tr>
<tr>
<td style="vertical-align: top;">Cost to Create Documents</td>
<td style="vertical-align: top; text-align: right;">$955.6</td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
<tr>
<td style="vertical-align: top;"></td>
<td style="vertical-align: top; text-align: right;"></td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
<tr>
<td style="vertical-align: top;">Benefits to Finding Missed or Overlooked Documents</td>
<td style="vertical-align: top; text-align: right;">$143.3</td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
<tr>
<td style="vertical-align: top;">Benefits to Improving Document Access</td>
<td style="vertical-align: top; text-align: right;">$23.8</td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
<tr>
<td style="vertical-align: top;">Benefits of Re-finding Web Documents</td>
<td style="vertical-align: top; text-align: right;">$9.7</td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
<tr>
<td style="vertical-align: top;"></td>
<td style="vertical-align: top; text-align: right;"></td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
<tr>
<td style="vertical-align: top;">Benefits of Proposal Preparation and Wins</td>
<td style="vertical-align: top; text-align: right;">$2.0</td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
<tr>
<td style="vertical-align: top;">Benefits of Paperwork Requirements and Compliance</td>
<td style="vertical-align: top; text-align: right;">$35.1</td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
<tr>
<td style="vertical-align: top;">Benefits of Reducing Unauthorized Disclosures</td>
<td style="vertical-align: top; text-align: right;">$15.0</td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
<tr>
<td style="vertical-align: top;"></td>
<td style="vertical-align: top; text-align: right;"></td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
<tr>
<td style="vertical-align: top;"><strong>Total Annual Benefits</strong></td>
<td style="vertical-align: top; text-align: right;">$229.0</td>
<td style="vertical-align: top; text-align: right;"></td>
</tr>
</tbody>
</table>
<p style="text-align: center; padding-top: 10px;">Table. Mid-range Estimates for the Annual Value of Documents, U.S. Firms, 2002 <a href="#asset20">[20]</a></p>
<p>The total benefit from improved document access and use to the U.S economy is on the order of 8% of GDP. For the 1,000 largest U.S. firms, benefits from these improvements can approach nearly $250 million annually per firm (2002 basis). About three-quarters of these benefits arise from <strong><em>not</em></strong> re-creating the intellectual capital already invested in prior document creation. About one-quarter of the benefits are due to reduced regulatory non-compliance or paperwork, or better competitiveness in obtaining solicited grants and contracts.</p>
<p>This overall value of document use and creation is quite in line with the analyses of intangible assets noted above, and which arose from totally different analytical bases and data. This triangulation brings confidence that true trends in the growing importance of information have been identified.</p>
<h3>How Big is the Information Asset Gap?</h3>
<p>These various estimates can now be combined to provide an assessment of just how large the &#8220;gap&#8221; is for the overlooked accounting and use of information assets:</p>
<table class="center_ok" style="border-collapse: collapse; width: 810px; height: 227px;" border="0" cellspacing="0" cellpadding="4">
<tbody>
<tr>
<td style="text-align: center; background-color: #cccccc;"></td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;">GDP ($T)</td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" colspan="2">Intangible %</td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" colspan="2">Info Contrib %</td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" colspan="2">Info Assets ($T)</td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" colspan="2">Unused Info ($T)</td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" colspan="2">Total ($T)</td>
</tr>
<tr>
<td style="text-align: center; background-color: #cccccc;" width="130"></td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" width="52"></td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" width="52">Lo</td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" width="52">Hi</td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" width="52">Lo</td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" width="52">Hi</td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" width="52">Lo</td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" width="52">Hi</td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" width="52">Lo</td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" width="52">Hi</td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" width="52">Lo</td>
<td style="text-align: center; background-color: #cccccc; font-weight: bold;" width="52">Hi</td>
</tr>
<tr>
<td>US</td>
<td align="right">$14.72</td>
<td style="text-align: center;">9%</td>
<td style="text-align: center;">14%</td>
<td style="text-align: center;">33%</td>
<td style="text-align: center;">67%</td>
<td align="right">$0.44</td>
<td align="right">$1.38</td>
<td align="right">$0.30</td>
<td align="right">$1.21</td>
<td align="right">$0.74</td>
<td align="right">$2.60</td>
</tr>
<tr>
<td>European Union</td>
<td align="right">$15.25</td>
<td style="text-align: center;">8%</td>
<td style="text-align: center;">12%</td>
<td style="text-align: center;">33%</td>
<td style="text-align: center;">50%</td>
<td align="right">$0.40</td>
<td align="right">$0.92</td>
<td align="right">$0.31</td>
<td align="right">$1.26</td>
<td align="right">$0.72</td>
<td align="right">$2.17</td>
</tr>
<tr>
<td>Remaining Advanced</td>
<td align="right">$10.17</td>
<td style="text-align: center;">8%</td>
<td style="text-align: center;">12%</td>
<td style="text-align: center;">33%</td>
<td style="text-align: center;">50%</td>
<td align="right">$0.27</td>
<td align="right">$0.61</td>
<td align="right">$0.21</td>
<td align="right">$0.84</td>
<td align="right">$0.48</td>
<td align="right">$1.45</td>
</tr>
<tr>
<td>Rest of World</td>
<td align="right">$34.32</td>
<td style="text-align: center;">2%</td>
<td style="text-align: center;">6%</td>
<td style="text-align: center;">10%</td>
<td style="text-align: center;">25%</td>
<td align="right">$0.07</td>
<td align="right">$0.51</td>
<td align="right">$0.00</td>
<td align="right">$0.71</td>
<td align="right">$0.07</td>
<td align="right">$1.22</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>Total</td>
<td align="right">$74.46</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td align="right">$1.18</td>
<td align="right">$3.42</td>
<td align="right">$0.83</td>
<td align="right">$4.02</td>
<td align="right">$2.00</td>
<td align="right">$7.44</td>
</tr>
<tr>
<td><small>Notes (see endnotes)</small></td>
<td style="text-align: center;"><a href="#asset21"><small>[21]</small></a></td>
<td style="text-align: center;" colspan="2"><a href="#asset22"><small>[22]</small></a></td>
<td style="text-align: center;" colspan="2"><a href="#asset23"><small>[23]</small></a></td>
<td align="right"></td>
<td align="right"></td>
<td colspan="2">
<div style="text-align: center;"><a href="#asset24"><small>[24]</small></a></div>
</td>
<td align="right"></td>
<td align="right"></td>
</tr>
</tbody>
</table>
<p>Depending, these estimates can either be viewed as being too optimistic about the importance of information assets<a href="#asset25"> [25] </a>or too conservative <a href="#asset26">[26]</a>. The breadth of the ranges of these values is itself an expression of the uncertainty in the numbers and the analysis.</p>
<p>The analysis shows that, globally, the value of unused and unaccounted information assets may be on the order of  $2.0 trillion to $7.4 trillion annually, with a mid-range value of $4.7 trillion. Even considering uncertainties, these are huge, huge numbers by any account. For the US alone, this range is $750 billion to $2.6 trillion annually. The analysis from the prior studies <a href="#asset20">[20]</a> would strongly suggest the higher end of this range is more likely than the lower. Similarly large gaps likely occur within the European Union and within other advanced nations. For individual firms, depending on size, the benefits of understanding and closing these gaps can readily be measured in the millions to billions <a href="#asset27">[27]</a>.</p>
<p>At the high end, these estimates suggest that perhaps as much as 10% of global expenditures is wasted and unaccounted for due to information-related activities. This is roughly equivalent to adding a half of the US economy to the global picture.</p>
<p>In the concluding section, we touch on why such huge holes may appear in the world&#8217;s financial books. Clearly, though, even with uncertain and heroic assumptions, the magnitude of this gap is huge, with compelling needs to understand and close it as soon as possible.</p>
<h3>The Relationship to Semantic Technologies</h3>
<p>The seven Moody and Walsh information &#8220;laws&#8221; provide the clues to the reasons why we are not properly accounting for information and why we inadequately use it:</p>
<ul>
<li>We don&#8217;t know what information we have and can not find it</li>
<li>What we have we don&#8217;t connect</li>
<li>We misallocate resources for generating, capturing and storing information, because we don&#8217;t understand its value and potential</li>
<li>We don&#8217;t manage the use of information or its re-use</li>
<li>We duplicate efforts</li>
<li>We inadequately leverage what information we have and miss valuable (that is, can be &#8220;valuated&#8221;) insights that could be gained.</li>
</ul>
<p>Fundamentally, because information is not understood in our bones as central to the well-being of our enterprises, we continue to view the generation, capture and maintenance of information as a &#8220;cost&#8221; and not an &#8220;asset&#8221;.</p>
<p>I have maintained for some time an interactive <a href="http://www.mkbergman.com/temp-exhibit/">information timeline</a><a href="#asset28"> [28] </a>that attempts to encompass the entire human history of information innovations. For tens of thousands of years steady &#8212; yet slow &#8212; progress in the ways to express and manage information can be seen in this timeline. But, then, beginning with electricity and then digitization, the pace of innovation explodes.</p>
<p>The same timeframe that sees the importance of intangible assets appear on national and firm accounts is when we see the full digitization of information and its ability to be communicated and linked over digital networks. A very insightful figure by Rama Hoetzlein for his thesis in 2007, which I have modified and enhanced, captures this evolution with some estimated dates as is shown below (click to expand) <a href="#asset29">[29]</a>:</p>
<div><a title="OSF workflow.png" href="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110509_knowledge_evolution.png"><br />
<img class="center_ok" style="border: 0px solid; width: 600px; height: 682px;" title="Knowledge System Trends throughout History" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110509_knowledge_evolution.png" alt="Knowledge System Trends throughout History" /></a></div>
<p>The first insight this figure provides is that all forms of information are now available in digital form. This includes unstructured (images and documents), semi-structured (mark-up and &#8220;tagged&#8221; information) and structured (database and spreadsheet) information. This information can now be stored and communicated over digital networks with broadly accepted protocols.</p>
<p>But the most salient insight is that we now have the means through semantic technologies and approaches to interrelate all of this information together. Tagging and extraction methods enable us to generate metadata for unstructured documents and content. Data models based on predicate logic and semantic logics give us the flexible means to express the relationships and connections between information. And all of this can be stored and manipulated through graph-based datastores and languages such that we can draw inferences and gain insights. Plus, since all of this is now accessible via the Web and browsers, virtually any user can access, use and leverage this information.</p>
<p>This figure and its dates not only shows where we have come as a species in our use and sophistication with information, but how we need to bring it all together using semantics to complete our transition to a knowledge economy.</p>
<p>The very same metadata and semantic tagging capabilities that enable us to interrelate the information itself also provides the techniques by which we can monitor and track usage and provenance. It is through these additional semantic methods that we can finally begin to gain insight as to what information is of what value and to whom. Tapping this information will complete the circle for how we can also begin to properly valuate and then manage and optimize our information assets.</p>
<h3>Conclusion</h3>
<p>With our transition to an information economy, we now see that intangible assets exceed the value of tangible ones. We see that the information component of these intangibles represent one-third to two-thirds of these intangibles. In other words, information makes up from 17% to more than one-third of an individual firm&#8217;s value in modern economies. Further, we see that at least 25% of firm expenditures on information is wasted, keeping it as a cost and negating its value as an asset.</p>
<p>The &#8220;factories&#8221; of the modern information economy no longer produce pins with the fixed inputs of labor and capital as in the time of Adam Smith. They rather produce information and knowledge and know-how. Yet our management and accounting systems seem fixed in the techniques of yesteryear. The quaint idea of total factor productivity as a &#8220;residual&#8221; merely belies our ignorance about the causes of economic growth and firm value. These are issues that should rightly occupy the attention of practitioners in the disciplines of accounting and management.</p>
<div class="boxYellowDotted"><span style="font-style: italic;">Why industrial-era accounting methods have been maintained in the present information age is for students of corporate power politics to debate. It should suffice to remind us that when industrialization induced a shift from the extraction of funds from feudal land possessions to earning profits on invested capital, most of the assumptions about how to measure performance had to change. When the expenses for acquiring information capabilities cease to be an arbitrary budget allocation and become the means for gaining Knowledge Capital, much of what is presently accepted as management of information will have to shift from a largely technological view of efficiency to an asset management perspective</span><a href="#asset30"> [30]</a>.</div>
<p>Accounting methods grounded in the early 1800s that are premised on only capital assets as the means to increase the productivity of labor no longer work. Our engines of innovation are not physical devices, but ideas, innovation and knowledge; in short, information. Capable executives recognize these trends, but have yet to change management practices to address them<a href="#asset31"> [31]</a>.</p>
<p>As managers and executives of firms we need not await wholesale modernization of accounting practices to begin to make a difference. The first step is to understand the role, use and importance of information to our organizations. Looking clearly at the seven information &#8220;laws&#8221; and what that means about tracking and monitoring is an immediate way to take this step. The second step is to understand and evaluate seriously the prospects for semantic approaches to make a difference today.</p>
<p>We have now sufficiently climbed the data federation pyramid<a href="#asset32"> [32]</a> to where all of our information assets are digital; we have network protocols to link it; we have natural language and extraction techniques for making documents first-class citizens along side structured data; and we have logical data models and sound semantic technologies for tying it all together.</p>
<p>We need to reorganize our &#8220;factory&#8221; floors around these principles, just as prime movers and unit electric drives altered our factories of the past. We need to reorganize and re-think our work processes and what we measure and value to compete in the 21st century. It is time to treat information as seriously as it has become an integral part of our enterprises. Semantic technologies and approaches provide just the path to do so.</p>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset1"></a>[1] Baruch Lev and Jürgen H. Daum, 2003. &#8220;Intangible Assets and the Need for a Holistic and More Future-oriented Approach to Enterprise Management and Corporate Reporting,&#8221; prepared for the 2003 PMA Intellectual Capital Symposium, 2nd October 2003, Cranfield Management Development Centre, Cranfield University, UK; see <a href="http://www.juergendaum.de/articles/pma_ic_symp_jdaum_final.pdf">http://www.juergendaum.de/articles/pma_ic_symp_jdaum_final.pdf</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset2"></a>[2] Claude E. Shannon and Warren Weaver, 1949. <span style="font-style: italic;">The Mathematical Theory of Communication</span>. The University of Illinois Press, Urbana, Illinois, 1949. ISBN 0-252-72548-4.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset3"></a>[3] Daniel Moody and Peter Walsh, 1999. &#8220;Measuring The Value Of Information: An Asset Valuation Approach,&#8221; paper presented at the <span style="font-style: italic;">Seventh European Conference on Information Systems (ECIS’99)</span>, Copenhagen Business School, Frederiksberg, Denmark, 23-25 June, 1999. See <a style="text-decoration: underline; color: #0000cc;" href="http://wwwinfo.deis.unical.it/zumpano/2004-2005/PSI/lezione2/ValueOfInformation.pdf">http://wwwinfo.deis.unical.it/zumpano/2004-2005/PSI/lezione2/ValueOfInformation.pdf</a>. A precursor paper that is also quite helpful and cited much in Moody and Walsh is R. Glazer, 199. &#8220;Measuring the Value of Information: The Information Intensive Organisation&#8221;, <span style="font-style: italic;">IBM Systems Journal</span>, Vol 32, No 1, 1993.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset4"></a>[4] Some trade secrets could buck this trend if the value of the underlying enterprise that relies on them increases.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset5"></a>[5] M.K. Bergman, 2009. &#8220;The Law of Linked Data,&#8221; post in <span style="font-style: italic;">AI3:::Adaptive Information</span> blog, October 11, 2009. See <a href="http://www.mkbergman.com/837/the-law-of-linked-data/">http://www.mkbergman.com/837/the-law-of-linked-data/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset6"></a>[6]  Leonard Nakamura, 2001. <span style="font-style: italic;">What is the U.S. Gross Investment in Intangibles?  (At Least) One Trillion Dollars a Year!</span>,<br />
Working Paper No. 01-15, Federal Reserve Bank of Philadelphia, October 2001; see <a href="http://www.phil.frb.org/files/wps/2001/wp01-15.pdf">http://www.phil.frb.org/files/wps/2001/wp01-15.pdf</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset7"></a>[7] Carol A. Corrado, Charles R. Hulten, and Daniel E. Sichel, 2004. <span style="font-style: italic;">Measuring Capital and Technology: An Expanded Framework</span>. Federal Reserve Board, August 2004. <a href="http://www.federalreserve.gov/pubs/feds/2004/200465/200465pap.pdf">http://www.federalreserve.gov/pubs/feds/2004/200465/200465pap.pdf</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset8"></a>[8] Leonard I. Nakamura, 2009. <span style="font-style: italic;">Intangible Assets and National Income Accounting: Measuring a Scientific Revolution</span>, Working Paper No. 09-11, Federal Reserve Bank of Philadelphia, May 8, 2009; see <a style="text-decoration: underline; color: #0000cc;" href="http://www.philadelphiafed.org/research-and-data/publications/working-papers/2009/wp09-11.pdf">http://www.philadelphiafed.org/research-and-data/publications/working-papers/2009/wp09-11.pdf</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset9"></a>[9] <span class="org">Christopher Mackie, Rapporteur, 2009. </span><span style="font-style: italic;">Intangible Assets: Measuring and Enhancing Their Contribution to Corporate Value and Economic Growth: Summary of a Workshop</span>, prepared by <span class="org">the Board on Science, Technology, and Economic Policy (<a href="http://sites.nationalacademies.org/PGA/step/index.htm">STEP</a>)</span> <span class="org">Committee on National Statistics (<a href="http://www7.nationalacademies.org/cnstat/">CNSTAT</a>), </span><span class="org">ISBN: 0-309-14415-9, 124 pages; see</span> <span class="org"><a href="http://www.nap.edu/openbook.php?record_id=1274">http://www.nap.edu/openbook.php?record_id=1274 </a>(available for PDF download with sign-in).</span></div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset10"></a>[10] Brand Finance, 2006. <span style="font-style: italic;">Global Intangible Tracker 2006: An Annual Review of the World’s Intangible Value</span>, paper published by Brand Finance and The Institute of Practitioners in Advertising, London, UK, December 2006. See  <a href="http://www.brandfinance.com/images/upload/9.pdf">http://www.brandfinance.com/images/upload/9.pdf</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset11"></a>[11] Kenan Patrick Jarboe and Roland Furrow, 2008. <span style="font-style: italic;">Intangible Asset Monetization: The Promise and the Reality</span>, Working Paper #03 from the Athena Alliance, April 2008. See <a href="http://www.athenaalliance.org/pdf/IntangibleAssetMonetization.pdf">http://www.athenaalliance.org/pdf/IntangibleAssetMonetization.pdf</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset12"></a>[12] Uday M. Apte and Hiranya K. Nath, 2004, <em><a title="Apte and Nath, 'Size, Structure and Growth of the US Information Economy'" href="http://www.anderson.ucla.edu/documents/areas/ctr/bit/ApteNath.pdf.pdf">&#8220;</a></em><a title="Apte and Nath, 'Size, Structure and Growth of the US Information Economy'" href="http://www.anderson.ucla.edu/documents/areas/ctr/bit/ApteNath.pdf.pdf">Size, Structure and Growth of the US Information Economy</a><em><a title="Apte and Nath, 'Size, Structure and Growth of the US Information Economy'" href="http://www.anderson.ucla.edu/documents/areas/ctr/bit/ApteNath.pdf.pdf">,&#8221;</a></em> UCLA Anderson School of Management on Business and Information Technologies, December 2004; see  <span style="font-style: italic;">http://www.anderson.ucla.edu/documents/areas/ctr/bit/ApteNath.pdf.pdf</span>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset13"></a>[13] Uday M. Apte, Uday S. Karmarkar and Hiranya K Nath, 2008. &#8220;Information Services in the US Economy: Value, Jobs and Management Implications,&#8221; <span style="font-style: italic;">California Management Review</span>, Vol. 50, No.3, 12-30, 2008.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset14"></a>[14] See the Financial Accounting Standards Board—SFAS 141; see <a href="http://www.gasb.org/pdf/fas141r.pdf">http://www.gasb.org/pdf/fas141r.pdf</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset15"></a>[15] See further, AICPA Special Committee on Financial Reporting, 1994. <span style="font-style: italic;">Improving Business Reporting—A Customer Focus: Meeting the Information Needs of Investors and Creditors</span>. See  <a href="http://www.aicpa.org/InterestAreas/AccountingAndAuditing/Resources/EBR/DownloadableDocuments/Jenkins%20Committee%20Report.pdf">http://www.aicpa.org/InterestAreas/AccountingAndAuditing/Resources/EBR/DownloadableDocuments/Jenkins%20Committee%20Report.pdf</a>.&nbsp;</p>
<table style="text-align: left; width: 100%; margin-left: 20px;" border="0" cellspacing="0" cellpadding="2">
<tbody>
<tr>
<td style="vertical-align: top; width: 25%;"><span style="font-size: xx-small;">Blueprints </span>&nbsp;</p>
<p><span style="font-size: xx-small;">Book libraries</span></p>
<p><span style="font-size: xx-small;">Broadcast licenses</span></p>
<p><span style="font-size: xx-small;">Buy-sell agreements</span></p>
<p><span style="font-size: xx-small;">Certificates of need</span></p>
<p><span style="font-size: xx-small;">Chemical formulas</span></p>
<p><span style="font-size: xx-small;">Computer software</span></p>
<p><span style="font-size: xx-small;">Computerized databases</span></p>
<p><span style="font-size: xx-small;">Contracts</span></p>
<p><span style="font-size: xx-small;">Cooperative agreements</span></p>
<p><span style="font-size: xx-small;">Copyrights</span></p>
<p><span style="font-size: xx-small;">Credit information files</span></p>
<p><span style="font-size: xx-small;">Customer contracts</span></p>
<p><span style="font-size: xx-small;">Customer and client lists</span></p>
<p><span style="font-size: xx-small;">Customer relationships</span></td>
<td style="vertical-align: top; width: 25%;"><span style="font-size: xx-small;">Designs and drawings </span>&nbsp;</p>
<p><span style="font-size: xx-small;">Development rights</span></p>
<p><span style="font-size: xx-small;">Employment contracts</span></p>
<p><span style="font-size: xx-small;">Engineering drawings</span></p>
<p><span style="font-size: xx-small;">Environmental rights</span></p>
<p><span style="font-size: xx-small;">Film libraries</span></p>
<p><span style="font-size: xx-small;">Food flavorings and recipes</span></p>
<p><span style="font-size: xx-small;">Franchise agreements</span></p>
<p><span style="font-size: xx-small;">Historical documents</span></p>
<p><span style="font-size: xx-small;">Heath maintenance organization enrollment lists</span></p>
<p><span style="font-size: xx-small;">Know-how</span></p>
<p><span style="font-size: xx-small;">Laboratory notebooks</span></p>
<p><span style="font-size: xx-small;">Literary works</span></p>
<p><span style="font-size: xx-small;">Management contracts</span></p>
<p><span style="font-size: xx-small;">Manual databases</span></td>
<td style="vertical-align: top; width: 25%;"><span style="font-size: xx-small;">Manuscripts </span>&nbsp;</p>
<p><span style="font-size: xx-small;">Medical charts and records</span></p>
<p><span style="font-size: xx-small;">Musical compositions</span></p>
<p><span style="font-size: xx-small;">Newspaper morgue files</span></p>
<p><span style="font-size: xx-small;">Noncompete covenants</span></p>
<p><span style="font-size: xx-small;">Patent applications</span></p>
<p><span style="font-size: xx-small;">Patents (both product and process)</span></p>
<p><span style="font-size: xx-small;">Patterns</span></p>
<p><span style="font-size: xx-small;">Prescription drug files</span></p>
<p><span style="font-size: xx-small;">Prizes and awards</span></p>
<p><span style="font-size: xx-small;">Procedural manuals</span></p>
<p><span style="font-size: xx-small;">Product designs</span></p>
<p><span style="font-size: xx-small;">Proposals outstanding</span></p>
<p><span style="font-size: xx-small;">Proprietary computer software</span></p>
<p><span style="font-size: xx-small;">Proprietary processes</span></td>
<td style="vertical-align: top; width: 25%;"><span style="font-size: xx-small;">Proprietary products </span>&nbsp;</p>
<p><span style="font-size: xx-small;">Proprietary technology</span></p>
<p><span style="font-size: xx-small;">Publications</span></p>
<p><span style="font-size: xx-small;">Royalty agreements</span></p>
<p><span style="font-size: xx-small;">Schematics and diagrams</span></p>
<p><span style="font-size: xx-small;">Shareholder agreements</span></p>
<p><span style="font-size: xx-small;">Solicitation rights</span></p>
<p><span style="font-size: xx-small;">Subscription lists</span></p>
<p><span style="font-size: xx-small;">Supplier contracts</span></p>
<p><span style="font-size: xx-small;">Technical and specialty libraries</span></p>
<p><span style="font-size: xx-small;">Technical documentation</span></p>
<p><span style="font-size: xx-small;">Technology-sharing agreements</span></p>
<p><span style="font-size: xx-small;">Trade secrets</span></p>
<p><span style="font-size: xx-small;">Trained and assembled workforce</span></p>
<p><span style="font-size: xx-small;">Training manuals</span></td>
</tr>
</tbody>
</table>
</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset16"></a>[16] See, for example, Carol Corrado, Charles Hulten and Daniel Sichel, 2009. &#8220;Intangible Capital and U.S. Economic Growth,&#8221; <span style="font-style: italic;">Review of Income and Wealth Series </span><span style="font-weight: bold;">55</span>, Number 3, September 2009; see <a href="http://www.conference-board.org/pdf_free/IntangibleCapital_USEconomy.pdf">http://www.conference-board.org/pdf_free/IntangibleCapital_USEconomy.pdf</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset17"></a>[17] As stated in Kenan Patrick Jarboe, 2007. <span style="font-style: italic;">Measuring Intangibles: A Summary of Recent Activity</span>, Working Paper #02 from the Athena Alliance, April 2007. See <a href="http://www.athenaalliance.org/pdf/MeasuringIntangibles.pdf">http://www.athenaalliance.org/pdf/MeasuringIntangibles.pdf</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset18"></a>[18] The 5% estimate comes from Graham G. Rong, Chair at MIT Sloan CIO Symposium, as reported in the <a href="http://semanticweb.com/word-to-cios-digital-business-means-the-semantic-web_b19679">SemanticWeb.com</a> on May 5, 2011. (Rong also touted the use of semantic technologies to overcome this lack of use.) A similar 7% estimate comes from Pushpak Sarkar, 2002. &#8220;Information Quality in the Knowledge-Driven Enterprise,&#8221; <span style="font-style: italic;">InfoManagement Direct</span>, November 2002. See <a href="http://www.information-management.com/infodirect/20021115/6045-1.html">http://www.information-management.com/infodirect/20021115/6045-1.html</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset19"></a>[19] M.K. Bergman, 2005. &#8220;Search and the &#8217;25% Solution&#8217;,&#8221; <span style="font-style: italic;">AI3:::Adaptive Innovation</span> blog, September 14, 2005. See <a href="http://www.mkbergman.com/121/search-and-the-25-solution/">http://www.mkbergman.com/121/search-and-the-25-solution/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset20"></a>[20] M.K. Bergman, 2005.  <a href="http://www.mkbergman.com/82/untapped-assets-the-3-trillion-value-of-us-enterprise-documents/">&#8220;Untapped Assets: the $3 Trillion Value of U.S. Documents</a>,&#8221; <em>BrightPlanet Corporation White Paper,</em> July 2005, 42 pp. Also available  <a href="http://www.mkbergman.com/82/untapped-assets-the-3-trillion-value-of-us-enterprise-documents/">online</a> and in <a href="http://www.mkbergman.com/wp-content/themes/ai3/files/DocValue/DocumentsValue050712.pdf">PDF</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset21"></a>[21] From the CIA, 2011. <span style="font-style: italic;">The World Factbook</span>; accessed online at  <a href="https://www.cia.gov/library/publications/the-world-factbook/index.html">https://www.cia.gov/library/publications/the-world-factbook/index.html </a>on May 9, 2011. The &#8220;remaining advanced&#8221; countries are Australia, Canada, Iceland, Israel, Japan, Liechtenstein, Monaco, New Zealand, Norway, Puerto Rico, Singapore. South Korea, Switzerland, Taiwan.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset22"></a>[22] The range of estimates is drawn from the Nakamura <a href="#asset8">[8]</a> and CHS<a href="#asset9"> [9] </a>studies, with each respectively providing the lower and upper bounds. These values have been slightly decremented for non-US advanced countries, and greatly reduced for non-advanced ones.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset23"></a>[23] The high range is based on the categorical share of intangible asset categories (60 of 90) from the AIPCA work<a href="#asset15"> [15]</a>; the lower range is from the one-third of GDP estimates from <a href="#asset20">[20]</a>.These values have been slightly decremented for non-US advanced countries, and greatly reduced for non-advanced ones.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset24"></a>[24] For unused information assets, the high range is based on the one-third of GDP and 25% &#8220;waste&#8221; estimates from<a href="#asset20"> [20]</a>; the low range halves each of those figures. These values have been slightly decremented for non-US advanced countries, and greatly reduced for non-advanced ones (and zero for the low range).</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset25"></a>[25] Reasons for the estimates to be too optimistic are information as important as goodwill; branding; intellectual basis of cited resources is indeed real; considerable differences by country and sector (see <a href="#asset10">[10]</a> and<a href="#asset16"> [16]</a>).</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset26"></a>[26] Reasons for the estimates to be too conservative: no network effects; greatly discounted non-advanced countries; share is growing (but older estimates used); considerable differences by country and sector (see <a href="#asset10">[10]</a> and<a href="#asset16"> [16]</a>).</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset27"></a>[27] For some discussion of individual firm impacts and use cases see <a href="#asset10">[10]</a> and <a href="#asset20">[20]</a>, among others.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset28"></a>[28] See the <a href="http://www.mkbergman.com/temp-exhibit/">Timeline of Information History</a>, and its supporting documentation at M.K. Bergman, 2008. &#8220;Announcing the ‘Innovations in Information’ Timeline,&#8221; <span style="font-style: italic;">AI3:::Adaptive Information</span> blog, July 6, 2008; see  <a href="http://www.mkbergman.com/421/announcing-the-innovations-in-information-timeline/">http://www.mkbergman.com/421/announcing-the-innovations-in-information-timeline/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset29"></a>[29] This figure is a modification of the original published by Rama C. Hoetzlein, 2007. <em>Quanta &#8211; The Organization of Human Knowedge: Systems for Interdisciplinary Research</em>, Master&#8217;s Thesis, University of California Santa Barbara, June 2007; see <a href="http://www.rchoetzlein.com/quanta/">http://www.rchoetzlein.com/quanta/ </a>(p 112). I adapted this figure to add logics, data and metadata to the basic approach, with color coding also added.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset30"></a>[30] From Paul A. Strassmann, 1998. &#8220;The Value of Knowledge Capital,&#8221; <a href="http://www.yourdon.com/ap/apsummary.html">American Programmer</a>, March 1998. See  <a href="http://www.strassmann.com/pubs/valuekc/">http://www.strassmann.com/pubs/valuekc/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset31"></a>[31] For example, according to<a href="#asset11"> [11]</a>, in a 2003 Accenture survey of senior managers across industries, 49 percent of respondents said that intangible assets are their primary focus for delivering long-term shareholder value, but only 5 percent stated that they had an organized system to track the performance of these assets. Also, according sources cited in Gio Wiederhold, Shirley Tessler, Amar Gupta and David Branson Smith, 2009. &#8220;The Valuation of Technology-Based Intellectual Property in Offshoring Decisions,&#8221; in <span style="font-style: italic;">Communications for the Association of Information Systems (CAIS)</span> <span style="font-weight: bold;">24</span>, May 2009 (see <a href="http://ilpubs.stanford.edu:8090/951/2/Article_07-270.pdf">http://ilpubs.stanford.edu:8090/951/2/Article_07-270.pdf): </a>Owners and stockholders acknowledge that IP valuation of technological assets is not routine within many organizations. A 2007 study performed by Micro Focus and INSEAD highlights the current state of affairs: Of the 250 chief information officers (CIOs) and chief finance officers (CFOs) surveyed from companies in the U.S., UK, France, Germany, and Italy, less than 50 percent had attempted to value their IT assets, and more than 60 percent did not assess the value of their software.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="asset32"></a>[32] M.K. Bergman, 2006. &#8220;Climbing the Data Federation Pyramid,&#8221; <span style="font-style: italic;">AI3:::Adaptive Information</span> blog, May 25, 2006; see  <a href="http://www.mkbergman.com/229/climbing-the-data-federation-pyramid/">http://www.mkbergman.com/229/climbing-the-data-federation-pyramid/</a>.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/958/leveraging-intangible-assets-using-semantic-technologies/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Making Connections Real</title>
		<link>http://www.mkbergman.com/941/making-connections-real/</link>
		<comments>http://www.mkbergman.com/941/making-connections-real/#comments</comments>
		<pubDate>Mon, 31 Jan 2011 07:18:34 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Adaptive Innovation]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[UMBEL]]></category>
		<category><![CDATA[concepts]]></category>
		<category><![CDATA[cyc]]></category>
		<category><![CDATA[interoperability]]></category>
		<category><![CDATA[ontology mapping]]></category>
		<category><![CDATA[OWL 2]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=941</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Making Connections Real&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Innovation&amp;rft.subject=Ontologies&amp;rft.subject=UMBEL&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-01-31&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/941/making-connections-real/&amp;rft.language=English"></span>
Refining UMBEL&#8217;s Linking and Mapping Predicates with Wikipedia We are only days away from releasing the first commercial version 1.00 of UMBEL (Upper Mapping and Binding Exchange Layer) [1]. To recap, UMBEL has two purposes, both aimed to promote the interoperability of Web-accessible content. First, it provides a general vocabulary of classes and predicates for [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Making Connections Real&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Innovation&amp;rft.subject=Ontologies&amp;rft.subject=UMBEL&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-01-31&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/941/making-connections-real/&amp;rft.language=English"></span>
<h2><a href="http://umbel.org/"><img style="border: 0px solid; width: 206px; height: 100px; float: left; margin-right: 10px;" title="UMBEL Vocabulary and Reference Concept Ontology" src="http://www.mkbergman.com/wp-content/themes/ai3/images/umbel_medium.png" alt="UMBEL Vocabulary and Reference Concept Ontology" /></a> <a href="http://en.wikipedia.org/"><img style="border: 0px solid; float: right; margin-left: 10px;" src="http://en.wikipedia.org/images/wiki-en.png" alt="Wikipedia" /></a> Refining UMBEL&#8217;s Linking and Mapping Predicates with Wikipedia</h2>
<p>We are only days away from releasing the first commercial version 1.00         of <a href="http://umbel.org">UMBEL</a> (<em>Upper Mapping and Binding         Exchange Layer</em>) <a href="#connect1">[1]</a>. To recap, UMBEL has two purposes, both aimed         to promote the interoperability of Web-accessible content. First, it         provides a general vocabulary of classes and predicates for describing         domain ontologies and external datasets. Second, UMBEL is a coherent         framework of 28,000 broad subjects and topics (the &#8220;reference         concepts&#8221;), which can act as binding nodes for mapping relevant         content.</p>
<p>This last iteration of development has focused on the real-world test         of mapping UMBEL to Wikipedia <a href="#connect2">[2]</a>. The result, to be more fully         described upon release, has led to two major changes. It has acted to         expand the size of the core UMBEL reference concepts to about 28,000.         And it has led to adding to and refining the mapping predicates         necessary for UMBEL to fulfill its purpose as a reference structure for         external resources. This latter change is the focus of this post.</p>
<p>There is a huge diversity of organizational structure and world views         on the Web; the linking and mapping predicates to fulfill this purpose         must also capture that diversity. Relations between things on the Web         can range from the exact and identity, to the approximate, descriptive         and casual <a href="#connect3">[3]</a>. The 16 K direct mappings that have now been made         between UMBEL and Wikipedia (resulting in the linkage of more than 2         million Wikipedia pages) provide a real-world test for how to capture         this diversity. The need is to find the range of predicates that can         reflect and capture quality, accurate mappings. Further, because         mappings also can be aided with a variety of techniques from the manual         to the automatic, it is important to characterize the specific mapping         methods used whenever a linking predicate is assigned. Such         qualifications can help to distinguish mapping trustworthiness, plus         enable later segregation for the application of improved methods as         they may arise.</p>
<p>As a result, the UMBEL Vocabulary now has a pretty well vetted and         diverse set of linking and mapping predicates. Guidelines for how these         differ, how they are used, and how they are qualified is described         next.</p>
<h2>A Comparison of Mapping Predicates</h2>
<p>Properties for linking and mapping need to differ more than in name or         intended use. They must represent differences that affect inferences         and reasoners, and can be acted upon by specific utilities via user         interfaces and other applications. Furthermore, the diversity of         mapping predicates should capture the types of diverse mappings and         linkages possible between disparate sources.</p>
<p>Sometimes things are individuals or instances; other times they are         classes or groupings of similar things. Sometimes things are of the         same kind, but not exactly aligned. Sometimes things are unlike, but         related in a common way. (Everything in Britain, for example, is a         British &#8220;thing&#8221; even though they may be as different as trees, dead         kings or cathedrals.) Sometimes we want to say something about a thing,         such as an animal&#8217;s fur color or age, as a way to further characterize         it, and so on.</p>
<p>The <a href="http://www.w3.org/TR/owl2-overview/">OWL 2</a> language         and existing semantic Web languages give us some tools and existing         vocabulary to capture some of this diversity. How these options, plus         new predicates defined for UMBEL&#8217;s purposes, compare is shown by this         table:</p>
<table style="margin-bottom: 10px; border-collapse: collapse;" border="0" cellpadding="4">
<tbody>
<tr>
<td style="text-align: center; background-color: #ffffcc;" rowspan="2"><strong>Property</strong></td>
<td style="text-align: center; background-color: #ffffcc;" rowspan="2"><strong>Relative Strength</strong></td>
<td style="text-align: center; background-color: #ffffcc;" rowspan="2"><strong>Usage</strong></td>
<td style="text-align: center; background-color: #ffffcc;" rowspan="2"><strong>Standard Reasoner?</strong></td>
<td style="text-align: center; background-color: #ffffcc;" rowspan="2"><strong>Inverse Property?</strong></td>
<td style="border-bottom: 1px solid; width: 134pt; text-align: center; background-color: #ffffcc;" colspan="2" width="178"><strong>Kind of Thing</strong></td>
<td style="text-align: center; background-color: #ffffcc;" rowspan="2"><strong>Symmetrical?</strong></td>
<td style="text-align: center; background-color: #ffffcc;" rowspan="2"><strong>Transitive?</strong></td>
<td style="text-align: center; background-color: #ffffcc;" rowspan="2"><strong>Reflexive?</strong></td>
</tr>
<tr>
<td style="text-align: center; background-color: #ffffcc;"><strong>It is</strong></td>
<td style="text-align: center; background-color: #ffffcc;"><strong>It Relates to</strong></td>
</tr>
<tr>
<td><code>owl:equivalentClass</code></td>
<td style="text-align: center;">10</td>
<td style="text-align: center;">equivalence</td>
<td style="text-align: center;">X</td>
<td style="text-align: center;">N/A</td>
<td style="text-align: center;">class</td>
<td style="text-align: center;">class</td>
<td style="text-align: center;">yes</td>
<td style="text-align: center;">yes</td>
<td style="text-align: center;">yes</td>
</tr>
<tr>
<td><code>owl:sameAs</code></td>
<td style="text-align: center;">9</td>
<td style="text-align: center;">identity</td>
<td style="text-align: center;">X</td>
<td style="text-align: center;">N/A</td>
<td style="text-align: center;">individual</td>
<td style="text-align: center;">individual</td>
<td style="text-align: center;">yes</td>
<td style="text-align: center;">yes</td>
<td style="text-align: center;">yes</td>
</tr>
<tr>
<td><code>rdfs:subClassOf</code></td>
<td style="text-align: center;">8</td>
<td style="text-align: center;">subset</td>
<td style="text-align: center;">X</td>
<td style="text-align: center;"></td>
<td style="text-align: center;">class</td>
<td style="text-align: center;">class</td>
<td style="text-align: center;">no</td>
<td style="text-align: center;">yes</td>
<td style="text-align: center;">yes</td>
</tr>
<tr>
<td><code>umbel:correspondsTo</code></td>
<td style="text-align: center;">7</td>
<td style="text-align: center;">~equivalence</td>
<td style="text-align: center;">+ / -</td>
<td style="text-align: center;"></td>
<td style="text-align: center;">anything</td>
<td style="text-align: center;">RefConcept</td>
<td style="text-align: center;">yes</td>
<td style="text-align: center;">yes</td>
<td style="text-align: center;">yes</td>
</tr>
<tr>
<td style="vertical-align: top;"><code>skos:narrowerTransitive</code></td>
<td style="vertical-align: top; text-align: center;">6</td>
<td style="vertical-align: top; text-align: center;">hierarchical</td>
<td style="vertical-align: top;"></td>
<td style="vertical-align: top; text-align: center;">X</td>
<td style="vertical-align: top; text-align: center;">skos:Concept</td>
<td style="vertical-align: top; text-align: center;">skos:Concept</td>
<td style="vertical-align: top; text-align: center;">no</td>
<td style="vertical-align: top; text-align: center;">yes</td>
<td style="vertical-align: top; text-align: center;">no</td>
</tr>
<tr>
<td style="vertical-align: top;"><code>skos:broaderTransitive</code></td>
<td style="vertical-align: top; text-align: center;">6</td>
<td style="vertical-align: top; text-align: center;">hierarchical</td>
<td style="vertical-align: top;"></td>
<td style="vertical-align: top; text-align: center;">X</td>
<td style="vertical-align: top; text-align: center;">skos:Concept</td>
<td style="vertical-align: top; text-align: center;">skos:Concept</td>
<td style="vertical-align: top; text-align: center;">no</td>
<td style="vertical-align: top; text-align: center;">yes</td>
<td style="vertical-align: top; text-align: center;">no</td>
</tr>
<tr>
<td><code>rdf:type</code></td>
<td style="text-align: center;">5</td>
<td style="text-align: center;">membership</td>
<td style="text-align: center;">X</td>
<td style="text-align: center;"></td>
<td style="text-align: center;">anything</td>
<td style="text-align: center;">class</td>
<td style="text-align: center;">no</td>
<td style="text-align: center;">no</td>
<td style="text-align: center;">no</td>
</tr>
<tr>
<td><code>umbel:isAbout</code></td>
<td style="text-align: center;">4</td>
<td style="text-align: center;">topical</td>
<td style="text-align: center;"></td>
<td style="text-align: center;">X</td>
<td style="text-align: center;">anything</td>
<td style="text-align: center;">RefConcept</td>
<td style="text-align: center;">perhaps</td>
<td style="text-align: center;">not likely</td>
<td style="text-align: center;">not likely</td>
</tr>
<tr>
<td><code>umbel:isLike</code></td>
<td style="text-align: center;">3</td>
<td style="text-align: center;">similarity</td>
<td style="text-align: center;"></td>
<td style="text-align: center;"></td>
<td style="text-align: center;">anything</td>
<td style="text-align: center;">anything</td>
<td style="text-align: center;">yes</td>
<td style="text-align: center;">no</td>
<td style="text-align: center;">not likely</td>
</tr>
<tr>
<td><code>umbel:relatesToXXX</code></td>
<td style="text-align: center;">2</td>
<td style="text-align: center;">relationship</td>
<td style="text-align: center;"></td>
<td style="text-align: center;"></td>
<td style="text-align: center;">anything</td>
<td style="text-align: center;">SuperType</td>
<td style="text-align: center;">no</td>
<td style="text-align: center;">no</td>
<td style="text-align: center;">not likely</td>
</tr>
<tr>
<td><code>umbel:isCharacteristicOf</code></td>
<td style="text-align: center;">1</td>
<td style="text-align: center;">attribute</td>
<td style="text-align: center;"></td>
<td style="text-align: center;">X</td>
<td style="text-align: center;">anything</td>
<td style="text-align: center;">RefConcept</td>
<td style="text-align: center;">no</td>
<td style="text-align: center;">no</td>
<td style="text-align: center;">no</td>
</tr>
</tbody>
</table>
<p>I discuss each of these predicates below. But, first, let&#8217;s discuss         what is in this table and how to interpret it <a href="#connect4">[4]</a>.</p>
<ul>
<li> <em>Relative strength</em> &#8211; an arbitrary value that is meant to           capture the inferencing power (<em>entailments</em>) embodied in the           predicate. Identity (equivalence), class implications, and specific           predicate properties that can be acted upon by reasoners are given           higher relative power </li>
<li> <em>Standard reasoner?</em> &#8211; indicates whether standard reasoners<a href="#connect5"> [5]</a> draw inferences and entailments from the specific property. A &#8220;+ / -&#8221;           indication indicates that reasoners do not recognize the specific           property per se, but can act upon the predicates (such as symmetric,           transitive or reflexive) used to define the predicate </li>
<li> <em>Inverse property?</em> &#8211; indicates whether there is an inverse           property used within UMBEL that is not listed in the table. In such           cases, the predicate shown is the one that treats the external entity           as the subject </li>
<li> <em>It is a kind of thing</em> &#8211; is the same as <em>domain</em>; it means           the kind of thing to which the subject belongs </li>
<li> <em>It relates to a kind on thing</em> &#8211; is the same as <em>range</em>;           it means the kind of thing to which the object of the subject belongs </li>
<li> <em>Symmetrical?</em> &#8211; describes whether the predicate for an           <span style="font-style: italic;">s &#8211; p &#8211; o</span> (<span style="font-style: italic;">subject &#8211; predicate &#8211; object</span>)           relationship can also apply in the o &#8211; p &#8211; s manner </li>
<li> <em>Transitive?</em> &#8211; is whether the predicate interlinks two           individuals A and C whenever it interlinks A with B and B with C for           some individual B </li>
<li> <em>Reflexive?</em> &#8211; By that is meant whether the subject has itself           as a member. In a reflexive closure between subject and object the           subject is fully included as a member. Equivalence, subset, greater           than or equal to, and less than or equal to relationships are           reflexive; not equal, less than or greater than relationships are           not. </li>
</ul>
<p>The <em>Usage</em> metric is described for each property below.</p>
<h2>Individual Predicates Discussion</h2>
<p>To further aid the understanding of these properties, we can also group         them into equivalence, membership, approximate or descriptive         categories.</p>
<h3>Equivalent Properties</h3>
<p>Equivalent properties are the most powerful available since they entail         all possible axioms between the resources.</p>
<h4>owl:equivalentClass</h4>
<p>Equivalent class means that two classes have the same members; each is         a sub-class of the other. The classes may differ in terms of         annotations defined for each of them, but otherwise they are         axiomatically equivalent.</p>
<p>An <code>owl:equivalentClass</code> assertion is the most powerful         available because of its ability to &#8216;<span style="font-style: italic;">Explode the Domain</span>&#8216; <a href="#connect6">[6]</a>. Because of its         entailments, <code>owl:equivalentClass</code> should be used with great         care.</p>
<h4>owl:sameAs</h4>
<p>The <code>owl:sameAs</code> assertion claims two instances to be an         identical individual. This assertion also carries with it strong         entailments of symmetry and reflexivity.</p>
<p><code>owl:sameAs</code> is often misapplied <a href="#connect7">[7]</a>. Because of its         entailments, it too should be used with great care. When there are         doubts about claiming this strong relationship, UMBEL has the         <code>umbel:isLike</code> alternative (see below).</p>
<h3>Membership and Hierarchical Properties</h3>
<p>Membership properties assert that an instance is a member of a class.</p>
<h4>rdfs:subClassOf</h4>
<p>The <code>rdfs:subClassOf</code> asserts that one class is a subset of         another class. This assertion is transitive and reflexive. It is a key         means for asserting hierarchical or taxonomic structures in an         ontology. This assertion also has strong entailments, particularly in         the sense of members having consistent general or more specific         relationships to one another.</p>
<p>Care must be exercised that full inclusivity of members occurs when         asserting this relationship. When correctly asserted, however, this is         one of the most powerful means to establish a reasoning structure in an         ontology because of its transitivity.</p>
<h4>skos:narrowerTransitive/skos:broaderTransitive</h4>
<p>Both of these predicates work on <code>skos:Concept</code> (recall that         <code>umbel:RefConcept</code> is itself a <code>subClassOf</code> a         <code>skos:Concept</code>). The predicates state a hierarchical link         between the two concepts that indicates one is in some way more general         (&#8220;broader&#8221;) than the other (&#8220;narrower&#8221;) or <span style="font-style: italic;">vice versa</span>. The particular application of         <code>skos:broaderTransitive</code> (or its complement) is used to         infer the transitive closure of the hierarchical links, which can then         be used to access direct or indirect hierarchical links between         concepts.</p>
<p>The transitive relationship means that there may be intervening         concepts between the two stated resources, making the relationship an         ancestral one, and not necessarily (though it is possible to be so) a         direct parent-child one.</p>
<h4>rdf:type</h4>
<p>The <code>rdf:type</code> assertion assigns instances (individuals) to         a class. While the idea is straightforward, it is important to         understand the intensional nature of the target class to ensure that         the assignment conforms to the intended class scope. When this         determination can not be made, one of the more approximate UMBEL         predicates (see below) should be used.</p>
<h3>Approximation Properties</h3>
<p>For one reason or another, the precise assertions of the equivalent or         membership properties above may not be appropriate. For example, we         might not know sufficiently an intended class scope, or there might be         ambiguity as to the identity of a specific entity (is it Jimmy Johnson         the football coach, race car driver, fighter, local plumber or someone         else?). Among other options &#8212; along a spectrum of relatedness &#8212; is         the desire to assign a predicate that is meant to represent the same         kind of thing, yet without knowing if the relationship is an         equivalence (identity, or <code>sameAs</code>), a subset, or merely         just a member of relationship. Alternatively, we may recognize that we         are dealing with different things, but want to assert a relationship of         an uncertain nature.</p>
<p>This section presents the UMBEL alternatives for these different kinds         of approximate predicates <a href="#connect4">[4]</a>.</p>
<h4>umbel:correspondsTo</h4>
<p>The most powerful of these approximate predicates in terms of alignment         and entailments is the <code>umbel:correspondsTo</code> property. This         predicate is the recommended option if, after looking at the source and         target knowledge bases <a href="#connect8">[8]</a>, we believe we have found the best         equivalent relationship, but do not have the information or assurance         to assign one of the relationships above. So, while we are sure we are         dealing with the same kind of thing, we may not have full confidence to         be able to assign one of these alternatives:</p>
<pre>   rdfs:subClassOf
   owl:equivalentClass
   owl:sameAs
   superClassOf
</pre>
<p>Thus, with respect to existing and commonly used predicates, we want an         umbrella property that is generally equivalent or so in nature, and if         perhaps known precisely might actually encompass one of the above         relations, but we don&#8217;t have the certainty to choose one of them nor         perhaps assert full &#8220;sameness&#8221;. This is not too dissimilar from the         rationale being tested for the <code>x:coref</code> predicate in         relation to <code>owl:sameAs</code> from the UMBC Ebiquity group<a href="#connect9"> [9</a>,<a href="#connect10">10]</a>.</p>
<p>The property <code>umbel:correspondsTo</code> is thus used to assert a         close correspondence between an external class, named entity,         individual or instance with a Reference Concept class. It asserts this         correspondence through the basis of both its subject matter and         intended scope.</p>
<p>This property may be reified with the <code>umbel:hasMapping</code> property to describe the &#8220;degree&#8221; of the assertion.</p>
<h4>umbel:isAbout</h4>
<p>In most uses, the most prevalent linking property to be used is the         <code>umbel:isAbout</code> assertion. This predicate is useful when         tagging external content with metadata for alignment with an         UMBEL-based reference ontology. The reciprocal assertion,         <code>umbel:isRelatedTo</code> is when an assertion within an UMBEL         vocabulary is desired to an external ontology. Its application is where         the reference vocabulary itself needs to refer to an external topic or         concept.</p>
<p>The <code>umbel:isAbout</code> predicate does not have the same level         of confidence or &#8220;sameness&#8221; as the <code>umbel:correspondsTo</code> property. It may also reflect an assertion that is more like         <code>rdf:type</code>, but without the confidence of class membership.</p>
<p>The property <code>umbel:isAbout</code> is thus used to assert the         relation between an external named entity, individual or instance with         a Reference Concept class. It can be interpreted as providing a topical         assertion between an individual and a Reference Concept.</p>
<p>This property may be reified with the <code>umbel:hasMapping</code> property to describe the &#8220;degree&#8221; of the assertion.</p>
<h4>umbel:isLike</h4>
<p>The property <code>umbel:isLike</code> is used to assert an associative         link between similar individuals who may or may not be identical, but         are believed to be so. This property is not intended as a general         expression of similarity, but rather the likely but uncertain same         identity of the two resources being related.</p>
<p>This property may be considered as an alternative to         <code>sameAs</code> where there is not a certainty of sameness, and/or         when it is desirable to assert a degree of overlap of sameness via the         <code>umbel:hasMapping</code> reification predicate. This property can         and should be changed if the certainty of the sameness of identity is         subsequently determined.</p>
<p>It is appropriate to use this property when there is strong belief the         two resources refer to the same individual with the same identity, but         that association can not be asserted at the present time with full         certitude.</p>
<p>This property may be reified with the <code>umbel:hasMapping</code> property to describe the &#8220;degree&#8221; of the assertion.</p>
<h4>umbel:relatesToXXX</h4>
<p>At a different point along this relatedness spectrum we have         <em>unlike</em> things that we would like to relate to one another. It         might be an attribute, a characteristic or a functional property about         something that we care to describe. Further, by nature of the thing we         are relating, we may also be able to describe the kind of thing we are         relating. The UMBEL <code>SuperTypes</code> (among many other options)         gives us one such means to characterize the thing being related.</p>
<p>UMBEL presently has 31 predicates for these assertions relating to a         <code>SuperType</code> <a href="#connect11">[11]</a>. The various properties designated by         <code>umbel:relatesToXXX</code> are used to assert a relationship         between an external instance (object) and a particular (XXX)         <code>SuperType</code>. The assertion of this property does not entail         class membership with the asserted <code>SuperType</code>. Rather, the         assertion may be based on particular attributes or characteristics of         the object at hand. For example, a British person might have an         <code>umbel:relatesToXXX</code> asserted relation to the         <code>SuperType</code> of the geopolitical entity of Britain, though         the actual thing at hand (person) is a member of the Person class         <code>SuperType</code>.</p>
<p>This predicate is used for filtering or clustering, often within user         interfaces. Multiple <code>umbel:relatesToXXX</code> assertions may be         made for the same instance.</p>
<p>Each of the 32 UMBEL <code>SuperTypes</code> has a matching predicate         for external topic assignments (<code>relatesToOtherOrganism</code> shares two <code>SuperTypes</code>, leading to 31 different         predicates):</p>
<table class="center_ok" style="margin-bottom: 10px; border-collapse: collapse; width: 600px;" border="1" cellpadding="4">
<tbody>
<tr>
<td style="text-align: center; background-color: #ffffcc;"><strong>SuperType</strong></td>
<td style="text-align: center; background-color: #ffffcc;"><strong>Mapping Predicate</strong></td>
<td style="text-align: center; background-color: #ffffcc;"><strong>Comments</strong></td>
</tr>
<tr>
<td valign="top">NaturalPhenomena</td>
<td valign="top"><code>relatesToPhenomenon</code></td>
<td valign="top">This predicate relates an external entity to the                 <code>SuperType</code> (ST) shown. It indicates there is a                 relationship to the ST of a verifiable nature, but which is                 undetermined as to strength or a full rdf:type relationship</td>
</tr>
<tr>
<td valign="top">NaturalSubstances</td>
<td valign="top"><code>relatesToSubstance</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Earthscape</td>
<td valign="top"><code>relatesToEarth</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Extraterrestrial</td>
<td valign="top"><code>relatesToHeavens</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Prokaryotes</td>
<td rowspan="2"><code>relatesToOtherOrganism</code></td>
<td rowspan="2">same as above</td>
</tr>
<tr>
<td valign="top">ProtistsFungus</td>
</tr>
<tr>
<td valign="top">Plants</td>
<td valign="top"><code>relatesToPlant</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Animals</td>
<td valign="top"><code>relatesToAnimal</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Diseases</td>
<td valign="top"><code>relatesToDisease</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">PersonTypes</td>
<td valign="top"><code>relatesToPersonType</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Organizations</td>
<td valign="top"><code>relatesToOrganizationType</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">FinanceEconomy</td>
<td valign="top"><code>relatesToFinanceEconomy</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Society</td>
<td valign="top"><code>relatesToSociety</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Activities</td>
<td valign="top"><code>relatesToActivity</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Events</td>
<td valign="top"><code>relatesToEvent</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Time</td>
<td valign="top"><code>relatesToTime</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Products</td>
<td valign="top"><code>relatesToProductType</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">FoodorDrink</td>
<td valign="top"><code>relatesToFoodDrink</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Drugs</td>
<td valign="top"><code>relatesToDrug</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Facilities</td>
<td valign="top"><code>relatesToFacility</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Geopolitical</td>
<td valign="top"><code>relatesToGeoEntity</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Chemistry</td>
<td valign="top"><code>relatesToChemistry</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">AudioInfo</td>
<td valign="top"><code>relatesToAudioMusic</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">VisualInfo</td>
<td valign="top"><code>relatesToVisualInfo</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">WrittenInfo</td>
<td valign="top"><code>relatesToWrittenInfo</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">StructuredInfo</td>
<td valign="top"><code>relatesToStructuredInfo</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">NotationsReferences</td>
<td valign="top"><code>relatesToNotation</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Numbers</td>
<td valign="top"><code>relatesToNumbers</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Attributes</td>
<td valign="top"><code>relatesToAttribute</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">Abstract</td>
<td valign="top"><code>relatesToAbstraction</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">TopicsCategories</td>
<td valign="top"><code>relatesToTopic</code></td>
<td valign="top">same as above</td>
</tr>
<tr>
<td valign="top">MarketsIndustries</td>
<td valign="top"><code>relatesToMarketIndustry</code></td>
<td valign="top">same as above</td>
</tr>
</tbody>
</table>
<p>This property may be reified with the <code>umbel:hasMapping</code> property to describe the &#8220;degree&#8221; of the assertion.</p>
<h3>Descriptive Properties</h3>
<p>Descriptive properties are annotation properties.</p>
<h4><span id="umbel:isCharacteristicOf" class="mw-headline">umbel:isCharacteristicOf</span></h4>
<p>Two annotation properties are used to describe the attribute         characteristics of a <code>RefConcept</code>, namely         <code>umbel:hasCharacteristic</code> and its reciprocal,         <code>umbel:isCharacteristicOf</code>. These properties are the means         by which the external properties to describe things are able to be         brought in and used as lookup references (that is, metadata) to         external data attributes. As annotation properties, they have weak         semantics and are used for accounting as opposed to reasoning purposes.</p>
<p>These properties are designed to be used in external ontologies to         characterize, describe, or provide attributes for data records         associated with a given <code>RefConcept</code>. It is via this         property or its inverse, <code>umbel:hasCharacteristic</code>, that         external data characterizations may be incorporated and modeled within         a domain ontology based on the UMBEL vocabulary.</p>
<h2>Qualifying the Mappings</h2>
<p>The choice of these mapping predicates may be aided with a variety of         techniques from the manual to the automatic. It is thus important to         characterize the specific mapping methods used whenever a linking         predicate is assigned. Following this best practice allows us to         distinguish mapping trustworthiness, plus to also enable later         segregation for the application of improved methods as they may arise.</p>
<p>UMBEL, for its current mappings and purposes, has adopted the following         controlled vocabulary for characterizing the         <code>umbel:hasMapping</code> predicate; such listings may be readily         modified for other domains and purposes when using the UMBEL         vocabulary. This controlled vocabulary is based on instances of the         <code>Qualifier</code> class. This class represents a set of         descriptions to indicate the method used when applying an approximate         mapping predicate (see above):</p>
<table class="center_ok" style="margin-bottom: 10px; border-collapse: collapse; width: 600px;" border="1" cellpadding="4">
<tbody>
<tr>
<td style="text-align: center; background-color: #ffffcc;"><strong>Qualifier</strong></td>
<td style="text-align: center; background-color: #ffffcc;"><strong>Description</strong></td>
</tr>
<tr>
<td>Manual &#8211; Nearly Equivalent</td>
<td>The two mapped concepts are deemed to be nearly an                 <code>equivalentClass</code> or <code>sameAs</code> relationship, but not 100% so</td>
</tr>
<tr>
<td>Manual &#8211; Similar Sense</td>
<td>The two mapped concepts share much overlap, but are not the                 exact same sense, such as an action as related to the thing it                 acts upon</td>
</tr>
<tr>
<td>Heuristic &#8211; ListOf Basis</td>
<td>Type assignment based on Wikipedia ListOf category; not                 currently used</td>
</tr>
<tr>
<td>Heuristic &#8211; Not Specified</td>
<td>Heuristic mapping method applied; script or technique not                 otherwise specified</td>
</tr>
<tr>
<td>External &#8211; OpenCyc Mapping</td>
<td>Mapping based on existing OpenCyc assertion</td>
</tr>
<tr>
<td>External &#8211; DBOntology Mapping</td>
<td>Mapping based on existing DBOntology assertion</td>
</tr>
<tr>
<td>External &#8211; GeoNames Mapping</td>
<td>Mapping based on existing GeoNames assertion</td>
</tr>
<tr>
<td>Automatic &#8211; Inspected SV</td>
<td>Mapping based on automatic scoring of concepts using <a rel="nofollow" href="http://code.google.com/p/semanticvectors/">Semantic Vectors</a>, with specific alignment choice                 based on hand selection</td>
</tr>
<tr>
<td>Automatic &#8211; Inspected S-Match</td>
<td>Mapping based on automatic scoring of concepts using <a rel="nofollow" href="http://semanticmatching.org/">S-Match</a>, with                 specific alignment choice based on hand selection; not                 currently used</td>
</tr>
<tr>
<td>Automatic &#8211; Not Specified</td>
<td>Mapping based on automatic scoring of concepts using a script                 or technique not otherwise specified; not currently used</td>
</tr>
</tbody>
</table>
<p>Again, as noted, for other domains and other purposes this listing can         be modified at will.</p>
<h2>Status of Mappings</h2>
<p>Final aspects of these mappings are now undergoing a last round of         review. A variety of sources and methods have been applied, to be more         fully documented at time of release.</p>
<p>Some of the final specifics and counts may be modified slightly by the         time of actual release of UMBEL v 1.00, which should occur in the next         week or so. Nonetheless, here are some tentative counts for a select         portion of these predicates in the internal draft version:</p>
<table class="center_ok" style="margin-bottom: 10px; border-collapse: collapse; width: 320px;" border="1" cellpadding="4">
<tbody>
<tr>
<td style="text-align: center; background-color: #ffffcc;"><strong>Item or Predicate</strong></td>
<td style="text-align: center; background-color: #ffffcc;"><strong>Count</strong></td>
</tr>
<tr>
<td style="vertical-align: top;">Total UMBEL Reference Concepts</td>
<td style="vertical-align: top; text-align: right;">27,917</td>
</tr>
<tr>
<td style="vertical-align: top;"><code>owl:equivalentClass</code> <br />
 (external OpenCyc, PROTON, DBpedia)</td>
<td style="vertical-align: top; text-align: right;">28,618</td>
</tr>
<tr>
<td><code>umbel:correspondsTo</code> <br />
 (direct mappings to Wikipedia)</td>
<td style="text-align: right;">16,884</td>
</tr>
<tr>
<td><code>rdf:type</code></td>
<td style="text-align: right;">876,125</td>
</tr>
<tr>
<td><code>umbel:relatesToXXX                 <br />
 (31 variations)                 <br />
 </code></td>
<td style="text-align: right;">3,059,023</td>
</tr>
<tr>
<td>Unique Wikipedia Pages Mapped</td>
<td style="text-align: right;">2,130,021</td>
</tr>
</tbody>
</table>
<p>All of these assignments have also been hand inspected and       vetted.</p>
<h2>Major Progress Towards a Gold Standard</h2>
<p>To date, in various steps and in various phases, the inspection of         Wikipedia, its categories, and its match with UMBEL has perhaps         incurred more than 5,000 hours (or nearly a three person-year         equivalence) of expert domain and semantic technology review <a href="#connect12">[12]</a>. As         noted, about 60% (16,884 of 27,917) of UMBEL concepts have now been         directly mapped to Wikipedia and inspected for accuracy.</p>
<p>Wikipedia provides the most demanding and complete mapping target         available for testing the coverage of UMBEL&#8217;s reference concepts and         the adequacy of its vocabulary. As a result, we have added to and         refined the mapping and linking predicates used in the UMBEL         vocabulary, and added a <code>Qualifier</code> class to record the         mapping process, as this post overviews. We have added the         <code>SuperType</code> class to better organize and disambiguate large         knowledge bases <a href="#connect13">[13]</a>. And, in this mapping process, we have expanded         UMBEL&#8217;s reference concepts by about 33% to improve coverage, while         remaining consistent with its origins as a faithful subset of the         venerable Cyc knowledge structure <a href="#connect14">[14]</a>.</p>
<p>A side benefit that has emerged from these efforts &#8212; with a huge         potential upside &#8212; is the valuable combination of UMBEL and Wikipedia         as a &#8220;<a href="http://en.wikipedia.org/wiki/Gold_standard_%28test%29">gold         standard</a>&#8221; for aligning and mapping knowledge bases. Such a standard         is critically needed. For example, in reviewing many of the existing         Wikipedia mappings claimed as accurate, we found misplacement errors         that averaged 15.8% <a href="#connect15">[15]</a>. Having a baseline of vetted mappings will aid         future mappings. Moreover, having a complete conceptual infrastructure         over Wikipedia will enable new and valuable reasoning and inference         services.</p>
<p>The results from the UMBEL v 1.00 mapping are promising and very much         useful today, but by no means complete. Future versions will extend the         current mappings and continue to refine its accuracy and completeness <a href="#connect16"> [16]</a>. What we can say, however, is that a coherent organization and         conceptual schema &#8212; namely, UMBEL &#8212; overlaid on the richness of the         instance data and content of Wikipedia, can produce immediate and         useful benefits. These benefits apply to semantic search, semantic         annotation and tagging, reasoning, discovery, inferencing, organization         and comparisons.</p>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect1"></a>[1] UMBEL has been under development since March 2007, with its first         release in July 2008 and its last release (v 0.80) in November 2010.         Throughout its releases we have reserved incrementing the vocabulary         and its ontology to version 1.00 until it was deemed &#8220;commercial&#8221;. This         is the first version to meet this test. We&#8217;d like to thank our partner,         <a href="http://ontotext.com">Ontotext</a>, and the <a href="http://render-project.eu/">RENDER</a> project for providing assistance         and resources to bring the system to this point.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect2"></a>[2] The basic approach is to use the <a rel="nofollow" href="http://dbpedia.org/About">DBpedia</a> representation of Wikipedia, since its         extractors have already done a great job in preparing structured data.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect3"></a>[3] M.K. Bergman, 2010. &#8220;The Nature of Connectedness on the Web,&#8221;         <em>AI3:::Adaptive Information</em> blog, November 22, 2010; see <a rel="nofollow" href="http://www.mkbergman.com/935/the-nature-of-connectedness-on-the-web/">http://www.mkbergman.com/935/the-nature-of-connectedness-on-the-web/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect4"></a>[4] A good starting reference for some of these concepts is Pascal         Hitzler et al., eds., 2009. <em>OWL 2 Web Ontology Language Primer</em>,         a W3C Recommendation, 27 October 2009; see <a rel="nofollow" href="http://www.w3.org/TR/owl2-primer/">http://www.w3.org/TR/owl2-primer/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect5"></a>[5] Such as the <a rel="nofollow" href="http://en.wikipedia.org/wiki/Reasoner">semantic reasoners</a> FaCT++, Racer, Pellet, Hermit, etc..</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect6"></a>[6] Fred Giasson first coined this phrase; see F. Giasson, 2008.         &#8220;Exploding the Domain: UMBEL Web Services by Zitgist,&#8221; blog posting on         April 20, 2008; see <a class="external free" rel="nofollow" href="http://fgiasson.com/blog/index.php/2008/04/20/exploding-the-domain-umbel-web-services-by-zitgist/">http://fgiasson.com/blog/index.php/2008/04/20/exploding-the-domain-umbel-web-services-by-zitgist/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect7"></a>[7] Among many, many references, see a fairly comprehensive discussion         of this issue at <a rel="nofollow" href="http://ontologydesignpatterns.org/wiki/Community:Overloading_OWL_sameAs">http://ontologydesignpatterns.org/wiki/Community:Overloading_OWL_sameAs</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect8"></a>[8] This predicate is designed for the circumstance of aligning two         different ontologies or knowledge bases based on node-level         correspondences, but without entailing the actual ontological         relationships and structure of the object source. For example, the         <code>umbel:correspondsTo</code> predicate is used to assert close         correspondence between UMBEL Reference Concepts and Wikipedia         categories or pages, yet without entailing the actual Wikipedia         category structure.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect9"></a>[9] Jennifer Sleeman and Tim Finin, 2010. &#8220;Learning Co-reference         Relations for FOAF Instances,&#8221; <em>Proceedings of the Poster and         Demonstration Session at the 9th International Semantic Web         Conference</em>, November 2010; see <a class="external free" rel="nofollow" href="http://ebiquity.umbc.edu/_file_directory_/papers/522.pdf">http://ebiquity.umbc.edu/_file_directory_/papers/522.pdf</a>.</div>
<div style="font-size: 90%;">
<div style="margin: 10px 0pt;"><a name="connect10"></a>[10] For example, in the words of Tim Finin of the Ebiquity group:</div>
<div class="boxGrayDotted"><span style="font-style: italic;">&#8220;The solution we are currently           exploring is to define a new property to assert that two RDF           instances are co-referential when they are believed to describe the           same object in the world. The two RDF descriptions might be           incompatible because they are true at different times, or the sources           disagree about some of the facts, or any number of reasons, so           merging them with</span> <code style="font-style: italic;">owl:sameAs</code> <span style="font-style: italic;">may lead to contradictions. However, virtually           merging the descriptions in a co-reference engine is fine &#8212; both           provide information that is useful in disambiguating future           references as well as for many other purposes.&#8221;</span></div>
<div style="margin: 10px 0pt;">See quote on <a rel="nofollow" href="http://www.semanticoverflow.com/questions/1095/alternatives-to-owlsameas-for-linked-data">http://www.semanticoverflow.com/questions/1095/alternatives-to-owlsameas-for-linked-data</a>.</div>
</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect11"></a>[11] The same vocabulary construct can be applied to other domain         ontologies based on the UMBEL Vocabulary.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect12"></a>[12] The efforts with Wikipedia have been ongoing to a certain degree         since the inception of UMBEL. As one example, we have been maintaining         a comprehensive tracking of the use of Wikipedia for mapping and         semantic technology purposes, called <a href="http://www.mkbergman.com/sweetpedia/">SWEETpedia</a>, for many years.         Applying these techniques to both UMBEL and Wikipedia has been most         active over the past 18 months.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect13"></a>[13] See the <a style="font-style: italic;" href="http://umbel.org/specifications/annex-g">UMBEL Annex G: UMBEL         SuperTypes Documentation</a>, which will also be slightly updated upon         the new UMBEL v 1.00 release.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect14"></a>[14] See the &#8216;Use of OpenCyc&#8217; section in the <a style="font-style: italic;" rel="nofollow" href="http://umbel.org/specifications/full-specification">UMBEL Specifications</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect15"></a>[15] These sources and error rates will be detailed in a paper after         the pending new release of UMBEL.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="connect16"></a>[16] Fortunately, returns on the time investment will accelerate since         basic lessons and techniques have now been learned.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/941/making-connections-real/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Declining IT Innovation in the Enterprise</title>
		<link>http://www.mkbergman.com/940/declining-it-innovation-in-the-enterprise/</link>
		<comments>http://www.mkbergman.com/940/declining-it-innovation-in-the-enterprise/#comments</comments>
		<pubDate>Mon, 17 Jan 2011 22:31:47 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Adaptive Innovation]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Semantic Enterprise]]></category>
		<category><![CDATA[Software and Venture Capital]]></category>
		<category><![CDATA[enterprise apps]]></category>
		<category><![CDATA[information technologies]]></category>
		<category><![CDATA[open source software]]></category>
		<category><![CDATA[Structured Dynamics]]></category>
		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=940</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Declining IT Innovation in the Enterprise&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Innovation&amp;rft.subject=Open Source&amp;rft.subject=Semantic Enterprise&amp;rft.subject=Software and Venture Capital&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-01-17&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/940/declining-it-innovation-in-the-enterprise/&amp;rft.language=English"></span>
Reasons for and Implications from Innovation Moving to Consumers Today, the headlines and buzz for information technologies centers on smartphones, social networks, cloud computing, tablets and everything Internet. Very little is now discussed about IT in the enterprise. This declining trend began about 15 years ago, and has been accelerating over time. Letting the air [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Declining IT Innovation in the Enterprise&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Adaptive Innovation&amp;rft.subject=Open Source&amp;rft.subject=Semantic Enterprise&amp;rft.subject=Software and Venture Capital&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-01-17&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/940/declining-it-innovation-in-the-enterprise/&amp;rft.language=English"></span>
<h2><a><img style="border: 0px solid; width: 260px; height: 185px; float: left; margin-right: 10px;" title="The Hollowing Out of Enterprise IT" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110117_EmptySuit.jpg" alt="The Hollowing Out of Enterprise IT" hspace="5" vspace="5" align="left" /></a>Reasons for and Implications from Innovation Moving to Consumers</h2>
<p>Today, the headlines and buzz for information technologies centers on         smartphones, social networks, cloud computing, tablets and everything         Internet. Very little is now discussed about IT in the enterprise. This         declining trend began about 15 years ago, and has been accelerating         over time. Letting the air out of the enterprise IT balloon has some         profound reasons and implications. It also has some lessons and         guidance related to semantic approaches and technologies and their         adoption by enterprises.</p>
<h3>A Brief Look at Sixty Years of Enterprise IT</h3>
<p>One can probably clock the start of enterprise <a href="http://en.wikipedia.org/wiki/Information_technology">information         technology</a> (IT) to the first use of mainframe computers in the         early 1950s <a href="#it1">[1]</a>, or sixty years ago. The earliest mainframes were huge         and expensive machines that required their own specially         air-conditioned rooms because of the heat they generated. The first use         of &#8220;information technology&#8221; as a term occurred in a <a href="http://www.hbr.org/">Harvard Business Review</a> article from 1958 <a href="#it2"> [2]</a>.</p>
<p>Until the late 1960s computers were usually supplied under lease, and         were not purchased<a href="#it3"> [3]</a>. Service and all software were generally bundled         into the lease amount without separate charge and with source code         provided. Then, in 1969, <a href="http://en.wikipedia.org/wiki/IBM">IBM</a> led an industry change by         starting to charge separately for (mainframe) software and services,         and ceasing to supply source code <a href="#it3">[3]</a>. At about the same time <a href="http://en.wikipedia.org/wiki/Integrated_circuit">integrated         circuits</a> enabled computer sizes to be reduced, with the <a href="http://en.wikipedia.org/wiki/Minicomputer">minicomputers</a> such as         from <a href="http://en.wikipedia.org/wiki/Digital_Equipment_Corporation">DEC</a> causing a marked expansion in number of potential customers. Enterprise         apps became a huge business, with software licensing and maintenance         fees achieving a peak of 70% of IT vendor total revenues by the         mid-1990s <a href="#it4">[4]</a>. However, since that peak, enterprise software as a         portion of vendor revenues has been steadily eroding.</p>
<p>One of the earliest enterprise applications was in transaction systems         and their underlying database management software. The relational         database management system (<a href="http://en.wikipedia.org/wiki/Oracle_Corporation">RDBMS</a>) was         initially developed at IBM. <a href="http://en.wikipedia.org/wiki/Oracle_Corporation">Oracle</a>, based on         early work for the CIA in the late 1970s and its innovation to write in         the C programming language, was able to port the RDBMS to multiple         operating systems. These efforts, along with those of other notable         vendors (most of which like <a href="http://en.wikipedia.org/wiki/Informix">Informix</a> no longer exist),         led to the RDBMS becoming more or less the <span style="font-style: italic;">de facto</span> standard for data management within the enterprise by         the 1980s. Today Oracle is the largest supplier of RDBMS software         globally, and other earlier database system designs such as <a href="http://en.wikipedia.org/wiki/Network_model">network databases</a> or         <a href="http://en.wikipedia.org/wiki/Object_database">object         databases</a> fell out of favor <a href="#it5">[5]</a>.</p>
<p>In 1975, the <a href="http://en.wikipedia.org/wiki/Altair_8800">Altair         8800</a> was introduced to electronics hobbyists as the first         microcomputer, followed then by <a href="http://en.wikipedia.org/wiki/IBM_Personal_Computer">Apple II</a> and         the <a href="http://en.wikipedia.org/wiki/IBM_Personal_Computer">IBM         PC</a> in 1981, among others. Rapidly a slew of new applications became         available to the individual, including spreadsheets, small databases,         graphics programs and word processors. These apps were a boon to         individual productivity and the IBM PC in particular brought         credibility and acceptance within the enterprise (along with the growth         of <a href="http://en.wikipedia.org/wiki/Microsoft">Microsoft</a>).         <a href="http://en.wikipedia.org/wiki/Novell">Novell</a> and <a href="http://en.wikipedia.org/wiki/Local_area_network">local area         networks</a> also pointed the way to a more distributed computing         future. By the late 1980s virtually every knowledge worker within         enterprises had some degree of computer literacy.</p>
<p>The apogee for enterprise software and apps occurred in the 1990s, with         whole classes of new applications (most denoted by <a href="http://en.wikipedia.org/wiki/List_of_computing_and_IT_abbreviations">three-letter         acronyms</a>) such as enterprise resource planning (<a href="http://en.wikipedia.org/wiki/Enterprise_resource_planning">ERP</a>),         business intelligence (<a href="http://en.wikipedia.org/wiki/Business_intelligence">BI</a>), customer         relationship management (<a href="http://en.wikipedia.org/wiki/Customer_relationship_management">CRM</a>),         enterprise information systems (<a href="http://en.wikipedia.org/wiki/Enterprise_Information_System">EIS</a>)         and the like coming to the fore. These systems also began as         proprietary software, which resulted in the &#8220;<a href="http://en.wikipedia.org/wiki/Stovepipe_system">stovepiping</a>&#8221; or         creating of <a href="http://en.wikipedia.org/wiki/Information_silo">information silos</a>.         In reaction and with great market acceptance, vendors such as <a href="http://en.wikipedia.org/wiki/SAP_AG">SAP</a> arose to provide         comprehensive, enterprise-wide solutions, though often at high cost and         with significant failure rates.</p>
<p>More significantly, the 1990s also saw the innovation of the <a href="http://en.wikipedia.org/wiki/Worldwide_web">World Wide Web</a> with         its basis in <a href="http://en.wikipedia.org/wiki/Hypertext">hypertext</a> links on the         <a href="http://en.wikipedia.org/wiki/Internet">Internet</a>. Greatly         facilitated by the <a href="http://en.wikipedia.org/wiki/Mosaic_%28web_browser%29">Mosaic</a> Web         browser, the basis of the commercial <a href="http://en.wikipedia.org/wiki/Netscape_%28web_browser%29">Netscape         browser</a>, and the <a href="http://en.wikipedia.org/wiki/Hypertext_Markup_Language">HTML</a> markup language and <a href="http://en.wikipedia.org/wiki/Hypertext_Transfer_Protocol">HTTP</a> transport protocol, millions began experiencing the benefit of creating         Web pages and interconnecting. By the mid-1990s, enterprises were on         the Web in force, bringing with them larger content volumes, dynamic         databases and <a href="http://en.wikipedia.org/wiki/Enterprise_portal">enterprise         portals</a>. The ability for anyone to become a publisher led to a         focus and attention on the new medium that led to still further         innovations in <a href="http://en.wikipedia.org/wiki/Online_advertising">e-commerce</a> and         <a href="http://en.wikipedia.org/wiki/Online_advertising">online         advertising</a>. New languages and uses of Web pages and applications         emerged, creating a convergence of design, media, content and         interactivity. Venture capital and new startups with valuations         independent of revenues led to a frenzy of hype and eventually the         <a href="http://en.wikipedia.org/wiki/Dot-com_bubble">dot com crash</a> of 2000.</p>
<p>The growth companies of the past 15 years have not had the traditional         focus on enterprises, but on the use and development of the Web. From         search (<a href="http://en.wikipedia.org/wiki/Google">Google</a>) to         social interactions (<a href="http://en.wikipedia.org/wiki/Facebook">Facebook</a>) to media and         video (<a href="http://en.wikipedia.org/wiki/Flickr">Flickr</a>,         <a href="http://en.wikipedia.org/wiki/Flickr">YouTube</a>) and to         information (<a href="http://en.wikipedia.org/wiki/Wikipedia">Wikipedia</a>), the engines of         growth have shifted away from the enterprise.</p>
<p>Meanwhile, the challenges of data integration and interoperability that         were such a keen focus going back to initial enterprise computerization         remain. Now, however, these challenges are even greater, as we see         images, documents (unstructured data) and Web pages, markup and         metadata (semi-structured data) become first-class information         citizens. What was a challenge in integrating structured data in the         1980s and 1990s via <a href="http://en.wikipedia.org/wiki/Data_warehouse">data warehousing</a>, has         now become positively daunting for the enterprise with respect to scale         and scope.</p>
<p>The paradox is that as these enterprise needs increased, the         attractiveness of the enterprise from an IT perspective has greatly         decreased. It is these factors we discuss below, with an eye to how Web         architecture, design and opportunities may offer a new path through the         maze of enterprise information interoperability.</p>
<h3>The Current Landscape</h3>
<p>Since 1995 the Gartner Group has been producing its annual Hype Cycle<a href="#it6"> [6]</a>. The clientele for this research is the enterprise, so Gartner&#8217;s         presentation of what&#8217;s hot and what&#8217;s hype and what is being adopted is         a good proxy for the IT state of affairs in enterprises. These graphs         are reproduced below since 2006 (click to expand). Note how many of the         items shown are not very specific to the enterprise:</p>
<div><a href="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110117_GartnerHype_compare.png"> <img class="center_ok" style="border: 0px solid; width: 600px; height: 472px;" title="Click to expand" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110117_GartnerHype_compare_600.png" alt="Gartner Hype Cycles - 2006 to 2010" /></a></div>
<p>References to architectures and content processing and related topics         were somewhat prevalent in 2006, but have disappeared most recently. In         comparison to the innovations noted under the History discussion, it         appears that the items on Gartner&#8217;s radar are more related to consumer         applications and uses. We no longer see whole new categories of         enterprise-related apps or enterprise architectures.</p>
<p>The kinds of innovations that <span class="double_u">are</span> being         discussed as important to enterprises in the coming year [<a href="#it7">7</a>,<a href="#it8">8</a>] tend to         mostly leverage existing innovations in other areas or to wrinkle         existing approaches. One report from <a href="http://www.constellationrg.com/">Constellation Research</a>, for         example, lists the five core disruptive technologies of social, mobile,         cloud, analytics and unified communications <a href="#it7">[7]</a>. Only analytics could         be described as enterprise focused or driven.</p>
<p>And, even in analytics, the kinds of things being promoted are         self-service reporting or analysis <a href="#it8">[8]</a>. In essence, these opportunities         represent the application of <a href="http://en.wikipedia.org/wiki/Web_2.0">Web 2.0</a> techniques to bring reporting or         analysis directly to the analyst. Though important and long overdue,         such innovations are more derivative than fundamental.</p>
<p>Master data management (<a href="http://en.wikipedia.org/wiki/Master_data_management">MDM</a>) is         another touted area. But, to read analyst&#8217;s predictions in these areas,         it feels like one has stepped into a time warp of technologies and         options from a decade ago. When has XML felt like an innovation?</p>
<p>Of course, there is a whole industry of analysts that makes their         living prognosticating to enterprises about what to expect from         information technologies and how to adopt and embrace them. The general         observations &#8212; across the board &#8212; seem to center on items such as         smartphones and mobile, moving to the cloud for software or platforms         (<a href="http://en.wikipedia.org/wiki/SaaS">SaaS</a>, <a href="http://en.wikipedia.org/wiki/Platform_as_a_service">PaaS</a>), and         collaboration and social networks. As I note below, there is nothing         inherently wrong or unexciting <span style="font-style: italic;">per         se</span> about these trends. But, what does appear true is that the         locus of innovation has shifted from the enterprise to consumers or the         Internet.</p>
<h3>Seven Reasons for a Shift in Innovation</h3>
<p>The shift in innovation away from the enterprise has been structural,         not cyclical. That means that very fundamental forces are at work to         cause this change in innovation focus. It does not mean that innovation         has permanently shifted away from the enterprise (organizations), but         that some form of countervailing structural changes would need to occur         to see a return to the IT focus on the enterprise from prior decades.</p>
<p>I think we can point to seven structural reasons for this shift, many         of which interact with one another. While all of them are bringing         benefits (some yet to be foreseen) to the enterprise, and therefore are         to be lauded, they are not strictly geared to address specific         enterprise challenges.</p>
<h4>#1: The Internet</h4>
<p>As pundits say, &#8220;The Internet changes everything&#8221; <a href="#it9">[9]</a>. For the reasons         noted under the history above, the most important cause for the shift         in innovation away from the enterprise has been the Internet.</p>
<p>One aspect that is quite interesting is the use of Internet-based         technologies to provide &#8220;outsourced&#8221; enterprise applications hosted on         Web servers. Such &#8220;cloud computing&#8221; leverages the technologies and         protocols inherent to the Internet. It shifts hosting, maintenance and         upgrade responsibilities for conventional apps to remote providers.         Initially, of course, this simply shifts locus and responsibility from         in-house to a virtual party. But, it is also the case that such changes         will also promote more subtle shifts in collaboration and interaction         possibilities. There is also the fact that quick upgrades of underlying         infrastructure and application software can also occur.</p>
<p>The implications for existing enterprise IT staff, traditional         providers, and licensing and maintenance approaches are profound. The         Internet and cloud computing will perhaps have a greater effect on         governance, staffing and management than application functionality         <span style="font-style: italic;">per se</span>.</p>
<h4>#2: Consumer Innovations</h4>
<p>The captivating IT-related innovations at present are mobile         (smartphones) and their apps, tablets and e-book readers, Internet TV         and video, and social networks of a variety of stripes. Somewhat like         the phenomenon of when personal computers first appeared, many of these         consumer innovations have applicability to the enterprise, though only         as a side effect.</p>
<p>It is perhaps instructive to look back at the adoption of PCs in the         enterprise to understand the possible effect of these new consumer         innovations. Central IT was never able to control and manage the         proliferation of personal computers, and only began to understand years         later what benefits and new governance challenges they brought.         Enterprise leaders will understand how to embrace and extend today&#8217;s         new consumer technologies for the enterprise&#8217;s benefits; laggards will         resist to no avail.</p>
<p>The ubiquity of computing will be enormously impactful on the         enterprise. The understanding of what makes sense to do on a mobile         basis with a small screen and what belongs on the desk or in the office         is merely a glimmer in the current conversation. However, in the end,         like most of the other innovations noted in this analysis, the         enterprise will largely be a reactive player to these innovations. Yes,         the implications will be profound, but their inherent basis are not         grounded in unique enterprise challenges. Nonetheless, adapting to them         and changing business practice will be critical to asserting enterprise         leadership.</p>
<h4>#3: Open Source</h4>
<div style="float: right; margin-left: 10px;"><img style="border: 0px solid; width: 476px; height: 357px;" title="Open Source Growth" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110117_open_source_growth.jpg" alt="Open Source Growth" /></div>
<p>Ten years ago open source was largely dismissed in the enterprise.         About five years ago VCs and others began funding new commercial open         source ventures, even while there were still rear guard arguments from         enterprises resisting open source. Meanwhile, as the figure to the         right shows, open source projects were growing exponentially <a href="#it10">[10]</a>.</p>
<p>The shift to open source in the enterprise, still ongoing, has been         rapid. Within 5 years, more than 50% of enterprise software will be         open source<a href="#it11"> [11]</a> . According to an article in Fortune magazine last         year <a href="#it12">[12]</a>, a Forrester Research <a rel="external nofollow" href="http://itmanagement.earthweb.com/osrc/article.php/3898206/LinuxCon-Analysts-Linux-Is-Winning.htm" target="new"> </a>survey found that 48% of enterprise respondents were using open         source operating systems, and 57% were using open source code. A         <a rel="external nofollow" href="http://newsroom.accenture.com/article_display.cfm?article_id=5045" target="new">similar         Accenture survey</a> of 300 large public and private companies found         that half are committed to open source software, with 38% saying they         would begin using open-source software for &#8220;mission-critical&#8221;         applications over the next 12 months.</p>
<p>There are likely many reasons for this shift, including the Internet         itself and its basis in open source. Many of the most successful         companies of the past 15 years including Amazon, Google, Facebook, and         virtually any large Web site has shown excellent performance and         scalability building their IT infrastructure around open source         foundations. Most of the large, existing enterprise IT vendors, notably         including IBM, Oracle, Nokia, Intel, Sun (prior to Oracle), Citrix,         Novell (just acquired by Attachmate) and SAP have bought open source         providers or have visible support for open source initiatives. Even two         of the most vocal proprietary source proponents of the past &#8212; HP and         Microsoft &#8212; have begun to make moves toward open source.</p>
<p>The age of proprietary software based on proprietary standards is dead.         The monopoly rents formerly associated with unique, proprietary         platforms and large-scale enterprise apps are over. Even where software         remains proprietary, it is embracing open standards for data         interchange and APIs. Traditional enterprise apps such as content         management, business intelligence and <a href="http://en.wikipedia.org/wiki/Extract,_transform,_load">ETL</a>, among         all others, are being penetrated by commercial open source offerings         (as examples, <a href="http://en.wikipedia.org/wiki/Alfresco_%28software%29">Alfresco</a>,         <a href="http://en.wikipedia.org/wiki/Pentaho">Pentaho</a> and <a href="http://en.wikipedia.org/wiki/Talend">Talend</a>, respectively). The         shift to services and new business models appears to be an inexorable         force.</p>
<p>Declining profit margins, matched with the relatively high cost of         marketing and sales to enterprises, means attention and focus have been         shifting away from the enterprise. And with these shifts in focus has         come a reduction in enterprise-focused innovation.</p>
<h4>#4: Slow Development Cycles in Enterprise</h4>
<p>It is not unusual to find deployed systems within enterprises as old as         thirty years<a href="#it13"> [13]</a>. So long as they work reasonably well, systems once         installed &#8212; along with their data &#8212; tend to remain in operation until         their platforms or functionality become totally obsolete. This leads to         rather lengthy turnover cycles, and slow development cycles.</p>
<p>Slow cycles in themselves slow innovation. But slow development cycles         are also a disincentive to attract the most capable developers. When         development tends to focus on maintenance and scripts and more routines         of the same nature, the best developers tend to migrate elsewhere (see         next).</p>
<p>Another aspect of slow development cycles is the imperative for new         enterprise IT to relate to and accommodate legacy systems &#8212; again,         including legacy data. This consideration is the source of one of the         negative implications of a shift away from innovation in the         enterprise: the orphaning of existing information assets.</p>
<h4>#5: What&#8217;s Hot: Developers</h4>
<p>Arguably the emphasis on consumer and Internet technologies means that         is where the best developers gravitate. Developing apps for smartphones         or working at one of the cool Internet companies or joining a         passionate community of open source developers is now attracting the         best developers. Open source and Web-based systems also lead to faster         development cycles. The very best developers are often the founders of         the next generation startups and Web and software companies<a href="#it14"> [14]</a>.</p>
<p>While, of course, huge numbers of computer programmers and IT         specialists are hired by enterprises each year, the motivations tend to         be higher pay, better benefits and more job security. The nature of the         work and the bureaucracy and routine of many IT functions require such         compensation. And, because of the other shifts noted elsewhere, even         the software startups that are able to attract the most innovative         developers no longer tend to develop for enterprise purposes.</p>
<p>Computer science students have been declining in industrialized  countries for some time and that is the category of slowest growth in IT  <a href="#it14">[14]</a>.  Meanwhile, existing IT personnel often have expertise in older legacy  systems or have been focused on bug fixes and more prosaic tasks like  report writing. Narrow job descriptions and work activities also keep  many existing IT personnel from getting exposed to or learning about new  trends or innovations, such as the semantic Web.</p>
<p>Declining numbers of new talent, plus declining interest by that talent, combined with (often) narrow and legacy expertise of existing talent, creates a disappointing storm of energy and innovation to address enterprise IT challenges. Enterprises have it within their power to create more exciting career opportunities to overcome these limitations, but unfortunately IT management often also appears challenged to get on top of these structural forces.</p>
<h4>#6: What&#8217;s Hot: Startups</h4>
<p>Open source and Internet-based systems have reduced the capital         necessary for a new startup by an order of magnitude or so over the         past decade. It is now quite possible to get a new startup up and         running for tens to hundreds of thousands of dollars, as opposed to the         millions of years past. This is leading to more startups, more startups         per innovator, and quicker startup and abandonment cycles. Ideas can be         tried quickly and more easily thrown away<a href="#it15"> [15]</a>.</p>
<p>These dynamics are acting to accelerate overall development cycles and         to cause a shift in funding structures and funding amounts by VCs and         angels. The kind of market and sales development typical for many         enterprise sales does not fit well within these dynamics and is a         countervailing force for more capital when all trends point the other         way.</p>
<p>In short, all of this is saying that money goes to where the returns         are, and returns are not of the same basis as decades past in the         enterprise sector. Again, this means a hollowing out of innovation for         enterprises.</p>
<h4>#7: Declining Software Rents and Consolidation</h4>
<p>As an earlier reference noted <a href="#it4">[4]</a>, software revenues as a percent of IT         vendor revenues peaked in about the mid-1990s. As profitability for         these entities began to decline, so did the overall attractiveness of         the sector.</p>
<p>As the next chart shows, coincident with the peak in profitability was         the onset of a consolidation trend in the enterprise IT vendor sector <a href="#it16"> [16]</a>. The chart below shows that three of the largest IT vendors today         &#8212; Oracle, IBM and HP &#8212; began an acquisition spree in the mid-1990s         that has continued until just recently, as many of the existing major         players have already been acquired:</p>
<div><a href="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110117_MajorITacquisitions.png"> <img class="center_ok" style="border: 0px solid; width: 600px; height: 456px;" title="Click to expand" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110117_MajorITacquisitions.png" alt="Major !T Acquisitions, 1994 - 2010" /></a></div>
<p>Notable acquisitions over this period include: Oracle &#8212; PeopleSoft,         Siebel Systems, MySQL, Hyperion, BEA and Sun; HP &#8212; EDS, 3Com,         VeriFone, Compaq, Palm and Mercury Interactive; IBM &#8212; Lotus, Rational,         Informix, Ascential, FileNet, Cognos and SPSS. Published acquisition         costs exceeded $130 billion, mostly for the larger deals. But terms for         75% of the 262 transactions were not disclosed <a href="#it16">[16]</a>. The total value of         these consolidations likely approaches $200 billion to $300 billion.</p>
<p>Clearly, the market is now favoring large players with large service         components. This consolidation trend does belie one early criticism of         open source <span style="font-style: italic;">v</span> proprietary         software: proprietary software is likely to be better supported. In         theory this might be true, but vanishing suppliers does not bode well         for support either. Over time, we may likely see successful open source         projects showing greater longevity than many IT vendors.</p>
<h3>Positive Implications from the Decline</h3>
<p>This discussion is not a boo-hoo because the heyday of enterprise IT         innovation is past. Much of that innovation was expensive, often failed         to achieve successful adoption, and promoted walled gardens and silos.         As someone who ran companies directly involved in enterprise software         sales, I personally do not miss the meetings, the travel, the suits and         the 18-month sales cycles.</p>
<p>The enterprise has gained much from outside innovation in the past,         from the personal computer to LANs and browsers and the Internet. To be         sure, what we are now seeing with mobile phones has more computing         power than the original Space Shuttle <a href="#it17">[17]</a>, and continued mashup and         social engagement innovations will have unforeseen and manifest         benefits for enterprises. I think this is unalloyed goodness.</p>
<p>We can also see innovations based on the Internet such as the semantic         Web and its languages and standards to promote interoperability.         Breaking these barriers is critically needed by enterprises of the         future. Data models such as RDF <a href="#it18">[18]</a> and open world mindsets that         better accommodate uncertainty and breadth of information<a href="#it19"> [19]</a> can only         be seen as positive. The leverage that will come from these         non-enterprise innovations may in the end prove to be as important as         the enterprise-specific innovations of the past.</p>
<h3>Negative Implications from the Decline</h3>
<p>Yet a shift to Internet and consumer IT innovation leaves some         implications. These concerns have to do with the unique demands and         needs of enterprises. One negative implication is that a diminishing         supplier base may not lead to actual deployments that are         enterprise-ready or -responsive.</p>
<p>The first concern relates to quality and operational integrity. There         is an immense gulf between <a href="http://en.wikipedia.org/wiki/ISO_9000">ISO 9000</a> or <a href="http://en.wikipedia.org/wiki/Six_Sigma">Six Sigma</a> and, for         example, the &#8220;good enough&#8221; of standard search results on the Web.         Consumer apps do not impose the same thresholds for quality as demanded         by paying bosses or paying customers. This is not a value judgment;         simply a reality. I see it reflected in the quality of tools and code         for many new innovations today on the Web.</p>
<p>Proofs-of-concept and &#8220;cool&#8221; demos work well for academic theses or         basic intros to new concepts. The 20% that gets you 80% goes a long way         to point the way to new innovation; but the 80% to get to the last 20%         is where enterprises bet their money. Unfortunately, in too many         instances, that gap is not being filled. The last 20% is hard work,         often boring, and certainly not as exciting as the next Big Thing. And,         as the trends above try to explicate, there are also diminishing         rewards for living in that territory.</p>
<p>A similar and second concern pervades data interoperability. Data         interoperability has been the central challenge of enterprise IT for at         least three decades. As soon as we were able to interconnect systems         and bridge differences in operating systems and data schema, the Holy         Grail has been breaking information barriers and silos. The initial         attempts with proprietary data warehouses or enterprise-wide ERP         systems were wrongly trying to apply closed solutions to inherently         open problems. But, now, finally when we have the open approaches and         standards in hand for bridging these gaps, the attractiveness of doing         so for the enterprise seems to have vanished.</p>
<p>For example, we see demos, tools and algorithms being published all         over the place that show promising advances or improvements in the semantic Web or linked data (among other areas; see <a href="#it20">[20]</a>). Some of these automated         techniques sound wonderful, but real systems require the hard slog of         review and manual approval. Quality matters. If Technique A, say, shows an improvement         over Technique B of 5%, that is worth touting. But even at 98% percent         accuracy, we will still find 20,000 errors in a population of 1 million         items. Such errors will simply not work in having trains run on time,         seats be available on airplanes, or inventory get to their required         destinations.</p>
<p>What can work from the standpoint of linkage or interoperability on the         Web according to consumer standards will simply not fly for many         enterprises. But, where are the rewards for tackling that hard slog?</p>
<p>Another concern is security and differential access. Open Web systems,         bless their hearts, do not impose the same access and need to know         restrictions as information systems within enterprises. If we are to         adopt Web-based approaches to the next-generation enterprise &#8212; a         position we strongly advocate &#8212; then we are also going to need to         figure out how to marry these two world views. Again, there appears to         be an effort-reward mismatch here.</p>
<h3>What Lessons Might be Drawn?</h3>
<p>These observations are not meant to be a polemic, but a statement of         more-or-less current circumstances. Since its widescale adoption, the         major challenge &#8212; and opportunity &#8212; of enterprise IT has been how to         leverage the value within the enterprise&#8217;s existing digital information         assets. That challenge is augmented today with the availability of         literally a whole world of external digital knowledge. Yet, the energy         and emphasis for innovation to address these challenges has seemingly         shifted to consumers and away from the enterprise.</p>
<p>Economics abhors a vacuum. I think two responses may be likely to this         circumstance. The first is that new vendors will emerge to address         these gaps, but with different cost structures and business models. I&#8217;d         like to think my own firm, <a href="http://structureddynamics.com">Structured Dynamics</a>, is one of         these entities. How we are addressing this opportunity and differences         in our business model we will discuss at a later time. In any case, any         such new player will need to take account of some of the structural         changes noted above.</p>
<p>Another response can come from enterprises themselves, using and         working the same forces of change noted earlier. Via collaboration and         open source, enterprises can band together to contribute resources,         expertise and people to develop open source infrastructures and         standards to address the challenges of interoperability. We already see         exemplars of such responses in somewhat related areas via initiatives         such as <a href="http://en.wikipedia.org/wiki/Eclipse_%28software%29">Eclipse</a>,         <a href="http://en.wikipedia.org/wiki/Apache_Software_Foundation">Apache</a>,         <a href="http://en.wikipedia.org/wiki/W3C">W3C</a>, <a href="http://en.wikipedia.org/wiki/OASIS_%28organization%29">OASIS</a> and         others. By leveraging the same tools of collaboration and open data and         systems and the Internet, enterprises can band together and ensure         their own self-interests are being addressed.</p>
<p>One advantage of this open, collaborative approach is that it is         consistent with the current innovation trends in IT. But the real         advantage is that it works and is needed. Without it, it is unclear how         the enterprise IT challenge &#8212; especially in data interoperability &#8212;         will be met.</p>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it1"></a>[1] Though calculating machines and others extend back to <a href="http://en.wikipedia.org/wiki/Babbage">Charles Babbage</a> and more         relevant efforts during World War II, the first <a href="http://en.wikipedia.org/wiki/UNIVAC">UNIVAC</a> was delivered to the         US Census Bureau in 1951, and the first IBM to the US Defense         Department in 1953. Many installations followed thereafter. See, for         example, <a href="http://www.computinghistorymuseum.org/teaching/lectures/pptlectures/9-MainframeComputers.ppt"> Lectures in the History of Computing: Mainframes</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it2"></a>[2] As provided by <a href="http://dictionary.oed.com/">&#8220;information         technology&#8221;</a> (subscription required), <span style="font-style: italic;">Oxford English Dictionary (2 ed.)</span>, Oxford         University Press, 1989, <a href="http://dictionary.oed.com/">http://dictionary.oed.com/</a>, retrieved         12 January 2011.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it3"></a>[3] See further the Wikipedia entry on <a href="http://en.wikipedia.org/wiki/Proprietary_software">proprietary         software</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it4"></a>[4] M.K. Bergman, 2006. &#8220;Redux: Enterprise Software Licensing on Life         Support,&#8221; <span style="font-style: italic;">AI3:::Adaptive         Information</span> blog, June 2, 2006. See <a href="http://www.mkbergman.com/111/the-death-of-enterprise-software-licensing/"> http://www.mkbergman.com/111/the-death-of-enterprise-software-licensing/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it5"></a>[5] The combination of distributed network systems and table-oriented         designs such as Google&#8217;s <a href="http://en.wikipedia.org/wiki/BigTable">BigTable</a> and related open         source <a href="http://en.wikipedia.org/wiki/Hadoop">Hadoop</a>, plus         many scripting languages, is leading to the resurgence of new database         designs including <a href="http://en.wikipedia.org/wiki/NoSQL">NoSQL</a>, <a href="http://en.wikipedia.org/wiki/Column-oriented_DBMS">columnar</a>, etc.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it6"></a>[6] The <a href="http://www.gartner.com/">Gartner</a> <a href="http://en.wikipedia.org/wiki/Hype_cycle">Hype Cycle</a> is a graphical         representation of the maturity, adoption and application of         technologies. It proceed through five phases beginning with a         technology trigger and then, if successful, ultimately adoption. The         peak of the curve represents the biggest &#8220;hype&#8221; for the innovation.The         information in these charts is courtesy of Gartner. The sources for the         charts are summary Gartner reports for <a href="http://www.gartner.com/it/page.jsp?id=1447613">2010</a>, <a href="http://www.gartner.com/it/page.jsp?id=1124212">2009</a>, <a href="http://www.gartner.com/it/page.jsp?id=739613">2008</a>, and 2006. 2007         was skipped to provide a bit longer time horizon for comparison         purposes.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it7"></a>[7] As summarized by Klint Finley, 2011. &#8220;<a href="http://www.readwriteweb.com/enterprise/2011/01/enterprise-distruption-2011.php">How         Will Technology Disrupt the Enterprise in 2011?</a>,&#8221; <span style="font-style: italic;">ReadWriteWeb Enterprise</span> blog, January 4,         2011.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it8"></a>[8] Jaikumar Vijayan, 2011. &#8220;<a href="http://news.idg.no/cw/art.cfm?id=4CB6C917-1A64-6A71-CE089BD3A08271AD">Self-service         BI, SaaS, Analytics will Dominate in 2011</a>,&#8221; in <span style="font-style: italic;">Computerworld Online</span>, January 3, 2011.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it9"></a>[9] According to Google on January 12, 2011, there were 251,000 uses of         this exact phrase on the Web.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it10"></a>[10] Amit Deshpande and Dirk Riehle, 2008. &#8220;The Total Growth of Open         Source,&#8221; in <span style="font-style: italic;">Proceedings of the Fourth         Conference on Open Source Systems (OSS 2008)</span>, Springer Verlag,         pp 197-209; see <a href="http://dirkriehle.com/wp-content/uploads/2008/03/oss-2008-total-growth-final-web.pdf"> http://dirkriehle.com/wp-content/uploads/2008/03/oss-2008-total-growth-final-web.pdf</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it11"></a>[11] See <a href="http://futureofopensource.drupalgardens.com/2010-survey-results">http://futureofopensource.drupalgardens.com/2010-survey-results</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it12"></a>[12] See <a href="http://tech.fortune.cnn.com/2010/08/16/how-corporate-america-went-open-source/"> http://tech.fortune.cnn.com/2010/08/16/how-corporate-america-went-open-source/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it13"></a>[13] For example, according to James Mullarney in 2005, &#8220;<a href="http://www.insursys.com/pdf/White%20Paper%20-%20Dealing%20with%20the%20Legacy%20of%20Legacy.pdf">How         to Deal with the Legacy of Legacy Systems</a>,&#8221; the average age of IT         systems in the insurance industry was 23 years. In that same year,         according to <a href="http://www.logicalminds.co.uk/news/view_news_item.asp?news_item_id=3">Logical         Minds</a>, a survey by HAL Knowledge Systems showed the average age of         applications running core business processes to be 15 years old, with         almost 30 per cent of companies maintaining software that is 25 years         old or older.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it14"></a>[14] For general IT employment trends, see the Bureau of Labor         Statistics; for example, <a href="http://www.bls.gov/oco/ocos303.htm">http://www.bls.gov/oco/ocos303.htm</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it15"></a>[15] See, for example, Paul Graham, 2010. &#8220;<a href="http://www.paulgraham.com/superangels.html">The New Funding         Landscape</a>,&#8221; <span style="font-style: italic;">Blog</span> post,         October 2010.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it16"></a>[16] This chart was constructed from these sources: Oracle &#8212; <a href="http://en.wikipedia.org/wiki/List_of_acquisitions_by_Oracle">http://en.wikipedia.org/wiki/List_of_acquisitions_by_Oracle</a>;         IBM &#8212; <a href="http://en.wikipedia.org/wiki/List_of_mergers_and_acquisitions_by_IBM">http://en.wikipedia.org/wiki/List_of_mergers_and_acquisitions_by_IBM</a>;         and HP &#8212; <a href="http://en.wikipedia.org/wiki/List_of_acquisitions_by_Hewlett-Packard">http://en.wikipedia.org/wiki/List_of_acquisitions_by_Hewlett-Packard</a>.         Of course, other acquisitions occurred by other players over this         period as well.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it17"></a>[17] Current smartphones may have around 2 GHz in processing power and         1 GB of RAM; see for example, this <a href="http://news.thomasnet.com/fullstory/Android-Based-Smartphone-is-optimized-for-mobile-computing-589587"> Motorola press release</a>. By comparison to the Shuttle, see <a href="http://en.wikipedia.org/wiki/Space_Shuttle#Flight_systems">http://en.wikipedia.org/wiki/Space_Shuttle#Flight_systems</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it18"></a>[18] M. K. Bergman, 2009. “<a href="http://www.mkbergman.com/483/advantages-and-myths-of-rdf/">Advantages         and Myths of RDF</a>,” <span style="font-style: italic;">AI3:::Adaptive         Information blog</span>, April 8, 2009.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it19"></a>[19] M. K. Bergman, 2009. “<a href="http://www.mkbergman.com/852/the-open-world-assumption-elephant-in-the-room/">The         Open World Assumption: Elephant in the Room</a>,” <span style="font-style: italic;">AI3:::Adaptive Information blog</span>, Dec. 21,         2009.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="it20"></a>[20] See, for example, the <strong><span style="color: #993300;"><a href="http://www.mkbergman.com/new-version-sweet-tools-sem-web/">Sweet Tools</a></span></strong> listing of 900 semantic Web and -related tools on this <span style="font-style: italic;">AI3:::Adaptive Information blog</span>.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/940/declining-it-innovation-in-the-enterprise/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>

