<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>AI3:::Adaptive Information &#187; Open Source</title>
	<atom:link href="http://www.mkbergman.com/category/open-source/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mkbergman.com</link>
	<description>Mike Bergman on the semantic Web and structured Web</description>
	<lastBuildDate>Tue, 24 Jan 2012 15:52:16 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>The State of Tooling for Semantic Technologies</title>
		<link>http://www.mkbergman.com/991/the-state-of-tooling-for-semantic-technologies/</link>
		<comments>http://www.mkbergman.com/991/the-state-of-tooling-for-semantic-technologies/#comments</comments>
		<pubDate>Mon, 12 Dec 2011 14:29:36 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Semantic Web Tools]]></category>
		<category><![CDATA[Structured Web]]></category>
		<category><![CDATA[#linkeddata]]></category>
		<category><![CDATA[#semweb]]></category>
		<category><![CDATA[new release]]></category>
		<category><![CDATA[semantic component]]></category>
		<category><![CDATA[semantic framework]]></category>
		<category><![CDATA[semantic technologies]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Sweet Tools]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=991</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=The State of Tooling for Semantic Technologies&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Open Source&amp;rft.subject=Semantic Web Tools&amp;rft.subject=Structured Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-12-12&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/991/the-state-of-tooling-for-semantic-technologies/&amp;rft.language=English"></span>
Number of Semantic Web Tools Passes 1000 for First Time; Many Other Changes We have been maintaining Sweet Tools, AI3‘s listing of semantic Web and -related tools, for a bit over five years now. Though we had switched to a structWSF-based framework that allows us to update it on a more regular, incremental schedule [1], [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=The State of Tooling for Semantic Technologies&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Open Source&amp;rft.subject=Semantic Web Tools&amp;rft.subject=Structured Web&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-12-12&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/991/the-state-of-tooling-for-semantic-technologies/&amp;rft.language=English"></span>
<h2><a><img style="border: 0px solid; width: 230px; height: 273px; float: left; margin-right: 10px;" title="State of SemWeb Tools - 2011" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/sw-earth-2011.png" alt="State of SemWeb Tools - 2011" align="left" /></a>Number of Semantic Web Tools Passes 1000 for First Time; Many Other Changes</h2>
<p>We have been maintaining <strong><a href="../sweet-tools">Sweet Tools</a></strong>, <strong>AI3</strong>‘s listing of semantic Web and -related tools, for a bit over five years now. Though we had switched to a <a href="http://openstructs.org/structwsf">structWSF-based framework</a> that allows us to update it on a more regular, incremental schedule <a href="#tool1">[1]</a>, like all databases, the listing needs to be reviewed and cleaned up on a periodic basis. We have just completed the most recent cleaning and update. We are also now committing to do so on an annual basis.</p>
<p>Thus, this is the inaugural &#8216;<em>State of Tooling for Semantic Technologies</em>&#8216; report, and, boy, is it a humdinger. There have been more changes &#8212; and more important changes &#8212; in this past year than in all four previous years combined. I think it fair to say that semantic technology tooling is now reaching a mature state, the trends of which likely point to future changes as well.</p>
<p>In this past year more tools have been added, more tools have been dropped (or abandoned), and more tools have taken on a professional, sophisticated nature. Further, for the first time, the number of semantic technology and -related tools has passed 1000. This is remarkable, given that more tools have been abandoned or retired than ever before.</p>
<div class="boxRedDotted" style="margin: 10px 0pt 10px 10px; width: 185px; font-size: 1.2em; float: right; text-align: center;">Click <a href="../sweet-tools/">here</a> to browse the <a href="../sweet-tools/">Sweet Tools</a> listing. There is also a <a href="../sweet-tools-simple-list/">simple listing</a> of URL links and categories only.</div>
<p>We first present our key findings and then overall statistics. We conclude with a discussion of observed trends and implications for the near term.</p>
<h3>Key Findings</h3>
<p>Some of the key findings from the <em>2011 State of Tooling for Semantic Technologies</em> are:</p>
<ul>
<li>As of the date of this article, there are 1010 tools in the Sweet Tools listing, the first it has passed 1000 total tools</li>
<li>A total of 158 new tools have been added to the listing in the last six months, an increase of 17%</li>
<li>75 tools have been abandoned or retired, the most removed at any period over the past five years</li>
<li>A further 6%, or 55 tools, have been updated since the last listing</li>
<li>Though open source has always been an important component of the listing, it now constitutes more than 80% of all listings; with dual licenses, open source availability is about 83%. Online systems contribute another 9%</li>
<li>Key application areas for growth have been in SPARQL, ontology-related areas and linked data</li>
<li>Java continues to dominate as the most important language.</li>
</ul>
<p>Many of these points are elaborated below.</p>
<h3>The Statistical Picture</h3>
<p>The updated Sweet Tools listing now includes nearly 50 different tools categories. The most prevalent categories, each with over 6% of the total, are information extraction, general RDF tools, ontology tools, browser tools (RDF, OWL), and parsers or converters. The relative share by category is shown in this diagram (click to expand):</p>
<div style="margin: 10px 0px;"><a href="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/112811_applications.png"> <img class="center_ok" style="border: 0px solid; width: 600px; height: 398px;" title="Click to expand" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/112811_applications.png" alt="Sweet Tools Applications" /></a></div>
<p>Since the last listing, the fastest growing categories have been SPARQL, linked data, knowledge bases and all things related to ontologies. The relative changes by tools category are shown in this figure:</p>
<div style="margin: 10px 0px;"><a href="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/112811_applications_change.png"> <img class="center_ok" style="border: 0px solid; width: 600px; height: 270px;" title="Click to expand" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/112811_applications_change.png" alt="Last Year Sweet Tools Applications" /></a></div>
<p>Though it is true that some of this growth is the result of discovery, based on our own tool needs and investigations, we have also been monitoring this space for some time and serendipity is not a compelling explanation alone. Rather, I think that we are seeing both an increase in practical tools (such as for querying), plus the trends of linked data growth matched with greater sophistication in areas such as ontologies and the OWL language.</p>
<p>The languages these tools are written in have also been pretty constant over the past couple of years, with Java remaining dominant. Java has represented half of all tools in this space, which continues with the most recent tools as well (see below). More than a dozen programming or scripting languages have at least some share of the semantic tooling space (click to expand):</p>
<div style="margin: 10px 0px;"><img class="center_ok" style="border: 0px solid; width: 609px; height: 541px;" title="Sweet Tools Languages" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/112811_languages_all.png" alt="Sweet Tools Languages" /></div>
<p>With only 160 new tools it is hard to draw firm trends, but it does appear that some languages (Haskell, XSLT) have fallen out of favor, while popularity has grown for Flash/Flex (from a small base), Python and Prolog (with the growth of logic tools):</p>
<div style="margin: 10px 0px;"><a href="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/112811_languages_change.png"> <img class="center_ok" style="border: 0px solid; width: 600px; height: 258px;" title="Click to expand" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/112811_languages_change.png" alt="Last Year Change in Sweet Tools Languages" /></a></div>
<p>PHP will likely continue to see some emphasis because of relations to many content management systems (WordPress, Drupal, etc.), though both Python and Ruby seem to be taking some market share in that area.</p>
<h3>New Tools</h3>
<p>The newest tools added to the listing show somewhat similar trends. Again, Java is the dominant language, but with much increased use of JavaScript and Python and Prolog:</p>
<div style="margin: 10px 0px;"><img class="center_ok" style="border: 0px solid; width: 582px; height: 544px;" title="Sweet Tools Languages" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/112811_languages_new.png" alt="Sweet Tools Languages" /></div>
<p>The higher incidence of Prolog is likely due to the parallel increase in reasoners and inference engines associated with ontology (OWL) tools.</p>
<p>The increase in comprehensive tool suites and use of Eclipse as a development environment would appear to secure Java&#8217;s dominance for some time to come.</p>
<h3>Trends and Observations</h3>
<p>These dry statistics tend to mask the feel one gets when looking at most of the individual tools across the board. Older academic and government-funded project tools are finally getting cleaned out and abandoned. Those tools that remain have tended to get some version upgrades and improved Web sites to accompany them.</p>
<p>The general feel one gets with regard to semantic technology tooling at the close of 2011 has these noticeable trends:</p>
<ul>
<li>A three-tiered environment &#8211; the tools seem to segregate into: 1) a bottom tier of tools (largely) developed by individuals or small groups, now most often found on Google Code or Github; 2) a middle-tier of (largely) government-funded projects, sometimes with multiple developers, often older, but with no apparent driving force for ongoing improvements or commercialization; and 3) a top-tier of more professional and (often) commercially-oriented tools. The latter category is the most noticeable with respect to growth and impact</li>
<li>Professionalism &#8211; the tools in the apparent top tiers feel to have more professionalism and better (and more attractive) packaging. This professionalism is especially true for the frameworks and composite applications. But, it also applies to many of the EU-funded projects from Europe, which has always been a huge source of new tool developments</li>
<li>More complete toolsets &#8211; similarly, the upper levels of tools are oriented to pragmatic problems and problem-solving, which often means they embody multiple functions and more complete tooling environments. This category actually appears to be the most visible one exhibiting growth</li>
<li>Changing nature of academic releases &#8211; yet, even the academic releases seem to be increasing in professionalism and completeness. Though in the lowest tier it is still possible to see cursory or experimental tool releases, newer academic releases (often) seem to be more strategically oriented and parts of broader programmatic emphases. Programs like <a href="http://aksw.org/About">AKSW</a> from the University of Leipzig or the <a title="Outgoing link (in new window)" href="http://dbpedia.org/resource/Freie_Universit%C3%A4t_Berlin" target="_blank">Freie Universität Berlin</a> or Finland&#8217;s Semantic Computing Research Group (<a href="http://www.seco.tkk.fi/">SeCo</a>), among many others, tend to be exemplars of this trend</li>
<li>Rise of commercial interests and enterprise adoption &#8211; the growing maturity of semantic technologies is also drawing commercial interest, and the incubation of new start-ups by academic and research institutions acts to reinforce the above trends. Promising projects and tools are now much more likely to be spun off as potential ventures, with accompanying better packaging, documentation and business models</li>
<li>Multiple languages and applications &#8211; with this growing complexity and sophistication has also come more complicated apps, combining multiple languages and functions. In fact, for some time the Sweet Tools listing has been justifiably criticized by some as overly &#8220;simplifying&#8221; the space by classifying tools under (largely) single applications or single languages. By the 2012 survey, it will likely be necessary to better classify the tools using multiple assignments</li>
<li>Google code over SourceForge for open source (and an increase in Github, as well) &#8211; virtually all projects on SourceForge now feel abandoned or less active. The largest source of open source projects in the semantic technology space is now clearly Google Code. Though of a smaller footprint today, we are also seeing many of the newer open source projects also gravitate to Github. Open source hosting environments are clearly in flux.</li>
</ul>
<p>I have said this before, and been wrong about it before, but it is hard to see the tooling growth curve continue at its current slope into the future. I think we will see many individual tools spring up on the open source hosting sites like Google and Github, perhaps at relatively the same steady release rate. But, old projects I think will increasingly be abandoned and older projects will not tend to remain available for as long a time. While a relatively few established open source standards, like Solr and Jena, will be the exception, I think we will see shorter shelf lives for most open source tools moving forward. This will lead to a younger tools base than was the case five or more years ago.</p>
<p>I also think we will continue to see the dominance of open source. Proprietary software has increasingly been challenged in the enterprise space. And, especially in semantic technologies, we tend to see many open source tools that are as capable as proprietary ones, and generally more dynamic as well. The emphasis on open data in this environment also tends to favor open source.</p>
<p>Yet, despite the professionalism, sophistication and complexity trends, I do not yet see massive consolidation in the semantic technology space. While we are seeing a rapid maturation of tooling, I don&#8217;t think we have yet seen a similar maturation in revenue and business models. While notable semantic technology start-ups like <a href="http://en.wikipedia.org/wiki/Powerset_%28company%29">Powerset</a> and <a href="http://en.wikipedia.org/wiki/Siri_%28software%29">Siri</a> have been acquired and are clear successes, these wins still remain much in the minority.</p>
<hr align="left" size="1" width="33%" />
<div style="margin: 10px 0pt; font-size: 90%;"><a name="tool1"></a>[1] Please use the comments section of this post for suggesting new or overlooked tools. We will incrementally add them to the Sweet Tools listing. Also, please see the About tab of the <a href="http://www.mkbergman.com/sweet-tools/">Sweet Tools</a> results listing for prior releases and statistics.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/991/the-state-of-tooling-for-semantic-technologies/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Fred&#8217;s Hair is on Fire</title>
		<link>http://www.mkbergman.com/981/freds-hair-is-on-fire/</link>
		<comments>http://www.mkbergman.com/981/freds-hair-is-on-fire/#comments</comments>
		<pubDate>Tue, 18 Oct 2011 04:19:34 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Open Semantic Framework]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Software Development]]></category>
		<category><![CDATA[Structured Dynamics]]></category>
		<category><![CDATA[#opensource]]></category>
		<category><![CDATA[#startup]]></category>
		<category><![CDATA[semantics]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=981</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Fred&#8217;s Hair is on Fire&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Open Semantic Framework&amp;rft.subject=Open Source&amp;rft.subject=Software Development&amp;rft.subject=Structured Dynamics&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-10-17&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/981/freds-hair-is-on-fire/&amp;rft.language=English"></span>
Today&#8217;s Post is a Testimony to the Value of Vacations My partner, Fred Giasson, today posted the second part of his series on open source. Since returning from a well-earned vacation a few weeks back &#8212; after more than three years without a break &#8212; Fred has been writing and developing up a storm. As [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Fred&#8217;s Hair is on Fire&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Open Semantic Framework&amp;rft.subject=Open Source&amp;rft.subject=Software Development&amp;rft.subject=Structured Dynamics&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-10-17&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/981/freds-hair-is-on-fire/&amp;rft.language=English"></span>
<h2><a href="http://structureddynamics.com/"><img style="margin-right: 10px; float: left;" title="Structured Dynamics" src="../wp-content/themes/ai3/images/sd_logo_black.png" alt="Structured Dynamics" width="225" /></a>Today&#8217;s Post is a Testimony to the Value of Vacations</h2>
<p>My partner, <a href="http://fgiasson.com/blog/">Fred Giasson</a>, today posted the <a href="http://fgiasson.com/blog/index.php/2011/10/17/open-sources-projects-as-a-pool-of-resources/">second part</a> of his <a href="http://fgiasson.com/blog/index.php/2011/10/05/unnatural-open-source/">series on open source</a>. Since returning from a well-earned vacation a few weeks back &#8212; after more than three years without a break &#8212; Fred has been writing and developing up a storm. As someone said to me last week, &#8220;Fred&#8217;s on fire!&#8221; I could not agree more.</p>
<p>I think Fred&#8217;s post speaks for itself as to why and how <a href="http://structureddynamics.com/">Structured Dynamics</a> has made a conscious choice to embrace open source. The major reason he puts forth &#8212; to bootstrap the company without the need for external investment &#8212; is unusual in itself. But one thing he is silent about is why this is a compelling reason. I&#8217;ll comment on that.</p>
<p>Fred and I have both worked for others dependent on their capital for our ventures (a few more times in my case). Capital is great for expansion and operations, but it can be deadly when visions requiring patience are in play. Structured Dynamics is only now a bit more than halfway through its five-year plan. While semantics technologies are exciting with a world of upside potential, they have also been incubated in academic labs with (as yet) a general lack of practical deployment. The promise is there, but often the delivery and maturation have been lacking. We are committed to play a visible role in correcting that.</p>
<p>The approach Fred outlines was not perhaps easily available to new startups a decade ago. But now, with open source and the Internet, costs of entry and ongoing development have dropped markedly. Yet, surprisingly, the idea of financing a startup via revenues is still not talked about sufficiently &#8212; let alone often used as an actual basis for building a company.</p>
<p>I&#8217;ve been fortunate to be able to partner with a young, world-class technologist whose maturity exceeds that of individuals many years his senior. He understands that in order to achieve important visions that the stewardship of those ideas can not be left to venture capitalists committed solely or mostly to gaming terms or near-term returns. We&#8217;re placing our bets on the paying customer and our own judgment.</p>
<p>So, it is great to see Fred continue his phenomenal development productivity since he returned from Hawaii. The benefit of his vacation is that we are also now getting his insights on his blog again.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/981/freds-hair-is-on-fire/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Of Flagpoles and Fishes</title>
		<link>http://www.mkbergman.com/969/of-flagpoles-and-fishes/</link>
		<comments>http://www.mkbergman.com/969/of-flagpoles-and-fishes/#comments</comments>
		<pubDate>Mon, 15 Aug 2011 08:25:48 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Software and Venture Capital]]></category>
		<category><![CDATA[Structured Dynamics]]></category>
		<category><![CDATA[enterprise software]]></category>
		<category><![CDATA[Inbound marketing]]></category>
		<category><![CDATA[market penetration strategies]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[new paradigm]]></category>
		<category><![CDATA[permission marketing]]></category>
		<category><![CDATA[sales practices]]></category>
		<category><![CDATA[software license fees]]></category>
		<category><![CDATA[substantive marketing]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=969</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Of Flagpoles and Fishes&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Open Source&amp;rft.subject=Software and Venture Capital&amp;rft.subject=Structured Dynamics&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-08-15&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/969/of-flagpoles-and-fishes/&amp;rft.language=English"></span>
The New Paradigm of &#8216;Substantive Marketing&#8217; for Innovative IT This decade has clearly marked a sea change in the move of enterprise software from proprietary to open source, as I have recently discussed [1]. It is instructive that only a mere six years ago I was in heated fights with my then Board about open [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Of Flagpoles and Fishes&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Open Source&amp;rft.subject=Software and Venture Capital&amp;rft.subject=Structured Dynamics&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-08-15&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/969/of-flagpoles-and-fishes/&amp;rft.language=English"></span>
<h2><a><img title="World's Tallest Flagpole; see ref [9]" src="../wp-content/themes/ai3/images/2011Posts/110815_tallest_flagpole.jpg" alt="World's Tallest Flagpole; see ref [9]" hspace="5" vspace="5" align="left" /></a>The New Paradigm of &#8216;Substantive Marketing&#8217; for Innovative IT</h2>
<p>This decade has clearly marked a sea change in the move of enterprise         software from proprietary to open source, as I have recently <a href="../940/declining-it-innovation-in-the-enterprise/"> discussed</a> <a href="#flagpole1">[1]</a>. It is instructive that only a mere six years ago I         was in heated fights with my then Board about open source; today, that         seems so quaint and dated.</p>
<p>Also during this period many have noted how open source has changed the capital required to begin a new software startup <a href="#flagpole2">[2]</a>. Open         source both provides the tooling and the components for cobbling         together specialty apps and extensions. Six and seven and even eight         figure startup costs common just a decade ago have now dropped to four         or five figures. When we see the explosion of hundreds of thousands of         smartphone apps we are seeing the glowing residue of these additional         sea changes. Dropping startup costs by one to three orders of magnitude         is truly democratizing innovation.</p>
<p>But something else has been going on that is changing the face of         enterprise software (besides consolidation, another factor I also         recently <a href="../940/declining-it-innovation-in-the-enterprise/"> commented on</a>). And that factor is &#8220;marketing&#8221;. Much less commentary         is made about this change, but it, too, is greatly lowering costs and         fundamentally changing market penetration strategies. That topic &#8212; and         my personal experience with it &#8212; is the focus of this article.</p>
<h3>The Obsolete Recent Past</h3>
<p>Besides the few remaining big providers of enterprise software &#8212; like         IBM, Oracle, HP, SAP &#8212; most vendors have totally remade their sales         practices of just a few years ago. Large sales forces with big         commissions and a year to two year sales cycles can no longer be         justified when software license fees and the percentage maintenance         annuities that flow from them are dropping rapidly. Today&#8217;s mantras are         doing more with less and doing it faster, hardly consistent with the         traditional enterprise software model. Sure, big enterprises,         especially big government and big business, have large sunk costs in         legacy systems that will continue to be milked by existing vendors. But         the flow is constricting with longer-term trends clear to see. The old         enterprise software model is obsolete.</p>
<p>Even if it were not dying, it is hard to square huge investments in         sales and marketing when product development has become inexpensive and         agile. The proliferation of three-letter marketing acronyms for         branding &#8220;new&#8221; product areas and standard formulas for product hype of         just a few years ago also feels old and dated. Cozy relationships with         conventional trade press pundits and market analysts seem to be         diminishing in importance, possibly because the authoritativeness of         their influence is also diminishing. It is harder to justify market         firm subscription costs when priority budget items are being cut and         new information outlets have emerged.</p>
<p>In response to this, many developers have forsaken the enterprise         market for the consumer one. Indeed enterprises themselves are looking         more and more to the consumer sector and commodity apps for innovation         and answers. But, still, problems unique to enterprises remain and how         to effectively reach them in this brave new world is today&#8217;s marketing         problem for enterprise software vendors.</p>
<p>Most entities today, when opining about these challenges, tend to         emphasize the need for &#8220;laser focus&#8221; and &#8220;rifle-shot&#8221; targeting of         prospects. The advice takes the form of: 1) emphasize well-defined         verticals; 2) know your market well; and 3) target and go after your         likely prospects. Prospect data mining and targeted ad analysis are the         proferred elixirs.</p>
<p>But, there is little evidence such refined methods for prospect         identification and targeting are really working. Like politicians doing         focus groups and opinion polling to capture the desired &#8220;message&#8221; of         their potential electorates, these are all still &#8220;push&#8221; models of         marketing. Yet we are swamped with pushed messages and marketing         everywhere we turn. The model is failing.</p>
<p>Besides message overload, there are two issues with laser targeting.         First, despite all that we try to know about ready buyers (for         enterprise software), we really don&#8217;t know if any particular individual is truly needful,         in a position to buy, has the authority to buy, or is the right         advocate to make the internal sell. Second, though the idea of &#8220;laser&#8221;         carries with it the image of focus and not flailing, it is in fact         expensive to identify the targets and send a focused message their way.         Because of these issues, decay rates for laser prospects throughout         conventional sales pipelines continue to rise.</p>
<h3>A New Marketing Paradigm<img style="width: 225px; height: 207px;" title="New Paradigm Roadsign" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110815_new_paradigm_sign.jpg" alt="New Paradigm Roadsign" align="right" /></h3>
<p>There has always been the phenomenon of the &#8220;<a href="http://www.youtube.com/watch?v=InENM6fwIwE">fish jumping into the         boat</a>&#8220;; that is, the unanticipated inbound inquiry from a previously         unknown prospect leading to a surprisingly swift sale. But we have seen         this phenomenon increase markedly in recent years. <a href="http://structureddynamics.com/">Structured Dynamics</a>&#8216; current         customer base &#8212; including recurring customers &#8212; comes almost         exclusively from this source. As we have noted this trend in comparison         with more targeted outreach, we have spent much time trying to         understand why it is occurring and how we can leverage what Peter         Drucker called the &#8220;unexpected success&#8221; <a href="#flagpole3">[3]</a>.</p>
<p>What we are seeing, I believe, is a shift from sales to marketing, and         within marketing from direct or outbound marketing to a new paradigm of         marketing. Others have likened this to <a href="http://en.wikipedia.org/wiki/Inbound_marketing">inbound marketing</a> <a href="#flagpole4">[4]</a> or <a href="http://en.wikipedia.org/wiki/Content_marketing">content         marketing</a> <a href="#flagpole5">[5]</a> or <a href="http://en.wikipedia.org/wiki/Permission_marketing">permission         marketing</a> <a href="#flagpole6">[6]</a>. What we are seeing at Structured Dynamics bears many         resemblances to parts of what is claimed for these other approaches,         but not all. And, it is also true that what we are seeing may pertain         mostly to innovative IT for emerging enterprise markets, and not a         generalized paradigm suitable to other products or markets.</p>
<p>For lack of a better term, what we are seeing we can term &#8220;substantive         marketing&#8221;. By this we mean offering valuable content and         solutions-oriented systems for free and without restriction. This         shares aspects with content marketing. Then, in keeping with the trend         for buyers doing their own research and analysis to fulfill their own         needs, similar to the premises of inbound or permission marketing,         potential consumers can make their own judgments as to relevance and         value of our offerings.</p>
<p>Sometimes, of course, some prospects find our approaches and solutions         lacking. Sometimes, they may grab what we have offered for free and use         them on their own without compensation to us. But where the match is         right &#8212; and we need to be honest with both ourselves and the customer         when it is <em><strong>not</strong></em> &#8212; we can better         spend the customer&#8217;s limited time and resources to tailor our generic         solutions to their specific needs. In doing so, we offer higher value         (tailored services) while learning better about another spectrum of         consumer need that can virtuously enhance our substantive offerings for         the next prospect.</p>
<p>So, let&#8217;s decompose these components further to see what they can tell         us about this new practice of substantive marketing and how to use it         as an engine for moving forward.</p>
<p><a href="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110815_substantive_marketing.png"><img class="center_ok" style="border: 0px solid; width: 600px; height: 524px;" title="Substantive Marketing" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110815_substantive_marketing.png" alt="Substantive Marketing" /></a></p>
<h4>The Virtuous Cycle Begins with Substantive Solutions</h4>
<p>The premise of substantive marketing is to offer square-deal value to         the marketplace in the form of solutions-based content. Like content         marketing that offers &#8220;the creation or sharing of content for the         purpose of engaging current and potential consumer bases&#8221; <a href="#flagpole5">[5]</a>,         substantive marketing goes even further. The whole basis and premise of         the approach is to provide substantive content, in one of more of these         areas, preferably all:</p>
<ul>
<li>Knowledge &#8212; this substantive area includes papers, commentary,         survey results or listings of tools and references useful to the target         market</li>
<li>Analysis &#8212; this content area includes unique analysis of market         trends, data, technologies or reviews that pertain to the target market</li>
<li>Code &#8212; this area relates to the provision of open source code and         tools, preferably under licenses that allow users to use the software         without restriction (two examples are the <a href="http://en.wikipedia.org/wiki/Apache_License">Apache 2 license</a> and         the <a href="http://en.wikipedia.org/wiki/MIT_License">MIT license</a>)</li>
<li>Documentation &#8212; a critical substantive area is the documentation         in how to install, use, modify or customize these tools, including a         prejudice to <a href="http://en.wikipedia.org/wiki/API">APIs</a> and         tutorial information</li>
<li>Methodologies, workflows and best practices &#8212; it is important to         also discuss how to properly operate and utilize these tools and         information. Taking care to document lessons learned and best practices         also helps the user community avoid common mistakes and to speed         adoption and utility, and</li>
<li>Demos &#8212; this area involves setting up (and sharing code and         procedures for same) demos that show how the code and its methods         actually work. Demos also become first use cases to aid the new user in         learning and setting up the code bases.</li>
</ul>
<p>Further, this substantive content is offered without strings,         restrictions or customer fill-in forms. The content is not a come on or         a teaser. We are not trying to gather leads or prospect names, because         we have no intent to dun them with emails or follow-ups.</p>
<p>This substantive content is as complete as can be to enable new users         to adopt the information and tools in their current state without         further assistance. (In some cases, the information also educates the         marketplace in order to prepare future customers for adoption.) Most         importantly, this substantive content is offered for free, either open source         (for code) or <a href="http://en.wikipedia.org/wiki/Creative_Commons_licenses">creative         commons</a> for documentation and other content. In return, it is fair         to request &#8212; and we do &#8212; attribution when this material is used.</p>
<p>We have previously termed this complete panoply of substantive content         a <em><a href="../882/listening-to-the-enterprise-total-open-solutions-part-1/"> total open solution</a></em><a href="#flagpole7"> [7]</a>.         Some might find the provision of such robust information crazy: How can         we give away the store of our proprietary knowledge and systems? But we         find this kind of thinking old school. In an open source world where so         much information is now available online, with a bit of effort         customers can find this information anyway. Rather, our mindset is that         customers do not want to pay again for what has already been done, but         are willing to pay for what can be done with that knowledge for their         own specific problems. Offering the complete storehouse of our         knowledge in fact signals our interest in only charging the customer         for new answers, new value or new formulations. The customers we like         to work with feel they are getting an honest, square deal.</p>
<h4>Flagpole Venues Help Increase Awareness</h4>
<p>Consider your substantive content to be your flag, a unique banner for         conveying and packaging your specific brand. It is thus important to         find appropriate flagpoles &#8212; in the virtual territories that your         customers visit &#8212; for raising this content high for them to see. Since         the role of these flagpoles is to create awareness in potential         prospects &#8212; who you do not likely know individually or even by group         in advance &#8212; it makes sense to raise your offerings up on many         flagpoles and on the highest flagpoles. Visibility is the object of the         approach.</p>
<p>This approach is distinctly not leafletting or cramming links or emails         into as many spaces as possible. The idea of substantive marketing is         to fly valuable content high enough that desirous potential customers         can discover and then inspect the information on their own, and only if         they so choose. In this regard, substantive marketing resembles         permission marketing <a href="#flagpole6">[6]</a>.</p>
<p>Being visible helps ensure that the needful, questing prospect that you         would never have been able to target on your own is able to see and be         aware of your offerings. And, since they are seeking information and         answers, your collateral needs to be of a similar nature. Solutions and         substance are what they are seeking; what you have run up the flagpole         should respond to that.</p>
<p>The mindset here is to respect your prospective customers and to allow         them to chose to receive and inspect your offerings, but only if they         so choose. If flown in the right venues with the right visibility,         customers will see your flags and inspect them if they meet their         requirements.</p>
<p>Some of the venues at which you can raise your flags include:</p>
<ul>
<li>Blogs &#8212; this venue is especially helpful, since you have complete         control over content, message, voice and packaging</li>
<li>Social networks &#8212; the value of social networks is now accepted,         and should be a core component of any visibility strategy. However, it         is also important to make sure that your contributions are driven by         substance and value and do not become part of the cacophonous         background noise</li>
<li>Vertical media &#8212; there are always existing outlets well-read and         -respected by your customer propects. Establishing relationships and         value with these third-party outlets can extend your reach</li>
<li>Web sites &#8212; this venue includes your standard Web sites, of         course. But, you should also consider setting up specific         project-related sites or sites dedicated to documentation (c.f., our         <a href="http://techwiki.openstructs.org/index.php/Main_Page">TechWiki</a> site         of 300+ technical articles) or to methodologies (the excellent           <a href="http://mike2.openmethodology.org/">MIKE2.0</a> site is one           great example) or to other ways by which particular content (such as           tools with the <a href="../sweet-tools/">Sweet           Tools</a> site) can raise another flag</li>
<li>User forums &#8212; user discussion groups and forums also become their         own attractants for like-interested prospects, and</li>
<li>Conferences and tradeshows &#8212; while potentially valuable, presence         at conferences and tradeshows must be carefully evaluated. Since         participation and opportunity costs are high, the venues should be         clearly relevant to your market space with likely decision makers in         attendance.</li>
</ul>
<p>The observant reader will have already concluded that each of these         venues develops slowly, and therefore raising visibility is generally a         slow-and-steady game that requires patience. Start-up vendors backed by         venture firms or those looking for quick visibility and cashout will         not find this approach suitable. On the other hand, customer prospects         looking for answers and self-sustaining solutions are not much         interested in flash in the pan vendors, either.</p>
<h4>A Model Responsive to the Changing Nature of Customer Prospects</h4>
<p>The real drivers for this changing paradigm come from customer         prospects. Sophisticated buyers of enterprise IT and instrumental         change agents within organizations share most if not all of these         characteristics:</p>
<ul>
<li>They are inundated with marketing messages and jaded about hype and         &#8220;pushed&#8221; messages</li>
<li>They are generally knowledgeable about their needs and problem         spaces and about approximate technologies. They are eager and desirous         of learning independently and know that their recommendations affect         their personal reputations and standing within their enterprises</li>
<li>With the many volatile external and internal changes, including         staff reductions and fluid assignments, leadership for new technology         adoption can come from many different and unknown corners of the         organization; it is extremely difficult to identify and target         prospects</li>
<li>The economic and competitive environment places a premium on         affordability and low-risk evaluations of new technologies</li>
<li>Lock-ins of any kind &#8212; be it to specific vendors or technologies         &#8212; are understood as inherently risky. This understanding is raising         the importance of open and standards-based approaches</li>
<li>Being the subject of a pushy sales effort is distasteful and a         negative to an eventual sale. Education and learning, however, is         respected</li>
<li>Because of all that is at stake, honesty with no bullshit is highly         appreciated. If you as a vendor do not offer an appropriate solution or         have fulfillment weaknesses, tell the prospect so. Further, tell them         who can supply the solution. One never knows when and where the next         problem may arise, and providing trustworthy advice can lead to later         engagements.</li>
</ul>
<p>More often than not we find our customers to have already installed and         used our existing substantive materials for some time before they         approach us about further work. They appreciate the tutorial         information and have taught themselves much in advance. By the time we         engage, both parties are able to cost-effectively focus on what is         truly missing and needed and to deliver those answers in a quick way.         Re-engagements tend to occur when a next set of gaps or challenges         arise.</p>
<p>Though it may sound trite or even unbelievable to those who have not         yet experienced such a relationship, the square deal value offered by         substantive marketing can really lead to true partnerships and trust         between vendor and customer. We experience it daily with our customers,         and <em>vice versa</em>. We also think         this is the adaptive approach that our new environment demands.</p>
<h4>The Free Path to Open Source and Solutions</h4>
<p>Once prospects learn of our substantive offerings, many may decide         independently that what we have is not suitable. Others may simply         download and use the information on their own, for which we often never         know let alone receive revenue. We are completely fine with this, as         shown for three different cases.</p>
<p>First, some of these prospects need no more than what we already have.         This increases our user base, increases our visibility and often         results in contributions to our forums and documentation.</p>
<p>Then, some of these prospects come to learn they need or want more than         what our current offerings provide, leading to two possible forks. In one         fork, the second case, they may have sufficient skills internally or         with other suppliers to extend the system on their own. Some of this         flows back to an improved code base or improved installation or         documentation bases.</p>
<p>In the other fork, the third case, they may decide to engage us in         tailoring a solution for them. That case is the only one of the three that         leads to a direct revenue path.</p>
<p>In all three cases we win, and the customer wins. Maybe enterprise         software vendors of decades past rue this reality of lower margins and         shared benefits; we agree that the absolute profit potential of         substantive marketing is much less. But we gladly accept the more         enjoyable work and steady revenue relationships resulting from these         changes. We are not engaged in some pollyann-ish altruism here, but in         a steely-eyed honest brokering that best serves our own self-interest         (and fairly that of the customer, as well).</p>
<h4>A Square Deal Baseline for Tailored Services</h4>
<p>Great IT product does not come from idle musings or dreamed up         functionality. It comes solely and directly from solving customer         problems. Only via customers can software be refined and made more         broadly usable.</p>
<p>A slipstream of those who have previously become aware and tested our         offerings will choose to engage our services. This generally takes the         form of an inbound call, where the prospect not only qualifies itself,         but also establishes the terms and conditions for the sale. They have         chosen to select us; they are fish that have jumped into the boat.</p>
<p>To again quote Peter Drucker, &#8220;. . . the aim of marketing is to make         selling superfluous. The aim of marketing is to know and understand the         customer so well that the product or service fits him and sells itself.         Ideally, marketing should result in a customer who is ready to buy. All         that should be needed then is to make the product or service available         . . .&#8221; <a href="#flagpole8">[8]</a>. This is precisely what I meant earlier about the shift in         emphasis from sales to marketing.</p>
<p>Even at this point there may be mismatches in needs and our skills and         availabilities. If such is the case, we do not hesitate to say so, and         attempt to point the prospect in another direction (from which we also         gain invaluable market knowledge). If there is indeed a match, we then         proceed to try to find common ground on schedule and budget.</p>
<p>Paradoxically, this square deal and honesty about the readiness and         weaknesses of our offerings often leads to forgiveness from our         customers. For example, for some time we have lacked automated         installation scripts that would make it easier for prospects to install         our <a href="http://openstructs.org/open-semantic-framework">open         semantic framework</a>. But, because of compensating value in other         areas, such gaps can be overlooked and tackled later on (indeed, as a         current customer is now funding). By not pretending to be everything to         everyone, we can offer what we do have without embarrassment and get on         with the job of solving problems.</p>
<p>For larger potential engagements, we typically suggest a fixed price         initial effort to develop an implementation plan. The interviews and         research to support this typical 4- to 6-weeks effort (generally in the $5         K to $10 K range, depending) then result in a detailed fulfillment         proposal, with firm tasks, budget and schedule, specific to that customer&#8217;s requirements. Just as we respect our         prospects&#8217; time and budget, we expect the same and do not conduct these         detailed plans without compensation. With respect to fulfillment         contracts, we cap contract amount and limit milestone payments to         pre-set percentages or time expended, whichever is lower.</p>
<p>This approach ensures we understand the customer&#8217;s needs and have         budgeted and tasked accordingly. Capped contracts also put the onus on         us the contractor to understand our own effort and tasking structures         and realities, which leads to better future estimating. For the         customer, this approach caps risk and potential exposure, and ensures         milestones are being met no matter the time expenditures by us, the         contractor. This approach extends our square-deal basis to also embrace         risks and payments.</p>
<h4>New (and Open Source) Developments Fuel the Substance Pipeline</h4>
<p>Thus, when customers engage us, they spend almost solely on new functionality         specifically tailored to their needs. In doing so, we suggest they         agree to release the new developments they fund as open source. We         argue &#8212; and customers predominantly agree &#8212; that they are already         benefitting from lower overall costs because other customers have         funded sharable, open source before them. We point out that the new         customers that follow them will also be independently creating new         functionality, to which they will also later benefit.</p>
<p>(This argument does not apply to specific customer data or ontologies,         which are naturally proprietary to the customer. Also, if the customer         wants to retain intellectual ownership of extensions, we charge higher         development fees.)</p>
<p>Once these new developments are completed, they are fed back into a new         baseline of valuable content and code. From this new baseline the cycle         of substantive marketing can be augmented anew and perpetuated.</p>
<h3>Three Guidelines to Leverage Substantive Marketing</h3>
<p>All of these points can really be boiled down to three guidelines for         how to make substantive marketing effective:</p>
<ul>
<li>First, whatever your domain or market, provide useful and         substantive content. The content you offer is indeed your marketing         collateral. Prospective customers can gauge from it directly whether it         meets their needs, appears sound and workable, and has value. If you         have little of substance to offer, this paradigm is not for you</li>
<li>Second, plant many flagpoles and raise your flags high in         territories your market prospects are likely to visit. This is a         process that requires thoughtfulness and patience. Thoughtfulness,         because that is how you determine where to plant your flags. If you         yourself are a consumer of what you offer, it is easier to find those         venues. And patience, because it takes time to stack valuable content         upon valuable content in order to raise visibility</li>
<li>And, third, be honest and respectful. Help your prospect work         within available budget to achieve the most possible at lowest risk.         And help them find others, if need be, who might be better able than         you to truly solve their problems.</li>
</ul>
<p>What we are finding &#8212; as we continue to refine our understanding of         this new paradigm &#8212; is that through substantive marketing the fish are         finding us and they sometimes jump into the boat. We like our         enterprise customers to pre-qualify themselves and already be &#8220;sold&#8221;         once they knock on the door. One never knows when that phone might ring         or the email might come in. But when it does, it often results in a         collaborative customer as a partner who is a joy to work with to solve         exciting new problems.</p>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a name="flagpole1"></a>[1] M.K. Bergman, 2011. &#8220;Declining IT Innovation in the Enterprise,&#8221; in<em> AI3:::Adaptive Innovation </em> blog, January 17, 2011. See <a href="../940/declining-it-innovation-in-the-enterprise/"> http://www.mkbergman.com/940/declining-it-innovation-in-the-enterprise/</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="flagpole2"></a>[2] Paul Graham has been the most prominent observer of this scene; see         P. Graham, 2008. &#8220;Why There Aren&#8217;t Any More Googles,&#8221; April 2008 (see         <a href="http://www.paulgraham.com/googles.html">http://www.paulgraham.com/googles.html</a>)         and subsequent articles.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="flagpole3"></a>[3] See esp. Peter F. Drucker, 1985. <em>Innovation and Entrepreneurialship: Practice and         Principals</em>, Harper &amp; Row, New York, NY, 277 pp.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="flagpole4"></a>[4] <a href="http://en.wikipedia.org/wiki/Inbound_marketing">Inbound         marketing</a> is a marketing strategy that focuses on getting found by         customers. According to <a title="David Meerman Scott" href="http://en.wikipedia.org/wiki/David_Meerman_Scott">David Meerman Scott</a>, inbound marketers         &#8220;earn their way in&#8221; (via publishing helpful information on a blog etc.)         in contrast to outbound marketing where they used to have to &#8220;buy, beg,         or bug their way in&#8221; (via paid advertisements, issuing press releases         in the hope they get picked up by the trade press, or paying         commissioned sales people, respectively). <a title="Brian Halligan" href="http://en.wikipedia.org/wiki/Brian_Halligan">Brian Halligan</a>, cofounder and CEO of <a title="HubSpot" href="http://en.wikipedia.org/wiki/HubSpot">HubSpot</a>,         claims he first coined the term of inbound marketing.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="flagpole5"></a>[5] <a href="http://en.wikipedia.org/wiki/Content_marketing" target="_blank">Content marketing</a> is an umbrella term encompassing         all marketing formats that involve the creation or sharing of content         for the purpose of engaging current and potential consumer bases. In         contrast to traditional marketing methods that aim to increase sales or         awareness through interruption techniques, content marketing subscribes         to the notion that delivering high-quality, relevant and valuable         information to prospects and customers drives profitable consumer         action. See also Holger Shulze, 2011. <em>B2B Content Marketing Trends</em> slideshow,         see <a href="http://www.slideshare.net/hschulze/b2b-content-marketing-report">http://www.slideshare.net/hschulze/b2b-content-marketing-report</a>.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="flagpole6"></a>[6] <a href="http://en.wikipedia.org/wiki/Seth_Godin">Seth Godin</a> coined the term <a href="http://en.wikipedia.org/wiki/Permission_marketing">permission         marketing</a> wherein marketers obtain permission before advancing to         the next step in the purchasing process. It is mostly used by online         marketers, notably email marketers and search marketers, as well as         certain direct marketers who send a catalog in response to a request.         Godin contrasts this approach to traditional &#8220;interruption marketing&#8221;         where messages are sent without prior permission.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="flagpole7"></a>[7] See the three-part series, M.K. Bergman, 2010. &#8220;Listening to the         Enterprise: Total Open Solutions,&#8221; &#8220;<a href="../882/listening-to-the-enterprise-total-open-solutions-part-1/">Part         1</a>,&#8221; &#8220;<a href="../883/listening-to-the-enterprise-total-open-solutions-part-2/">Part         2</a>&#8221; and &#8220;<a href="../884/listening-to-the-enterprise-total-open-solutions-part-3/">Part         3</a>,&#8221;<em> AI3:::Adaptive         Information</em> blog, May 12 &#8211; 31, 2010.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="flagpole8"></a>[8] Peter F. Drucker, 1974. <em>Management: Tasks, Responsibilities,         Practices</em>. New York, NY: Harper &amp; Row. pp. 864. ISBN         0-06-011092-9.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="flagpole9"></a>[9] The intro photo is of the world&#8217;s tallest flagpole (at 165 m), in         Dushanbe, Tajikistan. The photo is courtesy of <em>CentralAsiaOnline.com.</em></div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/969/of-flagpoles-and-fishes/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>A New Best Friend: Gephi for Large-scale Networks</title>
		<link>http://www.mkbergman.com/968/a-new-best-friend-gephi-for-large-scale-networks/</link>
		<comments>http://www.mkbergman.com/968/a-new-best-friend-gephi-for-large-scale-networks/#comments</comments>
		<pubDate>Mon, 08 Aug 2011 09:27:47 +0000</pubDate>
		<dc:creator>Mike Bergman</dc:creator>
				<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Semantic Web Tools]]></category>
		<category><![CDATA[UMBEL]]></category>
		<category><![CDATA[cytoscape]]></category>
		<category><![CDATA[Gephi]]></category>
		<category><![CDATA[graph]]></category>
		<category><![CDATA[graph analysis]]></category>
		<category><![CDATA[graph visualization]]></category>
		<category><![CDATA[graphing programs]]></category>
		<category><![CDATA[network]]></category>

		<guid isPermaLink="false">http://www.mkbergman.com/?p=968</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=A New Best Friend: Gephi for Large-scale Networks&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Ontologies&amp;rft.subject=Open Source&amp;rft.subject=Semantic Web Tools&amp;rft.subject=UMBEL&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-08-08&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/968/a-new-best-friend-gephi-for-large-scale-networks/&amp;rft.language=English"></span>
Visualization + Analysis Pushes Aside Cytoscape Though I never intended it, some posts of mine from a few years back dealing with 26 tools for large-scale graph visualization have been some of the most popular on this site. Indeed, my recommendation for Cytoscape for viewing large-scale graphs ranks within the top 5 posts all time [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=A New Best Friend: Gephi for Large-scale Networks&amp;rft.aulast=Bergman&amp;rft.aufirst=Mike&amp;rft.subject=Ontologies&amp;rft.subject=Open Source&amp;rft.subject=Semantic Web Tools&amp;rft.subject=UMBEL&amp;rft.source=AI3:::Adaptive Information&amp;rft.date=2011-08-08&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://www.mkbergman.com/968/a-new-best-friend-gephi-for-large-scale-networks/&amp;rft.language=English"></span>
<h2><a><img style="border: 0px solid; width: 250px; height: 246px; float: left; margin-right: 10px;" title="Geshi Network" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110808_geshi.png" alt="Geshi Network" hspace="5" vspace="5" align="left" /></a>Visualization + Analysis Pushes Aside Cytoscape</h2>
<p>Though I never intended it, some posts of mine from a few years back         dealing with <a href="../414/large-scale-rdf-graph-visualization-tools/"> 26 tools for large-scale graph visualization</a> have been some of the         most popular on this site. Indeed, my <a href="../415/cytoscape-hands-down-winner-for-large-scale-graph-visualization/"> recommendation for Cytoscape</a> for viewing large-scale graphs ranks         within the top 5 posts all time on this site.</p>
<p>When that analysis was done in January 2008 <a href="http://structureddynamics.com/">my company</a> was in the midst         of needing to process the large <a href="http://www.umbel.org/">UMBEL</a> vocabulary, which now consists of         28,000 concepts. Like anything else, need drives research and demand,         and after reviewing <a href="../414/large-scale-rdf-graph-visualization-tools/"> many graphing programs</a>, we <a href="../415/cytoscape-hands-down-winner-for-large-scale-graph-visualization/"> chose Cytoscape</a>, then provided some ongoing guidelines in its use         for semantic Web purposes. We have continued to use it productively in         the intervening years.</p>
<p>Like for any tool, one reviews and picks the best at the time of need.         Most recently, however, with growing customer usage of large ontologies and the development of our         own <a href="../959/intro-to-structontology/">structOntology</a> editing and managing framework, we have begun to butt up against the         limitations of large-scale graph and network analysis. With this post,         we announce our new favorite tool for semantic Web network and graph         analysis &#8212; <a href="http://gephi.org/">Gephi</a> &#8212; and explain its         use and showcase a current example.</p>
<h3>The Cytoscape Baseline and Limitations</h3>
<p>Three and one-half years ago when I first wrote about <a href="../415/cytoscape-hands-down-winner-for-large-scale-graph-visualization/"> Cytoscape</a>, it was at version 2.5. Today, it is at version 2.8, and         many aspects have seen improvement (including its Web site). However,         in other respects, development has slowed. For example, version         3.x was first discussed more than three years ago; it is still not         available today.</p>
<p>Though the system is open source, Cytoscape has also largely been         developed with external grant funds. Like other similarly funded         projects, once and when grant funds slow, development slows as well.         While there has clearly been an active community behind Cytoscape, it         is beginning to feel tired and a bit long in the tooth. From a semantic         Web standpoint, some of the limitations of the current Cytoscape         include:</p>
<ul>
<li>Difficult conversion of existing ontologies &#8212; Cytoscape requires         creating a CSV input; there was an earlier <a href="http://bioinformatics.org/rdfscape/">RDFscape</a> plug-in that held         great promise to bridge the software into the RDF and semantic Web         sphere, but it has not remained active</li>
<li>Network analysis &#8212; one of the early and valuable generalized         network analysis plug-ins was <a href="http://med.bioinf.mpi-inf.mpg.de/netanalyzer/">NetworkAnalyzer</a>;         however, that component has not seen active development in three years,         and dynamic new generalized modules suitable for social network         analysis (<a href="http://en.wikipedia.org/wiki/Social_network_analysis">SNA</a>) and         <a href="http://en.wikipedia.org/wiki/Small-world_network">small-world         networks</a> have not been apparent</li>
<li>Slow performance and too-frequent crashes &#8212; Cytoscape has always         had a quirky interface and frequent crashes; later versions are a bit         more stable, but usability remains a challenge</li>
<li>Largely supported by the biomedical community &#8212; from the         beginning, Cytoscape was a project of the biomedical community. Most         plug-ins still pertain to that space. Because of support for <a href="http://www.obofoundry.org/">OBO</a> (Open Biomedical and Biological         Ontologies) formats and a lack of uptake by the broader semantic Web         community, RDF- and OWL-based development has been keenly lacking</li>
<li>Aside from PDFs, poor ability to output large graphs in a viewable         manner</li>
<li>Limited layout support &#8212; and poor performance for many of those         included with the standard package.</li>
</ul>
<p>Undoubtedly, were we doing semantic technologies in the biomedical         space, we might well develop our own plug-ins and contribute to the         Cytoscape project to help overcome some of these limitations. But,         because I am a tools geek (see my <a href="../sweet-tools/">Sweet Tools listing</a> with         nearly 1000 semantic Web and -related tools), I decided to check out         the current state of large-scale visualization tools and see if any had         made progress on some of our outstanding objectives.</p>
<h3>Choosing Geshi and Using It</h3>
<p>There are three classes of graph tools in the semantic technology         space:</p>
<ol>
<li>Ontology navigation and discovery, to which the <a href="http://techwiki.openstructs.org/index.php/SRelationBrowser_Examples"> Relation Browser</a> and <a href="http://www.visualdataweb.org/relfinder.php">RelFinder</a> are           notable examples</li>
<li>Ontology structure visualization (and sometimes editing), such as         the <a href="http://graphviz.org/">GraphViz</a> (OWLViz) or <a href="http://protegewiki.stanford.edu/wiki/OntoGraf">OntoGraf</a> tools used         in <a href="http://protege.stanford.edu/">Protégé</a> (or the nice         <a href="http://www.thechiselgroup.org/flexviz">FlexViz</a>, again used         by the OBO community), and</li>
<li>Large-scale graph visualization in order to gain a complete picture         and macro relationships in the ontology.</li>
</ol>
<p>One could argue that the first two categories have received the most         current development attention. But, I would also argue that the third         class is one of the most critical:  to understand where one is in         a large knowledge space, much better larger-scale visualization and         navigation tools are needed. Unfortunately, this third category is also         the one that appears to be receiving the least development attention.         (To be sure, large-scale graphs pose computational and performance         challenges.)</p>
<p>In the nearly four years since my <a href="../414/large-scale-rdf-graph-visualization-tools/"> last major survey of 26 tools</a> in this category, the new entrants         appear <a href="http://en.wikipedia.org/wiki/Social_network_analysis_software">quite         limited</a>. I&#8217;ve surely overlooked some, but the most notable are         <a href="http://www.franz.com/agraph/gruff/">Gruff</a>, <a href="http://ophid.utoronto.ca/navigator/">NAViGaTOR</a>, <a href="http://networkx.lanl.gov/">NetworkX</a> and <a href="http://gephi.org/">Gephi</a> <a href="#gephi1">[1]</a>. Gruff actually appears to belong         most in Category #2; I could find no examples of graphs on the scale of         thousands of nodes. NAViGaTOR is biomedical only. NetworkX has no direct         semantic graph importing and &#8212; while apparently some RDF libraries can         be used for manipulating imports &#8212; alternative workflows were too         complex for me to tackle for initial evaluation. This leaves Gephi as         the only potential new candidate.</p>
<p>From a clean Web site to well-designed intro tutorials, first         impressions of Gephi are strongly positive. The real proof, of course,         was getting it to perform against my real use case tests. For that, I         used a &#8220;big&#8221; ontology for a current client that captures about 3000         different concepts and their relationships and more than 100         properties. What I recount here &#8212; from first installing the program         and plug-ins and then setting up, analyzing, defining display         parameters, and then publishing the results &#8212; took me less than a day         from a totally cold start. The Gephi program and environment is         surprisingly easy to learn, aided by some great tutorials and online info         (see concluding section).</p>
<p>The critical enabler for being able to use Gephi for this source and         for my purposes is the <a href="http://gephi.org/plugins/semanticwebimport/">SemanticWebImport</a> plug-in, recently developed by Fabien Gandon and his team at <a href="http://www.inria.fr/en/">Inria</a> as part of the <a href="http://www-sop.inria.fr/edelweiss/">Edelweiss</a> project <a href="#gephi2">[2]</a>. Once         the plug-in is installed, you need only open up the SemanticWebImport         tab, give it the URL of your source ontology, and pick the Start button         (middle panel):</p>
<div><a title="SemWeb Plug-in for Gephi" href="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110509_intangibles_trends.png"> </a><a href="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110808_gephi_semwebimport.png"><img class="center_ok" style="border: 0px solid; width: 600px; height: 488px;" title="SemWeb Plug-in for Gephi" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110808_gephi_semwebimport.png" alt="SemWeb Plug-in for Gephi" /></a>Note the SemanticWebImport tool also has the ability (middle panel) to         issue queries to a SPARQL endpoint, the results of which return a         results graph (partial) from the source ontology. (This feature is not         further discussed herein.) This ontology load and display capability         worked without error for the five or six OWL 2 ontologies I initially         tested against the system.&nbsp;</p>
</div>
<p>Once loaded, an ontology (graph) can be manipulated with a conventional         IDE-like interface of tabs and views. In the right-hand panels above we         are selecting various network analysis routines to run, in this case         Average Degrees. Once one or more of these analysis options is run, we can use the results to then cluster or         visualize the graph; the upper left panel shows highlighting the         Modularity Class, which is how I did the community (clustering)         analysis of our big test ontology. (When run you can also assign         different colors to the cluster families.) I also did some filtering of         extraneous nodes and properties at this stage and also instructed the         system via the ranking analysis to show nodes with more link         connections as larger than those nodes with fewer links.</p>
<p>At this juncture, you can also set the scale for varying such display         options as linear or some power function. You can also select different         graph layout options (lower left panel). There are many layout plug-in         options for Gephi. The layout plugin called <a rel="nofollow" href="http://gephi.org/2010/openord-new-layout-plugin-the-fastest-algorithm-so-far/">OpenOrd</a>, for instance, is reported to be able to         scale to millions of nodes.</p>
<p>At this point I played extensively with the combination of filters,          analysis, clusters, partitions and rankings (as may be separately          applied to nodes and edges) to: 1) begin to understand the gross          structure and characteristics of the big graph; and 2) refine the          ultimate look I wanted my published graph to have.</p>
<p>In our example, I ultimately chose the standard Yifan Hu layout in order to         get the communities (clusters) to aggregate close to one another on the         graph. I then applied the Parallel Force Atlas layout to organize the nodes and make the         spacings more uniform. The parallel aspect of this force-based layout         allows these intense calculations to run faster. The result of these         two layouts in sequence is then what was used for the results displays.</p>
<p>Upon completion of this analysis, I was ready to publish the graph. One         of the best aspects of Gephi is its flexibility and control over         outputs. Via the main Preview tab, I was able to do my final         configurations for the published graph:</p>
<div><a title="Publication Options for Gephi" href="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110509_intangibles_trends.png"> </a><a href="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110808_gephi_preview.png"><img class="center_ok" style="border: 0px solid; width: 600px; height: 489px;" title="Publication Options for Gephi" src="http://www.mkbergman.com/wp-content/themes/ai3/images/2011Posts/110808_gephi_preview.png" alt="Publication Options for Gephi" /></a>The graph results from the earlier-worked out filters and clusters and colors are         shown in the right-hand Preview pane. On the left-hand side, many         aspects of the final display are set, such as labels on or off, font         sizes, colors, etc. It is worth looking at the figure above in full         size to see some of the options available.&nbsp;</p>
</div>
<p>Standard output options include either SVG (vector image) or PDFs, as         shown at the lower left, with output size scaling via slider bar. Also,         it is possible to do standard saves under a variety of file formats or         to do targeted exports.</p>
<p>One really excellent publication option is to create a dynamically         zoomable display using the <a href="http://www.seadragon.com/">Seadragon</a> technology via a separate         <a href="http://gephi.org/plugins/seadragon/">Seadragon Web Export</a> plug-in. (However, because of cross-site scripting limitations due to         security concerns, I only use that option for specific sites. See next         section for the <a href="http://zoom.it/">Zoom It</a> option &#8212; based on Seadragon &#8212; to         workaround that limitation.)</p>
<h3>Outputs Speak for Themselves</h3>
<p>I am very pleased with the advances in display and analysis provided by         Gephi. Using the <a href="http://zoom.it/">Zoom It</a> alternative <a href="#gephi3">[3]</a> to embedded Seadragon, we can see our big ontology example with:</p>
<ul>
<li>All 3000 nodes labeled, with connections shown (though you must must zoom to see)         and</li>
<li>When zooming (use scroll wheel or + icon) or panning (via mouse down moves), wait a couple of seconds to get the         clearest image refresh:</li>
</ul>
<div><script src="http://zoom.it/BAs7.js?width=616px&amp;height=614px" type="text/javascript">
</script></div>
<p><strong>Note</strong>: at standard resolution,         if this graph were to be rendered in actual size, it would be larger than         7 feet by 7 feet square at full zoom !!!</p>
<p>To compare output options, you may also;</p>
<ul>
<li> <a href="../network/big_ontology.pdf">Download a           PDF</a> of this big graph, OR</li>
<li> <a href="../network/big_ontology.svg">Download an           SVG</a> (Inkscape readable version) of this big graph.</li>
</ul>
<h3>Still, Some Improvements Would be Welcomed</h3>
<p>It is notable that Gephi still only versions itself as an &#8220;alpha&#8221;.         There is already a robust user community with promise for much more         technology to come.</p>
<p>As an alpha, Gephi is remarkably stable and well-developed. Though         clearly useful as is, I measure the state of Gephi against my complete         list of desired functionality, with these items still missing:</p>
<ul>
<li>Real-time and interactive navigation &#8212; the ability to move through         the graph interactively and to issue queries and discover relationships</li>
<li>Huge node numbers &#8212; perhaps the <a rel="nofollow" href="http://gephi.org/2010/openord-new-layout-plugin-the-fastest-algorithm-so-far/">OpenOrd</a> plug-in somewhat addresses this need. We           will be testing Gephi against <a href="http://www.umbel.org/">UMBEL</a>, which is an order of magnitude           larger than our test big ontology</li>
<li>More node and edge control &#8212; Cytoscape still retains the advantage         in the degree to which nodes and edges can be graphically styled</li>
<li>Full round-tripping &#8212; being able to use Gephi in an edit mode         would be fantastic; the edit functionality is fairly straightforward,         but the ability to round-trip in appropriate formats (OWL, RDF or         otherwise) may be the greater sticking point.</li>
</ul>
<p>Ultimately, of course, as I explained in an earlier presentation on a         <em><a href="http://techwiki.openstructs.org/index.php/Normative_Landscape_of_Ontology_Tools"> Normative Landscape for Ontology Tools</a></em>, we would like to see a         full-blown graphical program tie in directly with the <a href="http://owlapi.sourceforge.net/">OWL API</a>. Some initial attempts         toward that have been made with the non-Gephi <a href="http://glowvis.org/Main_Page">GLOW visualization</a> approach, but it         is still in very early phases with ongoing commitments unknown.         Optimally, it would be great to see a Gephi plug-in that ties directly to the OWL API.</p>
<p>In any event, while perhaps Cytoscape development has stalled a bit for         semantic technology purposes, Gephi and its SemanticWebImport plug-in         have come roaring into the lead. This is a fine toolset that promises         usefulness for many years to come.</p>
<h3>Some Further Gephi Links</h3>
<p>To learn more about Gephi, also see the:</p>
<ul>
<li>Terrific introductory tutorials on <a href="http://gephi.org/users/quick-start/">quick start</a>, <a href="http://gephi.org/users/tutorial-visualization/">visualization</a> and         <a href="http://gephi.org/users/tutorial-layouts/">layouts</a></li>
<li>Fast videos on the use of the <a href="http://wiki.gephi.org/index.php/SemanticWebImport">SemanticWebImport</a> plug-in for <a href="http://www-sop.inria.fr/teams/edelweiss/wiki/wakka.php?wiki=DBpediaGephi"> DBpedia</a> and <a href="http://www-sop.inria.fr/teams/edelweiss/wiki/wakka.php?wiki=BBCinGephi"> BBC programs</a></li>
<li>Gephi <a href="http://wiki.gephi.org/index.php/Main_Page">community         wiki.</a></li>
</ul>
<p>Also, for future developments across the graph visualization spectrum,         check out the Wikipedia <a href="http://www.wikiviz.org/wiki/Tools">general visualization tools</a> listing on a periodic basis.</p>
<hr style="margin: 15px 0px;" size="1" />
<div style="margin: 10px 0pt; font-size: 90%;"><a name="gephi1"></a>[1] The <a href="http://www.r-project.org/">R</a> open source math and statistics package is very rich with apparently some graph visualization capabilities, such as the dedicated network analysis and visualization project <a href="http://statnetproject.org/">statnet</a>. <a href="http://cran.r-project.org/web/packages/rrdf/index.html">rrdf</a> may also provide an interesting path for RDF imports. R and its family of tools may indeed be quite promising, but the commitment necessary to R appears quite daunting. Longer-term, R may represent a more powerful upgrade path for our general toolsets. <a href="http://neo4j.org/">Neo4j</a> is also a rising star in graph databases, with its own visualization components. However, since we did not want to convert our underlying data stores, we also did not test this option.</div>
<div style="margin-top: 10px; font-size: 90%;"><a name="gephi2"></a>[2] Erwan Demairy is the lead developer and committer for <a href="http://wiki.gephi.org/index.php/SemanticWebImport">SemanticWebImport</a>. The first version was released in mid-April 2011.</div>
<div style="margin: 10px 0pt; font-size: 90%;"><a name="gephi3"></a>[3] For presentations like this blog post, the Seadragon JavaScript enforces some security restrictions against cross-site scripting. To overcome that, the option I followed was to:</div>
<div style="font-size: 90%;">
<ul>
<li>Use Gephi&#8217;s SVG export option</li>
<li>Open the SVG in Inkscape</li>
<li>Expand the size of the diagram as needed (with locked dimensions to prevent distortion)</li>
<li>Save As a PNG</li>
<li>Go to <a href="http://zoom.it/">Zoom It</a> and submit the image file</li>
<li>Choose the embed function, and</li>
<li>Embed the link provided, which is what is shown above.</li>
</ul>
</div>
<div style="font-size: 90%;">(Though Zoom.it also accepts SVG files directly, I found performance to be spotty, with many graphical elements dropped in the final rendering.)</div>
]]></content:encoded>
			<wfw:commentRss>http://www.mkbergman.com/968/a-new-best-friend-gephi-for-large-scale-networks/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

