Posted:November 18, 2007

Sweet Tools Listing

Version 11 Shows Semantic Web Tools Announcements to be on a Furious Pace

This AI3 blog maintains Sweet Tools, the largest listing of about 800 semantic Web and -related tools available. Most are open source. Click here to see the current listing!

After a two months hiatus, I have just updated AI3's listing of semantic Web and -related tools to version 11; follow the links below.

This version adds 72 new tools — one of the largest additions ever — since the last update on September 16, bringing the new total to 650 tools.

Like prior versions, this new Sweet Tools listing is provided either as:

Background on prior listings and earlier statistics may be found on these previous posts:

With interim updates periodically over that period.

In thanks to Henry Story, whose chat note prompted me to post this long overdue update, I have also included some of the tools breakdowns he enjoys.

Some Stats

Here is a table of tools categories, showing both the last breakdown from about 9 months ago when there were 500 tools in the listing, and today with 650 tools:

3/11/2007 11/18/2007 Category
11 24 Ontology Mapper/Mediator
3 6 RDF Editor
2 4 RDF Generator
13 23 Query Language or Service
19 32 Search Engine
23 37 Browser (RDF, other)
20 30 RDF (general)
23 33 NLP/Language Processor
15 21 Visualization
29 40 Composite App/Framework
9 12 Ontology Editor
22 28 Wiki- or blog-related
32 39 Ontology (general)
5 6 Semantic Desktop
22 26 Reasoner/Inference Engine
26 30 Annotator
29 33 Database/Datastore
8 9 Data Language
48 53 Miscellaneous
30 33 Parser or Converter
43 45 Information Extraction
22 23 Wrapper (Web data extractor)
3 3 Description or Formal Logics
4 4 Harvester
25 25 Programming Environment
8 8 Validator
3 Chat-related
3 Data Presentation
3 Mashup/Meshup Framework
2 Rules and related
6 12 NOT ACTIVE (???)
500 650

The fastest growing categories are listed first, with RDF, ontologies, and search growing the most. Note some categories have been added and others are being re-classified or delisted as time and familiarity with the listing grows. Certain areas of current interest, such as NLP, also see increased listings as a result. Of course, all such categorizations have a degree of arbitrariness.

As with the last survey, Java and JavaScript are dominant languages, with considerable growth. Python and PHP have also shown higher than average growth. Ruby appears to have stagnated, and many other languages are in second-tier positions:

Sweet Tools by Language

You’ll be pleased to see, Henry, that Java is still holding serve on market share!

Note: Because of comments expirations on prior posts, this entry is now the new location for adding a suggested new tool. Simply provide your information in the comments section, and your tool will be included in the next update. (Hopefully, that will not take another two months!) :)

Posted:October 17, 2007

Wesch Provides Another Winner with Information R/evolution

Michael Wesch, assistant professor of cultural anthroplogy at Kansas State University, initiator of the Digital Ethnography program, and creator of the previous video wonder “Web 2.0 … The Machine is Us/ing Us” (among others), has produced another fantastic 5 and one-half minutes of viewing pleasure (NOTE: may not show properly in IE; click on large white space):

Posted:October 5, 2007

Why not Web 4.0? Heck, why not Web 98.6?

I am again puking from all of the recent references to “Web 3.0″ as a code number for the pending semantic Web. (As if the semantic Web is a version release, as opposed to a process and an approach.)

The purveyors of this pseudo-debate know who they are and should know better. Shame, shame.

To settle my guts, I decided to I recover this posting I made from about a year ago:

I don't generally rant on this blog, but after having just about puked on too many Web 2.0 references, I'm now choking on new Web 3.0 references. (For those not in the know, Web 3.0 is the “real” semantic Web stuff, while Web 2.0 is that oh-so tired tagging and mashup shit.) Well, the escalation now appears clear. We're now seeing a proliferation of Web 3.0 posts today in order to set some distinctions. Don't like 3.0, not enough horsepower? How about I see that and raise you to 4.0? But why end there? After all, high school was so terribly fun and clever. . . . So, the heck with it. I decided to jump right to the jugular:

Hey, ninety eight point six, it's good to have you back again. oh, Hey, ninety eight point six, her lovin' is the medicine that saaaved me, Oh, I love my baby. *

Web 98.6. It's so cool to be on the cutting edge . . . .

* Lyrics for Keith 98.6 (written by G. Fischoff/T.Powers) provided by http://www.smoke420.com/lyrics.

Posted by AI3's author, Mike Bergman Posted on October 5, 2007 at 11:46 am in Semantic Web | Comments (2)
The URI link reference to this post is: http://www.mkbergman.com/406/please-squash-that-web-30-cockroach/
The URI to trackback this post is: http://www.mkbergman.com/406/please-squash-that-web-30-cockroach/trackback/
Posted:September 16, 2007

Sweet Tools Listing

AI3's Sweet Tools Listing Updated to Version 10

This AI3 blog maintains Sweet Tools, the largest listing of about 800 semantic Web and -related tools available. Most are open source. Click here to see the current listing!

AI3's listing of semantic Web and -related tools has just been updated to version 10. This version adds 36 new tools since the last update on June 19, bringing the new total to 578 tools.

This version 10 update of Sweet Tools also includes an upgrade to version 2 of the lightweight Exhibit display (thanks again, MIT's Simile program and David Huynh, plus congratulations on your Ph.D, David!) and is separately provided as a simple table for quick download and copying.

Background on prior listings and earlier statistics may be found on these previous posts:

With interim updates periodically over that period.

Because of comments expirations on prior posts, this entry is now the new location for adding a suggested new tool. Simply provide your information in the comments section, and the tool will be included in the next update.

Posted:September 10, 2007

Astoria is Whistling Past the Graveyard to Irrelevance

I was pleased to see in my blog reader this morning a post from the Microsoft Astoria team on anticipated data formats for its pending formal release. I have been working on modeling Web data models and hoped to see some insight in the piece.

As the project team states,

The goal of Astoria is to make data available to loosely coupled systems for querying and manipulation. In order to do that we need to use protocols that define the interaction model between the producer and the consumer of that data, and of course we have to serialize the data in some form that all the involved parties understand. So protocols and formats are an important topic in our design process.

With that said, the team announced that the first formal Astoria release will support these three formats (with the single HTTP protocol):

  • ATOM / APP
  • JSON, the JavaScript Object Notation, and
  • Web3S, a Microsoft marketing wonder that as far as I know is only used by the MS Live group.

The later is a strange mapping of a tree data model to the record base of Astoria, in the process also abandoning a straight XML implementation in earlier versions.

Also notable for its absence is RDF (Resource Description Framework). The defensive response of the Astoria team to this absence speaks for itself:

The May [announcement on Astoria] included support for RDF. While we got positive comments about the fact we supported it, we didn't see any early user actually using it and we haven't seen a particular popular scenario where RDF was a must-have. So we are thinking that we may not include RDF as a format in the first release of Astoria, and focus on the other 3 formats (which are already a bunch from the development/testing perspective).

My personal take is that while I understand how RDF fits in the picture of the semantic web and related tools, the semantic web goes well beyond a particular format. The point is to have well-defined, derivable semantics from services. I believe that Astoria does this independently of the format being used. That, combined with the fact that we didn't see a strong demand for it, put RDF lower in our priority lists for formats.

There was a funny Glenn Ford movie from 1964 called “Advance to the Rear”. The problem is, this is not a movie, but the largest software company in the world taking two steps back for each one forward. Congratulations on alienating still further many thought leaders on the Web.

This is yet another stunning and lame attempt by Microsoft to replace open standards with proprietary ones. Get a clue, Redmond!