Posted:September 22, 2006

Comprehensive Listing of 175 Semantic Web Tools

This AI3 blog maintains Sweet Tools, the largest listing of about 800 semantic Web and -related tools available. Most are open source. Click here to see the current listing!

I have been assembling for some time a listing of semantic Web-related software applications and tools. My first partial listing had about 50 sources. I recently noted the W3C’s semantic Web wiki listing of about 70 sources. I then came across the EU’s AKT (Advanced Knowledge Technologies) project, which also has about 75 tools compiled. Protégé also has a fairly long list of plugins, but not unfortunately well organized. Complicating matters still was the listing of natural language processing tools listed at the Natural Language Software Registy, another fantastic resource particularly in the annotation and information extraction arena.

Semantic Web tool sets span from comprehensive engineering environments to specific converters and editors and the like. The entire workflow extends from getting the initial content, annotating or tagging it according to existing or built ontologies, reconciling heterogeneities, and then storing and managing the RDF or OWL with subsequent querying and inferencing.

There are certainly more tools extant, and I made some choices to exclude some marginal tools (Sourceforge, for example, has more than 200 semantic Web-related projects, but the vast majority appear moribund with no actual software to download).

Thus, listed below, are today’s current, most comprehensive list of 175 semantic Web software tools and applications. I am now further characterizing these offline as to open source v. proprietary and categorizing according to SW-related workflow. I may later post those expansions.

I also welcome tool suggestions. I think the ESW tools listing is the best place ongoing for such a compilation, but so far I am not liking what I am seeing in vendors using hype to characterize their tools versus more dispassionate descriptions by practitioners.

3storeA core C library that uses MySQL to store its raw RDF data and caches, forming an important part of the infrastructure required to support a range of knowledgeable services
4Suite 4RDFThe 4Suite 4RDF is an open-source platform for XML and RDF processing implemented in Python with C extensions
ActiveRDFActiveRDF is a library for accessing RDF data from Ruby programs. It can be used as data layer in Ruby-on-Rails. You can address RDF resources, classes, properties, etc. programmatically, without queries
AdaptivaA user-centred ontology building environment, based on using multiple strategies to construct an ontology, minimising user input by using adaptive information extraction
Aduna Metadata ServerThe Aduna Metadata Server automatically extracts metadata from information sources, like a file server, an intranet or public web sites. The Aduna Metadata Server is a powerful and scalable store for metadata
AeroTextEntity extraction engine from Lockheed Martin
AJAX Client for SPARQLAJAX Client for SPARQL is a simple AJAX client that can be used for running SELECT queries against a service and then integrating them with client-side Javascript code
AKT Research MapA competence map for members of the AKT project
AKT-BusAn open, lightweight, Web standards-based communication infrastructure to support interoperability among knowledge services.
AllegroGraphFranz Inc’s AllegroGraph is a system to load, store and query RDF data. It includes a SPARQL interface and RDFS reasoning. It has a Java and a Prolog interface
AlembicThe Alembic Workbench project from Mitre has as its goal the creation of a natural language engineering environment for the development of tagged corpora
AlmoAn ontology-based workflow engine in Java
Altova SemanticWorksVisual RDF and OWL editor that auto-generates RDF/XML or nTriples based on visual ontology design
AmilcareAn adaptive information extraction tool designed to support document annotation for the Semantic Web.
ANNIE – Open Source Information ExtractionAn open-source robust information extraction system
ApertureAperture is a Java framework for extracting and querying full-text content and metadata from various information systems (e.g. file systems, web sites, mail boxes) and the file formats (e.g. documents, images) occurring in these systems
Applications of FCA in AKTFormal Concept Analysis (FCA) is used in a variety of application scenarios in AKT in order to perform concept-based domain analysis and automatically deduce a taxonomy lattice of that domain.
AquaAQUA is a system which answer questions written in English. It combines several technologies Natural Language Processing, Logic, Information Retrieval and Ontologies.
ARCARC is a lightweight, SPARQL-enabled RDF system for mainstream Web projects. It is written in PHP and has been optimized for shared Web environments
ArmadilloExploits the redundancies apparent in the Internet, combining many information sources to perform document annotation with minimal human intervention.
ArtEquAktA system that automatically extracts information about artists from the web, populates an ontology, then uses the knowledge to generate personalised biographies.
Automatic Support for Enterprise Modelling and WorkflowKnowledge management using multi-modelling techniques and how modelling activities may be assisted with automation based on formal methods.
BBN OWL ValidatorBBN OWL Validator
BibsterA semantics-based bibliographic peer-to-peer system
BossamBossam, a rule-based OWL reasoner (free, well-documented, closed-source)
BrahmsBrahms is a fast main-memory RDF/S storage, capable of storing, accessing and querying large ontologies. It is implemented as a set of C++ classes
BuddySpaceInstant messaging with custom map visualizations, semantics of presence (beyond ‘offline’/'online’/'away’ status) and value-added web services (group alerts, bots, inferences via personal profiles)
CallistoThe Callisto annotation tool was developed to support linguistic annotation of textual sources for any Unicode-supported language with annotation support from jATLAS
CASDA tool for producing system architecture diagrams from service and data descriptions.
Cerebra ServerA technology platform that is used by enterprises to build model-driven applications and highly adaptive information integration infrastructure; company recently bought by webMethods
COCKATOOA knowledge acquisition tool which can be used to produce a set of cases for use with a Case-Based Reasoning system.
COHSE – Conceptual Open Hypermedia Services EnvironmentCOHSE researches methods to improve significantly the quality, consistency and breadth of linking of WWW documents at retrieval and authoring time.
CS AKTiveSpaceCS AKTiveSpace is a smart browser interface for a Semantic Web application that provides ontologically motivated information about the UK computer science research community.
ClassAKTA text classification web service for classifying documents according to the ACM Computing Classification System.
CompendiumCompendium is a semantic, visual hypertext tool for supporting collaborative domain modelling and real time meeting capture
ConRefA service discovery system which uses ontology mapping techniques to support different user vocabularies
ConcepToolA system to model, analyse, verify, validate, share, combine, and reuse domain knowledge bases and ontologies, reasoning about their implication.
CoreseCorese stands for Conceptual Resource Search Engine. It is an RDF engine based on Conceptual Graphs (CG) and written in Java. It enables the processing of RDF Schema and RDF statements within the CG formalism, provides a rule engine and a query engine accepting the SPARQL syntax
cwmThe Closed World Machine (CWM) data manipulator, rules processor and query system mostly using using the Notation 3 textual RDF syntax. It also has an incomplete OWL Full and a SPARQL access. It is written in Python
CypherCypher Generates RDF and SeRQL representation of natural language statements and phrases
D2R ServerD2R Server, turns relational databases into SPARQL endpoints, based on Jena’s Joseki
D3E – Digital Document Discourse EnvironmentD3E enables the easy conversion of websites or structured documents into interactive discussion sites
Deep Query ManagerSearch federator from deep Web sources
DOMEA programmable XML editor which is being used in a knowledge extraction role to transform Web pages into RDF, and available as Eclipse plug-ins. DOME stands for DERI Ontology Management Environment
DOSEA distributed platform for semantic annotation
DriveDrive is an RDF parser written in C# for the .NET platform
ekoss.orgA collaborative knowledge sharing environment where model developers can submit advertisements
EllogonEllogon is a multi-lingual, cross-platform, general-purpose language engineering environment, based on the earlier TIPSTER approach
EndecaFacet-based content organizer and search platform
EprepAn add-on for the Eprints document archive which uses text extraction to automatically create the bibliographic metadata needed for the submission of a new document.
eServicesThe e-Services framework provides advanced scholarly services (in particular visualisations) using distributed metadata.
EulerEuler is an inference engine supporting logic based proofs. It is a backward-chaining reasoner enhanced with Euler path detection. It has implementations in Java, C#, Python, Javascript and Prolog. Via N3 it is interoperable with W3C Cwm
ExtrAKTExtrAKT is a tool for extracting ontologies from Prolog knowledge bases.
F-LifeF-Life is a tool for analysing and maintaining life-cycle patterns in ontology development.
FaCT++FaCT++ is an OWL DL Reasoner implemented in C++
FastrFastr is a parser for term and variant recognition. Fastr take as input a corpus and a list of terms and ouputs the indexed corpus in which terms and variants are recognized
FloodsimA prototype system which demonstrates the benefits of applying semantically rich service descriptions (expressed using Semantic Web technologies) to Web Services.
FOAF-o-maticOnline FOAF generator
FOAMFramework for ontology alignment and mapping
FoxtrotFoxtrot is a recommender system which represents user profiles in ontological terms, allowing inference, bootstrapping and profile visualization.
FreeLingFreeLing is an open source language analysis tool suite. The FreeLing package consists of a library providing language analysis services (such as morphological analysis, date recognition, PoS tagging, etc.) The current version (1.2) of the package provides tokenizing, sentence splitting, morphological analysis, NE detection, date/number/currency recognition, PoS tagging, and chart-based shallow parsing
GATE – General Architecture for Text EngineeringGATE is a stable, robust, and scalable open-source infrastructure which allows users to build and customise language processing components, while it handles mundane tasks like data storage, format analysis and data visualisation.
GnowsisA semantic desktop environment
GrOWLOpen source graphical ontology browser and editor
HAWKOWL repository framework and toolkit
Heart of GoldHeart of Gold is a middleware for the integration of deep and shallow natural language processing components. It provides a uniform and flexible infrastructure for building applications that use Robust Minimal Recursion Semantics (RMRS) and/or general XML standoff annotation produced by NLP components
HELENOSA Knowledge discovery workbench for the semantic Web
I-X Process PanelsThe I-X tool suite supports principled collaborations of human and computer agents in the creation or modification of some product.
Identify Knowledge BaseIdentify-Knowledge-Base is a tool of Topic Identification about Knowledge Base
IF-MapIF-Map is an Information Flow based ontology mapping method. It is based on the theoretical grounds of logic of distributed systems and provides an automated streamlined process for generating mappings between ontologies of the same domain.
ILP for Information ExtractionTo overcome the knowledge acquisition bottleneck, we apply Inductive Logic Programming techniques to learn Information Extraction rules.
Internet Reasoning ServiceThe Internet Reasoning Service provides a a number of tools which supports the publication, location, composition and execution of heterogeneous web services, specified using semantic web technology
IODTIBM’s toolkit for ontology-driven development
IsaVizIsaViz is a visual authoring tool for browsing and authoring RDF models represented as graphs. Developed by Emmanuel Pietriga of W3C and Xerox Research Centre Europe.
JambalayaProtégé plug-in for visualizing ontologies
JastorOpen source Java code generator that emits Java Beans from ontologies
Javascript RDF/Turtle parserJavascript RDF/Turtle parser, can be used with Jibbering
JenaJena is a Java framework to construct Semantic Web Applications. It provides a programmatic environment for RDF, RDFS and OWL, SPARQL and includes a rule-based inference engine. It also has the ability to be used as an RDF database via its Joseki layer. See the jena discussion list for more information
JibberingJibbering, a simple javascript RDF Parser and query thingy
JosekiJena’s Joseki layer offers an RDF Triple Store facility with SPARQL interface (see also the entry on Jena)
JRDFJRDF Java RDF Binding is an attempt to create a standard set of APIs and base implementations to RDF using Java. Includes a SPARQL GUI.
KAONOpen source ontology management infrastructure
KAON2KAON2 is an an infrastructure for managing OWL-DL, SWRL, and F-Logic ontologies. it is capable of manipulating OWL-DL ontologies; queries can be formulated using SPARQL
KazukiGenerates a java API for working with OWL instance data directly from a set of OWL ontologies
KIM PlatformKIM is a software platform for the semantic annotation of text, automatic ontology population, indexing and retrieval, and information extraction from Ontotext
Knowledge BrokerThe knowledge broker addresses the problem of knowledge service location in distributed environments.
KowariOpen source database for RDF and OWL
KRAFT – I-X TIESupports collaboration among members of a virtual organisation by integrating workflow and communication technology with constraint solving.
LingPipeLingPipe is a suite of Java tools designed to perform linguistic analysis on natural language data. LingPipe’s flexibility and included source make it appropriate for research use. Version 1.0 tools include a statistical named-entity detector, a heuristic sentence boundary detector, and a heuristic within-document coreference resolution engine
LinguaStreamLinguaStream is an integrated experimentation environment (IEE) targeted to researchers in Natural Language Processing. LinguaStream allows processing streams to be assembled visually, picking individual components in a “palette” (the standard set contains about fifty components, and is easily extensible using a Java API, a macro-component system, and templates). Some components are specifically targeted to NLP, while others solve various issues related to document engineering (especially to XML processing). Other components are to be used in order to perform computations on the annotations produced by the analysers, to visualise annotated documents, to generate charts, etc.
LinKFactoryLanguage & Computing’s LinKFactory is an ontology management tool, it provides an effective and user-friendly way to create, maintain and extend extensive multilingual terminology systems and ontologies (English, Spanish, French, etc.). It is designed to build, manage and maintain large, complex, language independent ontologies.
LuceneApache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform. It is open source
LuMriXA commercial search engine using semantic Web technologies
MagpieMagpie supports the interpretation of web documents through on-the-fly ontologically based enrichment. Semantic services can be invoked either by the user or be automatically triggered by patterns of browsing activity
MelitaMelita is a semi-automatic annotation tool using an Adaptive Information Extraction engine (Amilcare)to support the user in document annotation.
MetaMatrixSemantic vocabulary mediation and other tools
MetatomixCommercial semantic toolkits and editors
MindRaiderOpen source semantic Web outline editor
MnMMnM is an annotation tool which provides both automated and semi-automated support for annotating web pages with semantic contents. MnM integrates a web browser with an ontology editor and provides open APIs to link to ontology servers and for integrating information extraction tool
Model Futures OWL EditorSimple OWL tools, featuring UML (XMI), ErWin, thesaurus and imports
MulgaraThe Mulgara Semantic Store is an Open Source, massively scalable, transaction-safe, purpose-built database for the storage and retrieval of RDF, written in Java. It is an active fork of Kowari
Muskrat-IIGiven a set of knowledge bases and problems solvers, the Muskrat system will try to identify which knowledge bases could be combined with which problems solvers to solve a given problem.
MyPlanetMyPlanet allows users to create a personalised version of a web based newsletter using an ontologically based profile.
Net OWLEntity extraction engine from SRA International
NMARKUPNMARKUP helps the user build ontologies by detecting nouns in texts and by providing support for the creation of an ontology based on the entities extracted.
Nokia Semantic Web ServerAn RDF based knowledge portal for publishing both authoritative and third party descriptions of URI denoted resources
ONTOCOPIA tool which uncovers Communities Of Practise by analysing the connectivity of instances in the 3store knowledge base.
OntoEdit/OntoStudioEngineering environment for ontologies
OntoMat AnnotizerInteractive Web page OWL and semantic annotator tool
OntoPortalEnables the authoring and navigation of large semantically-powered portals
OpenLink Data Spaces (ODS)ODS is a distributed collaborative application platform for creating Semantic Web applications such as: blogs, wikis, feed aggregators, etc., with built-in SPARQL support and incorporation of shared ontologies such as SIOC, FOAF, and Atom OWL. ODS is an application of OpenLink Virtuoso and is available in Open Source and Commercial Editions
Oracle Spatial 10gOracle Spatial 10g includes an open, scalable, secure and reliable RDF management platform
OysterPeer-to-peer system for storing and sharing ontology metadata
OWL Consistency checkerOWL Consistency checker (based on Pellet)
OWL-DL ValidatorWonderWeb OWL-DL Validator
OWLJessKBOWLJessKB is a description logic reasoner for OWL. The semantics of the language is implemented using Jess, the Java Expert System Shell. Currently most of the common features of OWL lite, plus some and minus some
OWLIMOWLIM is a high-performance semantic repository, packaged as a Storage and Inference Layer (SAIL) for the Sesame RDF database
OWLVizOWLViz is visual editor for OWL and is available as a Protégéplug-in
PelletPellet is an open-source Java based OWL DL reasoner. It can be used in conjunction with both Jena and OWL API libraries; it can also be downloaded and be included in other applications
Piggy BankA Firefox-based semantic Web browser
PikeA dynamic programming (scripting) language similar to Java and C for the semantic Web
pOWLSemantic Web development platform
ProtégéOpen source visual ontology editor written in Java with many plug-in tools
RACERA collection of Projects and Tools to be used with the semantic reasoning engine RacerPro
RacerProRacerPro is an OWL reasoner and inference server for the Semantic Web’s ValidatorRDF/XML and N3 validator
RDF GatewayIntellidimension’s RDF Gateway is an RDF Triple database with RDFS reasoning and SPARQL interface
RDF InferEdIntellidimension’s RDF InferEd is an authoring environment with the ability to navigate and edit RDF documents
RDFLibRDFLib, an RDF libary for Python, including a SPARQL API. The library also contains both in-memory and persistent Graph backends
RDFReactorAccess RDF from Java using inferencing
RDF ServerThe RDF server of the PHP RAP environment
RDFStoreRDFStore is an RDF storage with Perl and C API-s and SPARQL facilities
RDFSuiteThe ICS-FORTH RDFSuite open source, high-level scalable tools for the Semantic Web. This suite includes Validating RDF Parser (VRP), a RDF Schema Specific DataBase (RSSDB) and supporting RDF Query Language (RQL)
RedlandThe Redland RDF Application Framework is a set of free software libraries that provide support for RDF. It provides parser for RDF/XML, Turtle, N-triples, Atom, RSS; has a SPARQL and GRDDL implementation, and has language interfaces to C#, Python, Obj-C, Perl, PHP, Ruby, Java and Tcl
RelationalOWLAutomatically extracts the semantics of virtually any relational database and transforms this information automatically into RDF/OW
ReTAX+ReTAX is an aide to help a taxonomist create a consistent taxonomy and in particular provides suggestions as to where a new entity could be placed in the taxonomy whilst retaining the integrity of the revised taxonomy (c.f., problems in ontology modelling).
Refiner++REFINER++ is a system which allows domain experts to create and maintain their own Knowledge Bases, and to receive suggestions as to how to remove inconsistencies, if they exist.
Seamark NavigatorSiderean’s Seamark Navigator provides a platform to combine Web search pages with product catalog databases, document servers, and other digital information from both inside and outside the enterprise
Semantic Annotation with MnMMnM is a semantic annotation tool which provides manual, automated and semi-automated support for annotating web pages with ‘semantics’, i.e., machine interpretable descriptions.
SemanticalOpen source semantic Web search engine
SemanticWorksA visual RDF/OWL Editor from Altova
Semantic MediawikiSemantic extension to the MediaWiiki wiki
Semantic Net GeneratorUtility for generating topic maps automatically
SemWebSemWeb for .NET supports persistent storage in MySQL, Postgre, and Sqlite; has been tested with 10-50 million triples; supports SPARQL
SesameSesame is an open source RDF database with support for RDF Schema inferencing and querying. It offers a large scale of tools to developers to leverage the power of RDF and RDF Schema
SMARTSystem for Managing Applications based on RDF Technology
SMOREOWL markup for HTML pages
SPARQLQuery language for RDF
SPARQLerSPARQL query demo and service
SPARQLetteA SPARQL demo query service
SPARQL JavaScript LibrarySPARQL JavaScript Library interfaces to the SPARQL Protocol and interpret the return values as part of an AJAX framework
SWCLOSA semantic Web processor using Lisp
SWI-PrologSWI-Prolog is a comprehensive Prolog environment, which also includes an RDF Triple store. There is also a separate Prolog library to handle OWL
SwishSwish is a framework for performing deductions in RDF. It has similar features to CWM. It is written for Haskell developers
SwoogleA semantic Web search engine with 1.5 M resources
SWOOPA lightweight ontology editor
TopBraid ComposerTop Quandrant’s TopBraid Composer is a complete standards-based platform for developing, testing and maintaining Semantic Web applications
Tucana SuiteNorthrop Grumman’s Tucana Suite is an industrial quality version of the Kowari metastore
TurtleTerse RDF “Triple” language
Visualisations for the CS AKTive PortalMaps are used to geographically illustrate knowledge from the Triplestore, such as highlighting the locations in the UK that are active in a particular research area.
VisuaTextVisualText ® is an integrated development environment for building information extraction systems, natural language processing systems, and text analyzers
W3C’s RDF ValidatorW3C’s RDF Validator
WebOntoWebOnto supports the browsing, creation and editing of ontologies through coarse grained and fine grained visualizations and direct manipulation.
WilburWilbur is lisp based toolkit for Semantic Web Programming. Wilbur is Nokia Research Center’s toolkit for programming Semantic Web applications that use RDF written in Common Lisp
WSMO StudioA semantic Web service editor compliant with WSMO as a set of Eclipse plug-ins
WSMT ToolkitThe Web Service Modeling Toolkit (WSMT) is a collection of tools for use with the Web Service Modeling Ontology (WSMO), the Web Service Modeling Language (WSML) and the Web Service Execution Environment (WSMX)
WSMXExecution environment for dynamic use of semantic Web services
XML Army KnifeXML Army Knife
XMPA labeling technology from Adobe that enables data about a file to be embedded as metadata into the file itself.
YARSYARS (Yet Another RDF Store) is a data store for RDF in Java and allows for querying RDF based on a declarative query language, which offers a somewhat higher abstraction layer than the APIs of RDF toolkits such as Jena or Redland
ZoteroFirefox add-in (in development) that allows the auto-completion of online citations Markup

