SearchMonkey’s Recommended Vocabularies a Useful Resource
I am pleased to report that UMBEL is now included as one of the recommended vocabularies for the Yahoo! SearchMonkey service. Using SearchMonkey, developers and site owners can use structured data to enhance the value of standard Yahoo! search results and customize their presentation, including through “infobars“. SearchMonkey is integral to a concerted effort by Yahoo! to embrace structured data, RDF and the semantic Web.
SearchMonkey was first announced in February 2008 with a beta release in April and then public release in May with 28 supported vocabularies. Then, last October, an additional set of common, external vocabularies were recommended for the system including DBpedia, Freebase, GoodRelations and SIOC. At the same time, some further internal Yahoo! vocabularies and standard Web languages (e.g., OWL, XHTML) were also added.
This is the first vocabulary update since then. Besides UMBEL, the AB Meta and Semantic Tags vocabularies have also been added to this latest revision. (There have also been a few deprecations over time.)
A recommended vocabulary means that its namespace prefix is recognized by SearchMonkey. The namespaces for the recommended vocabularies are reserved. Though site owners may customize and add new SearchMonkey structure, they must be explicitly defined in specific DataRSS feeds.
Structured data may be included in Yahoo! search results from these sources:
- Yahoo! Index — the core Yahoo! search data with limited structure such as the page’s title, summary, file size, MIME type, etc. This structure is only provided by Yahoo!
- Semantic Web Data — including microformats and RDF data embedded in the host page
- Data Feed — A feed of Yahoo! native DataRSS provided by a third party site
- Custom Data Service — Any data extracted from an (X)HTML page or web service and represented within SearchMonkey as DataRSS.
As a recommended vocabulary, UMBEL namespace references can now be embedded and recognized (and then presented) in Yahoo! search results.
The Current Vocabulary Set
Here are the 34 current vocabularies (plus five deprecated) recognized by the system:
|assert||SearchMonkey Assertions (deprecated)||http://search.yahoo.com/searchmonkey/assert/|
|context||SearchMonkey Context (deprecated)||http://search.yahoo.com/searchmonkey/context/|
|country||SearchMonkey Country Datatypes||http://search.yahoo.com/searchmonkey-datatype/country/|
|currency||SearchMonkey Currency Datatypes||http://search.yahoo.com/searchmonkey-datatype/currency/|
|owl||OWL ontology language||http://www.w3.org/2002/07/owl#|
|page||SearchMonkey Page (deprecated)||http://search.yahoo.com/searchmonkey/page/|
|rel||SearchMonkey Relations (deprecated)||http://search.yahoo.com/searchmonkey-relation/|
|tagspace||SearchMonkey Tagspace (deprecated)||http://search.yahoo.com/searchmonkey/tagspace/|
|use||SearchMonkey Use Datatypes||http://search.yahoo.com/searchmonkey-datatype/use/|
|xsd||XML Schema Datatypes||http://www.w3.org/2001/XMLSchema#|
In addition, there are a number of standard datatypes recognized by SearchMonkey, mostly a superset of XSD (XML Schema datatypes).
What is emerging from this Yahoo! initiative is a very useful set of structured data definitions and vocabularies. These same resources can be great starting points for non-SearchMonkey applications as well.
For More Information
There is quite a bit of online material now available for SearchMonkey, with new expansions and revisions also accompanying this most recent release. As some starting points, I recommend: