Posted:July 31, 2017

KBpediaRe-capping the Terminology for the Knowledge Graph

This last part on the updated KBpedia grammar is the final advance article on the knowledge graph prior to the release of version 1.50. In this article, I update and summarize the grammar at the heart of the ontology. I have written previously [1] on the role that Charles Sanders Peirce gave to what he called a speculative grammar, a Firstness in his triadic view of semiosis, or the logic of signs.  The basic idea of a speculative grammar is simple. What are the vocabulary and relationships that may be involved in the understanding of the question or concept at hand? What is the “grammar” for the question at hand that may help guide how to increase our understanding of it? What are the concepts and terms and relationships that populate our domain of inquiry? Of course, in our context, the domain of inquiry is the representation of knowledge in a knowledge graph useful to knowledge-based artificial intelligence, or KBAI.

Here is how Peirce in his own words placed speculative grammar in relation to his theory of logic [2]:

“All thought being performed by means of signs, logic may be regarded as the science of the general laws of signs. It has three branches: (1) Speculative Grammar, or the general theory of the nature and meanings of signs, whether they be icons, indices, or symbols; (2) Critic, which classifies arguments and determines the validity and degree of force of each kind; (3) Methodeutic, which studies the methods that ought to be pursued in the investigation, in the exposition, and in the application of truth.” (CP 2:260)

As I stated in an earlier article on a speculative grammar [1]:

“Effective use of knowledge bases (KBs) for artificial intelligence (AI) would benefit from a definition and organization of KB concepts and relationships specific to those AI purposes. Like any language, the construction of logical statements within KBAI (knowledge-based artificial intelligence) requires basic primitives for how to express these arguments. Just as in human language where we split our words into roughly nouns and verbs and modifiers and conjunctions of the same, we need a similar primitive vocabulary and basic rules of statement construction to actually begin this process. In all language variants, these basic building blocks are known as the grammar of the language. A well-considered grammar is the first step to being able to construct meaningful and coherent statements about our knowledge bases. The context for how to construct this meaningful grammar needs to be viewed through the lens of the KB’s purpose, which, in our specific case, is for artificial intelligence and machine learning.”

As Peirce says in his Logic of Relatives paper [3]:

“The fundamental principles of formal logic are not properly axioms, but definitions and divisions; and the only facts which it contains relate to the identity of the conceptions resulting from those processes with certain familiar ones.” (CP 3.149)

He goes on to say that the “. . . the woof and warp of a thought and all research is symbols, and the life of thought and science is the life inherent in symbols; so that it is wrong to say that a good language is important to good thought, merely; for it is of the essence of it.” (CP 2.220) An understanding of this language for KBpedia is essential to appreciate what the knowledge graph does and how to use it.

The Grammar Begins with the Universal Categories

The upper structure of KBpedia is captured by the KBpedia Knowledge Ontology. KKO is informed by the triadic logic and basic (“universal”) categories of C.S. Peirce. These same triadic universal categories are also the basis for Peirce’s semiosis, which both captures his views on the nature of logic and the basis of signs in representation and communication. These three universal constituents of Peirce’s trichotomy were in his view the most primitive or reduced manner by which to understand and categorize things, concepts and ideas. A key point for our purposes is that ‘threes’ are the fewest by which to model context and perspective, essential to capture the nature of knowledge [4].

Peirce’s trichotomy of the universal categories can be roughly summarized as:

  • Firstness [1ns] — these are possibilities or potentials, the basic forces or qualities that combine together or interact in various ways to enable the real things we perceive in the world, such as matter, life and ideas. These are the unrealized building blocks, or primitives, the essences or attributes or possible juxtapositions
  • Secondness [2ns] — these are the particular realized things or concepts in the world, what we can perceive, point to and describe. A particular is also known as an entity, event, instance or individual
  • Thirdness [3ns] — these are the laws, habits, regularities and continuities that may be generalized from particulars. All generals — what are also known as classes, kinds or types — belong to this category. The process of finding and deriving these generalities also leads to new insights or emergent properties, which continue to fuel knowledge discovery. Insights arising from Thirdness enable us to further explore and understand things, and is a driving force for further categorization.

The ideas of Firstness, Secondness and Thirdness in Peirce’s universal categories are not intended to be either sequential or additive. Rather, each interacts with the others in a triadic whole. Each alone is needed, and each is irreducible.

KKO begins with these three universal categories, all subsumed under the standard owl:Thing root node (a convention when using OWL). The first Firstness of KKO is called the Monad branch. This branch represents the qualities or potentials or possibilities that help define the actual individual things in the ontology, or Particulars, the real entities or events in the ontology. Monads are attributes and similar, such as colors or shapes; they do not exist independently except as concepts or ideas of themselves. They are only embodied when in relation to a specific instance. Particulars, or instances, are the second branch, which is the first Secondness. Particulars are the real individual things of the ontology. The largest branch in terms of nodes for KKO is the third one, the Generals (also called SuperTypes), wherein the general kinds, classes or types of the ontology are located. Generals are where generalizations (or categorizations) of individual things are made into types or classes. The idea of a man (as a representation of male humans) or the idea of a camera (as a representation of devices to capture an image) are generals. In Peircean terms, generals are as real as particulars, even though they are not instantiated into an individual thing. Generals are the largest category in terms of conceptual or classificatory richness in KKO.

Most layers of the the upper KKO are themselves split into threes, with the same ideas of Firstness, Secondness and Thirdness applied to the categories at each level. Categorization is itself a process grounded in these senses of the universal categories. Whenever an anomalous fact is discovered or the scale of a given category gets too large, it is time to create a new category with its own terminology (grammar), instances and generalities. It is through the application of this Peircean mindset that the breadth and scope of the knowledge graph may be grown.

Each level of the KKO may be labeled as to whether its given category is one of Firstness (1, or 1ns), Secondness (2, or 2ns), or Thirdness (3, or 3ns). We can here see the full upper structure of KKO, with its some 170 concepts, each one labeled as to which of the universal categories applies:

Upper KKO Structure

The Upper KBpedia Knowledge Ontology (KKO) Structure

Unfortunately, the graphing software used does not order the nodes by Firstness (1), Secondness (2) or Thirdness (3). (A demo version, viewable in Protege, is labeled such that this ordering is maintained; it will be made available upon KBpedia v 1.50’s release.)

The KKO structure enables us to capture the ideas of constituents and building blocks under the Monads; to place individual instances of entities and events under Particulars; and to generalize or find natural consistencies amongst kinds (or classes) under the Generals. There tends to be some conceptual parallelisms in this structure. The idea about a quality or relation may be placed as an abstract consideration under Monads; an embodied or instantiated quality or relation may be placed as an individual instance under Particulars; or a generalization (or “law”) about such qualities or relations may be placed under Generals. We can reason across all branches and levels of this structure, which means we can reason over things like colors and attributes and relations as easily as types. There is no object or concept, real or imaginary, historical or in the future, which can not be placed into this knowledge representation (KR) structure.

More than 85% of the classificatory structure of KBpedia resides in the generals, or types. These, in turn, are organized according to a set of typologies, or natural classification structures. (The tie-in points to these typologies are shown in the structure above, but the actual typology structures are not.) Unlike the KKO upper structure, each typology is not necessarily organized according to Peirce’s triadic logic. That is because once we come to organize and classify the real things in the world, we are dealing with objects of a more-or-less uniform character (such as animals or products or atomic elements). There are about 80 such typologies in the KBpedia structure, about 30 of which are deemed “core”, meaning they capture the bulk of the classificatory system. Another document presents these 30 “core” typologies in more detail.

The KKO structure in the figure above is quite different than the first versions of the KKO. As this article series has elaborated, our analysis and then additions of relations to KBpedia has resulted in quite a few changes. These relations, categorized as attributes (1ns), relations (2ns) and representations (3ns) are shown under the 2ns of the Predications branch under Generals. While properties for each of these categories are separately provided under object properties, data properties and annotation properties, they are provided here as class types so that we can reason over these different kinds of relations. 

Key Terminology within the Grammar

The complete terminology of the KBpedia ontology is provided through these concepts. However, some of these concepts are more key than others to important structural aspects of the ontology. Thus, besides the universal categories of Firstness, Secondness and Thirdness, here are some of those key concepts, split into three broad groups:

The first group relates to the predicates of the grammar, and includes these three main branches under the Predications node:

  • Attributes are the ways to characterize the entities or things within the knowledge base; while the attribute values and options may be quite complex, the relationship is monadic to the subject at hand. These are intensional properties of the subject
  • Relations are the way we describe connections between two or more things; relations are external-facing, between the subject and another entity or concept; relations set the extensional structure of the knowledge graph, and
  • Representations are icons, denotations, indexes and the metadata of the KB; these can not be inferenced over. But, they can be searched and language features can be processed in other ways.

This first group was the subject of Part III and Part IV in this series.

The second broad group are the individual instances within the ontology. Besides the monadic dyads (which are the reifications of the qualities in the ontology), the two main kinds of instances are:

  • Events are possibly nameable sequences of time, are described in some manner, can be referenced, and may be related to other time sequences or types, and
  • Entities are the basic, real things in our domain of interest; they are nameable things or ideas that have identity, are defined in some manner, can be referenced, and should be related to types; entities (by count) are the bulk of the overall knowledge base.

The understanding of events from a Peircean viewpoint also leads to a better understanding of actions, activities and processes. Events are a bona fide particular, being an expression of time, while entities, which can have tangible form, are a particular expression of space.

The last grouping relates to the terminology and organization of the Generals branch. Here the key concepts are:

  • Types are the hierarchical classification of natural kinds within all of the terms above
  • The Typology structure is not only a natural organization of natural classes, but it enables flexible interaction points with inferencing across its ‘accordion-like’ design (see further [5]).

Understanding the Release Through Its Grammar

When I first adopted this approach of using a speculative grammar, I wrote in part of its advantages [1]:

“In Peirce’s universal categories, Firstness is meant to capture the potentialities of the domain at hand, the speculative grammar; Secondness is meant to capture the particular facts or real things of the domain at hand, the critic; and Thirdness is meant to capture methods for discovering the generalities, laws or emergents within the domain, the methodeutic. This mindset can really be applied to any topic, from signs themselves to logic and to science. The “surprising fact” or new insight arising from Thirdness points to potentially new topics that may themselves become new targets for this logic of semiosis.

“Without the right concepts, terminology, or bounding — that is, the speculative grammar — it is clearly impossible to properly understand or compose the objects or Secondness that populate the domain at hand. Without the right language and concepts to capture the connections and implications of the domain at hand — again, part of its speculative grammar — it is not possible to discover the generalities or the “surprising fact” or Thirdness of the domain.

“The speculative grammar is thus needed to provide the right constructs for describing, analyzing, and reasoning over the given domain. Our logic and ability to understand the focus of our inquiry requires that we describe and characterize the domain of discourse in ways that are properly scoped and related. How well we bound, characterize and signify our problem domains — that is, the speculative grammar — directly relates to how well we can reason and inquire over that space. It very much matters how we describe, relate and define what we analyze and manipulate.

“Let’s take a couple of examples to illustrate this. First, imagine van Leeuwenhoek first discovering “animacules” under his early, advanced microscopes. New terms and concepts like flagella, cells, and vacuoles needed to be coined and systematized in order for further advances in microorganisms to be described. Or, second, imagine “action at a distance” phenomena such as magnetic repulsion or static electricity causing hair to stand on end. For centuries these phenomena were assumed to be caused by atomistic particles too small to see or discover. Only when Hertz was able to prove Maxwell‘s equations of electromagnetism hundreds of years later in the mid-1800s were the concepts and vocabulary of waves and fields sufficiently developed to begin to unravel electromagnetic theory in earnest. Progress required the right concepts and terminology.

“For Peirce, the triadic nature of the sign — and its relation between the sign, its object and its interpretant — was the speculative grammar breakthrough that then allowed him to better describe the process of signmaking and its role in the logic of inquiry and truth-testing (semiosis). Because he recognized it in his own work, Peirce understood a conceptual “grammar” appropriate to the inquiry at hand is essential to further discovery and validation.”

And then [6]:

“Peirce argues persuasively that how we perceive and communicate things requires this irreducible triadic structure. The symbolic nature of Thirdness means that communication and understanding is a continuous process of refinement, getting us closer to the truth, but never fully achieving it. Thirdness is a social and imprecise mode of communication and discovery, conducted by us and other agents separate from the things and phenomena being observed. Though it is a fallibilistic process, it is one that also lends itself to rigor and methods. The scientific method is a premier example of Thirdness in action.”

We are pleased that our next posting will be the announced release of KBpedia v 150.

This series on KBpedia relations covers topics from background, to grammar, to design, and then to implications from explicitly representing relations in accordance to the principals put forth through the universal categories by Charles Sanders Peirce. Relations are an essential complement to entities and concepts in order to extract the maximum information from knowledge bases. This series accompanies the next release of KBpedia (v 150), which includes the relations enhancements discussed.

[1] See further M.K. Bergman, 2016. “A Speculative Grammar for Knowledge Bases“, AI3:::Adaptive Information blog, June 20, 2016.
[2] See the electronic edition of The Collected Papers of Charles Sanders Peirce, reproducing Vols. I-VI, Charles Hartshorne and Paul Weiss, eds., 1931-1935, Harvard University Press, Cambridge, Mass., and Arthur W. Burks, ed., 1958, Vols. VII-VIII, Harvard University Press, Cambridge, Mass. The citation scheme is volume number using Arabic numerals followed by section number from the collected papers, shown as, for example, CP 1.208.
[3] This quote is drawn from Charles Sanders Peirce, 1870. “Description of a Notation for the Logic of Relatives, Resulting from an Amplification of the Conceptions of Boole’s Calculus of Logic”, Memoirs of the American Academy of Arts and Sciences 9 (1870), 317–378 (the “Logic of Relatives”), using the same numbering as from [2].
[4] M.K. Bergman, 2016. “The Irreducible Truth of Threes,” AI3:::Adaptive Information blog, September 27, 2016.
[5] M.K. Bergman, 2016. “Rationales for Typology Designs in Knowledge Bases,” AI3:::Adaptive Information blog, June 6, 2016.
[6] M.K. Bergman, 2016. “Threes All the Way Down to Typologies,” AI3:::Adaptive Information blog, October 3, 2016.
Posted:June 27, 2017

KBpediaCompleting the Schema for the Relations Addition

In the previous part of this series I discussed the relations model that we are adding to the KBpedia knowledge structure for its upcoming version 1.50. This article continues that discussion by fleshing out the concept hierarchy for this relations addition. These changes are taking place within the upper structure of KBpedia, what we call the KBpedia Knowledge Ontology, or KKO. The addition of this relations model affects both the class hierarchy of KKO and its properties structure. Both aspects are discussed below.

The previous parts of this series introducing the new KBpedia v 150 provide the rationale for adding a relations or predications structure to KBpedia, as well as the rationale for grounding this structure in the universal categories and logic of relations developed by Charles Sanders Peirce. Let me expand on those arguments a bit further here.

Recall from the earlier parts in this discussion that we want to adopt a schema for relations because we are interested in assertions and propositions in our knowledge structures. As Peirce notes [1],”The unity to which the understanding reduces impressions is the unity of a proposition. This unity consists in the connection of the predicate with the subject; and, therefore, that which is implied in the copula, or the conception of being, is that which completes the work of conceptions of reducing the manifold to unity.” (CP 1.548). The previous part discussed the rationale for three main classes of relationship types and how they related to Peirce’s views on logic and the theory of signs (semiosis). It is now time to expand on those distinctions.

The Second Level of the Relations Hierarchy

At the top level, relations are defined in KBpedia as belonging to one of three main types: Attributes, which are how we describe and characterize individual things (using the shorthand of A:A); External Relations, which are how an individual thing may relate, interact or situate with regard to external things (using the shorthand of A:B); or Representations [3], which are how we may name or define, indicate or reference, or provide supporting information about an individual thing (using the shorthand of re:A). These three main categories capture every conceivable form of relation. Note that an individual thing may also include classes or types, as well as individuals. Through this means, we can talk about and refer to categories, concepts, classes or types when we need to consider them as things unto themselves.

These three main categories correspond to Peirce’s universal categories of Firstness, Secondness and Thirdness [2]. (Where appropriate, I also sometimes footnote some Peirce quotes relevant to the category at hand.) Using the same trichotomous approach to categorization [2], we can now expand these three main relational categories into a second level of nine categories:

Mid-level KBpedia Relations

Mid-level KBpedia Relations

For Attributes (A:A), the split is into these three categories:

  • Intrinsic — innate characteristics or essences of single entities or events (particulars). Example concepts include oneness, qualities, feelings, inherent, negation, is, has, intensional, naturalness, internal, innateness
  • Adjunctual — events that may occur to a single entities or events (particulars) that help characterize it. Example concepts include birth, death, marriage, events, accidents, surprises, happenings, extrinsic, adjunctual
  • Contextual — circumstances or placements of single entities or events (particulars) that help characterize it. Example concepts include space, time, continuity, contiguous, smooth, otherness, ratings, level, situational (w/ respect to A), sensible, contiguous, all placements thereto, derivative, classificatory, rankings.

For External Relations (A:B), the split is into these three categories:

  • Direct — a simple, direct relationship (no intermediaries) between two different objects (entities, events, or their types, considered as instances). Example concepts include is a, simple without parts, part of, members in types or classes, genealogical roles (parent, child, brother), identity, extensional
  • Copulative — relationships of combination, membership, quantity, action, or circumstance. Example concepts include accidental, real, place, time, situation, quantity, facets, aspects, conjunctive, one-to-many, many-to-one, sum of, contextual, verbs
  • Mediative — true, triadic external relations, such as “A gives B to C”; relationships of relevance, meaning or explanation – namely, thirdness – about subjects and types. Example concepts include concepts, generalities, similarity, genres, aspects, comparison, performance, thought, triadic, agreement/difference, placement in space/time (contiguity), conditional, reasoning, classification.

And, for Representations (re:A), the split is into these three categories:

  • Denotatives — icons or symbols that name or describe the subject. Example concepts include names, labels, images, descriptions, definitions, denotations, icons, designations, proper nouns
  • Indexes — indirect references or pointers that help situate or draw attention to the subject. Example concepts include URIs, identifiers, keys, indices, references, semes, propositions (w/o objects), codes, selections, directional, citations, pronouns [4]
  • Associatives — a situational and contextual assertion of proximity, affiliation or adjacency of the subject with regard to any contiguity. Example concepts include see also, lists, links (incoming + outgoing), associations, likenesses, resemblances [5].

The Third Level of the Relations Hierarchy

For the next level, we can continue with our process of categorization based on the universal categories applied to the nine categories listed above. Here is the schema that results from this approach:

Full Upper Hierarchy of KBpedia Relations

Third Level of the KBpedia Relations Hierarchy

For the mid-level category of Intrinsics, the first of three categories under Attributes, here are the three subsidiary categorizations (representing the Firstness, Secondness and Thirdness, respectively):

  • Qualities — an internal characteristic or aspect of an object; collectively these define intensionally what kind of thing to which the object belongs, though that relationship is not intrinsic
  • Elementals — a contributing part of or integral input or aspect that adds to the understanding about the subject (A)
  • Configurations (forms) — forms or arrangements that are of the nature or perceivable of the subject (A).

For the mid-level category of Adjunctual, here are the three subsidiary categorizations:

  • Quantities — a characteristic of a subject (A) that is expressed as a number quantity
  • Eventuals — chance, accidental or planned occurrences that directly involve subject (A)
  • Extrinsics — external events or circumstances that directly involve subject (A) or help define the nature or reality of subject (A).

For the mid-level category of Contextual, here are the three subsidiary categorizations:

  • Situants — attributes or characteristics that help situate, or place in a locational context, the subject (A)
  • Ratings — an assigned value or characterization that orders subject (A) in relation to other subjects for a given attribute
  • Classifications — a characterization of subject (A) that involves evaluating subject (A) and providing a multi-factor typing, coding or value in relation to a given attribute or set of attributes.

For the mid-level category of Direct, the first of three categories under External Relations, here are the three subsidiary categorizations:

  • Equivalences — a simple, direct relatiionship between a subject (A) and an object (B) that asserts equalness or sameness
  • Parts — a simple, direct relationship where the object (B) is a part of or component of subject (A), including the idea of ‘whole’
  • Descendants — a simple, direct relationship where object (B) is a direct child or parent or subsumption (hyponym) or supersumption (hypernym) to subject (A).

For the mid-level category of Copulative, here are the three subsidiary categorizations:

  • Typings (is B) — this simple relation is for all of the is-a relations to types (B) for subject (A). Identities and common names fit into this category
  • ActionTypes — simple relations of energetics, perception or thought of subject (A) to some other object (B)
  • Conjoins — relations that involve the joining of a subject (A) to an object (B) via an intermediate object.

For the mid-level category of Mediative, here are the three subsidiary categorizations:

  • Comparisons — relations that compare, contrast or size up similarities or differences or overlaps between subject (A) and object (B)
  • Performances — relations of quantity or rank for how subject (A) performed in relation to object (B)
  • Circumstances — relations of subject (A) to external circumstances, situations or contexts.

For the mid-level category of Denotatives, the first of three categories under Representations, here are the three subsidiary categorizations:

  • Media — iconic images or sounds that invoke the identification with a given object or representation. Media in this sense draws attention to the object [6]
  • Labels — symbolic text strings that help to name or draw attention to a particular object
  • Descriptions — text strings that may be longer than labels and provide additional or contextual information or specify attributes about the object, beyond drawing attention.

For the mid-level category of Indexes, here are the three subsidiary categorizations:

  • Pointers — physical or symbolic indicators of a given thing and which draw attention to it
  • Identifiers — generally (unique) symbols or strings that provide a key to the given subject, often within some conventional scheme for generating and recognizing the token assigned
  • Codes — an assigned symbolic token or string that groups the object with similar items; the generation and interpretation of the token is (often) done in relation to an understood schema. Indexes are included in this category.

And, lastly, for the mid-level category of Associatives, here are the three subsidiary categorizations:

  • Lists — an aggregation, either ordered or unordered, of objects similar to one another with respect to given characters or types
  • Relateds (see also) — an indicator of some nature to other objects similar or related to the given object; the criteria and degree or strength of relationship between the items are indeterminate
  • Augments — an indicator to an external factor in relation to the object, which factor itself leads to still further explanations.

These categories, then, complete the three levels underneath the KBpedia Knowledge Ontology’s Predications branch (itself a Secondness under the basic Thirdness of Generals). We have established these as concepts within KKO such that we may reason over the ideas of these categories, and do other conceptual work with them. This conceptualization is helpful in order to consider the ideas of predicates and the relationships between them. However, for mapping properties from external sources, we need a parallel structure in terms of KKO properties.

An Analogous Structure for Properties

We thus created an analogous structure for properties within KBpedia. One useful addition that is forthcoming in the KBpedia v 1.50 release is a mapping of 2500 Wikidata properties to this property schema, which represents more than 95% of all property assignments to the 25 million+ entities within Wikidata. This is one of the first tangible expressions of the benefits of having a fully rational predicate structure within KBpedia.

Attributes, External Relations and Representations comprise OWL properties. In general, Attributes correspond to the OWL datatypes property; External Relations to the OWL object property; and Representations to the OWL annotation properties. These specific OWL terms are not used in our KKO grammar, however, because some attributes may be drawn from controlled vocabularies, such as colors or shapes, that can be represented as one of a list of attribute choices. In these cases, such attributes are defined as object properties. Nonetheless, the mappings of KKO’s grammar to existing OWL properties is quite close. In all cases, our mappings to external sources are done via the subPropertyOf relation in OWL.

Object and Data Properties

Because a given external non-annotation property (that is, all properties except Representations) may either refer to another object (IRI) or to a string or data value, we created parallel listings of object and datatype properties within KKO. Here is the listing for the object properties group as shown by a screen capture from Protégé:

A Matching KBpedia Object Properties Hierarchy

A Matching KBpedia Object Properties Hierarchy

Note that Protégé lists its items alphabetically, rather than our preferred ordering of Firstness, Secondness and Thirdness. You also should note the parallelism to the concept hierarchy discussed above. Not shown is the similar datatype properties structure.

Annotation Properties

As noted, Representations correspond to annotation properties, and that listing is shown in this screen capture:

A Matching KBpedia Annotation Properties Hierarchy

A Matching KBpedia Annotation Properties Hierarchy

In OWL2, annotations may be organized in a subPropertyOf manner, even though inferencing is not possible over this structure. The organization, though, is helpful to understand relationships (and they can independently be reasoned over with the matching concept structure) and may be used in search, SPARQL or external analytic scripts.

We also organize the SKOS and Dublin Core annotation properties used by KKO into the categorical structure of these Representations, as shown by the screen capture above. This organization helps elucidate the structure under the Representations branch using these commonly applied properties.

Some Caveats and Next Part

This completes the hierarchical specification of the Predications branch within KKO. These 39 categories (3 + 9 + 27), following the categorization approach suggested by Peirce’s universal categories, now complete our additions to this next version of KBpedia.

This addition has been under intense and active development for the past year, though it has been in the works for much longer than that. Over that time, the names of categories and, indeed, even the splits themselves, have changed frequently and evolved. Peirce’s Logic of Relatives [7], as one of the two main sources, is itself laid out in a dichotomous structure, though elsewhere in Peirce’s writings — as we showed in the previous parts — these are often expressed in Peirce’s more standard trichotomous structure. I am sure with use and more experience that we will see further refinements to the KBpedia relations schema as time goes on. Please do not be surprised if you see further changes.

We are now nearly complete with the advance material leading up the KBpedia v 1.50 release. We will have one next part in our series recapitulating the terminology and grammar for this new version. The announcement following that will finally release version 1.50.

This series on KBpedia relations covers topics from background, to grammar, to design, and then to implications from explicitly representing relations in accordance to the principals put forth through the universal categories by Charles Sanders Peirce. Relations are an essential complement to entities and concepts in order to extract the maximum information from knowledge bases. This series accompanies the next release of KBpedia (v 150), which includes the relations enhancements discussed.

[1] Peirce citation schemes tend to use an abbreviation for source, followed by volume number using Arabic numerals followed by section number (such as CP 1.208) or page, depending on the source. For CP, see the electronic edition of The Collected Papers of Charles Sanders Peirce, reproducing Vols. I-VI, Charles Hartshorne and Paul Weiss, eds., 1931-1935, Harvard University Press, Cambridge, Mass., and Arthur W. Burks, ed., 1958, Vols. VII-VIII, Harvard University Press, Cambridge, Mass.
[2] M.K. Bergman, 2016. “A Foundational Mindset: Firstness, Secondness, Thirdness,” AI3:::Adaptive Information blog, March 21, 2016.
[3] “I confine the word representation to the operation of a sign or its relation to the object for the interpreter of the representation.” (CP 1.540)
“A very broad and important class of triadic characters [consists of] representations. A representation is that character of a thing by virtue of which, for the production of a certain mental effect, it may stand in place of another thing.” (CP 1.564)
“as my analysis makes it to be, that a percept contains only two kinds of elements, those of firstness and those of secondness, then the great overshadowing point of difference is that the perceptual judgment professes to represent something, and thereby does represent something, whether truly or falsely. This is a very important difference, since the idea of representation is essentially what may be termed an element of “Thirdness,” that is, involves the idea of determining one thing to refer to another.” (CP 7.630)
[4] “But it would be difficult if not impossible, to instance an absolutely pure index, or to find any sign absolutely devoid of the indexical quality. Psychologically, the action of indices depends upon association by contiguity, and not upon association by resemblance or upon intellectual operations.” (CP 2.306)
“Indices may be distinguished from other signs, or representations, by three characteristic marks: first, that they have no significant resemblance to their objects; second, that they refer to individuals, single units, single collections of units, or single continua; third, that they direct the attention to their objects by blind compulsion. But it would be difficult if not impossible, to instance an absolutely pure index, or to find any sign absolutely devoid of the indexical quality. Psychologically, the action of indices depends upon association by contiguity, and not upon association by resemblance or upon intellectual operations.” (CP 2.306)
[5] “Association is the only force which exists within the intellect, and whatever power of controlling the thoughts there may be can be exercised only by utilizing these forces; indeed, the power, and even the wish, to control ourselves can come about only by the action of the same principles. Still, the force of association in its native strength and wildness is seen best in persons whose understandings are so little developed that they can hardly be said to reason at all. Believing one thing puts it into their heads to believe in another thing; but they know not how they come by their beliefs, and can exercise no control over the inferential process. These unconscious and uncontrolled reasonings hardly merit that name; although they are very often truer than if they were regulated by an imperfect logic, showing in this the usual superiority of instinct over reason, and of practice over theory.” (CP 7.453)
[6] “The only way of directly communicating an idea is by means of an icon; and every indirect method of communicating an idea must depend for its establishment upon the use of an icon. Hence, every assertion must contain an icon or set of icons, or else must contain signs whose meaning is only explicable by icons. The idea which the set of icons (or the equivalent of a set of icons)contained in an assertion signifies may be termed the predicate of the assertion.” (CP 2.278)
[7] C.S. Peirce, 1897. “Logic of Relatives,” The Monist Vol VII, No 2, pp. 161-217. See https://ia801703.us.archive.org/12/items/jstor-27897407/27897407.pdf
Posted:May 24, 2017

KBpediaAttributes, External Relations and Representations Form the Trichotomy

The forthcoming release of KBpedia version 1.50 deals primarily with the addition of a relations schema to the knowledge structure. In the previous part of this series, I discussed the event-action model at the heart of the schema. Actions are external relations between two objects, including parts, for which we use the variable shorthand of A and B. In terms of the universal categories of Charles Sanders Peirce [1], these dyadic relations are a Secondness that we formally term External Relations, or A:B.

But external relations do not constitute the complete family of relations. External relations are but one form of predicate we require. We need relations that cover the full range of language usage, as well as the entire scope of OWL properties. KBpedia is written in OWL2, which is the semantic ontology language extension of RDF. We need to capture the three types of properties in OWL2, namely object properties, datatype properties, and annotation properties.

In Peircean terms, we thus need relations that characterize the subject itself (A:A), which are mostly datatype properties, as well as statements about the subject (re:A), which are annotation properties. Most external relations are represented by object properties in OWL2, but sometimes datatype properties are also used [2].

We call relations of subject characterizations Attributes, and these are a Firstness within Peirce’s universal categories. We call relations about the subject Representations, and these are a Thirdness within the universal categories. The purpose of this part in our series is to introduce and define these three main branches of relations — Attributes, External Relations, and Representations — within the KBpedia schema.

Nature of the Trichotomy

In accordance with the design basis of KBpedia, we use Peirce’s universal categories and his writings on logic and semiosis (signs) to provide the intellectual coherence for the design. For the analysis of relations, two additional Peirce manuscripts were closely studied. The first manuscript is the first one on logic relations by Peirce in 1870, which goes by the shorthand of DNLR [3]. The second manuscript was from nearly 20 years later, known simply as the “Logic of Relatives” [4]. These two manuscripts deal with the ideas of internal and external relations, or the Attributes and External Relations, respectively, discussed above, and relate more to the predicate side of propositions. However, we also need to reference or point to the subjects of the proposition and to predicates, for which the Representations portion applies. Our need here was to organize a full breadth of relations in context with the universal categories, the needs of knowledge representation, and the structure of properties within OWL2 [5]. The result, we think, is consistent with the Peircean architectonic, but modernized for KR purposes.

For example, Peirce notes “the law of logic governs the relations of different predicates of one subject” (CP 1.485). In expanding on this law he states:

“Now logical terms are of three grand classes.

“The first embraces those whose logical form involves only the conception of quality, and which therefore represent a thing simply as “a ──”. These discriminate objects in the most rudimentary way, which does not involve any consciousness of discrimination. They regard an object as it is in itself as such (quale); for example, as horse, tree, or man. These are absolute terms.

“The second class embraces terms whose logical form involves the conception of relation, and which require the addition of another term to complete the denotation. These discriminate objects with a distinct consciousness of discrimination. They regard an object as over against another, that is as relative; as father of, lover of, or servant of. These are simple relative terms.

“The third class embraces terms whose logical form involves the conception of bringing things into relation, and which require the addition of more than one term to complete the denotation. They discriminate not only with consciousness of discrimination, but with consciousness of its origin. They regard an object as medium or third between two others, that is as conjugative; as giver of ── to ──, or buyer of ── for ── from ──. These may be termed conjugative terms.

“The conjugative term involves the conception of third, the relative that of second or other, the absolute term simply considers an object. No fourth class of terms exists involving the conception of fourth, because when that of third is introduced, since it involves the conception of bringing objects into relation, all higher numbers are given at once, inasmuch as the conception of bringing objects into relation is independent of the number of members of the relationship. Whether this reason for the fact that there is no fourth class of terms fundamentally different from the third is satisfactory of not, the fact itself is made perfectly evident by the study of the logic of relatives.” (CP 3.63) [1]

We take the first “class” above as largely relating to the Attributes. The next two classes, including the conjugative terms, we relate to External Relations. To this we add the Representations as the Thirdness within our revised relations category. Each of these three categories is described more fully below with further discussion as to the rationale for these splits.

We think this organization of relational categories is consistent with Peirce’s thinking, even though he never had today’s concepts of computerized knowledge representation as an objective for his analysis. For example, he labeled one of his major sections “The Conceptions of Quality, Relation and Representation, Applied to this Subject.” (1867, “Upon Logical Comprehension and Extension”; CP 2.418) Thirty five years later, Peirce still held to this split,”. . . there are but three elementary forms of predication or signification, which as I originally named them (but with bracketed additions now made to render the terms more intelligible) were qualities (of feeling), (dyadic) relations, and (predications of) representations.” (1903, EP 424; CP 1.561)

And, of course, human intelligence and communication is a symbolic world. So, our computer-reasoning basis should also be geared to the manipulation of ideas, which in a knowledge context is the accumulation of (approximately) known false and known true assertions about the world. These are our statements or propositions or assertions. Peirce elaborates:

“Now every simple idea is composed of one of three classes; and a compound idea is in most cases predominantly of one of those classes. Namely, it may, in the first place, be a quality of feeling, which is positively such as it is, and is indescribable; which attaches to one object regardless of every other; and which is sui generis and incapable, in its own being, of comparison with any other feeling, because in comparisons it is representations of feelings and not the very feelings themselves that are compared [Attributes]. Or, in the second place, the idea may be that of a single happening or fact, which is attached at once to two objects, as an experience, for example, is attached to the experiencer and to the object experienced [External Relations]. Or, in the third place, it is the idea of a sign or communication conveyed by one person to another (or to himself at a later time) in regard to a certain object well known to both [Representations].” (CP 5.7) (Emphasis brackets added.)

Peirce’s recommendations as to how to analyze a question proceed from defining the domain and its relations (the speculative grammar) to the logical analysis of it, including hypotheses about still questionable areas or emerging from new insights or combinations. The methods of this progression should be purposeful and targeted to produce a better likelihood of economic results or outcomes. This overall process he called the pragmatic maxim, and is a key insight into Peirce’s reputation at the father of pragmatism.

The concepts above, then, represent our starting speculative grammar for how to organize the relations, including the choice of the three adics, or branches of the trichotomy [6]. We also set the guidance of how each adic branch may be analyzed and split according to the universal categories (which is the subject of the next Part IV in this series.)

Nature of Propositions and Predicates

In terms of Peirce’s more formal definition of signs, a proposition is a dicisign, and it consists of a subject + predicate. (CP 2.316) A predicate is a rhema (CP 2.95). In terms of OWL2 with its RDF triples (subject – property – object), the predicate in this model is property + object, or a multitude of annotations that are representations of the subject. Further, “Every proposition refers to some index” (CP 2.369), that is its subject (also referred to as object). “Thus every kind of proposition is either meaningless or has a real Secondness as its object.” (CP 2.315) The idea of an individual type (or general) is also a Secondness [7]. “I term those occasions or objects which are denoted by the indices the subjects of the assertion.” (CP 2.238) The assertion give us the basic OWL statement, also known as a triple.

A proposition captures a relation, which is the basis for the assertion about the ‘subjects’. “Any portion of a proposition expressing ideas but requiring something to be attached to it in order to complete the sense, is in a general way relational. But it is only a relative in case the attachment of indexical signs will suffice to make it a proposition, or, at least, a complete general name.” (CP 3.463)  “But the Logic of Relations has now reduced logic to order, and it is seen that a proposition may have any number of subjects but can have but one predicate which is invariably general.” (CP 5.151)

We now have the building blocks to represent the nature of the proposition:

Propositions, the Subject-Predicate Model

The Proposition Subject-Predicate Model

Subject(s) and a general predicate make up the proposition (statement, assertion). Subjects need to be individual things (including generals) and are defined, denoted and indicated by various indexical representations, including icons and images. Active predicates that may be reasoned over include the attributes (characteristics) of individual subjects or the relations between objects.

This basic structure also lends itself to information theoretics. “Every term has two powers or significations, according as it is subject or predicate. The former, which will here be termed its breadth, comprises the objects to which it is applied; while the latter, which will here be termed its depth, comprises the characters which are attributed to every one of the objects to which it can be applied.” (CP 2.473) Peirce importantly defines the total information regarding a subject to consist of the “sum of synthetical propositions in which the symbol is subject or predicate, or the information concerning the symbol.” (CP 2.418) In other words, information = breadth x depth. We can reason over attributes and external relations, but our total information also consists in our representations.

These insights give us some powerful bases for defining and categorizing the terms or tokens within our knowledge space. By following these constructs, I believe we can extract the maximum information from our input content.

Definitions of the Relations

Best practice using semantic technologies includes providing precise, actionable definitions to key concepts and constructs. Here are the official statements regarding this trichotomy of relations.

Attribute Relations

Attributes are the intensional characteristics of an object, event, entity, type (when viewed as an instance), or concept. The relationship is between the individual instance (or Particular) and its own attributes and characteristics, in the form of A:A. Attributes may be intrinsic characteristics or essences of single particulars, such as colors, shapes, sizes, or other descriptive characteristics. Attributes may be adjunctual or accidental happenings to the particular, such as birth or death. Or, attributes may be contextual in terms of placing the particular within time or space or in relation to external circumstances.

Attributes are specific to the individual, and only include events that are notable for the individual. They are a Firstness, and in totality try to capture the complete characteristics of the individual particular, which is a Secondness.

These attributes are categorized according to these distinctions and grouped and organized into types, which will be presented in the next part.

External Relations

External relations are assertions between an object, event, entity, type, or concept and another particular or general. An external relationship has the form of A:B. External relations may be simple ones of a direct relationship between two different instances. External relations may be copulative by combining objects or asserting membership, quantity, action or circumstance. Or, external relations may be mediative to provide meaning, context, relevance, generalizations, or other explanations of the subject with respect to the external world. External relations are extensional.

External relations are by definition a Secondness. These external relations are categorized according to these distinctions and grouped and organized into types, which will be presented in the next part.

The events discussion in the previous Part II pertained mostly to external relations.

Representational Relations

Representations are signs (CP 8.191), and the means by which we point to, draw or direct attention to, or designate, denote or describe a particular object, entity, event, type or general.  A representational relationship has the form of re:A. Representations can be designative of the subject, that is, be icons or symbols (including labels, definitions, and descriptions). Representations may be indexes that more-or-less help situate or provide traceable reference to the subject. Or, representations may be associations, resemblances and likelihoods in relation to the subject, more often of indeterminate character.

The representational relation includes what is known as annotations or metadata in other contexts, such as images, links, labels, descriptions, pointers, references, or indexes. Representations can not be reasoned over, except abductive reasoning, but some characteristics may be derived or analyzed through non-inferential means.

These representations are categorized according to these distinctions and grouped and organized into types, which will be presented in the next part.

Summary of the Three Relations

We can now pull these threads together to present a summary chart of these three main relational branches:

KBpedia Relations

KBpedia Three Relations Model

This trichotomy sets the boundaries and affirms the method by which further sub-divisions will be presented in the next installment in this series.

A Strong Relation Schema

We now have a much clearer way for how to build up the assertions in our knowledge representations, according to linguistic predicate construction and predicate calculus. We can now explicitly state a premise underlying our choice of Peirce and his architectonic for the design of KBpedia: it is the most accurate, expressive basis for capturing human language and logical reasoning, both individually and together. Our ability to create new symbolic intelligence from human knowledge requires that we be able to compute and reason over human language.

In the next part we will establish sub-categories for each of these three branches according to the universal categories [8].

This series on KBpedia relations covers topics from background, to grammar, to design, and then to implications from explicitly representing relations in accordance to the principals put forth through the universal categories by Charles Sanders Peirce. Relations are an essential complement to entities and concepts in order to extract the maximum information from knowledge bases. This series accompanies the next release of KBpedia (v 150), which includes the relations enhancements discussed.

[1] Peirce citation schemes tend to use an abbreviation for source, followed by volume number using Arabic numerals followed by section number (such as CP 1.208) or page, depending on the source. For CP, see the electronic edition of The Collected Papers of Charles Sanders Peirce, reproducing Vols. I-VI, Charles Hartshorne and Paul Weiss, eds., 1931-1935, Harvard University Press, Cambridge, Mass., and Arthur W. Burks, ed., 1958, Vols. VII-VIII, Harvard University Press, Cambridge, Mass. For EP, see Nathan Houser and Christian Kloesel, eds., 1992. The Essential Peirce – Volume 1, Selected Philosophical Writings‚ (1867–1893), Indiana University Press, 428 pp. For EP2, see The Peirce Edition Project, 1998. The Essential Peirce – Volume 2, Selected Philosophical Writings‚ (1893-1913), Indiana University Press, 624 pp.
[2] Attributes, External Relations and Representations comprise OWL properties. In general, Attributes correspond to the OWL datatypes property; External Relations to the OWL object property; and Representations to the OWL annotation properties. These specific OWL terms are not used in our speculative grammar, however, because some attributes may be drawn from controlled vocabularies, such as colors or shapes, that can be represented as one of a list of attribute choices. In these cases, such attributes are defined as object properties. Nonetheless, the mappings of our speculative grammar to existing OWL properties is quite close. In the actual KKO, these labels are replaced with AttributeTypes, RelationTypes, and RepresentationTypes, respectively, when talking about Generals, to conform to the typing terminology of the ontology.
[3] C.S. Peirce, 1870. “Description of a Notation for the Logic of Relatives, Resulting from an Amplification of the Conceptions of Boole’s Calculus of Logic”, Memoirs of the American Academy of Arts and Sciences 9, 317–378, 26 January 1870. Reprinted, Collected Papers (CP3.45–149), Chronological Edition (CE2, 359–429).
[4] C.S. Peirce, 1897. “Logic of Relatives,” The Monist Vol VII, No 2, pp. 161-217. See https://ia801703.us.archive.org/12/items/jstor-27897407/27897407.pdf
[5] The attributes-relation split has been a not uncommon one in the KB literature, insofar as such matters are discussed. For example, see Nicola Guarino, 1997. “Some Organizing Principles for a Unified Top-level Ontology,” in AAAI Spring Symposium on Ontological Engineering, pp. 57-63. 1997. Also, see Yankai Lin, Zhiyuan Liu, and Maosong Sun, 2016. “Knowledge Representation Learning with Entities, Attributes and Relations.” ethnicity 1 (2016): 41-52; the authors propose splitting existing KG-relations into attributes and relations, and propose a KR model with entities, attributes and relations (KR-EAR).
[6] See further M.K. Bergman, 2016. “A Speculative Grammar for Knowledge Bases“, AI3:::Adaptive Information blog, June 20, 2016.
[7] Peirce recognized the importance of being able to talk of the individual type or general as an object in itself. It was only until the revision of OWL2 that such punning was added to the OWL language.
[8] Some additional useful quotes from Peirce related to this topic of relations and these splits are (with emphases per the originals):

  • “Whether or not every proposition has a principal subject, and, if so, whether it can or cannot have more than one, will be considered below. A proposition may be defined as a sign which separately indicates its object. For example, a portrait with the proper name of the original written below it is a proposition asserting that so that original looked. If this broad definition of a proposition be accepted, a proposition need not be a symbol. Thus a weathercock “tells” from which direction the wind blows by virtue of a real relation which it would still have to the wind, even if it were never intended or understood to indicate the wind. It separately indicates the wind because its construction is such that it must point to the quarter from which the wind blows; and this construction is distinct from its position at any particular time. But what we usually mean by a proposition or judgment is a symbolic proposition, or symbol, separately indicating its object. Every subject partakes of the nature of an index, in that its function is the characteristic function of an index, that of forcing the attention upon its object. Yet the subject of a symbolic proposition cannot strictly be an index. When a baby points at a flower and says, “Pretty,” that is a symbolic proposition; for the word “pretty” being used, it represents its object only by virtue of a relation to it which it could not have if it were not intended and understood as a sign. The pointing arm, however, which is the subject of this proposition, usually indicates its object only by virtue of a relation to this object, which would still exist, though it were not intended or understood as a sign. But when it enters into the proposition as its subject, it indicates its object in another way. For it cannot be the subject of that symbolic proposition unless it is intended and understood to be so. Its merely being an index of the flower is not enough. It only becomes the subject of the proposition, because its being an index of the flower is evidence that it was intended to be. In like manner, all ordinary propositions refer to the real universe, and usually to the nearer environment. Thus, if somebody rushes into the room and says, “There is a great fire!” we know he is talking about the neighbourhood and not about the world of the Arabian Nights’ Entertainments. It is the circumstances under which the proposition is uttered or written which indicate that environment as that which is referred to. But they do so not simply as index of the environment, but as evidence of an intentional relation of the speech to its object, which relation it could not have if it were not intended for a sign. The expressed subject of an ordinary proposition approaches most nearly to the nature of an index when it is a proper name which, although its connection with its object is purely intentional, yet has no reason (or, at least, none is thought of in using it) except the mere desirability of giving the familiar object a designation.” (CP 2.357)
  • “But it remains to point out that there are usually two Objects, and more than two Interpretants. Namely, we have to distinguish the Immediate Object, which is the Object as the Sign itself represents it, and whose Being is thus dependent upon the Representation of it in the Sign, from the Dynamical Object, which is the Reality which by some means contrives to determine the Sign to its Representation. In regard to the Interpretant we have equally to distinguish, in the first place, the Immediate Interpretant, which is the interpretant as it is revealed in the right understanding of the Sign itself, and is ordinarily called the meaning of the sign; while in the second place, we have to take note of the Dynamical Interpretant which is the actual effect which the Sign, as a Sign, really determines. Finally there is what I provisionally term the Final Interpretant, which refers to the manner in which the Sign tends to represent itself to be related to its Object. I confess that my own conception of this third interpretant is not yet quite free from mist.” (CP 4.536)
  • “A rhema which has one blank is called a monad; a rhema of two blanks, a dyad; a rhema of three blanks, a triad; etc. A rhema with no blank is called a medad, and is a complete proposition. A rhema of more than two blanks is a polyad. A rhema of more than one blank is a relative. Every proposition has an ultimate predicate, produced by putting a blank in every place where a blank can be placed, without substituting for some word its definition.” [CP 4.438]
  • “Hence, as soon as we admit the idea of absurdity, we are bound to class the rejection of an argumentation among argumentations. Thus, as was said, a proposition is nothing more nor less than an argumentation whose propositions have had their assertiveness removed, just as a term is a proposition whose subjects have had their denotative force removed.” (CP 2.356)
  • “The only way of directly communicating an idea is by means of an icon; and every indirect method of communicating an idea must depend for its establishment upon the use of an icon. Hence, every assertion must contain an icon or set of icons, or else must contain signs whose meaning is only explicable by icons. The idea which the set of icons (or the equivalent of a set of icons) contained in an assertion signifies may be termed the predicate of the assertion.” (CP 2.278)
  • “Thus, we have in thought three elements: first, the representative function which makes it a representation; second, the pure denotative application, or real connection, which brings one thought into relation with another; and third, the material quality, or how it feels, which gives thought its quality.†” (CP 5.290)
  • “Every informational sign thus involves a Fact, which is its Syntax. It is quite evident, then, that Indexical Dicisigns equally accord with the definition and the corollaries.” (CP2.320)
  • “The monad has no features but its suchness, which in logic is embodied in the signification of the verb. As such it is developed in the lowest of the three chief forms of which logic treats, the term, the proposition, and the syllogism.” (CP 1.471)
  • “The unity to which the understanding reduces impressions is the unity of a proposition. This unity consists in the connection of the predicate with the subject; and, therefore, that which is implied in the copula, or the conception of being, is that which completes the work of conceptions of reducing the manifold to unity.” [CP 1.548]
  • “This search resulted in what I call my categories. I then named them Quality, Relation, and Representation. But I was not then aware that undecomposable relations may necessarily require more subjects than two; for this reason Reaction is a better term. Moreover, I did not then know enough about language to see that to attempt to make the word representation serve for an idea so much more general than any it habitually carried, was injudicious. The word mediation would be better.” (CP 4.3)
  • “Every thought, or cognitive representation, is of the nature of a sign. “Representation” and “sign” are synonyms. The whole purpose of a sign is that it shall be interpreted in another sign; and its whole purport lies in the special character which it imparts to that interpretation.” (CP 8.191)
Posted:May 15, 2017

KBpediaEvents are a Quasi-Entity and Cross All of Peirce’s Universal Categories

Most knowledge graphs have an orientation to things and concepts, what might be called the nouns of the knowledge space. Entities and concepts have certainly occupied my own attention in my work on the UMBEL and KBpedia ontologies over the past decade. In Part I of this series, I discussed how knowledge graphs, or ontologies, needed to move beyond simple representations of things to embrace how those things actually interact in the world, which is the understanding of context. What I discuss in this part is how one might see actions and events in a way that has logic and coherency. What I discuss in this part is an event-action model.

As we all know, knowledge statements or assertions are propositions that combine a subject with a predicate. Those predicates might describe the nature or character of the subject or might relate the subject to other objects or situations. For covering this aspect we need to pay close attention to the verbs or relations that connect these things.

How we model or represent these things is one of the critical design choices in a knowledge graph. But these choices go beyond simply using RDF or OWL properties (or whatever predicate basis your modeling language may provide). Modeling relations and predicates needs to capture a worldview of how things are connected, preferably based on some coherent, underlying rationale. Similar to how we categorize the things and entities in our world, we also need to make ontological choices (in the classic sense of the Greek ontos, or the nature of being) as to what a predicate is and how predicates may be classified and organized. As I noted in Part I, much less is discussed about this topic in the literature.

Some Things Just Take Time

Information and information theory have been my passion my entire professional career. In that period, two questions stand out as the most perplexing to me, each of which took some years to resolve to some level of personal intellectual satisfaction. My first perplexing question was how to place information in text on to a common, equal basis to the information in a database, such as a structured record. (Yeah, I know, kind of a weird question.) These ruminations, now what we call being able to place unstructured, semi-structured and structured information on to a common footing, was finally solved for me by the RDF (Resource Description Framework) data model. But, prior to RDF, and for perhaps a decade or more, I thought long and hard and read much on this question. I’m sure there were other data models out there at the time that could have perhaps given me the way forward, but I did not discover them. It took RDF and its basic subject-predicate-object (s-p-o) triple assertion to show me the way forward. It was not only a light going on once I understood, but the opening of a door to a whole new world of thinking about knowledge representation.

My second question is one that has been gnawing at me for at least five or six years. The question is, What is an event? (Yeah, I know, another kind of weird question.) When one starts representing information in a knowledge graph, we model things. My early ideas of what is an entity is that it was some form of nameable thing. By that light, the War of 1812, or a heartbeat, or an Industrial Age are all entities. But these things are events, granted of greatly varying length, that are somehow different from tangible objects that we can see or sense in the world, and different from ideas or thoughts. These things all differ, but how and why?

Actually, the splits noted in the prior paragraph give us this clue. Events are part of time, occupy some length of time, and sometimes are so notable as to get their own names, either as types or named events. They have no substance or tangibility. These characteristics are surely different than tangible objects which occupy some space, have physicality, exist over some length of time, and also get their own names as types or named instances. And both of these are different still than concepts or ideas that are creatures of thought.

These distinctions, mostly first sensed or intuited, are hard to think about because we need a vocabulary and mindset (context) by which to evaluate and discern believable differences. For me, the idea and definition of What is an event? was my focus and entry point to try to probe this question. Somehow, I felt events to be a key to the very structures used for knowledge representation (KR) or knowledge-based artificial intelligence (KBAI), which need to be governed by some form of conceptual schema. In the semantic Web space, such schema are known as “ontologies”, since they attempt to capture the nature or being (Greek ὄντως, or ontós) of the knowledge domain at hand. Because the word ‘ontology’ is a bit intimidating, a better variant has proven to be the knowledge graph (because all semantic ontologies take the structural form of a graph). In Cognonto‘s KBAI efforts, we tend to use the terms ontology and knowledge graph interchangeably.

A key guide to this question of What is an event? are the views of Charles Sanders Peirce, the great 19th century American logician, polymath and philosopher. His theory of the universal categories — what he termed Firstness, Secondness and Thirdness — provides the groundings for his views on logic and sign-making. As we’ve noted before about KBpedia, Peirce’s theory of universal categories greatly informs how we have constructed it [1]. Peirce’s categories, while unique, are an organizational framework not unlike categories of being put forward by many philosophers; see [1] for more background on this topic.

Thus, with liberal quotes from Peirce himself [2], I work through below some of the background context for how we treat events — and related topics such as actions, relations, situations and predicates — in our pending KBpedia v 150 release.

What is an Event?

Of course, I am hardly raising new questions. The philosophical question of What is an event? is readily traced back to Plato and Aristotle. The fact we have no real intellectual consensus as to What is an event? after 2500 years suggests both that it is a good question, but also that any “answer” is unlikely to find consensus. Nonetheless, I think through the application of Peircean principles we can still find a formulation that is coherent and logically consistent (and, thus, computable).

To begin this evaluation, let’s first summarize the diversity of views of What is an event? Given the long history of this question, and the voluminous writings and diversity of opinion on the matter, a good place to start is the Stanford Encyclopedia of Philosophy, which offers a kind of Cliff Notes version overviewing various views on events [3], among many other articles in philosophy. I encourage interested students of this question to study that entry in detail. However, we can summarize the various views as fitting into one or more of these definitions:

  • Events are objects (also potentially referred to as entities)
  • Events are facts
  • Events are actions
  • Events are properties
  • Events are times
  • Events are situations.

Within the context of current knowledge bases, the Cyc knowledge base, for example, asserts situations are a generalization of events, and actions are a specialization of events. Most current semantic Web ontologies place events in the same category or class as entities. Some upper ontologies model time in a different way by viewing objects as temporal parts that change over time, or other dichotomous splits around the questions of events and actions. Like I said, there is really no consensus model for events and actually little discussion of them.

From our use in general language, I think we can fairly define events as having some of these characteristics:

  • Events occur in time; “For example, everyday experience is that events occur in time, and that time has but one dimension.” (CP 1.273)
  • Events have a beginning, duration and end
  • Events may be of nearly instantaneous duration (beta decay) to periods spanning centuries or millenia (the Industrial Age, the Cenozoic Era)
  • Events can refer to individual instances (tokens in Peircean terms) or general types
  • Events may be single or recurring (birthdays)
  • Events occur, or “take place”
  • Events, if sufficiently notable, can be properly named (World War II).

For Peirce, “We perceive objects brought before us; but that which we especially experience — the kind of thing to which the word ‘experience’ is more particularly applied — is an event. We cannot accurately be said to perceive events;” (CP 1.336). He further states that “If I ask you what the actuality of an event consists in, you will tell me that it consists in its happening then and there. The specifications then and there involve all its relations to other existents. The actuality of the event seems to lie in its relations to the universe of existents.” (CP 1.24)

We often look for causes for events, but Peirce cautions us that, “Men’s minds are confused by a looseness of language and of thought which leads them to talk of the causes of single events. They ought to consider that it is not the single actuality, in its identity, which is the subject of a law, but an ingredient of it, an indeterminate predicate. Consequently, the question is, not whether each and every event is precisely caused, in one respect or another, but whether every predicate of that event is caused.” (EP p 396) Peirce notes that the chance flash or shock, say a natural phenomenon like a lightning strike or an accident, which by definition is not predictable, can itself through perception of or reaction to the shock “cause” an event. Chance occurrences are a central feature in Peirce’s doctrine of tychism.

Though events are said to occur, to happen or to take place, entities are said to exist. From Peirce again:

“The event is the existential junction of states (that is, of that which in existence corresponds to a statement about a given subject in representation) whose combination in one subject would violate the logical law of contradiction. The event, therefore, considered as a junction, is not a subject and does not inhere in a subject. What is it, then? Its mode of being is existential quasi-existence, or that approach to existence where contraries can be united in one subject. Time is that diversity of existence whereby that which is existentially a subject is enabled to receive contrary determinations in existence.” (CP 1.494).”

Nonetheless, “Individual objects and single events cover all reality . . . .” (CP 5.429). Other possibly useful statements by Peirce regarding events may be found under [4].

In these regards, we can see both entities and single events as individual instances within our KBpedia Knowledge Ontology, what we call Particulars, which represent the second (or Secondness) of the three main branches in KKO. Per our use of the universal categories to evaluate our subsequent category structures (see [1]) within Particulars, events are treated as a Secondness due to their triggering and “quasi-existence” nature, with entities treated as a Thirdness [5]. Like entities, we can also generalize events into types, which are placed under the third main branch of KKO, the Generals. Event types can be defined and are real in a similar way to entity types.

Within events, we can also categorize according to the three universal categories. What I present below are comments with respect to the event examples first mentioned in the introductory material above.

Events Resulting from Chance or Flash

As noted, the chance event, the unexpected flash or shock, is a Firstness within events. Peirce’s doctrine of tychism places a central emphasis on chance, being viewed as the source of processes in nature such as evolution and the “surprising fact” that causes us to re-investigate our assumptions leading to new knowledge.”Chance is any event not especially intended, either not calculated, or, with a given and limited stock of knowledge, incalculable.” (CP 6.602 ref). The surprising fact is the spark that causes us to continually reassess the nature of the world, to assess and categorize anew [6].

“Anything which startles us is an indication, in so far as it marks the junction between two portions of experience. Thus a tremendous thunderbolt indicates that something considerable happened, though we may not know precisely what the event was.” (EP What is a Sign, Sec 5) The chance event or shock joins energetic effort or perception as the stimulants of action. “Effort and surprise are the only experiences from which we can derive the concept of action.” (EP p 385)

The chance shock produces sensation, which itself may be a stimulant (reaction) to produce further action. “This is present in all sensation, meaning by sensation the initiation of a state of feeling; — for by feeling I mean nothing but sensation minus the attribution of it to any particular subject. In my use of words, when an ear-splitting, soul-bursting locomotive whistle starts, there is a sensation, which ceases when the screech has been going on for any considerable fraction of a minute; and at the instant it stops there is a second sensation. Between them there is a state of feeling.” (CP 1.332)

Still, the chance shock is the one form of event for which there is not a discernible cause-and-effect. It remains inexplicable because the triggering event remains unpredictable. “In order to explain what I mean, let us take one of the most familiar, although not one of the most scientifically accurate statements of the axiom viz.: that every event has a cause. I question whether this is exactly true . . . . may it be that chance, in the Aristotelian sense, mere absence of cause, has to be admitted as having some slight place in the universe . . . . ” (W4:546)

Events Resulting from Actions

We more commonly associate an event with action, and that is indeed a major cause of events. (Though, as we saw, chance events or accidents, as an indeterminate group, may trigger events.) An action is a Secondness, however, because it is always paired with a reaction. Reactions may then cause new actions, itself a new event. In this manner activities and processes can come into being, which while combinatorial and compound, can also be called events, including those of longer duration. That entire progression of multiple actions represents increasing order, and thus the transition to Thirdness.

One of Peirce’s more famous quotes deals with the question of action and reaction, even with respect to our cognition, and their necessary pairing:

“We are continually bumping up against hard fact. We expected one thing, or passively took it for granted, and had the image of it in our minds, but experience forces that idea into the background, and compels us to think quite differently. You get this kind of consciousness in some approach to purity when you put your shoulder against a door and try to force it open. You have a sense of resistance and at the same time a sense of effort. There can be no resistance without effort; there can be no effort without resistance. They are only two ways of describing the same experience. It is a double consciousness. We become aware of ourself in becoming aware of the not self. The waking state is a consciousness of reaction; and as the consciousness itself is two sided, so it has also two varieties; namely, action, where our modification of other things is more prominent than their reaction on us, and perception, where their effect on us is overwhelmingly greater than our effect on them.” (CP 1.324)

So, we see that actions can be triggered by chance, energetic effort, perceptions, and reactions to prior actions, sometimes cascading into processes involving a chain of actions, reactions and events.

Thoughts as Events

Peirce makes the interesting insight that thoughts are events, too. “Now the logical comprehension of a thought is usually said to consist of the thoughts contained in it; but thoughts are events, acts of the mind. Two thoughts are two events separated in time, and one cannot literally be contained in the other.” (CP 5.288) Similarly, in “Law of the Mind” Peirce calls an idea “an event in an individual consciousness” (CP 6.105) Through these assertions, the sticky question of thinking and cognition (always placed as a Thirdness by Peirce) is clearly put into the event category.

Events Resulting from Continuity

Of course, the essence of Thirdness is continuity, what Peirce called synechism. The very nature of continuity in a temporal sense are events, some infinitesimal, transitioning from one to another, with breaks, if we are to become aware of them, merely breaks in the continuity of time. Entities, at least as we define them, provide a similar function, but now over the continuity of space. All objects are deformations of continuous space. By this neat trick of relating events to time and entities to space, all of which is (yes, singular tense) continuous, Peirce nailed one of the hard metaphysical nuts to crack. Some claim that Peirce was the first philosopher to anticipate the space-time continuum [7].

In addition to its embeddedness and embedding into continuity, there are also actions which themselves are expressions of triadic relations. Peirce first says:

“Let me remind you of the distinction … between dynamical, or dyadic, action; and intelligent, or triadic action. An event, A, may, by brute force, produce an event, B; and then the event, B, may in its turn produce a third event, C. The fact that the event, C, is about to be produced by B has no influence at all upon the production of B by A. It is impossible that it should, since the action of B in producing C is a contingent future event at the time B is produced. Such is dyadic action, which is so called because each step of it concerns a pair of objects.” (CP 5.472)

In triadic action, the classic example is ‘A gives B to C’ (EP 2 170-171). The other classic triadic example is Peirce’s sign relation between object, sign and interpretant. Peirce adopted the term semiosis for this triadic relation and defined it to mean an “action, or influence, which is, or involves, a coöperation of three subjects, such as a sign, its object, and its interpretant, this tri-relative influence not being in any way resolvable into actions between pairs” (EP 2 411). Peirce’s reduction thesis also maintains that all higher order relationships (polyadic with more than three terms) can be decomposed to monadic, dyadic or triadic relations. All three are required to capture the universe or potential relations, but any relation can be reduced to one of those three [1]. Further, Peirce also maintained that the triadic relation is primary, with monadic and dyadic relations being degenerate forms of it.

Now, all symbols (therefore, also the basis for human language) are also a Thirdness. The symbol (sign) stands for an object which the interpretant understands to have a meaning relationship to the object. Symbols, too, may be causes of events. As Peirce states,”Thus a symbol may be the cause of real individual events and things. It is easy to see that nothing but a symbol can be such a cause, since a cause is by its definition the premiss of an argument; and a symbol alone can be an argument.” (EP, p 317).

It is not always easy to interpret Peirce. The ideas of creation and destruction, for example, would seem to be elements of Firstness, being closely allied to the idea of potentiality. Yet, as Peirce states, “But the event may, on the other hand, consist in the coming into existence of something that did not exist, or the reverse. There is still a contradiction here; but instead of consisting in the material, or purely monadic, repugnance of two qualities, it is an incompatibility between two forms of triadic relation . . . .” (CP 1.493) This statement seems to suggest that creation and destruction are somehow related to Thirdness. It is not always easy to evaluate where certain concepts fit within Peirce’s universal categories.

While I am unsure of some aspects of my Peircean analysis as to exact placements into Firstness, Secondness and Thirdness, what is also true is that Peirce sets conditions and mindsets for looking at these very questions. Ultimately open questions such as I mention are amenable to analysis and argumentation according to the principles underlying Peirce’s universal categories of Firstness, Secondness and Thirdness. The challenge is not due to the criteria for evaluation; rather, it comes from probing what is truly meant and implied within any question.

The Event-Action Model

What does not seem complicated or confusing is Peirce’s basic model for actions. Actions are grounded in events. As discussed above, Peirce provides for us broad and comprehensive examples of events — chance, actions, thoughts and continuity. Events are always things that occur to an individual (including the idea of an individual type). Peirce categorically states that “. . . the two chief parts of the event itself are the action and the reaction . . . .” (CP 5.424)

We now have the building blocks to enable us to diagram the event-action model embodied in KBpedia:

Events model

The KBpedia Event-Action Model

The single event may arise from any of the bulleted items shown on this diagram. Though every action is paired with a reaction, one or the other might be more primary for different kinds of events. As Peirce notes, the event represents a juxtaposition of states, the comparison of the subject prior and after the event providing the basis for the nature of the event. Each change in state represents a new event, which can trigger new actions and reactions leading to still further events. Simple events represent relatively single changes in state, such as turning off a light switch or a bolt of lightning. More complicated events are the topic of the next section.

Relating Events More Broadly to KBpedia

The next diagram relates events more broadly to KBpedia particulars and generals. Events may be the triggers for actions, both embedded within situations that provide their context (and often influence the exact nature and course of the event). Events also precede the creation or destruction of entities, which are manifestations, and events do occur over time to affect those very same entities. These events and these entities are singular, and if notable, we may give the individual instances of them names. These items are shown at the top of the diagram:

Events Cascade

KBpedia Particulars and Types

On the left-hand portion of the diagram we have the cascade from events to actions to activities and then processes. The progression down the cascade requires the chaining together of events and actions and paired reactions, getting more ordered and purposeful as we move down the cascade. Of course, single events may trigger single actions and reactions, and events express themselves at every level of the cascade. Events and actions always occur in a situation, the context of which may have influence on the resulting nature of the event and its actions and reactions. This mediation is the exact reason that Thirdness is a logical foundation.

Parallel with events are entities, which themselves result from events. Entities, too, may be named. This side of the diagram cascade leads to classes or types of individual entities, which now become generals, and that may be classified or organized by types. A similar type aggregation may be applied to individual events, actions, activities and processes. At this point, we are moving into the territory of what is known as Peirce’s token-type distinction (particulars v generals). With generals, we now move beyond the focus of events; see further my own typology piece for this transition [8].

Events Provide the Entree to Understand Relations

Events are like the spark that leads us to better understand actions and what emerges from them, which in turn helps us better understand predicates and relations. These are topics for next parts in this series.

What we learn from Peirce is that events are quasi-entities, based on time rather than space, and, like entities, are a Secondness. Like entities, we can name events and intrinsically inspect their attributes. Events may also range from the simple to the triadic and durative. Events are the fundamental portions of activity and process cascades, and also capture such seemingly non-energetic actions like thought. Thought, itself, may be a source of further events and action, as may be the expressions of our thought, symbols. And actions always carry with them a reaction, which can itself be the impetus for the next action in the event cascade.

What this investigation shows us is that events are the real triggering and causative factors in reality. Entities are a result and manifestation of events, but less central to the notion of relations. Events, like entities, can be understood through Peirce’s universal categories of Firstness, Secondness and Thirdness.

Events help give us a key to understand the dynamic nature of Charles Peirce’s worldview. I hope in subsequent parts of this series to help elucidate further how an understanding of events helps to unmask the role and purpose of relations. Though entities, events and generals may all be suitable subjects for our assertions within knowledge bases and knowledge graphs, it is really through the relations of our system, in this case KBpedia, where we begin to understand the predicates and actions of our chosen domain.

This series on KBpedia relations covers topics from background, to grammar, to design, and then to implications from explicitly representing relations in accordance to the principals put forth through the universal categories by Charles Sanders Peirce. Relations are an essential complement to entities and concepts in order to extract the maximum information from knowledge bases. This series accompanies the next release of KBpedia (v 150), which includes the relations enhancements discussed.

[1] M.K. Bergman, 2016. “The Irreducible Truth of Threes,” AI3:::Adaptive Information blog, September 27, 2016.
[2] Peirce citation schemes tend to use an abbreviation for source, followed by volume number using Arabic numerals followed by section number (such as CP 1.208) or page, depending on the source. For CP, see the electronic edition of The Collected Papers of Charles Sanders Peirce, reproducing Vols. I-VI, Charles Hartshorne and Paul Weiss, eds., 1931-1935, Harvard University Press, Cambridge, Mass., and Arthur W. Burks, ed., 1958, Vols. VII-VIII, Harvard University Press, Cambridge, Mass. For EP, see Nathan Houser and Christian Kloesel, eds., 1992. The Essential Peirce – Volume 1, Selected Philosophical Writings‚ (1867–1893), Indiana University Press, 428 pp. For EP2, see The Peirce Edition Project, 1998. The Essential Peirce – Volume 2, Selected Philosophical Writings‚ (1893-1913), Indiana University Press, 624 pp.
[3] Roberto Casati and Achille Varzi, 2014. “Events“, Stanford Encyclopedia of Philosophy, Aug 27, 2014. Retrieved May 2, 2017.
[4] What constitutes the potentials, realized particulars, and generalizations that may be drawn from a query or investigation is contextual in nature. That is why the mindset of Peirce’s triadic logic is a powerful guide to how to think about and organize the things and ideas in our world. We can apply this triadic logic to any level of information granularity. Here are some further Peirce statements about events:

  • “An event always involves a junction of contradictory inherences in the subjects existentially the same, whether there is a simple monadic quality inhering in a single subject, or whether they be inherences of contradictory monadic elements of dyads or polyads, in single sets of subjects. But there is a more important possible variation in the nature of events. In the kind of events so far considered, while it is not necessary that the subjects should be existentially of the nature of subjects — that is, that they should be substantial things — since it may be a mere wave, or an optical focus, or something else of like nature which is the subject of change, yet it is necessary that these subjects should be in some measure permanent, that is, should be capable of accidental determinations, and therefore should have dyadic existence. But the event may, on the other hand, consist in the coming into existence of something that did not exist, or the reverse. There is still a contradiction here; but instead of consisting in the material, or purely monadic, repugnance of two qualities, it is an incompatibility between two forms of triadic relation, as we shall better understand later. In general, however, we may say that for an event there is requisite: first, a contradiction; second, existential embodiments of these contradictory states; [third,] an immediate existential junction of these two contradictory existential embodiments or facts, so that the subjects are existentially identical; and fourth, in this existential junction a definite one of the two facts must be existentially first in the order of evolution and existentially second in the order of involution.”(CP 1.493)
  • “A Sinsign (where the syllable sin is taken as meaning “being only once,” as in single, simple, Latin semel, etc.) is an actual existent thing or event which is a sign. It can only be so through its qualities; so that it involves a qualisign, or rather, several qualisigns. But these qualisigns are of a peculiar kind and only form a sign through being actually embodied.” (EP Nomenclature of Triadic; p 291) That is, a sinsign is either an existing thing or event; further, events have attributes.
  • “Another Universe [Secondness] is that of, first, Objects whose Being consists in their Brute reactions, and of, second, the facts (reactions, events, qualities, etc.) concerning those Objects, all of which facts, in the last analysis, consist in their reactions. I call the Objects, Things, or more unambiguously, Existents, and the facts about them I call Facts. Every member of this Universe is either a Single Object subject, alike to the Principles of Contradiction and to that of Excluded Middle, or it is expressible by a proposition having such a singular subject.” (EP p 479) The latter is an event.
[5] Under Particulars, the instantiation of qualities (making them subjects as opposed to unformed potentiality) is the Firstness.
[6] M.K. Bergman, 2016. “A Foundational Mindset: Firstness, Secondness, Thirdness,” AI3:::Adaptive Information blog, March 21, 2016.
[7] See, for example, A. Nicolaidis, 2008. “Categorical Foundation of Quantum Mechanics and String Theory,” arXiv:0812.1946, 10 Dec 2008. It is not unusual to see grand claims for the foresight exhibited by Peirce in his writings, which sometimes have an inadequate basis for the claims of prescience. However, Peirce’s general observations often pre-date current, modern interpretations, even if not fully articulated.
[8] M.K. Bergman, 2016. “Threes All the Way Down to Typologies,” AI3:::Adaptive Information blog, October 13, 2016.
Posted:May 10, 2017

AI3 PulseThis is the Place to Start with the Academic Literature

Dan Roth and his former post-doc, Yangqiu Song, yesterday released a major paper on machine learning with knowledge bases, “Machine Learning with World Knowledge: The Position and Survey[1]. This 20-page paper with 250 references is a goldmine of starting points and a useful organizational schema for how to look at various machine learning applications and methods based on large-scale knowledge bases. The paper covers the exact territory that we refer to as knowledge-based artificial intelligence, or KBAI.

I always have my eye out for papers by Roth. Both he and his colleague at the University of Illinois Urbana-Champaign, Jiawei Han, publish well-thought and practical papers in the areas of knowledge representation and data mining. Various groups at Illinois also offer open-source software resources useful to these tasks. These efforts, in my view, are some of the best available worldwide.

The authors are proponents for the use of knowledge bases in machine learning for the same reasons we are: “Two essential problems of machine learning are how to generate features and how to acquire labels for machines to learn. Particularly, labeling large amount of data for each domain-specific problem can be very time consuming and costly. It has become a key obstacle in making learning protocols realistic in applications.” The authors then go on to address in specifics how knowledge bases can overcome these problems.

Besides the valuable reference listing, the other real contribution of the paper is how it frames and organizes what roles knowledge bases can play in AI. Like the paper’s problem statement, the authors organize their presentation around features and labels. They discuss the use and role of various techniques in relation to machine learning applications. The way they structure their presentation should be a help to those new to KBAI and the variety of terminology inherent to the field. Based on my own experience, I find their characterizations and guidance to be spot on.

I really have only one bone to pick with this otherwise excellent paper. No where do the authors discuss the quality, coherence or accuracy of the underlying knowledge bases used to perform these tasks. Many of the cited KB sources have known quality problems, some of which we have discussed before, such as Wikipedia (coverage and category structure), YAGO (reliance on WordNet), Freebase (highly variable quality and no longer maintained), Cyc (questionable upper ontology and mis-assignments), DBpedia (simplistic schema), etc. One of the major reasons for our efforts with KBpedia is to continue to work to create cleaner training environments suitable to machine learning so as to reduce the GIGO problem.

Still, quibbles aside, this paper will prove highly useful to anyone interested in distant supervised machine learning and knowledge-based artificial intelligence. For the foreseeable future, this paper should be a standard reference in your KBAI library.


[1] Yangqiu Song and Dan Roth, 2017. “Machine Learning with World Knowledge: The Position and Survey,” arXiv:1705.02908, 8 May 2017.