登录查看更多内容

Herding, Culling, and Caging Predicates for Knowledge Graph Relations

Mike Dillinger, PhD

发布日期: 2025年2月7日

Lists and bags and sets are jumbles of items that I find aberrant and abhorrent. So when I see people blithely invent and bandy about this knowledge graph predicate and that ontological predicate, I instinctively want to herd the predicates together, cage them in groups, cull the weakest among them, and try to tame the best so they make some sort of sense -– and eventually do my bidding. Words have work to do and predicates are the strongest among them.

This urge to organize and quest for coherence is nothing new. Centuries of experience show very clearly that the key to progress in understanding is to continuously and recursively “unpack” the things we study and not treat them only as unanalyzed wholes. It's already perfectly clear that when we want to compare, group, or organize things of any kind, we need to identify and leverage their features, their attributes, their relations with others. Analysis (or the more trendy “featurization”) is essential.?

Let's look at knowledge graph predicates and relations in this light.

More often than not we rely on strings – database column labels, sentences, keywords, definitions, etc. – as a starting point for creating structured knowledge. Some of those strings label entities; others represent relations between those entities. For today, I want to call “predicates” those strings (like verbs, adjectives, conjunctions) that represent some kind of conceptual relation or attribute. They are particularly important because we use predicates to describe and define entities.

Predicates are the language of description, labels on the bedrock of knowledge, pointers to how to ground our concepts in perception.

But like other strings, predicates are inherently vague and ambiguous: labels that are often clear to their creators but not to their consumers.??

We label things based on their features. We group things based on their features. We organize things based on their features. And though we're more used to analyzing and featurizing concrete physical things, abstract conceptual relations (and the predicates that convey them) are no different.? However unfashionable it might be at the moment, doing our own analysis – featurizing the items that we want to understand and model – is crucial for deep understanding, for robust matching, and for reliable inference.?

When we analyze and featurize entities like concrete physical things, we use several kinds of features (which are not mutually exclusive):

We use inherent attributes as features:? size, color, length, etc.?
We use physical parts or conceptual components as features: 2 legs, 2 arms, fur, etc.
We use elements of internal structure (how parts are related to each other) as features: attached to opposite sides of torso, etc.
We observe typical behaviors and activities: fly, slither, climb, etc.
We use characteristics of external context as features:? live in the arctic, grow in the tropics, others use it as food, etc.

We can do the same thing to analyze and featurize knowledge graph relations.

Once our relations are featurized more explicitly, we can use these definitions to group and organize the predicates that convey them. In so doing, we adopt a more reliable, systematic approach to relation resolution – just as biologists have done for centuries to “resolve” the wildly variable names that people give to living beings.?

领英推荐

Thomson Reuters Top Reads

汤森路透 2 年前

INSANITY OR IGNORANCE?

Bill Inmon 2 个月前

Can we detect LLM hallucinations?

??Hakim Elakhrass 10 个月前

If for simplicity we focus initially on the components of conceptual relations, we can attend to the signature of each relation (the types of nodes or entities it connects by definition) and identify an initial array of fairly common families of relations:

Why bother?

When we extract relations from text, based on very variable predicates, we can use the families above as initial criteria for identifying candidates and for validating the relations we extract.? We will expect predicates in different families to be less likely candidates for a particular relation. And we expect the signature of the predicate to match closely the signature of the best relation it resolves to. In addition, the more features of relations (like their signature) that we can reliably identify and document, the richer and more robust the kinds of reasoning we can simulate in algorithms.?

Some technologists see little use in distinguishing between strings like verbs or adjectives (predicates) and abstract conceptual relations so they use many different predicates to represent the same relation. But multiple representations for the same relation defeats the purpose of representation in the first place – it simply creates more technical debt. And use cases that require better accuracy and higher reliability demand a way to decide which predicates are similar or the same – they can't avoid relation resolution.? Defining relation types as above helps to guide and validate this process.

Different kinds of structured knowledge (taxonomies, ontologies, knowledge graphs, etc.) include some but often not all of these relation types – they vary in expressivity. So the relation types above can help us choose which techniques to use for each use case.

AI Safety requires us to convey principles and guidelines to algorithms – and to verify how they have been “understood”.? Different kinds of structured knowledge include more of these families of relations so they allow us to express and verify a wider range of concepts.?

This is only the beginning. As we document more deeply and more precisely the conceptual relations that we need to represent, we can make simulated reasoning more scalable, more effective, and more impactful.??

Reliable Artificial Intelligence requires reliable Artificial Knowledge.

It has to be built on a foundation of clearly featurized and structured conceptual relations.?

Knowledge Architecture

3,602 位关注者

George Burch

Semantic AI @AICYC | Executive Chairman @ IKNOWit.WORLD | CEO at INTELLISOPHIC.

1 个月

Hi Mike, The relationship categories are critical to unification of triples in reasoning logic. Great to see you are on it. A piece of cake (all are the same), an engine part (most are different), paint on a surface (differ in color) illustrate the subtle issues you raise. The purpose is to accuratly automate reasoning systems. Thanks for like. https://aicyc.org/2024/10/05/how-sam-thinks/ TL;DR: Semantic AI Models (SAM) and Large Language Models (LLM) work together to create a distributed inference system that mimics human cognition. SAM extracts facts and concepts from text to build a knowledge graph. It then directs the LLM to represent this knowledge in second-order logic (SOL) expressions. These SOL expressions can be computed efficiently using fuzzy logic operations like MIN and MAX in a cloud environment like AWS S3 with Hadoop MapReduce. This allows the knowledge graph to reason and infer new knowledge similar to how humans think, bringing us closer to artificial general intelligence (AGI). The approach is contrasted with neuro-symbolic AI, which is less transparent and harder to guarantee correctness compared to the explicit logic used by SAM and LLMs.

1 次回应

Dinis Cruz

Founder @ The Cyber Boardroom, Chief Scientist @ Glasswall, vCISO, vCTO and GenAI expert

1 个月

Hi, great article, I really like the richness of those predicates, and I completely agree that they are critical What I found is that : a) it is very important to have the reverse path for each predicate ('is parent of' and 'is child of') b) this needs to be created bottom up (organically by the consumers of the graphs), not top down c) it's ok to have lots of redundancy and very similar predicates (reflecting the reality that different teams, environments, cultures, areas of expertise and roles have different names or verbs (aka predicates) for the same thing d) given good feedback back loops and REPLs for those graph curators, this will actually create really good, natural and easy to understand bottom-up ontologies and predicate lists (better than anything that would be created top-down)

1 次回应

Jason D. Eshleman

Technical Content Developer at US Pharmacopeia

1 个月

So much relies on untrendy concepts like "standardization" and "requirements". A knowledge graph isn't a work of art, it's a tool serving a purpose. Fail to observe that now, and you'll soon find yourself in a working group developing a "thesaurus of predicates".

4 次回应

John O'Gorman

Disambiguation Specialist

1 个月

Mike Dillinger, PhD - "Predicates are the strings that represent conceptual relations between entities. But (un)like other strings, predicates are inherently vague and ambiguous.?So we need to featurize carefully what we think those predicates mean..." Yep!

2 次回应

Ilya Goldin

Head of Data Science, AI & Ethics lead

1 个月

Aberrant and abhorrent! :D Dropping KG hot takes and taking names Mike Dillinger, PhD

1 次回应

查看更多评论

要查看或添加评论，请登录

Mike Dillinger, PhD的更多文章

Knowledge Graphs: Artificial Knowledge for Artificial Intelligence

2025年3月14日

Knowledge Graphs: Artificial Knowledge for Artificial Intelligence

Intelligence is simply being good at thinking: at using what you know to make sense of what you don't. That might be…

59 条评论
Diversity, Depth, and Density of Knowledge Graph Relations

2025年1月13日

Diversity, Depth, and Density of Knowledge Graph Relations

At the top of my list of New Year’s resolutions this year is relation resolution. In the world of structured knowledge,…

15 条评论
New Year's Resolutions for your Knowledge Graphs

2024年12月23日

New Year's Resolutions for your Knowledge Graphs

As you enjoy your holiday season, I suggest two resolutions to consider for the New Year: entity resolution and…

18 条评论
Knowledge Graphs and Monkey Business with Generative AI

2024年12月9日

Knowledge Graphs and Monkey Business with Generative AI

Throughout the year I got poked and prodded and challenged in a bunch of different ways by my friends, colleagues, and…

7 条评论
Thanks for them Knowledge Graphs

2024年11月28日

Thanks for them Knowledge Graphs

It's Thanksgiving Day here in the US. A time to count one's blessings.

10 条评论
Knowledge Graphs are Essential for Safe AI

2024年11月11日

Knowledge Graphs are Essential for Safe AI

AIs will only be safe for general use when they have and use goals and values that are identical to those of humans. In…

30 条评论
Knowledge graphs, Linguists, and the Last-mile problem of AI

2024年11月4日

Knowledge graphs, Linguists, and the Last-mile problem of AI

Now that AI can generate fluent text at scale in multiple languages and different styles, are authors, translators…

22 条评论
Audio: How to make AI safe and reliable?

2024年10月21日

Audio: How to make AI safe and reliable?

Janie and Johnny are back for Episode 2 of my Byte-sized AI series! Listen in to these engaging, bite-sized podcasts to…
Audio: What are Knowledge Graphs?

2024年10月1日

Audio: What are Knowledge Graphs?

Who knew? It seems that Max Headroom had blue-eyed twins and they're all grown up! I suspect that he sent them to…

10 条评论
Entity Resolution: Priority #1 for Building Real Knowledge Graphs

2024年9月6日

Entity Resolution: Priority #1 for Building Real Knowledge Graphs

I keep seeing mentions of "entity-resolved knowledge graphs", which leads me to believe that other so-called…

33 条评论

See all articles

Herding, Culling, and Caging Predicates for Knowledge Graph Relations

Mike Dillinger, PhD

领英推荐

Why bother?

Knowledge Architecture

3,602 位关注者

Mike Dillinger, PhD的更多文章

社区洞察

其他会员也浏览了

SYNTHETIC DATA – MY AHA MOMENT

Why creativity is the new currency in an AI-driven world?

Towards a P4 Ecosystem Logic

Honestly, no one wants to have this conversation…

This is what a "deal closing" discovery call looks like

"Quick success" in Competitive Intelligence?

Unlocking Dееp Insights: Thе Art of Conducting Qualitativе Rеsеarch

TIME TO MIND Analytical skills, critical thinking and new professions

Randomness and Complexity: Keys to Thriving in a Rapidly Changing World

Real Qual in an Age of 'Good Enough': How AI Is Reshaping Research Culture

领英推荐

Why bother?

Knowledge Architecture

3,602 位关注者

Mike Dillinger, PhD的更多文章

Knowledge Graphs: Artificial Knowledge for Artificial Intelligence

Diversity, Depth, and Density of Knowledge Graph Relations

New Year's Resolutions for your Knowledge Graphs

Knowledge Graphs and Monkey Business with Generative AI

Thanks for them Knowledge Graphs

Knowledge Graphs are Essential for Safe AI

Knowledge graphs, Linguists, and the Last-mile problem of AI

Audio: How to make AI safe and reliable?

Audio: What are Knowledge Graphs?

Entity Resolution: Priority #1 for Building Real Knowledge Graphs

社区洞察

其他会员也浏览了

SYNTHETIC DATA – MY AHA MOMENT

Why creativity is the new currency in an AI-driven world?

Towards a P4 Ecosystem Logic

Honestly, no one wants to have this conversation…

This is what a "deal closing" discovery call looks like

"Quick success" in Competitive Intelligence?

Unlocking Dееp Insights: Thе Art of Conducting Qualitativе Rеsеarch

TIME TO MIND Analytical skills, critical thinking and new professions

Randomness and Complexity: Keys to Thriving in a Rapidly Changing World

Real Qual in an Age of 'Good Enough': How AI Is Reshaping Research Culture