Knowledge isn't communication

Our faith in the power of words and data to convey knowledge can lead us to believe that AI promotes understanding when it doesn’t.

Knowledge has always depended on words and data. But it’s easy to assume that words or data themselves encapsulate and transmit knowledge. That’s especially true as generative AI encourages users to approach knowledge evaluation and acquisition through the medium of natural language sentences.

Expertise, knowledge, and understanding are specific to the individual. Users want advice from credible experts. Yet such expertise can be a barrier to clear communication and interfere with user understanding.

Writers have long faced the challenge of bridging what they know with how they communicate that knowledge. When done poorly, it results in a chasm between the writer’s and the reader’s understanding of a topic. Experts are often lousy communicators.

Readers judge a source’s expertise by its mastery of factual details. If a person or system can answer detailed factual questions, then it has expert-level knowledge. They are credible.

But knowledge isn’t a universal standard. The notion of a knowledge base can be deceptive because it implies a scope that is relevant to everyone. Knowledge is simply externalized understanding. We can talk about collective knowledge — the sum of available information contributed by various experts. But the understanding attained from this body of knowledge will depend on the individual.

Clear writing does not necessarily create consumable knowledge. For knowledge to be usable, it must be readable. But readability does not guarantee the information is useful to an individual.

Matthew Crawford, the author of Shop Class as Soulcraft, draws a distinction between thinking-as-writing (conveying what) and thinking-as-doing (knowing how). What’s written too often is divorced from what the user needs to do. He argues:

The writers of modern technical manuals are neither mechanics nor engineers but rather technical writers. This is a profession that is institutionalized on the assumption that it has its own principles that can be mastered without the writer being immersed in any particular problem; it is universal rather than situated. Technical writers know that, but they don’t know how.

The problem, as Crawford sees it, is that the writer does not necessarily have a firsthand understanding of the topic they write about. The writer has thought about the topic abstractly, but without direct experience — what’s known as the knowing-doing gap.

When confronted with explanations that don’t make sense, the user must:

render the gibberish coherent, and he can only do this by referring it to a model he had in his own mind of how the thing works.

The key to promoting understanding is the mental model framing of the discussion. It must be accurate and intelligible to the user.

Writing generates qualified knowledge. The writing process can help writers understand a topic. It can help them clarify the relationships between issues and encourage them to develop precise terminology to describe concepts.

Writing is, without question, valuable for developing an understanding of complex issues. But the writing will only express the knowledge of the writer. It won’t necessarily convey understanding that another reader can acquire. The knowledge articulated must be further revised to match the expectations and understanding of external readers.

The writer must decouple how they arrived at a conclusion from how they communicate that conclusion to others. Their goal should not be to show their own thinking process, but to address how the reader is likely to evaluate the topic.

The writer will normally have different background knowledge and motivations than the readers seeking to understand a topic. While in some fields (scientific research, for example), both the writer and the readers will follow a common, standard process for arriving at conclusions, this pattern is not the norm in most cases. Instead, the writer is presumed to have more expertise than the reader and will be concerned with factual and logical details that won’t be relevant to the non-expert.

With LLMs, we find that responses to user prompts often replicate the expert’s explanatory logic and terminology that was expressed in the source content, even though these are inappropriate for the user.

Explanations or conclusions without rationales sound authoritarian. Users are exacting: they want expert opinions, yet don’t want to put up with expert-level explanations. At the same time, many users find explanations lacking rationales less credible. They question why they should believe what’s being said.

Public trust in professional experts has been eroding with the “democraticization” of information. People don’t like being told what to do when they feel they aren’t permitted to make the decision themselves. Users feel empowered and entitled to form their own conclusions. Yet LLMs don’t always provide an extensive rationale for their responses. The tendency of chatbots to provide “instant answers” can undercut their credibility.

Knowledge sources (justifications) must be transparent and easy to trace. Users want to know where information comes from. AI solutions make tracing sources difficult.

The major generative AI platforms provide limited links to sources in their responses. A response might contain 5-8 links, often to obvious sources such as Wikipedia. At best, these links provide a shallow overview of available knowledge.

Most chatbot responses have limited source citations. Source: Generative Pulse, survey of 25 million bot links

Dense knowledge is not necessarily clear information. Facts, rather than understanding. becomes the end goal.

Knowledge graphs represent an alternative paradigm for answering user questions. They can provide better traceability, but at the expense of poorer usability. They don’t facilitate deeper understanding.

Because of their complexity, knowledge graph implementations increasingly rely on natural language chatbot user interfaces. When using a chatbot to interact with knowledge graph data, information traceability is lost. Google’s Knowledge Panels, which aggregate information from multiple sources, require users to click links to perform new searches to trace the origins of the data shown.

Google Knowledge Panel: most links send users to another search, which might yield another knowledge panel. The answers and links raise more questions than they answer. To improve ease of use, traceability is sacrificed.

Just because the writer can explain something doesn’t mean they understand it. Scripts become the end goal, rather than understanding.

It’s easy to make statements and to come up with reasons why readers should believe what you’ve said. But harder to justify statements with a sound rationale. Many assertions fail to withstand scrutiny.

Writers and AI platforms are both aware that credibility depends on providing a rationale. Writers have long used formulas such as “six reasons why….” to buttress their assertions. Professional writers might follow a template that outlines methods, procedures, and evidence before presenting conclusions. AI platforms reproduce these formulas in their responses.

It’s tempting for users to see formulaic explanations as valid because they match expectations. But in many cases, especially with LLM responses, the justification is developed after the conclusion is generated. Chatbots explain things without understanding them.

A widespread myth about LLMs is that they can reason. Vendors promote the “Thought-Action-Observation Loop” technique and ReAct (Reasoning/Acting) prompting, suggesting that LLMs are reasoning. These techniques build on a prompt-engineering approach called Chain-of-Thought (CoT) reasoning, which supposedly shows the thought process of the LLM. However, more detailed research indicates that these measures are largely proformative. They slow down responses without yielding appreciable improvements in answer quality. On the contrary, the techniques can make answers less consistent.

The core problem is misplaced trust: CoTs can appear persuasive even when they do not faithfully reflect a model’s actual decision process

— Chain-of-Thought Is Not Explainability

Explanations can be myths. Plausibility is the end goal, not truth.

To enhance perceived credibility, communications aim to sound expert and appear to be based on a comprehensive review of available information.

But mimicking common arguments does not make the arguments valid. Few people apply critical reading online, parsing claims, reasons, and evidence.

One of the most reliable and relatable forms of communication is the story. Stories often rely on reasoning by analogy. Chatbots have access to countless examples of material that are vaguely similar to the user’s prompt. We shouldn’t be surprised if we start seeing chatbot explanations including sports or TV show analogies.

— Michael Andrews