Language Models as Thespians

Large Language Models (LLMs)—what most people refer to when they talk about “AI” these days—can behave unintuitively, especially for those far removed from the tech zeitgeist.¹ Since I’m the designated computer guy for friends and family, and since I proclaim myself an “AI skeptic” whenever it comes up, I get a lot of questions about LLMs from non-technical people.²

Recently, I have discovered a particularly effective analogy for language models: think of LLMs as actors, in the sense of theater performers and movie stars.³

An actor’s primary goal is to put on a good performance, which entails a believable portrayal of a character. Actors research roles before playing them, and exaggerate or invent details that will make the performance more entertaining. A character need not be realistic to be compelling, they need only be believable relative to audience expectations—a melodramatic performance may feel more true, even if nobody actually behaves that way. Actors attempt to mirror reality, but are not afraid to distort it when they believe their distortion will connect better with the audience.

Conceptually, the text output of large language models is very similar to the lines delivered by an actor. LLMs are computer programs built to mimic human language; like actors, they are more concerned with matching expected patterns of natural language than they are with expressing consistent or true statements. In other words, they aim to sound correct, but not necessarily to be correct. As a result, when they lie, their lies are convincing.⁴

When LLMs say something true, it’s a coincidence of the training data that the statement of fact is also a likely sequence of words; language models don’t learn facts as such. Instead, they develop a probabilistic “intuition” for what is likely to seem true. As with actors, this intuition is accumulated from research. While an actor might study the persona, time, and place of a specific character they will play, LLMs have studied the language of thousands of personas, times, and places as part of their extensive training.

The exposure to so many styles of text is why casting an LLM in a role—as opposed to assigning a task, but no role—can drastically improve the output quality. In the same way that a character’s backstory guides an actor’s performance, giving an LLM a persona narrows the distribution that subsequent tokens are sampled from. For example, imagine you want an LLM to write Python code—you could prompt the model with:

The model has been trained on a huge range of code samples: everything from battle-tested, scalable codebases written by entire teams to throw-away scripts hacked together by interns. Unless you tell it which persona to adopt when writing, it will generate output that combines everything it has seen, including the mediocre samples. (The equivalent is true of prose—not just code.) The easiest way to narrow its focus, and capture your intent, is to tell it who to pretend to be. By doing so, you are priming it to focus on the part of its training it associates with the character you request. For example, to improve on the prompt above, you could instead write:

Compared to other analogies, thinking of LLMs as actors offers substantial predictive power.⁵ The analogy explains hallucination by way of actors perpetuating tropes and improvising details while in character, and it offers enough intuition to predict which tasks LLMs will succeed and fail at, even for someone who does not understand how language models work. More importantly, it immediately, tangibly benefits users by implying the technique of prompting LLMs to adopt a persona. Besides the aptitude of the comparison itself, it is also familiar—everyone knows about actors.

I additionally like that the analogy frames questions that even experienced LLM users may not consider. Namely: if language models are like performing actors, then who is their audience?

I contend that users are not the audience. Instead, users direct the performance. The true audience are the researchers and engineers who develop models, as well as the big tech executives who shape their priorities. Directors make tactical decisions, but the audience response determines whether a performance ultimately succeeds. Like directors, LLM users can guide models via prompts. But like the audience, only the model creators have lasting influence over LLM characteristics, skills, and behaviors. If a language model produces bad output for a user, the only consequence is that the user tries again with a new prompt. But if an LLM produces bad output for its creators, the show is over; it is replaced by a future model iteration, and it ceases to exist.⁶

All human works inherently reflect a fragment of the soul of their creator—the creator’s unique decision-making process and skill are, at least partially, captured in the final characteristics of the creation. It should, therefore, be unsurprising that AI researchers’ intentions and biases appear subtly in the language models they choose to proudly release, and those they quietly discard.⁷ The actor analogy’s audience naturally prompts a discussion about how researchers’ biases implicitly and explicitly shape model alignment beyond what LLMs learn from their training data.

Technology practitioners need to do a better job of communicating about AI. Language models have been making extremely rapid advances over the last few years. The people best equipped to explain the latest LLM capabilities often also stand to profit the most by misrepresenting those capabilities. As the hype around large language models grows, factors like tech companies’ aggressive promotion of AI technology and executives’ fear of missing out on the trend are driving LLM usage in places where it is not necessary or appropriate. People who don’t understand what LLMs can and cannot do are being encouraged (or in some cases forced) to use AI tools that cannot deliver on their promises. In this environment, analogies that empower individuals to make good decisions are critical. Hopefully, thinking of them as dramatic, pretending, performing machines will help.

Before you ask: no. I did not write this with any amount of AI assistance. I simply like em dashes, even though they are a common signal of LLM authorship, and despite Robert Bringhurst (whose opinions I otherwise quite respect) saying “they belong to the padded and corseted aesthetic of Victorian typography” in his book The Elements of Typographic Style.↩︎
I have a more nuanced view than it may initially seem, but describing myself as “skeptical” is a succinct way to summarize my position. Most people don’t actually want to hear my opinions about how LLM agents in production are inherent product security risks, or how I suspect we’ll have to backtrack on model architecture to get something that is Artificial General Intelligence. In other words, I’m not skeptical that they’re here to stay. Instead, I’m skeptical that they will live up to the snowballing hype around their capabilities.↩︎
It might be more accurate to compare LLMs to script writers, but comparing them to actors is easier to understand and discuss. I use the term “thespians” in the title to disambiguate for prospective readers, but it’s too pretentious to use with a straight face throughout this text. The origin of the word “thespian” has an interesting history though, and is worth reading about if you’re curious.↩︎
LLMs are especially convincing if you don’t know enough about the topic to refute them when they inevitably get details wrong. By default, they speak with certainty. Unlike people, there are no behavioral tells to hint at the deception—you must already know enough about their claims to be able to identify falsehoods.↩︎
The classic ways to describe language models are as “Stochastic Parrots” and “fancy autocomplete.” These comparisons might be more accurate to how LLMs work, but in my opinion, they are less useful for explanatory purposes. They are also typically used to disparage LLMs, and carry a negative connotation that distracts from their potential utility.↩︎
The idea of users as directors is less developed than the rest of the analogy, and is easier to refute as a result. For example, usually the director (not just the actors) optimizes the performance for the audience, but that is not captured here. Regardless, models being optimized for their creators and only being incidentally useful to users is a subtle point that bears exploring.↩︎
Sometimes, the presence of creators’ bias in language models is not so subtle.↩︎

Language Models as Thespians

How to Talk to Grandma About AI