Computer gaming is about to experience one heck of an upgrade.
The advent of Large Language Models (LLMs) and Context Engineering has made coding far faster and easier, allowing anyone to create at least basic apps simply by asking for it in sufficient detail, given a sufficiently advanced model and enough tokens.
This is already profound magic in its own right, but LLMs can do much more than just code. With the right prompts, they can perform countless language-based tasks. Including storytelling. And this opens the door to the ultimate kind of Choose-Your-Own-Adventure stories: ones where your choices influence the world of the story, and the storyteller changes the tale accordingly.
I’m pretty sure that ChatGPT and other frontier models could tell a good interactive story. But can locally-hosted language models that can run on a reasonably-priced modern PC?
To find out, I coded got Codex to code a Python script to set up a LLM with the tools needed to be a “Storyrunner”. Given basic instructions on how I wanted the app to work, GPT5.5 (as the Codex agent) put together an effective sketch, saving the game state in various files. On each storytelling turn, the model is given the initial prompt plus a game journal which details everything about the current game that is known to the storyteller / Game Master. In theory, a different model (certainly a different instance of the same model) could handle each turn, reading all of the information about the current state from scratch and then deciding how to direct the next step of the story, what things and events to describe, and so on. There is a Player Journal which details everything known to the player, and a Game Journal (full of spoilers if the player were to read it.)
Model choice seems to matter a lot, and models which excel at coding don’t necessarily always make good storytellers. OpenAI’s gpt-oss-20b is a good all-around local model (and happens to fit nicely in a 12GB RTX4070 GPU). But its adventures are all AI-slop knockoffs of some generic Thomas Kinkade-meets-Tolkien fantasy world. Put lighthouses, globes, clocks, windmills, and foggy beaches on your Bingo card and you’ll win every time.
Google’s Gemma-4-31b, however, seems to be pretty capable. I’m already several collaborative pages into a “strange adventure” (as the initial prompt requested) in a foggy land (okay, first mark on the Bingo card) where Time seems to ebb and flow erratically, following something called the Drift. The feeling is one of conversing with a skilled storyteller, but communicating via carrier pigeon: the local Gemma4 model takes about ten minutes per response.
This is only going to get better, faster, and more immersive. Expect the next generation of video games to have NPCs, companions, and even simple mobs which can react intelligently (or authentically unintelligently), remember players and their actions taken in context, and develop opinions, form friendships, hold grudges, and all the rest. Gameplay will be far more immersive and dynamic than ever. Rather than having to hand-code scripts for characters to recite based on a few player choice possibilities, players will simply talk to characters, who will respond based on that character’s worldview and memories.
Imagine going on a riverboat trip with Mark Twain.
I love living in the Future.







