Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Creative Work

Let me be clear about what this chapter is not. It is not about using AI to generate your novel, compose your symphony, or design your poster. If that is what you want, there are plenty of tutorials and they will serve you well until the output starts to feel like everything else that was generated the same way — which, in my experience, takes about three weeks.

This chapter is about something harder and more interesting: using AI to reach creative ideas you genuinely could not have reached alone, and then doing the creative work yourself.

The distinction matters. When you ask an AI to write your story, you get a story. When you use an AI to break your own creative fixation, you get a you who can write a better story. One of these is a shortcut. The other is a cognitive tool. We are interested in the tool.

The Shape of Creative Stuckness

Every experienced creative knows the particular texture of being stuck. It is not the same as not having ideas. Often it is the opposite — you have too many ideas, all of them variants of things you have done before, and somewhere in the back of your mind you know they are not quite right but you cannot articulate why they are not right or what right would even look like.

Psychologists call this functional fixedness when it applies to objects and design fixation when it applies to creative work. We covered the general mechanism in Chapter 3. But creative fixation has a specific cruelty to it: your taste exceeds your reach. You can feel that your current direction is stale, but the alternatives your mind generates are all drawn from the same well. You are trying to escape a room using a map of the room.

This is precisely the situation where the techniques from Part III earn their keep. Not because AI has better taste than you — it does not — but because AI can generate perturbations from outside your habitual pattern space. It can suggest things that are wrong in ways you would never be wrong, and occasionally, in being wrong in a new direction, it points you toward something genuinely right.

Adversarial Brainstorming for Narrative

Consider a novelist stuck on a plot. She is writing a thriller, and her protagonist needs to discover the conspiracy through a series of revelations. She has outlined the beats, and they are competent. They are also, she suspects, exactly what any reader of the genre would predict. The structural skeleton feels like it was assembled from parts of other thrillers she has read.

The standard brainstorming approach — “give me ten alternative plot structures” — will produce ten variations on the same theme, because the AI is also drawing from the corpus of thrillers. What she needs is not more ideas from within the genre. She needs a hostile reader who will articulate precisely why her current structure feels predictable, and then she needs that hostile reader to suggest structural alternatives drawn from outside the genre entirely.

Here is a prompt that actually works:

You are a literary critic who is deeply skeptical of genre fiction and believes that thriller conventions actively prevent interesting storytelling. Read this plot outline and:

  1. Identify every beat that follows a predictable genre pattern. Be specific and merciless about which convention each beat is following.
  2. For each predictable beat, suggest an alternative approach drawn from a completely different narrative tradition — literary fiction, folklore, memoir, documentary, theater, or any tradition that is NOT thriller/mystery/suspense.
  3. Explain why the alternative might actually serve the story’s themes better than the genre-conventional approach.

Here is the outline: [outline]

The results are not publishable plot points. They are perturbations. When one of my collaborators tried this, the critic identified that her “protagonist discovers a hidden document” beat was the single most overused revelation mechanism in the genre. The suggested alternative, drawn from oral history traditions, was to have the conspiracy revealed through contradictions in different characters’ casual memories of the same event — not a dramatic discovery, but a slow accumulation of inconsistencies that the reader notices before the protagonist does.

She did not use that suggestion directly. But it broke her fixation on the “discovery” model of revelation and led her to a structure where the protagonist’s understanding shifts not through finding things but through reinterpreting things she already knew. That was the novel she actually wanted to write. She just could not see it from inside the genre’s conventions.

The Before and After

Before (genre-conventional): Protagonist finds a classified file in a dead informant’s apartment. The file reveals the scope of the conspiracy. She realizes she is in danger.

After (post-perturbation): Protagonist attends the dead informant’s funeral and hears three people tell stories about him that cannot all be true. She does not find a file. She finds a pattern — and the reader finds it two chapters before she does, creating a different and more unsettling kind of tension.

The AI did not write the second version. The AI made the first version feel obviously insufficient by articulating, from an alien critical perspective, what was wrong with it. The novelist’s own judgment and craft did the rest.

Conceptual Blending for Visual Language

Design fixation is, if anything, more pernicious than narrative fixation because visual languages are deeply habitual. A designer who has spent years working in a particular aesthetic develops what amounts to a visual accent — characteristic ways of handling space, color relationships, typographic hierarchies. This accent is their strength until it becomes their cage.

Conceptual blending, as we discussed in Chapter 13, works by forcing connections between domains that do not normally touch. For visual work, this means asking the AI to describe visual principles from domains the designer has never considered as sources of visual language.

A concrete example. A brand designer was developing the visual identity for a marine biology research institute. Her initial concepts were, predictably, blue. Oceanic. Clean sans-serif type. Tasteful photography of sea creatures. Perfectly competent. Also indistinguishable from every other marine science brand she had ever seen.

The prompt that broke her out:

I’m designing a visual identity for a marine biology research institute. I’ve fallen into the obvious oceanic visual language — blue palette, clean modernism, nature photography. I need to find a completely different visual approach that still communicates “serious marine science.”

Describe the visual principles of each of the following domains, then explain how those principles could be applied to this brand identity:

  1. Soviet-era scientific illustration
  2. Traditional Japanese fish market signage
  3. Deep-sea bioluminescence (as a color system, not as imagery)
  4. Victorian-era naturalist field notebooks
  5. Weather radar data visualization

She did not use any of these wholesale. But the description of bioluminescence as a color system — light emerging from darkness, a palette built on black with specific luminous accents rather than the conventional white-with-blue — gave her a fundamentally different starting point. The final identity used a dark palette with precise, luminous specimen illustrations that referenced both bioluminescence and the tradition of scientific illustration against dark backgrounds. It looked like nothing else in the marine science space, and yet it communicated exactly what it needed to communicate.

The key insight: she did not ask the AI to design anything. She asked it to describe visual principles from unfamiliar domains. The blending — the creative act — happened in her own mind when she read those descriptions and felt one of them resonate.

When Blending Fails

I should be honest about the failure mode. About half the time, conceptual blending produces connections that are merely weird. “Apply the visual principles of competitive barbecue to your marine biology brand” is a perturbation, but it is not a useful one. The designer’s judgment is what separates a productive collision from an arbitrary one. The technique works not because every blend is good, but because you only need one to break your fixation, and generating five candidate blends takes three minutes.

The failure mode to watch for is not bad blends — you will recognize those immediately. It is plausible blends that feel novel but are actually just unfamiliar cliches from the source domain. “Apply Japanese aesthetics to your brand” sounds fresh until you realize you have just reinvented the minimalist-zen visual language that has been a design cliche since 2010. The remedy is specificity: not “Japanese aesthetics” but “the specific visual conventions of Tsukiji fish market signage in the 1970s.” The more specific the source domain, the less likely you are to land in someone else’s well-worn territory.

Constraint Injection for Style Breaking

Every creative practitioner develops default moves. A musician reaches for the same chord voicings. A writer deploys the same sentence rhythms. A photographer frames shots from the same angles. These defaults are not weaknesses — they are the foundation of a personal style. But there comes a point where a style becomes a rut, and the difference between the two is whether you are choosing your defaults or merely repeating them.

Constraint injection, which we explored in Chapter 12, is brutally effective for breaking stylistic defaults because it makes them impossible rather than merely inadvisable. Willpower is not enough to break a well-practiced habit. Structural impossibility is.

A songwriter who always writes in 4/4 time, in major keys, with verse-chorus-verse structures does not need someone to tell her to try something different. She knows. She has tried. She ends up back in 4/4 every time because that is where her musical instincts live. What she needs is a set of constraints that make her defaults structurally unavailable:

I’m a songwriter stuck in my own conventions: 4/4 time, major keys, verse-chorus-verse structure, piano-driven arrangements. Generate a set of five creative constraints for my next song that make my defaults impossible while still allowing for something musical and emotionally compelling. Each constraint should:

  1. Explicitly forbid one of my defaults
  2. Suggest a specific alternative (not just “do something different”)
  3. Include a reference to an existing song or artist that successfully uses this alternative, so I can hear what it sounds like

A set of constraints that emerged from this approach:

  • Time signature: 7/8 (reference: Radiohead’s “2+2=5” for how 7/8 can feel urgent rather than academic)
  • Tonality: Dorian mode instead of major (reference: “Eleanor Rigby” for how Dorian creates melancholy without the heaviness of minor)
  • Structure: Through-composed, no repeated sections (reference: Joanna Newsom’s “Emily” for how narrative drive can replace structural repetition)
  • Lead instrument: Voice and a single sustained-tone instrument (cello, organ, or harmonium) — no piano, no guitar
  • Lyrics: No first person. Every line describes an observed scene, not an internal state

The songwriter does not have to follow all five. Even following two of them puts her in unfamiliar enough territory that her habitual moves stop working. She has to think about every decision instead of executing on instinct. That is the point. The constraints do not produce the song — they produce the cognitive state in which a different kind of song becomes possible.

The Prompt as Creative Catalyst, Not Creative Agent

There is a pattern in all these examples that I want to make explicit because it is the central thesis of this chapter.

In every case, the prompt is designed to produce raw material for the creator’s judgment, not finished creative output. The novelist gets a critique and a set of structural alternatives, not a rewritten plot. The designer gets descriptions of visual principles from unfamiliar domains, not a mood board. The songwriter gets constraints, not a melody.

This is not modesty about AI’s creative capabilities. It is a practical observation about where the value lies. The creative act is not generating possibilities — humans and machines can both do that tolerably well. The creative act is recognizing which possibility is the right one, and that recognition depends on everything you are: your taste, your experience, your emotional response, your understanding of your audience, your sense of what has been done before and what has not. That recognition is yours. It is the part that cannot be automated, and it is the part that matters.

What AI does, in the framework of this book, is expand the space within which your recognition operates. If you can only see possibilities A through E, your judgment can only choose among A through E. If a well-crafted prompt surfaces possibilities F through Z — including many that are terrible — your judgment now has more to work with. The quality of the judgment does not change. The range of options it operates on does.

Working with AI on Long-Form Creative Projects

Short perturbations are one thing. What about sustained creative projects — a novel, a film, a design system — where you need to maintain coherence over weeks or months while still using AI to push your thinking?

The practical answer is to use AI at specific decision points rather than continuously. The moments where AI-augmented thinking is most valuable in long-form creative work are:

1. The initial concept phase, where you are choosing among directions and the risk of fixation is highest because you have not yet committed to anything. This is where adversarial brainstorming and conceptual blending earn their highest returns.

2. Structural inflection points, where you have been executing on a direction and need to make a major decision — a plot turn, a design system extension, an architectural choice in a composition. These are moments where your accumulated momentum creates fixation risk.

3. The “something is wrong” moments, where you can feel that a piece is not working but cannot articulate why. Here, the Socratic interrogation techniques from Chapter 14 are invaluable — not asking the AI what is wrong, but using the AI to ask you what is wrong in ways that surface your own tacit knowledge.

4. The revision phase, where you need to see your own work from outside. The alien perspectives from Chapter 11 are powerful here: ask the AI to read your work as a specific kind of critic, not to fix it but to reveal what someone with a fundamentally different aesthetic framework would see in it.

Between these decision points, you work. You write, you design, you compose. The AI is not a collaborator in the romantic sense. It is a cognitive tool that you pick up when you need to see around a corner that your own mind cannot see around, and you set down when the work requires execution, craft, and sustained artistic judgment.

A Worked Example: Escaping a Visual Rut

Let me walk through a complete example at enough length to show the full cycle of perturbation, recognition, and creative development.

A photographer specializing in architectural photography had been shooting the same way for years: dramatic angles, high contrast, strong geometric composition, monochrome or desaturated color. Her portfolio was striking. It was also, she realized, completely predictable to anyone who had seen more than three of her images. Every building looked like it was posing for the same photograph.

Step 1: Articulate the rut. Before involving any AI, she spent thirty minutes writing down what she always did. This self-diagnosis is essential — you cannot break a pattern you have not identified. Her list included: low angles looking up, strong vanishing points, removal of human presence, high contrast, desaturation, emphasis on geometric patterns, tight framing that isolates architectural details from context.

Step 2: Adversarial critique. She fed this list to an AI with the following prompt:

Here are the consistent characteristics of my architectural photography. I suspect they have become a rut rather than a style. For each characteristic, explain: (a) What cognitive or aesthetic habit it likely represents (b) What it systematically excludes or makes invisible (c) A specific counter-approach used by a photographer or visual artist known for the opposite tendency

The response was illuminating. The AI noted that her systematic removal of human presence was likely rooted in a modernist conception of architecture as pure form — but it made her photographs unable to communicate how buildings are actually experienced. It suggested looking at the work of photographers who treated architecture as a social medium — the building as a container for human life rather than a sculptural object.

Step 3: Constraint generation. She then asked for constraints:

Based on this analysis, give me a set of rules for my next shoot that make my current approach impossible. I want to be forced into a completely different way of seeing buildings.

The constraints included: every frame must contain at least one human figure; no angle may be more than 15 degrees from eye level; color must be the primary compositional element (not geometry); every image must include the building’s immediate context (street, sky, neighboring structures); nothing may be cropped tighter than a full facade.

Step 4: Selective adoption. She chose three of the five constraints for her next project: human presence required, eye-level angles, and full-context framing. She dropped the color constraint (she was not ready for that) and the facade constraint (too limiting for the buildings she was shooting).

Step 5: The shoot and the discovery. Working under these constraints, she found herself making images she had never made before. The eye-level constraint, in particular, transformed her relationship with buildings — instead of looking up at them as monumental objects, she was looking at them as a pedestrian does, which completely changed what she noticed. She started seeing the weathering at street level, the way entrances frame the people passing through them, the relationship between a building and the sidewalk cafe next to it. The human presence constraint forced her to wait for the right moment rather than the right light, which introduced a temporal dimension her work had never had.

Step 6: Integration. She did not abandon her previous style. She developed a second mode — warmer, more human, more contextual — that she could deploy when it suited the subject. The AI had not taught her this mode. It had made her own habitual mode temporarily impossible, which created the cognitive space for a different way of seeing to emerge.

The total AI interaction time was about forty-five minutes. The creative development it catalyzed took months. That ratio — brief perturbation, extended creative development — is typical. If you find yourself spending more time talking to the AI than doing creative work, you are probably using it as a collaborator rather than a catalyst, and you should revisit Chapter 18.

Limitations, Honestly Stated

AI cannot give you taste. If you do not already have a well-developed sense of what is good in your domain — a sense earned through years of practice, study, and exposure — then AI perturbations will not help you. You will not be able to distinguish the productive suggestions from the merely novel ones. This is the creative equivalent of the epistemic hygiene problem we discussed in Chapter 19: the tool is only as good as the judgment applied to its output.

AI cannot replace craft. The photographer still needed to know how to expose a frame, compose a shot, and work in post-production. The songwriter still needed to know how to write a melody in 7/8 time, which is not trivial. The novelist still needed to be able to execute a complex non-linear narrative structure. AI expanded their creative possibilities, but craft is what allowed them to realize those possibilities.

AI’s creative suggestions carry a homogeneity risk. Because large language models are trained on broadly the same corpus, their “surprising” suggestions tend to converge. If every designer uses conceptual blending prompts, they will tend to get similar blends. The remedy is specificity and idiosyncrasy in your prompts — the more your prompt reflects your particular situation, knowledge, and obsessions, the more particular the perturbation will be.

Finally, AI cannot tell you when you are done. The most important creative judgment is knowing when a piece is finished, and that requires a kind of holistic aesthetic assessment that no current AI can perform. You will know. Or you won’t, and you will keep working. That, at least, has not changed.

The Takeaway

Use AI to expand the space of creative possibilities you can perceive. Use your own judgment to navigate that expanded space. Use your own craft to realize what you find there.

The sequence is always: diagnose your fixation, perturb it with AI-generated alternatives from outside your habitual space, recognize which perturbation points toward something real, and do the creative work yourself.

The AI is the pebble thrown into the pond. The ripples are yours.