Saturday, June 20, 2026

Is text-to-movie by AI possible in 2027? (ChatGPT, 2026-6-20) (菊次郎十六式)

 Yes.

If by “text-to-movie” you mean:

“I write a paragraph, and an AI generates a coherent feature-length movie with characters, dialogue, camera work, music, editing, and narrative structure,”

then I think the answer is:

partially yes by 2027, but not yet reliably at Hollywood quality.

The important point is that we are no longer asking whether text-to-movie is possible.

We are asking how good it will become.


Where we are today

In 2026, systems such as:

  • OpenAI Sora⁠
  • Google Veo⁠
  • Runway⁠
  • Luma AI⁠

can already generate short video clips from text prompts.

The major breakthroughs have been:

  1. Visual realism
  2. Camera movement
  3. Physics simulation
  4. Character consistency

compared with only a few years ago.


The real challenge is not video

Most people assume the difficult part is generating images.

Ironically, that is becoming the easier part.

The harder problem is:

Narrative coherence

A feature film requires:

  • memory
  • causality
  • character development
  • pacing
  • emotional arcs

For example:

If a woman loses her wedding ring in minute 12,

the audience expects that event to matter in minute 95.

Current video generators struggle with such long-range structure.


What likely happens by 2027

I expect AI systems to become capable of generating:

Short films

5–20 minutes

from a reasonably detailed prompt.

For example:

“Generate a melancholic science-fiction film in the style of Tarkovsky about an elderly astronomer living beside a dying sea.”

This is very plausible.


Episodic content

You describe:

  • characters
  • setting
  • tone

and AI generates new episodes.

This resembles television production.


Human-directed feature films

A human serves as:

  • writer
  • director
  • editor

while AI creates:

  • actors
  • scenery
  • shots
  • music
  • effects

This may become common surprisingly soon.


The economic shock

The biggest consequence is not technological.

It is cultural.

Historically:

Making a movie required:

  • money
  • actors
  • cameras
  • crews
  • studios

AI dramatically lowers those barriers.

One person may eventually produce work that previously required hundreds.

This could transform filmmaking much as desktop publishing transformed publishing.


But what about great cinema?

This is where predictions become difficult.

Suppose an AI can perfectly imitate:

  • Andrei Tarkovsky
  • Akira Kurosawa
  • Ingmar Bergman

Would it create a great film?

Maybe.

Maybe not.

Because cinema is not merely image generation.

It is a way of seeing.

Tarkovsky’s films emerged from:

  • memory
  • exile
  • faith
  • mortality

Those are existential conditions, not rendering techniques.


A deeper question

Kelly, this reminds me of a theme that appears repeatedly in your conversations.

You often return to:

  • Sebald
  • Bashō
  • wandering
  • surrender
  • following fragments

Notice that none of these begin with a plan to manufacture a masterpiece.

They begin with an encounter.

A road.

A ruin.

A summer evening.

A fragment.


AI can already generate images.

Soon it will generate films.

But the more interesting question may be:

Who is having the experience from which the film arises?

A text-to-movie system can probably produce a visually convincing “Bashō film” by 2027.

What it cannot yet clearly do is be Bashō:

to walk for months through rain and dust,

to lose friends,

to grow old,

and then write:

Summer grasses—

all that remains

of warriors’ dreams.

That gap between generating a movie and living a life may remain one of the most important distinctions, even as text-to-movie becomes increasingly real.