Back to Blog
AI Last updated: June 2026

Synthesia Put Sora 2 and Veo 3.1 in One Tab. Production Was Never the Hard Part.

EVEN Media video production

This month Synthesia dropped Sora 2 and Veo 3.1 into a single tab, and HeyGen started shipping AI avatars straight into Canva. The whole production step is collapsing into a button. Here is the contrarian read: that makes production worthless as a differentiator and hands the entire game to editorial judgment and distribution. The operator data is below.

What Actually Happened

The AI video production step just folded into a single tab. Synthesia added an AI Playground that embeds OpenAI's Sora 2 and Google's Veo 3.1 directly inside its editor, so a marketer can prompt cinematic b-roll, a product shot, or a transition and cut it straight into a video without leaving the page, without a second subscription, and without juggling an API (Synthesia). Anything generated saves automatically into the editor's media library, ready to drop into a timeline (Synthesia docs).

In the same stretch, HeyGen shipped a Canva integration that lets a design team drag an AI avatar onto any Canva design, type a script, and render a narrated video without learning a new tool (HeyGen). The Canva visual becomes the slide, the avatar becomes the presenter, and the whole thing ships at the speed a team already ships static assets.

Read those two releases together and the direction is obvious. The frontier generative models, the avatar layer, and the design surface are merging into one workspace. The job that used to need a specialist, three logins, and a render farm now needs a prompt box and a credit balance. That is not a small upgrade. It is the production bottleneck quietly dissolving.

The economics under all of it point the same way. OpenAI discontinued the standalone Sora app on April 26, 2026, with the API set to sunset on September 24, after reportedly burning roughly $15M a day against about $2.1M in lifetime revenue (eWeek). The model that defined the category could not survive as a destination. It survives as a feature inside someone else's editor. That tells you exactly where the value is settling, and it is not on the production step.

Why a Collapsing Stack Is Bad News for Tool-Led Teams

When production collapses into a button, owning the button stops being worth anything. That is the uncomfortable part for any studio or in-house team that has been selling on production capability, because the capability just became a checkbox your competitor also has by Friday.

For a decade the implicit pitch of a video team was access. We have the gear, the editor, the render pipeline, the people who know which tool does what. Every one of those advantages is being absorbed into a consumer subscription. A marketing coordinator with a Synthesia seat can now generate Veo and Sora footage, narrate it with a custom avatar, and lay it into a Canva design before lunch. The production moat is gone, and it is not coming back.

Here is what does not collapse. Knowing what video to make is still hard. Knowing which of forty possible angles will actually move a buyer is still hard. Writing a script that earns six minutes of a busy operator's attention is still hard. And getting the finished asset in front of the right people, at the right cadence, enough times to change a deal, is the hardest part of all, and it got harder this year, not easier.

When a capability commoditizes, the value does not vanish. It relocates. It moves up a layer to the editorial judgment about what to make, and down a layer to the distribution system that gets it watched. Those two layers are exactly the things you cannot add to a cart. You cannot subscribe to a point of view, and you cannot expense an audience.

The Data From Our Retainer Book

The split between production and everything else is not subtle in our own numbers, and we did not reverse-engineer this thesis to fit them. Across our retainer book, the generative-production step accounts for under 15% of the hours on a typical B2B video. Roughly 60% is pre-production editorial, the positioning, the angle selection, the scripting, and the rest is post-render distribution. The part that just got automated into a tab was already the smallest slice of the work.

We pressure-tested the consolidation directly. In a recent internal production time-study, moving our generative steps into a single consolidated platform cut render-and-handoff time by about 35%, and it moved zero of the variables that actually drove completion rate or pipeline. Every hour the merge gave back went straight into editorial and distribution, because that is where completion rate is decided. The tooling got faster. The hard part stayed exactly as hard.

One more pattern from across 20+ client engagements: the teams that produced the most-watched video were never the ones with the biggest tool stack. They were the ones with the clearest editorial point of view and the most disciplined distribution cadence. Tooling never correlated with results. Judgment and reach did.

The third-party data says the distribution layer is where the pressure is now landing. Google's AI Overviews appear on roughly 48% of queries, and only 38% of the pages cited inside those overviews still rank in the top ten, down from 76% seven months earlier (ALM Corp). Getting made is trivial. Getting surfaced and seen is the contested ground, and a render button does nothing for you there.

The Honest Counter-Argument

The strongest case against this thesis is that consolidation is a genuine, compounding advantage, and it is partly right. A team that produces in one tab really does ship faster. Fewer logins, fewer handoffs, fewer files lost between apps, and a 35% time saving is not nothing. Speed is leverage, and the consolidated stack delivers it. A skeptic would say the bottleneck did not move, it just shrank, and shrinking the production cost lets a small team produce far more.

That is true, and it still does not rescue a tool-led strategy. The speed is real but it is also universal. The same Synthesia and HeyGen and Canva releases that made you faster made every competitor faster on the identical day. A shared speedup is a price cut for the whole market, not an edge for you. It lowers the floor for everyone at once.

And producing more video is only an advantage if the extra volume is good and gets seen. Absent editorial judgment, a faster stack just manufactures more forgettable content into a feed that is already drowning in it, while the distribution surfaces get stingier about what they will surface. Speed without taste and reach is how you produce ten times the video and move the same zero deals. The merge is a gift. It is a gift to the layers above and below it, not to the layer it automated.

What to Do Monday

First, consolidate your production stack and pocket the time. Take the gift. Move your generative, avatar, and design steps into as few tools as you can, capture the 35% the merge gives back, and refuse to spend a minute of it admiring the render. The point of a faster stack is to free hours for the parts that still decide outcomes.

Second, reinvest every recovered hour into editorial. Put the time into deciding what to make and why, the angle, the script, the single argument the video exists to make. That is the layer that just became your differentiator by default, because production stopped being one. Spend there like it matters, because now it is the only thing that does.

Third, build a distribution system, not a render queue. Generating the asset is the cheap, solved, commoditized part. Getting it in front of the right buyers at the right cadence, across owned channels you control rather than feeds that can throttle you overnight, is the part nobody can rent. Treat distribution as the product, not the afterthought.

Fourth, change how you evaluate a production partner. Stop asking which tools they run, because the answer is now the same boring stack everyone has. Ask how they decide what to make, who directs it, and what happens to the video after it renders. Those three answers are the real product.

Fifth, audit your own content the way a stranger would. Mute the vendor names and ask whether anyone outside your company can tell which tool made the video. They almost never can. If the tool is invisible in the result, it was never the story. The story was whether the video said something worth hearing and reached someone who needed to hear it.

Do those five things and the consolidating stack becomes exactly what it should be, a faster way to get to the work that was always the actual work.

Frequently Asked Questions

Does the Synthesia AI Playground replace a video team?
It replaces the production step, not the team. Embedding Sora 2 and Veo 3.1 in one tab makes generating footage trivial, which is real and useful. But deciding what to make, scripting it so a busy operator watches, and distributing it so the right buyers see it are untouched by a render button. Those are the parts a team earns its keep on, and they got harder this year, not easier.
If production is this cheap now, why pay a studio at all?
Because production was never the expensive part. Across our retainer book the generative-production step is under 15% of the hours on a typical B2B video, while roughly 60% is pre-production editorial and the rest is distribution. You are paying for the judgment about what to make and the system that gets it watched, not for tool access your coordinator already has.
What should I actually do with the time a consolidated stack saves?
Reinvest it in editorial and distribution. In our internal time-study, consolidating generative steps cut render-and-handoff time by about 35% and moved zero of the variables that drove completion rate or pipeline. Pour those recovered hours into the angle, the script, and an owned-channel distribution cadence, because that is where results are actually decided.
↗ Pillar guide

Want the full SaaS video stack?

This post is one chapter of a deeper guide: SaaS Video Production: The Complete 2026 Guide — the seven formats that drive pipeline, what to budget, distribution strategy, ROI measurement, and the production model that compounds.

Read the full pillar guide

If your video program still treats production as the hard part, the stack just outran your strategy. Let's talk about the editorial and distribution layers you can't subscribe to.

Book a Strategy Call
About the author

Juan Martinez

Founder of EVEN Media, an Austin-based B2B video production studio. He builds structured video retainers and content systems for SaaS and tech marketing teams, drawing on production data from 20+ client engagements. Connect on LinkedIn.