I used to be intrigued by Google’s new video-cloning Omni AI – then I thought of the implications

Comply with ZDNET: Add us as a most well-liked supply on Google.

ZDNET’s key takeaways

Google Omni goals to do for video what Nano Banana did for photographs.
Creators can construct movies from textual content, photographs, audio, or video.
AI avatars might assist creators, however elevate belief issues.

Final week, Google introduced a brand new AI video functionality that can both assist creatives produce higher-quality movies extra simply, or vastly enhance the quantity of AI slop on YouTube. I am betting it’s going to be a mixture of each.

Google introduced Gemini Omni, a device that raises the flexibility to create video through AI to a wholly new degree. The corporate in contrast this announcement to the extent of enchancment in AI picture era achieved when it launched Nano Banana.

Additionally: Google I/O 2026: All the things introduced

Nano Banana raised the bar significantly on what was potential with picture era. Omni purports to do the identical with video. Omni has begun to roll out, however I have never had an opportunity to play with it.

Google described Omni as “the place Gemini’s capability to purpose meets the flexibility to create.” Curiously, in line with the corporate, “With Omni, you possibly can mix photographs, audio, video and textual content as enter and generate high-quality movies grounded in Gemini’s real-world data.”

Though Omni is “beginning with video,” Google mentioned the brand new mannequin can “create something from any enter,” so presumably we’ll see different media varieties generated by the device inside due time.

Additionally: 6 Android Auto apps I want I discovered sooner, as a result of they make each drive simpler

Omni may also be out there in mannequin tiers, beginning now with Gemini Omni Flash. The potential is coming to the Gemini app, Google Move, and YouTube Shorts. It isn’t clear whether or not the online model of Gemini will help Omni, or whether or not you may want to make use of the Move interface through your browser.

There are some standout options that make this a really fascinating providing.

Clone your self

I actually cannot determine if that is going to be a standout characteristic, a really massive concern for privateness, or an untethered slop generator. The corporate mentioned you possibly can create movies “with your personal voice by utilizing Avatars, which create a digital model of your self so you possibly can generate movies that look and sound such as you.”

Additionally: I used Nano Banana 2 to make excellent sketchnotes: 5 classes realized

As a daily producer of YouTube movies for my channel, I am intrigued. There have been instances once I needed to place out a video, however was having a foul hair day, a foul voice day, or a foul angle day, and I simply did not need that to come back throughout in video.

Might I simply feed a script into my digital twin avatar and have RoboDave do the speaking? Would my viewers discover? Would they care? Would they hate it? Would I? Clearly that is an space worthy of experimentation, but it surely’s in all probability not one thing I am going to use usually.

I do my YouTube channel, partially, to maintain my talking and presentation chops up. Foisting that work on a digital avatar would possibly scale back my workload, however it might additionally scale back my coaching and follow.

Google could be very cautious to say that it is incorporating its SynthID digital fingerprinting know-how in these movies, to allow them to be verified as having been produced with Omni. Google additionally mentioned, “Past the avatar characteristic, by way of modifying movies to alter audio and speech, we’re nonetheless working to check this and higher perceive how we are able to convey this functionality to customers responsibly.”

Physics mannequin

A few of you could keep in mind the early days of video video games, when characters behaved extra like ragdolls than objects within the bodily world. As video games acquired higher, they started to include physics fashions, so if one thing acquired shot, knocked again, or dropped, it did so in a matter in line with the physics of the item.

Omni now incorporates physics into the movies it creates. Google mentioned it has “an improved intuitive understanding of forces like gravity, kinetic power, and fluid dynamics.” It additionally makes use of Gemini’s data to “join language, imagery, and that means in ways in which go far past sample matching.”

Additionally: OpenAI’s new picture watermarks make it simpler to identify AI fakes – this is how

The corporate mentioned Omni can construct detailed movies from brief prompts and might generate movies for issues like explainers that break down pretty complicated concepts. I do not doubt this. The evaluation capabilities of NotebookLM’s audio overview and video overview to have the ability to create explainers are astonishing. If a few of that know-how discovered its method into Omni, issues might get fascinating rapidly.

I really fed advertising and marketing paperwork and spec sheets into NotebookLM and it produced explainer movies for varied options of my safety product that had been higher than something I might have finished by hand, particularly within the time it took. The visuals on the time weren’t nice, however having complicated options defined in a clear video in beneath half-hour was a force-multiplier for my product launch schedule.

Enter selection

Considered one of Nano Banana’s early standout options was its capability to recontextualize a picture. For instance, I had it take an image of me strolling in a park and alter it so I used to be carrying one thing near an admiral’s uniform on the bridge of an plane service. Whereas it did not get the uniform fruit salad and brass fairly proper, it did handle to precisely reproduce my physique and face.

Additionally: I turned informal selfies into skilled headshots with Gemini

Omni proposes to take that to video, turning picture, textual content, video, or audio right into a “cohesive output.” Proper now, the one audio it can settle for is voice recordings, however the firm mentioned it’s going to “roll out different sorts of audio inputs quickly.”

The corporate additionally mentioned you possibly can create scenes, match kinds, describe what you need in pure language, and get character consistency all through the video.

Conversational modifying

One facet of manufacturing movies I don’t get pleasure from is the modifying course of. It is usually enormously tedious. However, with Omni, “Gemini Omni provides you a better strategy to edit video – with pure language. Each instruction builds on the final. Your characters keep constant, the physics maintain up and the scene remembers what got here earlier than.”

Google additionally mentioned you possibly can change components within the video. I can see an enormous profit if it is potential to import a video and have the editor take away obstructions or change objects and backgrounds. It isn’t clear how lengthy a clip will be, or precisely how a lot modifying you are able to do with Omni on a given plan, however these potentialities are thrilling.

Additionally: Are Sora 2 and different AI video instruments dangerous to make use of? Here is what a authorized scholar says

Two different transformations the corporate mentioned the brand new Omni can do are:

Change particular issues, or change all the things. Your video turns into the start line for one thing you by no means might have filmed your self.
Take a video you shot and simply ask Omni to alter what’s taking place. Edit the motion, add in new characters or objects, or remodel a second into one thing surprising.

Moreover, Google hasn’t but specified video format or decision. Will this be knowledgeable device that may deal with 16:9 movies in 4K or 8K decision, or is it meant to be a device for the YouTube Shorts era?

When OpenAI launched Sora, it was a novelty. Whereas customers abused it (we gave Sam Altman blue hair and made him sing ZDNET’s reward), it by no means managed to be a device that helped knowledgeable’s workflow.

Whereas producing AI avatar clones and changing objects is perhaps enjoyable, I am hoping this functionality is prolonged in order that it is usable both inside Remaining Lower, Premiere Professional, and DaVinci Resolve, or a minimum of built-in sufficient that these instruments can use edits created by Omni.

It is potential. Omni’s options will probably be rolling out to enterprise clients and builders through a Google API.

Additionally: OpenAI’s new picture watermarks make it simpler to identify AI fakes – this is how

I am additionally curious if Omni will embed the little diamond watermark within the nook of its movies, prefer it does with Nano Banana’s generated photographs. Whereas it is good to know a clip was generated by AI, such watermarking will get in the best way of utilizing the AI as knowledgeable device.

Will we see licensing tiers the place the watermark will be eliminated? Or will we see third-party instruments crop up that take away the watermark, whether or not Google needs you to or not? Time will inform.

Would you utilize Google Omni to create a digital avatar of your self for movies you did not need to document in individual? Tell us within the feedback under.

You’ll be able to observe my day-to-day mission updates on social media. Be sure you subscribe to my weekly replace e-newsletter, and observe me on Twitter/X at @DavidGewirtz, on Fb at Fb.com/DavidGewirtz, on Instagram at Instagram.com/DavidGewirtz, on Bluesky at @DavidGewirtz.com, and on YouTube at YouTube.com/DavidGewirtzTV.

Source link