Launched this week
Gemini Omni
Create anything from any input – starting with video
657 followers
Create anything from any input – starting with video
657 followers
Create anything from anything, starting with video. Gemini Omni is where Gemini’s ability to reason meets the ability to create. It delivers a leap in world understanding, multimodality, and editing.







Hey Hunters, I am excited to hunt Gemini Omni 🔮
Turn any idea into stunning videos with a single prompt, image, sketch, or reference. Think of it like Nano Banana for video creation — fast, creative, and insanely flexible.
Now available across the Gemini App, Flow, and YouTube, with API access coming soon 🚀
@saaswarrior The video-first direction is interesting for launch and campaign assets. The hard part for marketing teams is usually consistency across variants: same product, same message, different channels. Curious whether Omni can preserve a campaign's style and visual rules across several generated clips or if each prompt needs to restate the constraints.
Video editing through natural language is something every YouTube creator I talk to asks for. The gap today isn't generating video — it's editing existing footage without spending 3 hours in Premiere Pro. If I could tell an AI "cut this 20-minute video to the best 8 minutes and add captions," that alone would save creators 5+ hours per week.
The object tracking across frames and physics-aware generation sound promising for B-roll generation too. Right now most creators use generic stock footage because creating custom visuals is too expensive. How does Omni handle consistency when you're making multiple edits to the same project — does it maintain a "memory" of previous changes or does each prompt start fresh?
The multimodal video-first direction feels much more practical than most AI demos lately. Combining reasoning with generation opens interesting possibilities for editing and accessibility workflows. how you maintain temporal consistency across longer video generations?
Honestly, I've been waiting for something like this for a while. Pairing reasoning with generation is exactly where most tools fall apart right now, videos look nice but lose all coherence after a few seconds. If Gemini Omni can actually keep the logic consistent across shots, that alone sells it for me. Congrats to the team 🙌
Love the focus on editing and world understanding instead of text to video generation. That’s where practical creative workflows start emerging.
Interesting !! here isn't just the video generation t's the combination of world understanding and editing in one unified model. That's the hard part everyone's been quietly working on.