Quick Brief
- The Launch: Google released enhanced Veo 3.1 “Ingredients to Video” with native 9:16 aspect ratio support, 1080p/4K upscaling, and improved character consistency across Gemini app, YouTube Shorts, Flow, and Vertex AI
- The Impact: Professional filmmakers and mobile creators gain production-ready vertical video output at $0.40/second (Standard) or $0.15/second (Fast) with no price increase from Veo 3
- The Context: Google advances its position against OpenAI’s Sora 2 by targeting mobile-first workflows and enterprise API integrations amid the generative video AI race
Google announced major upgrades to Veo 3.1 Ingredients to Video on January 13, 2026, introducing native vertical format generation, state-of-the-art 4K upscaling, and enhanced visual consistency controls. The update targets both consumer creators on YouTube Shorts and enterprise workflows via the Gemini API and Vertex AI, marking Google’s strategic push into mobile-optimized AI video generation.
What’s New in Veo 3.1
The Ingredients to Video capability now generates videos from up to three reference images with significantly improved character identity consistency, background preservation, and object continuity across scenes. Google reports that the updated model produces more expressive outputs even with shorter text prompts, enabling faster iteration for content creators.
The platform introduces native 9:16 vertical video generation the first time Ingredients to Video supports portrait mode without cropping or quality degradation. Creators can output directly to YouTube Shorts format, which YouTube deployed to compete with TikTok and Instagram Reels. Professional users gain access to 1080p and 4K resolution upscaling through Flow, Gemini API, and Vertex AI, designed for broadcast-ready production environments.
Google maintains Veo 3.1 pricing at $0.40 per second for Standard quality and $0.15 per second for Fast generation, unchanged from the October 2025 Veo 3.1 release. Enterprise subscribers using Google AI Ultra ($249.99/month) receive approximately 12,500+ credits monthly with 30TB storage for generated assets.
Why This Matters
Google’s vertical video integration addresses the mobile-first content consumption shift, where 9:16 format dominates platforms including YouTube Shorts, TikTok, Instagram Reels, and Snapchat Spotlight. The native aspect ratio support eliminates post-production cropping workflows that typically reduce resolution quality and increase rendering time.
The 4K upscaling capability positions Veo 3.1 for commercial production pipelines where OpenAI’s Sora 2 offers 1080p as maximum resolution. Industry analysts note that Google’s “Ingredients to Video” feature using reference images to maintain visual consistency provides more control over multi-scene narratives compared to Sora 2’s prompt-only approach.
Google embeds SynthID digital watermarks in all Veo-generated videos, maintaining AI content transparency standards as regulatory frameworks evolve globally. The Gemini app now includes video verification tools, allowing users to upload content and confirm whether Google AI generated it.
Technical Specifications
| Feature | Veo 3.1 Standard | Veo 3.1 Fast |
|---|---|---|
| Pricing | $0.40/second | $0.15/second |
| Resolution | 720p, 1080p, 4K | 720p, 1080p |
| Aspect Ratios | 16:9, 9:16 | 16:9, 9:16 |
| Reference Images | Up to 3 images | Up to 3 images |
| Audio | Native audio with dialogue sync | Native audio |
| Availability | Gemini API, Vertex AI, Flow, Gemini app, YouTube | Gemini API, Vertex AI |
Platform Availability
Consumer Access: Veo 3.1 Ingredients to Video deploys to YouTube Shorts and YouTube Create app for the first time, expanding beyond professional tools. The Gemini app provides immediate access to enhanced Ingredients to Video and portrait mode generation for consumer-tier subscribers.
Enterprise Integration: Flow, Gemini API, and Vertex AI receive the enhanced Ingredients to Video capabilities with native vertical format support. The 1080p and 4K resolution options roll out exclusively to Flow, Gemini API, and Vertex AI users, excluding consumer-tier Gemini app access to ultra-high-resolution outputs.
Developer Tools: The Gemini API provides programmatic access to both Veo 3.1 Standard and Fast models through Google AI Studio and Vertex AI endpoints. Developers can implement video generation workflows with start/end frame control and multi-image ingredient inputs via API calls.
What’s Next
Google plans to expand Flow’s editing capabilities with an “erase” tool that removes unwanted objects or characters while automatically reconstructing backgrounds. The feature targets iterative creative workflows where creators refine AI-generated outputs through selective element removal.
The company positions Veo 3.1 against OpenAI’s Sora 2 in the enterprise video generation market, where temporal consistency and audio synchronization determine production viability. Analysts expect Google to leverage its YouTube distribution advantage of 2.7 billion monthly active users to accelerate Veo adoption among content creators.
Regulatory compliance remains critical as governments implement AI labeling requirements. Google’s SynthID watermarking system provides infrastructure for content provenance tracking, addressing European Union AI Act transparency mandates and similar frameworks in development globally.
Frequently Asked Questions (FAQs)
What is Google Veo 3.1 Ingredients to Video?
Veo 3.1 Ingredients to Video generates videos from up to three reference images with text prompts, maintaining character and background consistency across scenes at 720p to 4K resolution.
How much does Veo 3.1 cost?
Veo 3.1 costs $0.40 per second for Standard quality and $0.15 per second for Fast generation via Gemini API, with Google AI Ultra subscribers receiving 12,500+ monthly credits at $249.99/month.
What resolution does Veo 3.1 support?
Veo 3.1 outputs 720p, 1080p, and 4K resolution with state-of-the-art upscaling available through Flow, Gemini API, and Vertex AI in both 16:9 and native 9:16 aspect ratios.
How does Veo 3.1 compare to OpenAI Sora?
Veo 3.1 offers 4K resolution versus Sora 2’s 1080p maximum, supports up to three reference images for consistency, and provides native audio generation with dialogue synchronization.

