Veo 3.1 Lite × BibiGPT
Google released Veo 3.1 Lite on 2026-03-31 — an economy-tier video generation model on the Gemini API priced at $0.05/s for 720p and $0.08/s for 1080p, supporting 4 / 6 / 8s clips, 9:16 vertical output, and native audio. BibiGPT pairs Veo 3.1 Lite with our video summarization and visual analysis pipeline so creators can spin up short, vertical, AI-generated video assets without leaving the workflow.
Key facts (90-second read)
Google released Veo 3.1 Lite on 2026-03-31 — an economy-tier video generation model on the Gemini API. Pricing is $0.05/s at 720p and $0.08/s at 1080p, durations are fixed at 4 / 6 / 8 seconds, output is 9:16 vertical native, and native audio is generated in the same pass. For BibiGPT users, Veo 3.1 Lite is the short-form vertical output complement to long-video summarization — turn a 90-minute lecture into 8-second TikTok / Reels / 小红书 hooks at a few cents per clip.
Features
What is Veo 3.1 Lite?
Google's 2026-03-31 economy-tier video generation model, available on the Gemini API. Designed for short-form, social-first vertical output at a fraction of the flagship Veo cost — $0.05/s at 720p, $0.08/s at 1080p, with 4/6/8 second durations and native audio.
Economy tier price anchor
$0.05/s for 720p and $0.08/s for 1080p — roughly an order of magnitude cheaper than the flagship Veo tier. An 8-second 1080p clip costs about $0.64, finally making programmatic short-video generation affordable for content creators.
9:16 vertical with native audio
Vertical 9:16 output is the default — built for TikTok, Reels, Shorts, and 小红书 视频号. Native audio means voice and background sound are generated together with the visual, no separate TTS or sound-design pass needed.
Short-form 4 / 6 / 8 second clips
Three preset durations cover the typical short-video grammar — a hook (4s), a beat (6s), or a full micro-narrative (8s). Constraint-driven design that keeps cost predictable and turnaround fast.
Why this matters for BibiGPT users
BibiGPT is the AI audio/video assistant for creators — long-video summarization, visual analysis, and knowledge-product generation (公众号 articles, 小红书 graphics, short videos, slides). Veo 3.1 Lite plugs into the short-video output side of that creator workflow.
Turn long videos into short vertical clips
BibiGPT summarizes a 90-minute lecture or podcast and surfaces the highest-leverage moments. With Veo 3.1 Lite, those moments can be re-rendered as 8-second vertical hooks for TikTok / Reels / 小红书 — the full pipeline from long-form input to short-form output.
Affordable A/B variant generation
At $0.05–$0.08/s, generating 5–10 variant hooks of a single concept costs a few dollars total. Creators can iterate on opening shots, voiceover styles, and visual hooks without burning the flagship-tier budget.
Vertical-first social asset pipeline
BibiGPT already produces 公众号 / 小红书 / PPT knowledge artifacts from a single video input. Veo 3.1 Lite extends the same artifact set to short vertical video — finishing the multi-format coverage creators expect from the AI 知行助理.
5 key changes (90-second read)
Headline shifts from the Veo 3.1 Lite release on 2026-03-31.
- 1
Economy tier joins the Veo lineup
Google adds an explicit economy tier below the flagship Veo. Positioned for high-volume, short-form, social-first generation rather than cinematic production — a different price point and a different scope.
- 2
$0.05/s 720p · $0.08/s 1080p pricing
An 8-second 1080p clip costs about $0.64; an 8-second 720p clip about $0.40. Roughly an order of magnitude cheaper than flagship Veo, finally putting programmatic short-video generation in reach of individual creators.
- 3
9:16 vertical with native audio
Output is 9:16 vertical by default, built for TikTok / Reels / Shorts / 小红书 / 视频号. Native audio (voice + background sound) is generated together with the visual — no separate TTS or sound-design pass.
- 4
4 / 6 / 8 second duration presets
Three fixed durations match the short-form grammar: 4s for a hook, 6s for a beat, 8s for a full micro-narrative. Constraint-driven design that keeps cost predictable and turnaround fast.
- 5
Available on the Gemini API
Veo 3.1 Lite ships on the same Gemini API surface as the rest of Google's multimodal stack. Integration with Gemini text, image, and embedding models is direct — useful for pipelines that already route through Gemini.
3 typical scenarios for BibiGPT users
Where Veo 3.1 Lite pays off most for BibiGPT's creator audience.
Long video → short vertical hooks
BibiGPT summarizes a 90-minute lecture, podcast, or conference and surfaces the highest-leverage moments. Veo 3.1 Lite re-renders those moments as 8-second 9:16 clips for TikTok, Reels, 小红书, and 视频号 — the full creator workflow from long-form input to short-form distribution.
Affordable A/B variant generation
At $0.05–$0.08 per second, generating 5–10 variant hooks of a single concept costs a few dollars total. Iterate on opening shots, voiceover styles, and visual hooks without the flagship-tier budget — useful for finding the variant that actually converts.
Vertical social asset pipeline for creators
BibiGPT already produces 公众号 articles, 小红书 graphics, and PPT decks from a single video input. Veo 3.1 Lite extends the same artifact set to short vertical video — finishing the multi-format coverage creators expect from the AI 知行助理.
FAQ'S
Frequently Asked Questions
Ask us anything!
Use BibiGPT to turn long videos into short vertical hooks — backed by Veo-tier models
BibiGPT summarizes long-form video and surfaces the highest-leverage moments; pair that with Veo 3.1 Lite's economy-tier vertical 9:16 generation to spin those moments into TikTok / Reels / 小红书 video assets without leaving the workflow.