deepmind.googleAI tool

deepmind-google

Site: https://deepmind.google/models/veo/

Visiter le site

deepmind.google

Assistant de programmation de l'IA Formation en modèles d'IA

Visiter le site

Plans tarifaires

Aucun plan tarifaire detaille n'est encore disponible pour cet outil.

Presentation detaillee

deepmind.google uses cookies from Google to deliver and enhance the quality of its services and to analyze traffic. Learn moreUnderstoodSkip to main content Slide 1 of 3VeoOur state-of-the-art video generation model Try in Gemini Try in Flow Build with Veo Your browser does not support the video tag.Veo 3.1Video, meet audio. Our latest video generation model, designed to empower filmmakers and storytellers. Try in Gemini Try in Flow Build with Veo Your browser does not support the video tag.New capabilitiesGreater control, consistency, and creativity than ever before. Try in Flow Your browser does not support the video tag.Explore the latest Introducing Veo 3, our video generation model with expanded creative controls – including native audio and extended videos. Learn how to prompt Capabilities Performance Safety Showcase Try Veo What’s new Re-designed for greater realismGreater realism and fidelity, made possible by Veo 3’s real world physics and audio. Follows prompts like never beforeImproved prompt adherence, meaning more accurate responses to your instructions. Improved creative controlOffers new levels of control, consistency, and creativity – now across audio.Introducing Veo 3.1Video, meet audio. Our latest video generation model, designed to empower filmmakers and storytellers. Learn how to prompt Your browser does not support the video tag. Prompt: A medium shot opens on a seasoned, grey-bearded man in sunglasses and a paisley shirt, his gaze fixed off-camera with a contemplative expression. His gold chain glints subtly. Beside him, a younger man in a tank top, also looking forward, suggests a shared moment of observation or reflection. The camera slowly pushes in, subtly emphasizing their quiet focus. In the background, a vibrant mural splashes across a wall, hinting at an urban setting. Faint city murmurs and distant chatter drift in, accompanied by a mellow, soulful hip-hop beat that adds a contemplative yet grounded atmosphere. "The city always got a story," the older man murmurs, a slight nod of his head. "Just gotta listen." Prompt: A medium shot opens on a seasoned, grey-bearded man in sunglasses and a paisley shirt, his gaze fixed off-camera with a contemplative expression. His gold chain glints subtly. Beside him, a younger man in a tank top, also looking forward, suggests a shared moment of observation or reflection. The camera slowly pushes in, subtly emphasizing their quiet focus. In the background, a vibrant mural splashes across a wall, hinting at an urban setting. Faint city murmurs and distant chatter drift in, accompanied by a mellow, soulful hip-hop beat that adds a contemplative yet grounded atmosphere. "The city always got a story," the older man murmurs, a slight nod of his head. "Just gotta listen."Prompt: A medium shot opens on a seasoned, grey-bearded man in sunglasses and a paisley shirt, his gaze fixed off-camera with a contemplative expression. His gold chain glints subtly. Beside him, a younger man in a tank top, also looking forward, suggests a shared moment of observation or reflection. The camera slowly pushes in, subtly emphasizing their quiet focus. In the background, a vibrant mural splashes across a wall, hinting at an urban setting. Faint city murmurs and distant chatter drift in, accompanied by a mellow, soulful hip-hop beat that adds a contemplative yet grounded atmosphere. "The city always got a story," the older man murmurs, a slight nod of his head. "Just gotta listen."Prompt: A medium shot opens on a seasoned, grey-bearded man in sunglasses and a paisley shirt, his gaze fixed off-camera with a contemplative expression. His gold chain glints subtly. Beside him, a younger man in a tank top, also looking forward, suggests a shared moment of observation or reflection. The camera slowly pushes in, subtly emphasizing their quiet focus. In the background, a vibrant mural splashes across a wall, hinting at an urban setting. Faint city murmurs and distant chatter drift in, accompanied by a mellow, soulful hip-hop beat that adds a contemplative yet grounded atmosphere. "The city always got a story," the older man murmurs, a slight nod of his head. "Just gotta listen." Veo 3 lets you add sound effects, ambient noise, and even dialogue to your creations – generating all audio natively. It also delivers best in class quality, excelling in physics, realism and prompt adherence.Slide 1 of 5 Your browser does not support the video tag. Prompt: A medium shot frames an old sailor, his knitted blue sailor hat casting a shadow over his eyes, a thick grey beard obscuring his chin. He holds his pipe in one hand, gesturing with it towards the churning, grey sea beyond the ship's railing. "This ocean, it's a force, a wild, untamed might. And she commands your awe, with every breaking light" Prompt: A medium shot frames an old sailor, his knitted blue sailor hat casting a shadow over his eyes, a thick grey beard obscuring his chin. He holds his pipe in one hand, gesturing with it towards the churning, grey sea beyond the ship's railing. "This ocean, it's a force, a wild, untamed might. And she commands your awe, with every breaking light"Prompt: A medium shot frames an old sailor, his knitted blue sailor hat casting a shadow over his eyes, a thick grey beard obscuring his chin. He holds his pipe in one hand, gesturing with it towards the churning, grey sea beyond the ship's railing. "This ocean, it's a force, a wild, untamed might. And she commands your awe, with every breaking light" Your browser does not support the video tag. Prompt: A follow shot of a wise old owl high in the air, peeking through the clouds in a moonlit sky above a forest. The wise old owl carefully circles a clearing looking around to the forest floor. After a few moments, it dives down to a moonlit path and sits next to a badger. Audio: wings flapping, birdsong, loud and pleasant wind rustling and the sound of intermittent pleasant sounds buzzing, twigs snapping underfoot, croaking. A light orchestral score with woodwinds throughout with a cheerful, optimistic rhythm, full of innocent curiosity.A wise old owl and a nervous badger sit on a moonlit forest path. "They left behind a...a 'ball' today. It bounced higher than I can jump.” the badger stammered, trying to comprehend it. “What manner of magic is that?" the owl hooted thoughtfully. Audio: Owl hooting, badger's nervous chitters, rustling leaves, crickets.A wise old owl flies away out of the frame and a nervous young badger runs in a different direction out of the frame. In the background, you can see a squirrel hurrying past making noise of rustling dried autumn leaves as it goes. Audio: birdsong, loud and leaves rustling, and the sound of intermittent pleasant sounds buzzing, twigs snapping underfoot, and the sounds of squirrels scurrying through the dried autumn leaves. The sound of an owl hooting in the distance, badger's nervous chitters, rustling leaves, crickets, sounds that are full of innocent curiosity. Prompt: A follow shot of a wise old owl high in the air, peeking through the clouds in a moonlit sky above a forest. The wise old owl carefully circles a clearing looking around to the forest floor. After a few moments, it dives down to a moonlit path and sits next to a badger. Audio: wings flapping, birdsong, loud and pleasant wind rustling and the sound of intermittent pleasant sounds buzzing, twigs snapping underfoot, croaking. A light orchestral score with woodwinds throughout with a cheerful, optimistic rhythm, full of innocent curiosity.A wise old owl and a nervous badger sit on a moonlit forest path. "They left behind a...a 'ball' today. It bounced higher than I can jump.” the badger stammered, trying to comprehend it. “What manner of magic is that?" the owl hooted thoughtfully. Audio: Owl hooting, badger's nervous chitters, rustling leaves, crickets.A wise old owl flies away out of the frame and a nervous young badger runs in a different direction out of the frame. In the background, you can see a squirrel hurrying past making noise of rustling dried autumn leaves as it goes. Audio: birdsong, loud and leaves rustling, and the sound of intermittent pleasant sounds buzzing, twigs snapping underfoot, and the sounds of squirrels scurrying through the dried autumn leaves. The sound of an owl hooting in the distance, badger's nervous chitters, rustling leaves, crickets, sounds that are full of innocent curiosity.Prompt: A follow shot of a wise old owl high in the air, peeking through the clouds in a moonlit sky above a forest. The wise old owl carefully circles a clearing looking around to the forest floor. After a few moments, it dives down to a moonlit path and sits next to a badger. Audio: wings flapping, birdsong, loud and pleasant wind rustling and the sound of intermittent pleasant sounds buzzing, twigs snapping underfoot, croaking. A light orchestral score with woodwinds throughout with a cheerful, optimistic rhythm, full of innocent curiosity.A wise old owl and a nervous badger sit on a moonlit forest path. "They left behind a...a 'ball' today. It bounced higher than I can jump.” the badger stammered, trying to comprehend it. “What manner of magic is that?" the owl hooted thoughtfully. Audio: Owl hooting, badger's nervous chitters, rustling leaves, crickets.A wise old owl flies away out of the frame and a nervous young badger runs in a different direction out of the frame. In the background, you can see a squirrel hurrying past making noise of rustling dried autumn leaves as it goes. Audio: birdsong, loud and leaves rustling, and the sound of intermittent pleasant sounds buzzing, twigs snapping underfoot, and the sounds of squirrels scurrying through the dried autumn leaves. The sound of an owl hooting in the distance, badger's nervous chitters, rustling leaves, crickets, sounds that are full of innocent curiosity. A wise old owl and a nervous badger sit on a moonlit forest path. "They left behind a...a 'ball' today. It bounced higher than I can jump.” the badger stammered, trying to comprehend it. “What manner of magic is that?" the owl hooted thoughtfully. Audio: Owl hooting, badger's nervous chitters, rustling leaves, crickets.A wise old owl flies away out of the frame and a nervous young badger runs in a different direction out of the frame. In the background, you can see a squirrel hurrying past making noise of rustling dried autumn leaves as it goes. Audio: birdsong, loud and leaves rustling, and the sound of intermittent pleasant sounds buzzing, twigs snapping underfoot, and the sounds of squirrels scurrying through the dried autumn leaves. The sound of an owl hooting in the distance, badger's nervous chitters, rustling leaves, crickets, sounds that are full of innocent curiosity. Your browser does not support the video tag. Prompt: A medium shot, historical adventure setting: Warm lamplight illuminates a cartographer in a cluttered study, poring over an ancient, sprawling map spread across a large table. Cartographer: "According to this old sea chart, the lost island isn't myth! We must prepare an expedition immediately!" Prompt: A medium shot, historical adventure setting: Warm lamplight illuminates a cartographer in a cluttered study, poring over an ancient, sprawling map spread across a large table. Cartographer: "According to this old sea chart, the lost island isn't myth! We must prepare an expedition immediately!"Prompt: A medium shot, historical adventure setting: Warm lamplight illuminates a cartographer in a cluttered study, poring over an ancient, sprawling map spread across a large table. Cartographer: "According to this old sea chart, the lost island isn't myth! We must prepare an expedition immediately!" Your browser does not support the video tag. Prompt: A detective interrogates a nervous-looking rubber duck. "Where were you on the night of the bubble bath?!" he quacks. Audio: Detective's stern quack, nervous squeaks from rubber duck. Prompt: A detective interrogates a nervous-looking rubber duck. "Where were you on the night of the bubble bath?!" he quacks. Audio: Detective's stern quack, nervous squeaks from rubber duck.Prompt: A detective interrogates a nervous-looking rubber duck. "Where were you on the night of the bubble bath?!" he quacks. Audio: Detective's stern quack, nervous squeaks from rubber duck. Your browser does not support the video tag. Prompt: A close up of spies exchanging information in a crowded train station with uniformed guards patrolling nearby "The microfilm is in your ticket" he murmured pretending to check his watch "They're watching the north exit" she warned casually adjusting her scarf "Use the service tunnel" Commuters rush past oblivious to the covert exchange happening amid announcements of arrivals and departures Prompt: A close up of spies exchanging information in a crowded train station with uniformed guards patrolling nearby "The microfilm is in your ticket" he murmured pretending to check his watch "They're watching the north exit" she warned casually adjusting her scarf "Use the service tunnel" Commuters rush past oblivious to the covert exchange happening amid announcements of arrivals and departuresPrompt: A close up of spies exchanging information in a crowded train station with uniformed guards patrolling nearby "The microfilm is in your ticket" he murmured pretending to check his watch "They're watching the north exit" she warned casually adjusting her scarf "Use the service tunnel" Commuters rush past oblivious to the covert exchange happening amid announcements of arrivals and departures Your browser does not support the video tag. Prompt: The scene explodes with the raw, visceral, and unpredictable energy of a hardcore off-road rally, captured with a dynamic, almost found-footage or embedded sports documentary aesthetic. The camera is often shaky, seemingly mounted inside one of the vehicles or held by a daring spectator very close to the action, frequently splattered with mud or water, catching unintentional lens flares from the natural, often harsh, sunlight filtering through trees or reflecting off wet surfaces. We are immersed in a challenging, untamed natural environment – perhaps a dense, muddy forest trail, a treacherous rocky incline littered with loose scree, or a series_of shallow, fast-flowing river crossings. Several heavily modified, entirely unidentifiable, and unbranded off-road vehicles are engaged in a frenetic, no-holds-barred race. These are not showroom models; they are custom-built, rugged machines – open-wheeled buggies with exposed engines and prominent roll cages, heavily armored pickup trucks with oversized, knobby tires and snorkel exhausts, their original forms and manufacturers completely obscured by extreme modifications, layers of caked-on mud, and a general air of brutal functionality. The dominant sounds are the deafening, guttural roar of powerful, untamed engines, the whine of transmissions, the percussive impact of suspension bottoming out, and the constant spray of mud and water. Within an 8-second sequence, one of the lead vehicles, a low-slung, open-cockpit buggy so caked in thick, brown mud that its original color is a mystery, approaches a wide, shallow river crossing at incredible speed. Without the slightest hesitation, its unseen driver powers straight into the water. The impact sends an enormous, almost solid, opaque sheet of muddy water, mixed with stones and debris from the riverbed, spectacularly high into the air, completely engulfing the small buggy for a terrifying moment, obscuring it from view as if it has been swallowed by the river itself. Right on its tail, a pursuing, equally mud-encrusted, custom-built truck – a hulking, high-clearance beast with a heavily reinforced external roll cage and no discernible badging – arrives at the river crossing just as this massive wall of airborne water reaches its peak. Instead of slowing or attempting to find a clearer path, the truck's driver, with unwavering aggression, plunges directly into and through this opaque, turbulent curtain of muddy spray at full throttle. A split second later, the truck bursts out from the other side of the deluge, water cascading from its roof and chassis, its oversized windshield wipers struggling frantically to clear the torrent of muddy water obscuring the driver's vision. It lands heavily on the far bank, suspension groaning, but still in hot pursuit of the now-reappearing buggy. This thrilling, messy, and visually spectacular sequence of one vehicle creating a massive environmental obstacle and the next immediately conquering it through sheer force, forms the core, immersive, attention-grabbing event of the 8-second sequence. The race continues with undiminished ferocity, the natural terrain itself an active participant in the conflict. Prompt: The scene explodes with the raw, visceral, and unpredictable energy of a hardcore off-road rally, captured with a dynamic, almost found-footage or embedded sports documentary aesthetic. The camera is often shaky, seemingly mounted inside one of the vehicles or held by a daring spectator very close to the action, frequently splattered with mud or water, catching unintentional lens flares from the natural, often harsh, sunlight filtering through trees or reflecting off wet surfaces. We are immersed in a challenging, untamed natural environment – perhaps a dense, muddy forest trail, a treacherous rocky incline littered with loose scree, or a series_of shallow, fast-flowing river crossings. Several heavily modified, entirely unidentifiable, and unbranded off-road vehicles are engaged in a frenetic, no-holds-barred race. These are not showroom models; they are custom-built, rugged machines – open-wheeled buggies with exposed engines and prominent roll cages, heavily armored pickup trucks with oversized, knobby tires and snorkel exhausts, their original forms and manufacturers completely obscured by extreme modifications, layers of caked-on mud, and a general air of brutal functionality. The dominant sounds are the deafening, guttural roar of powerful, untamed engines, the whine of transmissions, the percussive impact of suspension bottoming out, and the constant spray of mud and water. Within an 8-second sequence, one of the lead vehicles, a low-slung, open-cockpit buggy so caked in thick, brown mud that its original color is a mystery, approaches a wide, shallow river crossing at incredible speed. Without the slightest hesitation, its unseen driver powers straight into the water. The impact sends an enormous, almost solid, opaque sheet of muddy water, mixed with stones and debris from the riverbed, spectacularly high into the air, completely engulfing the small buggy for a terrifying moment, obscuring it from view as if it has been swallowed by the river itself. Right on its tail, a pursuing, equally mud-encrusted, custom-built truck – a hulking, high-clearance beast with a heavily reinforced external roll cage and no discernible badging – arrives at the river crossing just as this massive wall of airborne water reaches its peak. Instead of slowing or attempting to find a clearer path, the truck's driver, with unwavering aggression, plunges directly into and through this opaque, turbulent curtain of muddy spray at full throttle. A split second later, the truck bursts out from the other side of the deluge, water cascading from its roof and chassis, its oversized windshield wipers struggling frantically to clear the torrent of muddy water obscuring the driver's vision. It lands heavily on the far bank, suspension groaning, but still in hot pursuit of the now-reappearing buggy. This thrilling, messy, and visually spectacular sequence of one vehicle creating a massive environmental obstacle and the next immediately conquering it through sheer force, forms the core, immersive, attention-grabbing event of the 8-second sequence. The race continues with undiminished ferocity, the natural terrain itself an active participant in the conflict.Prompt: The scene explodes with the raw, visceral, and unpredictable energy of a hardcore off-road rally, captured with a dynamic, almost found-footage or embedded sports documentary aesthetic. The camera is often shaky, seemingly mounted inside one of the vehicles or held by a daring spectator very close to the action, frequently splattered with mud or water, catching unintentional lens flares from the natural, often harsh, sunlight filtering through trees or reflecting off wet surfaces. We are immersed in a challenging, untamed natural environment – perhaps a dense, muddy forest trail, a treacherous rocky incline littered with loose scree, or a series_of shallow, fast-flowing river crossings. Several heavily modified, entirely unidentifiable, and unbranded off-road vehicles are engaged in a frenetic, no-holds-barred race. These are not showroom models; they are custom-built, rugged machines – open-wheeled buggies with exposed engines and prominent roll cages, heavily armored pickup trucks with oversized, knobby tires and snorkel exhausts, their original forms and manufacturers completely obscured by extreme modifications, layers of caked-on mud, and a general air of brutal functionality. The dominant sounds are the deafening, guttural roar of powerful, untamed engines, the whine of transmissions, the percussive impact of suspension bottoming out, and the constant spray of mud and water. Within an 8-second sequence, one of the lead vehicles, a low-slung, open-cockpit buggy so caked in thick, brown mud that its original color is a mystery, approaches a wide, shallow river crossing at incredible speed. Without the slightest hesitation, its unseen driver powers straight into the water. The impact sends an enormous, almost solid, opaque sheet of muddy water, mixed with stones and debris from the riverbed, spectacularly high into the air, completely engulfing the small buggy for a terrifying moment, obscuring it from view as if it has been swallowed by the river itself. Right on its tail, a pursuing, equally mud-encrusted, custom-built truck – a hulking, high-clearance beast with a heavily reinforced external roll cage and no discernible badging – arrives at the river crossing just as this massive wall of airborne water reaches its peak. Instead of slowing or attempting to find a clearer path, the truck's driver, with unwavering aggression, plunges directly into and through this opaque, turbulent curtain of muddy spray at full throttle. A split second later, the truck bursts out from the other side of the deluge, water cascading from its roof and chassis, its oversized windshield wipers struggling frantically to clear the torrent of muddy water obscuring the driver's vision. It lands heavily on the far bank, suspension groaning, but still in hot pursuit of the now-reappearing buggy. This thrilling, messy, and visually spectacular sequence of one vehicle creating a massive environmental obstacle and the next immediately conquering it through sheer force, forms the core, immersive, attention-grabbing event of the 8-second sequence. The race continues with undiminished ferocity, the natural terrain itself an active participant in the conflict.Prompt: The scene explodes with the raw, visceral, and unpredictable energy of a hardcore off-road rally, captured with a dynamic, almost found-footage or embedded sports documentary aesthetic. The camera is often shaky, seemingly mounted inside one of the vehicles or held by a daring spectator very close to the action, frequently splattered with mud or water, catching unintentional lens flares from the natural, often harsh, sunlight filtering through trees or reflecting off wet surfaces. We are immersed in a challenging, untamed natural environment – perhaps a dense, muddy forest trail, a treacherous rocky incline littered with loose scree, or a series_of shallow, fast-flowing river crossings. Several heavily modified, entirely unidentifiable, and unbranded off-road vehicles are engaged in a frenetic, no-holds-barred race. These are not showroom models; they are custom-built, rugged machines – open-wheeled buggies with exposed engines and prominent roll cages, heavily armored pickup trucks with oversized, knobby tires and snorkel exhausts, their original forms and manufacturers completely obscured by extreme modifications, layers of caked-on mud, and a general air of brutal functionality. The dominant sounds are the deafening, guttural roar of powerful, untamed engines, the whine of transmissions, the percussive impact of suspension bottoming out, and the constant spray of mud and water. Within an 8-second sequence, one of the lead vehicles, a low-slung, open-cockpit buggy so caked in thick, brown mud that its original color is a mystery, approaches a wide, shallow river crossing at incredible speed. Without the slightest hesitation, its unseen driver powers straight into the water. The impact sends an enormous, almost solid, opaque sheet of muddy water, mixed with stones and debris from the riverbed, spectacularly high into the air, completely engulfing the small buggy for a terrifying moment, obscuring it from view as if it has been swallowed by the river itself. Right on its tail, a pursuing, equally mud-encrusted, custom-built truck – a hulking, high-clearance beast with a heavily reinforced external roll cage and no discernible badging – arrives at the river crossing just as this massive wall of airborne water reaches its peak. Instead of slowing or attempting to find a clearer path, the truck's driver, with unwavering aggression, plunges directly into and through this opaque, turbulent curtain of muddy spray at full throttle. A split second later, the truck bursts out from the other side of the deluge, water cascading from its roof and chassis, its oversized windshield wipers struggling frantically to clear the torrent of muddy water obscuring the driver's vision. It lands heavily on the far bank, suspension groaning, but still in hot pursuit of the now-reappearing buggy. This thrilling, messy, and visually spectacular sequence of one vehicle creating a massive environmental obstacle and the next immediately conquering it through sheer force, forms the core, immersive, attention-grabbing event of the 8-second sequence. The race continues with undiminished ferocity, the natural terrain itself an active participant in the conflict. Your browser does not support the video tag. Prompt: A meticulously detailed scene opens, displaying a small, pale yellow, humanoid figure crafted from wax. This figure stands centered in a warm, ethereal landscape composed entirely of molten wax, which forms gently undulating hills and reflective pools. In its raised hand, a delicate, bright flame flickers with a vibrant glow, casting soft, warm light on the figure's smooth, slightly reflective surface. To the left, a larger, partially melted candle drips viscous wax onto a nearby mound, its own blue-tinged flame barely visible. The atmosphere is serene, illuminated by the golden light of the small figure's flame, highlighting the glossy textures and subtle translucence of the wax environment. (0-1 seconds) The camera initiates a smooth, tracking shot, maintaining an eye-level perspective with the small wax person. As the figure begins to gently walk forward, its small feet creating subtle ripples in the viscous, pale yellow wax terrain, the camera gracefully follows its movement. The figure takes slow, deliberate steps across the shimmering, honey-colored landscape, its arm steadily raised to protect the precious, unwavering flame. Each step is deliberate, conveying a sense of purpose. The soft glow of the flame remains the primary light source, illuminating the path ahead and emphasizing the intricate, dripping textures of the surrounding wax formations. (1-7 seconds) The wax person continues its quiet journey, steadily progressing across the glowing, soft landscape. The camera holds its smooth, tracking motion, subtly receding slightly to reveal a broader view of the wax world, emphasizing the figure's determined, solitary walk through its unique environment. The flame continues to burn brightly, a beacon in the warm, diffused light. (7-8 seconds) Prompt: A meticulously detailed scene opens, displaying a small, pale yellow, humanoid figure crafted from wax. This figure stands centered in a warm, ethereal landscape composed entirely of molten wax, which forms gently undulating hills and reflective pools. In its raised hand, a delicate, bright flame flickers with a vibrant glow, casting soft, warm light on the figure's smooth, slightly reflective surface. To the left, a larger, partially melted candle drips viscous wax onto a nearby mound, its own blue-tinged flame barely visible. The atmosphere is serene, illuminated by the golden light of the small figure's flame, highlighting the glossy textures and subtle translucence of the wax environment. (0-1 seconds) The camera initiates a smooth, tracking shot, maintaining an eye-level perspective with the small wax person. As the figure begins to gently walk forward, its small feet creating subtle ripples in the viscous, pale yellow wax terrain, the camera gracefully follows its movement. The figure takes slow, deliberate steps across the shimmering, honey-colored landscape, its arm steadily raised to protect the precious, unwavering flame. Each step is deliberate, conveying a sense of purpose. The soft glow of the flame remains the primary light source, illuminating the path ahead and emphasizing the intricate, dripping textures of the surrounding wax formations. (1-7 seconds) The wax person continues its quiet journey, steadily progressing across the glowing, soft landscape. The camera holds its smooth, tracking motion, subtly receding slightly to reveal a broader view of the wax world, emphasizing the figure's determined, solitary walk through its unique environment. The flame continues to burn brightly, a beacon in the warm, diffused light. (7-8 seconds)Prompt: A meticulously detailed scene opens, displaying a small, pale yellow, humanoid figure crafted from wax. This figure stands centered in a warm, ethereal landscape composed entirely of molten wax, which forms gently undulating hills and reflective pools. In its raised hand, a delicate, bright flame flickers with a vibrant glow, casting soft, warm light on the figure's smooth, slightly reflective surface. To the left, a larger, partially melted candle drips viscous wax onto a nearby mound, its own blue-tinged flame barely visible. The atmosphere is serene, illuminated by the golden light of the small figure's flame, highlighting the glossy textures and subtle translucence of the wax environment. (0-1 seconds) The camera initiates a smooth, tracking shot, maintaining an eye-level perspective with the small wax person. As the figure begins to gently walk forward, its small feet creating subtle ripples in the viscous, pale yellow wax terrain, the camera gracefully follows its movement. The figure takes slow, deliberate steps across the shimmering, honey-colored landscape, its arm steadily raised to protect the precious, unwavering flame. Each step is deliberate, conveying a sense of purpose. The soft glow of the flame remains the primary light source, illuminating the path ahead and emphasizing the intricate, dripping textures of the surrounding wax formations. (1-7 seconds) The wax person continues its quiet journey, steadily progressing across the glowing, soft landscape. The camera holds its smooth, tracking motion, subtly receding slightly to reveal a broader view of the wax world, emphasizing the figure's determined, solitary walk through its unique environment. The flame continues to burn brightly, a beacon in the warm, diffused light. (7-8 seconds)Prompt: A meticulously detailed scene opens, displaying a small, pale yellow, humanoid figure crafted from wax. This figure stands centered in a warm, ethereal landscape composed entirely of molten wax, which forms gently undulating hills and reflective pools. In its raised hand, a delicate, bright flame flickers with a vibrant glow, casting soft, warm light on the figure's smooth, slightly reflective surface. To the left, a larger, partially melted candle drips viscous wax onto a nearby mound, its own blue-tinged flame barely visible. The atmosphere is serene, illuminated by the golden light of the small figure's flame, highlighting the glossy textures and subtle translucence of the wax environment. (0-1 seconds) The camera initiates a smooth, tracking shot, maintaining an eye-level perspective with the small wax person. As the figure begins to gently walk forward, its small feet creating subtle ripples in the viscous, pale yellow wax terrain, the camera gracefully follows its movement. The figure takes slow, deliberate steps across the shimmering, honey-colored landscape, its arm steadily raised to protect the precious, unwavering flame. Each step is deliberate, conveying a sense of purpose. The soft glow of the flame remains the primary light source, illuminating the path ahead and emphasizing the intricate, dripping textures of the surrounding wax formations. (1-7 seconds) The wax person continues its quiet journey, steadily progressing across the glowing, soft landscape. The camera holds its smooth, tracking motion, subtly receding slightly to reveal a broader view of the wax world, emphasizing the figure's determined, solitary walk through its unique environment. The flame continues to burn brightly, a beacon in the warm, diffused light. (7-8 seconds) Your browser does not support the video tag. Prompt: The scene opens with a top-down or wide-angle shot showcasing a vast, perfectly flat, neutral-colored surface – perhaps the polished concrete floor of an enormous, empty aircraft hangar, or a giant, minimalist tabletop stretching beyond the frame, under bright, even, shadowless studio lighting. This surface is meticulously covered with thousands upon thousands of small, identical, brightly colored paper squares, arranged in a simple, orderly grid. Each square is a single, vibrant, uncreased sheet – a sea of reds, blues, yellows, greens, oranges, creating a stunning, static mosaic of pure potential. The atmosphere is one of quiet anticipation, a sense of immense latent energy waiting to be unleashed. There is no visible mechanism, no hint of how these papers might be manipulated. Within an 8-second sequence, initiated by an unseen cue – perhaps a subtle, almost inaudible, low-frequency hum that ripples almost invisibly across the surface, or a sudden, soft flash of diffused light – all the thousands of paper squares simultaneously, and with breathtaking precision, leap a few inches into the air as if startled into life. Then, in a mesmerizing, perfectly synchronized, and incredibly high-speed aerial ballet, they begin to fold themselves in mid-air. With impossible, almost magical celerity and accuracy, unseen forces guide each individual square through a complex series of sharp creases, neat tucks, and intricate folds. The swarm of fluttering, self-constructing papers is a blur of color and motion, a chaotic yet utterly controlled explosion of activity. Within a mere five to six seconds, this frenetic process of airborne origami completes. Each of the thousands of squares has transformed into an identical, perfectly formed, complex origami figure – perhaps graceful cranes with outstretched wings, delicate multi-petaled lotus flowers, or miniature, intricately detailed dragons. In the final two to three seconds of the sequence, these newly formed origami figures, still hovering in mid-air, then smoothly and rapidly arrange themselves, like a flock of perfectly trained birds or a sophisticated, self-organizing swarm of nanobots, into a stunning, larger, three-dimensional collective pattern or a recognizable mosaic image – perhaps a giant, hovering sphere composed of countless tiny birds, or a complex, flowing wave of flowers, or even a pixel-perfect, three-dimensional representation of a face or symbol. This collective sculpture holds its form for a beat before the individual origami figures begin to gently, gracefully, and silently settle back down onto the surface, now arranged in their magnificent new configuration. This entire rapid, impossible, and beautiful transformation – from simple squares to a synchronized swarm of self-folding forms creating a complex collective artwork – is the core, eye-popping, and meticulously detailed VFX spectacle of the 8-second sequence. The visual is one of magical precision, emergent complexity, and the beauty of mass synchronized action. Prompt: The scene opens with a top-down or wide-angle shot showcasing a vast, perfectly flat, neutral-colored surface – perhaps the polished concrete floor of an enormous, empty aircraft hangar, or a giant, minimalist tabletop stretching beyond the frame, under bright, even, shadowless studio lighting. This surface is meticulously covered with thousands upon thousands of small, identical, brightly colored paper squares, arranged in a simple, orderly grid. Each square is a single, vibrant, uncreased sheet – a sea of reds, blues, yellows, greens, oranges, creating a stunning, static mosaic of pure potential. The atmosphere is one of quiet anticipation, a sense of immense latent energy waiting to be unleashed. There is no visible mechanism, no hint of how these papers might be manipulated. Within an 8-second sequence, initiated by an unseen cue – perhaps a subtle, almost inaudible, low-frequency hum that ripples almost invisibly across the surface, or a sudden, soft flash of diffused light – all the thousands of paper squares simultaneously, and with breathtaking precision, leap a few inches into the air as if startled into life. Then, in a mesmerizing, perfectly synchronized, and incredibly high-speed aerial ballet, they begin to fold themselves in mid-air. With impossible, almost magical celerity and accuracy, unseen forces guide each individual square through a complex series of sharp creases, neat tucks, and intricate folds. The swarm of fluttering, self-constructing papers is a blur of color and motion, a chaotic yet utterly controlled explosion of activity. Within a mere five to six seconds, this frenetic process of airborne origami completes. Each of the thousands of squares has transformed into an identical, perfectly formed, complex origami figure – perhaps graceful cranes with outstretched wings, delicate multi-petaled lotus flowers, or miniature, intricately detailed dragons. In the final two to three seconds of the sequence, these newly formed origami figures, still hovering in mid-air, then smoothly and rapidly arrange themselves, like a flock of perfectly trained birds or a sophisticated, self-organizing swarm of nanobots, into a stunning, larger, three-dimensional collective pattern or a recognizable mosaic image – perhaps a giant, hovering sphere composed of countless tiny birds, or a complex, flowing wave of flowers, or even a pixel-perfect, three-dimensional representation of a face or symbol. This collective sculpture holds its form for a beat before the individual origami figures begin to gently, gracefully, and silently settle back down onto the surface, now arranged in their magnificent new configuration. This entire rapid, impossible, and beautiful transformation – from simple squares to a synchronized swarm of self-folding forms creating a complex collective artwork – is the core, eye-popping, and meticulously detailed VFX spectacle of the 8-second sequence. The visual is one of magical precision, emergent complexity, and the beauty of mass synchronized action.Prompt: The scene opens with a top-down or wide-angle shot showcasing a vast, perfectly flat, neutral-colored surface – perhaps the polished concrete floor of an enormous, empty aircraft hangar, or a giant, minimalist tabletop stretching beyond the frame, under bright, even, shadowless studio lighting. This surface is meticulously covered with thousands upon thousands of small, identical, brightly colored paper squares, arranged in a simple, orderly grid. Each square is a single, vibrant, uncreased sheet – a sea of reds, blues, yellows, greens, oranges, creating a stunning, static mosaic of pure potential. The atmosphere is one of quiet anticipation, a sense of immense latent energy waiting to be unleashed. There is no visible mechanism, no hint of how these papers might be manipulated. Within an 8-second sequence, initiated by an unseen cue – perhaps a subtle, almost inaudible, low-frequency hum that ripples almost invisibly across the surface, or a sudden, soft flash of diffused light – all the thousands of paper squares simultaneously, and with breathtaking precision, leap a few inches into the air as if startled into life. Then, in a mesmerizing, perfectly synchronized, and incredibly high-speed aerial ballet, they begin to fold themselves in mid-air. With impossible, almost magical celerity and accuracy, unseen forces guide each individual square through a complex series of sharp creases, neat tucks, and intricate folds. The swarm of fluttering, self-constructing papers is a blur of color and motion, a chaotic yet utterly controlled explosion of activity. Within a mere five to six seconds, this frenetic process of airborne origami completes. Each of the thousands of squares has transformed into an identical, perfectly formed, complex origami figure – perhaps graceful cranes with outstretched wings, delicate multi-petaled lotus flowers, or miniature, intricately detailed dragons. In the final two to three seconds of the sequence, these newly formed origami figures, still hovering in mid-air, then smoothly and rapidly arrange themselves, like a flock of perfectly trained birds or a sophisticated, self-organizing swarm of nanobots, into a stunning, larger, three-dimensional collective pattern or a recognizable mosaic image – perhaps a giant, hovering sphere composed of countless tiny birds, or a complex, flowing wave of flowers, or even a pixel-perfect, three-dimensional representation of a face or symbol. This collective sculpture holds its form for a beat before the individual origami figures begin to gently, gracefully, and silently settle back down onto the surface, now arranged in their magnificent new configuration. This entire rapid, impossible, and beautiful transformation – from simple squares to a synchronized swarm of self-folding forms creating a complex collective artwork – is the core, eye-popping, and meticulously detailed VFX spectacle of the 8-second sequence. The visual is one of magical precision, emergent complexity, and the beauty of mass synchronized action.Prompt: The scene opens with a top-down or wide-angle shot showcasing a vast, perfectly flat, neutral-colored surface – perhaps the polished concrete floor of an enormous, empty aircraft hangar, or a giant, minimalist tabletop stretching beyond the frame, under bright, even, shadowless studio lighting. This surface is meticulously covered with thousands upon thousands of small, identical, brightly colored paper squares, arranged in a simple, orderly grid. Each square is a single, vibrant, uncreased sheet – a sea of reds, blues, yellows, greens, oranges, creating a stunning, static mosaic of pure potential. The atmosphere is one of quiet anticipation, a sense of immense latent energy waiting to be unleashed. There is no visible mechanism, no hint of how these papers might be manipulated. Within an 8-second sequence, initiated by an unseen cue – perhaps a subtle, almost inaudible, low-frequency hum that ripples almost invisibly across the surface, or a sudden, soft flash of diffused light – all the thousands of paper squares simultaneously, and with breathtaking precision, leap a few inches into the air as if startled into life. Then, in a mesmerizing, perfectly synchronized, and incredibly high-speed aerial ballet, they begin to fold themselves in mid-air. With impossible, almost magical celerity and accuracy, unseen forces guide each individual square through a complex series of sharp creases, neat tucks, and intricate folds. The swarm of fluttering, self-constructing papers is a blur of color and motion, a chaotic yet utterly controlled explosion of activity. Within a mere five to six seconds, this frenetic process of airborne origami completes. Each of the thousands of squares has transformed into an identical, perfectly formed, complex origami figure – perhaps graceful cranes with outstretched wings, delicate multi-petaled lotus flowers, or miniature, intricately detailed dragons. In the final two to three seconds of the sequence, these newly formed origami figures, still hovering in mid-air, then smoothly and rapidly arrange themselves, like a flock of perfectly trained birds or a sophisticated, self-organizing swarm of nanobots, into a stunning, larger, three-dimensional collective pattern or a recognizable mosaic image – perhaps a giant, hovering sphere composed of countless tiny birds, or a complex, flowing wave of flowers, or even a pixel-perfect, three-dimensional representation of a face or symbol. This collective sculpture holds its form for a beat before the individual origami figures begin to gently, gracefully, and silently settle back down onto the surface, now arranged in their magnificent new configuration. This entire rapid, impossible, and beautiful transformation – from simple squares to a synchronized swarm of self-folding forms creating a complex collective artwork – is the core, eye-popping, and meticulously detailed VFX spectacle of the 8-second sequence. The visual is one of magical precision, emergent complexity, and the beauty of mass synchronized action. Your browser does not support the video tag. Prompt: In rural Ireland, circa 1860s, two women, their long, modest dresses of homespun fabric whipping gently in the strong coastal wind, walk with determined strides across a windswept cliff top. The ground is carpeted with hardy wildflowers in muted hues. They move steadily towards the precipitous edge, where the vast, turbulent grey-green ocean roars and crashes against the sheer rock face far below, sending plumes of white spray into the air. Prompt: In rural Ireland, circa 1860s, two women, their long, modest dresses of homespun fabric whipping gently in the strong coastal wind, walk with determined strides across a windswept cliff top. The ground is carpeted with hardy wildflowers in muted hues. They move steadily towards the precipitous edge, where the vast, turbulent grey-green ocean roars and crashes against the sheer rock face far below, sending plumes of white spray into the air.Prompt: In rural Ireland, circa 1860s, two women, their long, modest dresses of homespun fabric whipping gently in the strong coastal wind, walk with determined strides across a windswept cliff top. The ground is carpeted with hardy wildflowers in muted hues. They move steadily towards the precipitous edge, where the vast, turbulent grey-green ocean roars and crashes against the sheer rock face far below, sending plumes of white spray into the air.Prompt: In rural Ireland, circa 1860s, two women, their long, modest dresses of homespun fabric whipping gently in the strong coastal wind, walk with determined strides across a windswept cliff top. The ground is carpeted with hardy wildflowers in muted hues. They move steadily towards the precipitous edge, where the vast, turbulent grey-green ocean roars and crashes against the sheer rock face far below, sending plumes of white spray into the air. Slide 1 of 9 Your browser does not support the video tag. Prompt: A breathtaking, painterly 2D animated continuous visual narrative, rendered with the lush, vibrant, and slightly surreal, almost dreamlike, infused with the intricate, delicate detail of traditional Japanese woodblock prints (Ukiyo-e), follows a young, adventurous, and kind-hearted girl (perhaps with bright, curious eyes and wearing simple, practical, beautifully patterned traditional Japanese farm attire) as she befriends a colossal, gentle, ancient Forest Spirit. The Spirit is a magnificent, awe-inspiring creature, its form a harmonious blend of animal and plant – perhaps with moss-covered, antler-like branches, fur like shimmering leaves that change color with its mood, and eyes like deep, tranquil forest pools. They meet in a sun-dappled, sacred grove deep within an ancient, primeval forest, where impossibly tall, gnarled trees form a living cathedral and tiny, glowing, friendly forest sprites (Kodama-like) peek from behind mossy rocks and giant, fantastical mushrooms. The girl, initially awestruck, offers the massive Spirit a small, carefully cultivated offering – perhaps a perfectly ripe persimmon or a handful of wild berries – her gesture one of pure, innocent respect and affection. The Forest Spirit responds with a slow, gentle inclination of its massive head, its leafy fur rustling like a thousand whispers, and perhaps causes a shower of magical, luminous flower petals to drift down from the canopy, or a tiny, new sapling to sprout at the girl's feet. The animation captures the incredible, detailed textures of the forest, the Spirit's majestic yet gentle presence, and the profound, unspoken emotional connection forming between the child and this ancient guardian of nature. The color palette is a rich symphony of deep forest greens, earthy browns, vibrant floral hues, and the soft, magical glow of the sprites and the Spirit's own subtle luminescence. This continuous, sweeping visual journey is a celebration of the profound, often mystical, bond between humanity and nature, the innocence and courage of childhood, and the power of kindness and respect to bridge even the most fantastical of divides, an affectionate, visually intoxicating ode to ecological harmony and interspecies understanding. The only implied sounds are the gentle rustling of leaves, the distant calls of unseen forest birds, the girl's soft, respectful breathing, the Spirit's deep, resonant, almost inaudible hum, and a soaring, emotionally resonant, orchestral score. Prompt: A breathtaking, painterly 2D animated continuous visual narrative, rendered with the lush, vibrant, and slightly surreal, almost dreamlike, infused with the intricate, delicate detail of traditional Japanese woodblock prints (Ukiyo-e), follows a young, adventurous, and kind-hearted girl (perhaps with bright, curious eyes and wearing simple, practical, beautifully patterned traditional Japanese farm attire) as she befriends a colossal, gentle, ancient Forest Spirit. The Spirit is a magnificent, awe-inspiring creature, its form a harmonious blend of animal and plant – perhaps with moss-covered, antler-like branches, fur like shimmering leaves that change color with its mood, and eyes like deep, tranquil forest pools. They meet in a sun-dappled, sacred grove deep within an ancient, primeval forest, where impossibly tall, gnarled trees form a living cathedral and tiny, glowing, friendly forest sprites (Kodama-like) peek from behind mossy rocks and giant, fantastical mushrooms. The girl, initially awestruck, offers the massive Spirit a small, carefully cultivated offering – perhaps a perfectly ripe persimmon or a handful of wild berries – her gesture one of pure, innocent respect and affection. The Forest Spirit responds with a slow, gentle inclination of its massive head, its leafy fur rustling like a thousand whispers, and perhaps causes a shower of magical, luminous flower petals to drift down from the canopy, or a tiny, new sapling to sprout at the girl's feet. The animation captures the incredible, detailed textures of the forest, the Spirit's majestic yet gentle presence, and the profound, unspoken emotional connection forming between the child and this ancient guardian of nature. The color palette is a rich symphony of deep forest greens, earthy browns, vibrant floral hues, and the soft, magical glow of the sprites and the Spirit's own subtle luminescence. This continuous, sweeping visual journey is a celebration of the profound, often mystical, bond between humanity and nature, the innocence and courage of childhood, and the power of kindness and respect to bridge even the most fantastical of divides, an affectionate, visually intoxicating ode to ecological harmony and interspecies understanding. The only implied sounds are the gentle rustling of leaves, the distant calls of unseen forest birds, the girl's soft, respectful breathing, the Spirit's deep, resonant, almost inaudible hum, and a soaring, emotionally resonant, orchestral score.Prompt: A breathtaking, painterly 2D animated continuous visual narrative, rendered with the lush, vibrant, and slightly surreal, almost dreamlike, infused with the intricate, delicate detail of traditional Japanese woodblock prints (Ukiyo-e), follows a young, adventurous, and kind-hearted girl (perhaps with bright, curious eyes and wearing simple, practical, beautifully patterned traditional Japanese farm attire) as she befriends a colossal, gentle, ancient Forest Spirit. The Spirit is a magnificent, awe-inspiring creature, its form a harmonious blend of animal and plant – perhaps with moss-covered, antler-like branches, fur like shimmering leaves that change color with its mood, and eyes like deep, tranquil forest pools. They meet in a sun-dappled, sacred grove deep within an ancient, primeval forest, where impossibly tall, gnarled trees form a living cathedral and tiny, glowing, friendly forest sprites (Kodama-like) peek from behind mossy rocks and giant, fantastical mushrooms. The girl, initially awestruck, offers the massive Spirit a small, carefully cultivated offering – perhaps a perfectly ripe persimmon or a handful of wild berries – her gesture one of pure, innocent respect and affection. The Forest Spirit responds with a slow, gentle inclination of its massive head, its leafy fur rustling like a thousand whispers, and perhaps causes a shower of magical, luminous flower petals to drift down from the canopy, or a tiny, new sapling to sprout at the girl's feet. The animation captures the incredible, detailed textures of the forest, the Spirit's majestic yet gentle presence, and the profound, unspoken emotional connection forming between the child and this ancient guardian of nature. The color palette is a rich symphony of deep forest greens, earthy browns, vibrant floral hues, and the soft, magical glow of the sprites and the Spirit's own subtle luminescence. This continuous, sweeping visual journey is a celebration of the profound, often mystical, bond between humanity and nature, the innocence and courage of childhood, and the power of kindness and respect to bridge even the most fantastical of divides, an affectionate, visually intoxicating ode to ecological harmony and interspecies understanding. The only implied sounds are the gentle rustling of leaves, the distant calls of unseen forest birds, the girl's soft, respectful breathing, the Spirit's deep, resonant, almost inaudible hum, and a soaring, emotionally resonant, orchestral score. Your browser does not support the video tag. Prompt: The camera slowly pushes forward into a breathtaking ice cave, its jagged walls sculpted by nature into intricate patterns of blues and whites, reflecting the ethereal light from an opening ahead. The crunch of ice underfoot and the drip-drip of melting water create a serene, echoing soundscape. As the camera moves closer, a gentle, ambient melody begins, swelling with the light from the cave's exit. The camera emerges from the narrow opening into a vast, sun-drenched valley, revealing a group of polar bears playfully sliding down an ice slope, their roars echoing with joy. Prompt: The camera slowly pushes forward into a breathtaking ice cave, its jagged walls sculpted by nature into intricate patterns of blues and whites, reflecting the ethereal light from an opening ahead. The crunch of ice underfoot and the drip-drip of melting water create a serene, echoing soundscape. As the camera moves closer, a gentle, ambient melody begins, swelling with the light from the cave's exit. The camera emerges from the narrow opening into a vast, sun-drenched valley, revealing a group of polar bears playfully sliding down an ice slope, their roars echoing with joy.Prompt: The camera slowly pushes forward into a breathtaking ice cave, its jagged walls sculpted by nature into intricate patterns of blues and whites, reflecting the ethereal light from an opening ahead. The crunch of ice underfoot and the drip-drip of melting water create a serene, echoing soundscape. As the camera moves closer, a gentle, ambient melody begins, swelling with the light from the cave's exit. The camera emerges from the narrow opening into a vast, sun-drenched valley, revealing a group of polar bears playfully sliding down an ice slope, their roars echoing with joy. Your browser does not support the video tag. Prompt: Camping (Stop Motion): Camper: "I'm one with nature now!" Bear: "Nature would prefer some personal space." Prompt: Camping (Stop Motion): Camper: "I'm one with nature now!" Bear: "Nature would prefer some personal space."Prompt: Camping (Stop Motion): Camper: "I'm one with nature now!" Bear: "Nature would prefer some personal space." Your browser does not support the video tag. Prompt: A handheld shot follows a wok as it’s expertly flicked, sending vibrant, sizzling vegetables tumbling over themselves in a flash of motion and steam. Audio: a metallic clank and a sharp whoosh. Prompt: A handheld shot follows a wok as it’s expertly flicked, sending vibrant, sizzling vegetables tumbling over themselves in a flash of motion and steam. Audio: a metallic clank and a sharp whoosh.Prompt: A handheld shot follows a wok as it’s expertly flicked, sending vibrant, sizzling vegetables tumbling over themselves in a flash of motion and steam. Audio: a metallic clank and a sharp whoosh. Your browser does not support the video tag. Prompt: A keyboard whose keys are made of different types of candy. Typing makes sweet, crunchy sounds. Audio: Crunchy, sugary typing sounds, delighted giggles. Prompt: A keyboard whose keys are made of different types of candy. Typing makes sweet, crunchy sounds. Audio: Crunchy, sugary typing sounds, delighted giggles.Prompt: A keyboard whose keys are made of different types of candy. Typing makes sweet, crunchy sounds. Audio: Crunchy, sugary typing sounds, delighted giggles. Your browser does not support the video tag. Prompt: The camera begins with a slow, elegant track along the richly paneled walls of a dimly lit, sophisticated hallway, the warm glow of the ornate wall sconces casting inviting reflections on the polished floor. Soft jazz music plays in the background. As we approach an arched entryway, the camera performs a graceful push-in, revealing a grand mirror and flickering candles, then smoothly pivots to the right, opening up to a luxurious home bar. The clinking of ice and the murmur of conversation become audible. The camera settles on a close-up of a perfectly crafted cocktail. "Welcome," a smooth, baritone voice says. "Care for a taste?" Suddenly, a renowned mixologist, known for his eccentric creations, steps into frame, followed by a playful, mischievous cat that jumps onto the bar, batting at a cocktail stirrer. Prompt: The camera begins with a slow, elegant track along the richly paneled walls of a dimly lit, sophisticated hallway, the warm glow of the ornate wall sconces casting inviting reflections on the polished floor. Soft jazz music plays in the background. As we approach an arched entryway, the camera performs a graceful push-in, revealing a grand mirror and flickering candles, then smoothly pivots to the right, opening up to a luxurious home bar. The clinking of ice and the murmur of conversation become audible. The camera settles on a close-up of a perfectly crafted cocktail. "Welcome," a smooth, baritone voice says. "Care for a taste?" Suddenly, a renowned mixologist, known for his eccentric creations, steps into frame, followed by a playful, mischievous cat that jumps onto the bar, batting at a cocktail stirrer.Prompt: The camera begins with a slow, elegant track along the richly paneled walls of a dimly lit, sophisticated hallway, the warm glow of the ornate wall sconces casting inviting reflections on the polished floor. Soft jazz music plays in the background. As we approach an arched entryway, the camera performs a graceful push-in, revealing a grand mirror and flickering candles, then smoothly pivots to the right, opening up to a luxurious home bar. The clinking of ice and the murmur of conversation become audible. The camera settles on a close-up of a perfectly crafted cocktail. "Welcome," a smooth, baritone voice says. "Care for a taste?" Suddenly, a renowned mixologist, known for his eccentric creations, steps into frame, followed by a playful, mischievous cat that jumps onto the bar, batting at a cocktail stirrer. Your browser does not support the video tag. Prompt: A snow-covered plain of iridescent moon-dust under twilight skies. Thirty-foot crystalline flowers bloom, refracting light into slow-moving rainbows. A fur-cloaked figure walks between these colossal blossoms, leaving the only footprints in untouched dust. Prompt: A snow-covered plain of iridescent moon-dust under twilight skies. Thirty-foot crystalline flowers bloom, refracting light into slow-moving rainbows. A fur-cloaked figure walks between these colossal blossoms, leaving the only footprints in untouched dust.Prompt: A snow-covered plain of iridescent moon-dust under twilight skies. Thirty-foot crystalline flowers bloom, refracting light into slow-moving rainbows. A fur-cloaked figure walks between these colossal blossoms, leaving the only footprints in untouched dust. Your browser does not support the video tag. Prompt: A woman, classical violinist with intense focus plays a complex, rapid passage from a Vivaldi concerto in an ornate, sunlit baroque hall during a rehearsal. Their bow dances across the strings with virtuosic speed and precision. Audio: Bright, virtuosic violin playing, resonant acoustics of the hall, distant footsteps of crew, conductor's occasional soft count-in (muffled), rustling sheet music. Prompt: A woman, classical violinist with intense focus plays a complex, rapid passage from a Vivaldi concerto in an ornate, sunlit baroque hall during a rehearsal. Their bow dances across the strings with virtuosic speed and precision. Audio: Bright, virtuosic violin playing, resonant acoustics of the hall, distant footsteps of crew, conductor's occasional soft count-in (muffled), rustling sheet music.Prompt: A woman, classical violinist with intense focus plays a complex, rapid passage from a Vivaldi concerto in an ornate, sunlit baroque hall during a rehearsal. Their bow dances across the strings with virtuosic speed and precision. Audio: Bright, virtuosic violin playing, resonant acoustics of the hall, distant footsteps of crew, conductor's occasional soft count-in (muffled), rustling sheet music. Your browser does not support the video tag. Prompt: A close up in a smooth, slow pan focuses intently on diced onions hitting a scorching hot pan, instantly creating a dramatic sizzle. Audio: distinct sizzle. Prompt: A close up in a smooth, slow pan focuses intently on diced onions hitting a scorching hot pan, instantly creating a dramatic sizzle. Audio: distinct sizzle.Prompt: A close up in a smooth, slow pan focuses intently on diced onions hitting a scorching hot pan, instantly creating a dramatic sizzle. Audio: distinct sizzle. Greater control, consistency, and creativity than ever before. Your browser does not support the video tag. Add ingredients to your videoMake sure videos align with your creative vision by giving Veo reference images of a scene, a character, or an object to guide its generation. Now includes audio.Slide 1 of 10 Your browser does not support the video tag. Prompt: Camera dramatically dollies around the subject in this striking cinematic scene. It captures a high-tension moment within a long, sterile, monochromatic green corridor. A lone woman, dressed in a dark, flowing trench coat and trousers that billow dramatically, is suspended mid-air in a powerful, graceful leap. Her arms are outstretched as if bracing for impact or propelling herself forward. Her sharp profile reveals an intense, focused expression, suggesting profound determination. Prompt: Camera dramatically dollies around the subject in this striking cinematic scene. It captures a high-tension moment within a long, sterile, monochromatic green corridor. A lone woman, dressed in a dark, flowing trench coat and trousers that billow dramatically, is suspended mid-air in a powerful, graceful leap. Her arms are outstretched as if bracing for impact or propelling herself forward. Her sharp profile reveals an intense, focused expression, suggesting profound determination.Prompt: Camera dramatically dollies around the subject in this striking cinematic scene. It captures a high-tension moment within a long, sterile, monochromatic green corridor. A lone woman, dressed in a dark, flowing trench coat and trousers that billow dramatically, is suspended mid-air in a powerful, graceful leap. Her arms are outstretched as if bracing for impact or propelling herself forward. Her sharp profile reveals an intense, focused expression, suggesting profound determination. Your browser does not support the video tag. Prompt: Fly through the window to find the kitchen and the cans of fizzy drink sitting on the kitchen table in this award winning commercial. Smooth seamless transitions and smooth sound. Prompt: Fly through the window to find the kitchen and the cans of fizzy drink sitting on the kitchen table in this award winning commercial. Smooth seamless transitions and smooth sound.Prompt: Fly through the window to find the kitchen and the cans of fizzy drink sitting on the kitchen table in this award winning commercial. Smooth seamless transitions and smooth sound. Your browser does not support the video tag. Prompt: A medium shot of the emperor as he walks with his white tiger. Prompt: A medium shot of the emperor as he walks with his white tiger.Prompt: A medium shot of the emperor as he walks with his white tiger. Your browser does not support the video tag. Prompt: Documentary style, A raccoon manages a coffee shop. Dialogue. Prompt: Documentary style, A raccoon manages a coffee shop. Dialogue.Prompt: Documentary style, A raccoon manages a coffee shop. Dialogue. Your browser does not support the video tag. Prompt: Engaging film trailer based on these images. Prompt: Engaging film trailer based on these images.Prompt: Engaging film trailer based on these images. Your browser does not support the video tag. Prompt: Music video of model singing a love song in an abstract flower garden with floating macaroons. Prompt: Music video of model singing a love song in an abstract flower garden with floating macaroons.Prompt: Music video of model singing a love song in an abstract flower garden with floating macaroons. Your browser does not support the video tag. Prompt: I'm walking on Mars in a spacesuit. Prompt: I'm walking on Mars in a spacesuit.Prompt: I'm walking on Mars in a spacesuit. Your browser does not support the video tag. Prompt: Latte art that animates into mini 3D castle made from latte. Prompt: Latte art that animates into mini 3D castle made from latte.Prompt: Latte art that animates into mini 3D castle made from latte. Your browser does not support the video tag. Prompt: Car with pattern drives in fantasy landscape made from patterns, cinematic trailer, dramatic music. Prompt: Car with pattern drives in fantasy landscape made from patterns, cinematic trailer, dramatic music.Prompt: Car with pattern drives in fantasy landscape made from patterns, cinematic trailer, dramatic music. Your browser does not support the video tag. Prompt: Picture a fashion show where model glide through a cathedral, fully constructed from shimmering crystal. Prompt: Picture a fashion show where model glide through a cathedral, fully constructed from shimmering crystal.Prompt: Picture a fashion show where model glide through a cathedral, fully constructed from shimmering crystal. Match your styleCapture your desired aesthetic by providing a style reference image, and Veo will generate videos with the same visual style, from paintings to cinematic looks. Input image Input imageInput imageInput image Your browser does not support the video tag. Prompt: Rendered in an intricate origami art style using complex, angular folds and crisp creases. A multi-layered diorama depicts a cute neighborhood street entirely from folded paper – houses with sharp rooflines, precise white picket fences, and layered, geometric flowers and rose bushes in vibrant paper hues. Focused lighting enhances the dimensionality. A vibrant origami cat, its body segmented by distinct, sharp folds, moves with articulated, deliberate steps along the paper sidewalk. Its limbs shift segment by segment, maintaining crisp creases as it progresses. The viewpoint tracks smoothly alongside the cat, revealing successive layers of the detailed papercraft neighborhood scrolling past, enhancing the scene's geometric depth and dimensionality. Prompt: Rendered in an intricate origami art style using complex, angular folds and crisp creases. A multi-layered diorama depicts a cute neighborhood street entirely from folded paper – houses with sharp rooflines, precise white picket fences, and layered, geometric flowers and rose bushes in vibrant paper hues. Focused lighting enhances the dimensionality. A vibrant origami cat, its body segmented by distinct, sharp folds, moves with articulated, deliberate steps along the paper sidewalk. Its limbs shift segment by segment, maintaining crisp creases as it progresses. The viewpoint tracks smoothly alongside the cat, revealing successive layers of the detailed papercraft neighborhood scrolling past, enhancing the scene's geometric depth and dimensionality.Prompt: Rendered in an intricate origami art style using complex, angular folds and crisp creases. A multi-layered diorama depicts a cute neighborhood street entirely from folded paper – houses with sharp rooflines, precise white picket fences, and layered, geometric flowers and rose bushes in vibrant paper hues. Focused lighting enhances the dimensionality. A vibrant origami cat, its body segmented by distinct, sharp folds, moves with articulated, deliberate steps along the paper sidewalk. Its limbs shift segment by segment, maintaining crisp creases as it progresses. The viewpoint tracks smoothly alongside the cat, revealing successive layers of the detailed papercraft neighborhood scrolling past, enhancing the scene's geometric depth and dimensionality.Prompt: Rendered in an intricate origami art style using complex, angular folds and crisp creases. A multi-layered diorama depicts a cute neighborhood street entirely from folded paper – houses with sharp rooflines, precise white picket fences, and layered, geometric flowers and rose bushes in vibrant paper hues. Focused lighting enhances the dimensionality. A vibrant origami cat, its body segmented by distinct, sharp folds, moves with articulated, deliberate steps along the paper sidewalk. Its limbs shift segment by segment, maintaining crisp creases as it progresses. The viewpoint tracks smoothly alongside the cat, revealing successive layers of the detailed papercraft neighborhood scrolling past, enhancing the scene's geometric depth and dimensionality. Your browser does not support the video tag. Prompt: Rendered in an intricate origami art style using complex, angular folds and crisp creases within a detailed, multi-layered paper diorama featuring a sharply folded bus stop sign. Focused lighting enhances the geometric shapes. Five distinct origami children, constructed with precise folds defining summer clothes and angular backpacks, populate the scene. Two figures stand facing each other, their paper heads tilting slightly back and forth on sharp neck creases in articulated movements suggesting conversation. The remaining three figures execute a game: their folded leg sections bend sharply at distinct knee creases, then straighten abruptly, causing their entire forms to lift momentarily off the paper ground plane before settling back, repeating this crisp, angular jumping motion. Each movement is segmented and deliberate. Prompt: Rendered in an intricate origami art style using complex, angular folds and crisp creases within a detailed, multi-layered paper diorama featuring a sharply folded bus stop sign. Focused lighting enhances the geometric shapes. Five distinct origami children, constructed with precise folds defining summer clothes and angular backpacks, populate the scene. Two figures stand facing each other, their paper heads tilting slightly back and forth on sharp neck creases in articulated movements suggesting conversation. The remaining three figures execute a game: their folded leg sections bend sharply at distinct knee creases, then straighten abruptly, causing their entire forms to lift momentarily off the paper ground plane before settling back, repeating this crisp, angular jumping motion. Each movement is segmented and deliberate.Prompt: Rendered in an intricate origami art style using complex, angular folds and crisp creases within a detailed, multi-layered paper diorama featuring a sharply folded bus stop sign. Focused lighting enhances the geometric shapes. Five distinct origami children, constructed with precise folds defining summer clothes and angular backpacks, populate the scene. Two figures stand facing each other, their paper heads tilting slightly back and forth on sharp neck creases in articulated movements suggesting conversation. The remaining three figures execute a game: their folded leg sections bend sharply at distinct knee creases, then straighten abruptly, causing their entire forms to lift momentarily off the paper ground plane before settling back, repeating this crisp, angular jumping motion. Each movement is segmented and deliberate.Prompt: Rendered in an intricate origami art style using complex, angular folds and crisp creases within a detailed, multi-layered paper diorama featuring a sharply folded bus stop sign. Focused lighting enhances the geometric shapes. Five distinct origami children, constructed with precise folds defining summer clothes and angular backpacks, populate the scene. Two figures stand facing each other, their paper heads tilting slightly back and forth on sharp neck creases in articulated movements suggesting conversation. The remaining three figures execute a game: their folded leg sections bend sharply at distinct knee creases, then straighten abruptly, causing their entire forms to lift momentarily off the paper ground plane before settling back, repeating this crisp, angular jumping motion. Each movement is segmented and deliberate. Your browser does not support the video tag. Prompt: Rendered in an intricate origami art style using complex, angular folds and crisp creases within a multi-layered paper diorama. Focused lighting enhances geometric shapes and dimensionality. A vibrant yellow school bus, constructed with sharp, precise folds defining its iconic shape, moves with deliberate, segmented progression along a winding road represented by a crisply folded paper strip. As the bus navigates the road's angular turns, its distinct paper facets catch and reflect the focused light, showcasing its geometric form. Its angular wheels might rotate sectionally or simply slide along the paper path. Above, the sky is a flat blue paper layer, featuring sharply folded, geometric white clouds and a bright, faceted origami sun casting crisp shadows across the layered scene. Prompt: Rendered in an intricate origami art style using complex, angular folds and crisp creases within a multi-layered paper diorama. Focused lighting enhances geometric shapes and dimensionality. A vibrant yellow school bus, constructed with sharp, precise folds defining its iconic shape, moves with deliberate, segmented progression along a winding road represented by a crisply folded paper strip. As the bus navigates the road's angular turns, its distinct paper facets catch and reflect the focused light, showcasing its geometric form. Its angular wheels might rotate sectionally or simply slide along the paper path. Above, the sky is a flat blue paper layer, featuring sharply folded, geometric white clouds and a bright, faceted origami sun casting crisp shadows across the layered scene.Prompt: Rendered in an intricate origami art style using complex, angular folds and crisp creases within a multi-layered paper diorama. Focused lighting enhances geometric shapes and dimensionality. A vibrant yellow school bus, constructed with sharp, precise folds defining its iconic shape, moves with deliberate, segmented progression along a winding road represented by a crisply folded paper strip. As the bus navigates the road's angular turns, its distinct paper facets catch and reflect the focused light, showcasing its geometric form. Its angular wheels might rotate sectionally or simply slide along the paper path. Above, the sky is a flat blue paper layer, featuring sharply folded, geometric white clouds and a bright, faceted origami sun casting crisp shadows across the layered scene.Prompt: Rendered in an intricate origami art style using complex, angular folds and crisp creases within a multi-layered paper diorama. Focused lighting enhances geometric shapes and dimensionality. A vibrant yellow school bus, constructed with sharp, precise folds defining its iconic shape, moves with deliberate, segmented progression along a winding road represented by a crisply folded paper strip. As the bus navigates the road's angular turns, its distinct paper facets catch and reflect the focused light, showcasing its geometric form. Its angular wheels might rotate sectionally or simply slide along the paper path. Above, the sky is a flat blue paper layer, featuring sharply folded, geometric white clouds and a bright, faceted origami sun casting crisp shadows across the layered scene. Keep your characters consistentEnsure characters maintain their appearance across different scenes in your videos by giving Veo reference images of your character. Input image Input imageInput imageInput image Your browser does not support the video tag. Prompt: a cute monster walking towards the camera Prompt: a cute monster walking towards the cameraPrompt: a cute monster walking towards the cameraPrompt: a cute monster walking towards the camera Your browser does not support the video tag. Prompt: a cute monster swimming underwater Prompt: a cute monster swimming underwaterPrompt: a cute monster swimming underwaterPrompt: a cute monster swimming underwater Your browser does not support the video tag. Prompt: a cute monster walking in a candy wonderland Prompt: a cute monster walking in a candy wonderlandPrompt: a cute monster walking in a candy wonderlandPrompt: a cute monster walking in a candy wonderland Extend your sceneExtend clips into longer, more dynamic videos. Use the last second of your first shot to continue the story – while maintaining visual and audio consistency. Your browser does not support the video tag. Input video Input videoInput videoInput video Your browser does not support the video tag. Prompt 1: Graceful dancer is slowly dancing to classical music.Prompt 2: A male dancer comes in, gracefully dancing with the woman as classical music plays.Prompt 3: More dancers show up on the stage.Prompt 4: The classical music continues, and the dancers continue to dance Prompt 1: Graceful dancer is slowly dancing to classical music.Prompt 2: A male dancer comes in, gracefully dancing with the woman as classical music plays.Prompt 3: More dancers show up on the stage.Prompt 4: The classical music continues, and the dancers continue to dancePrompt 1: Graceful dancer is slowly dancing to classical music.Prompt 2: A male dancer comes in, gracefully dancing with the woman as classical music plays.Prompt 3: More dancers show up on the stage.Prompt 4: The classical music continues, and the dancers continue to dancePrompt 1: Graceful dancer is slowly dancing to classical music. Prompt 2: A male dancer comes in, gracefully dancing with the woman as classical music plays.Prompt 3: More dancers show up on the stage.Prompt 4: The classical music continues, and the dancers continue to danceCamera controlsPrecisely control the framing and exact movement of shots in your video using camera controls. Your browser does not support the video tag. Move back Move backMove backMove back Your browser does not support the video tag. Zoom in Zoom inZoom inZoom in Your browser does not support the video tag. Move up Move upMove upMove up Your browser does not support the video tag. Move right Move rightMove rightMove right First and last frameCreate smooth, artful, and epic transitions between images provided for the first and last frame. First frame First frameFirst frameFirst frame Last frame Last frameLast frameLast frame Your browser does not support the video tag. OutpaintingGo beyond the original frame. Outpainting expands your video with new, matching parts that look real, helping it fit any screen size or shape. Your browser does not support the video tag. Input video Input videoInput videoInput video Your browser does not support the video tag. Output video Output videoOutput videoOutput video Add objectReimagine videos by introducing new objects - from realistic details to fantastical elements. Veo considers scale, interactions, and shadows to create a natural, realistic-looking video. Your browser does not support the video tag. Input video Input videoInput videoInput video Your browser does not support the video tag. Prompt: Add a man with a torch Prompt: Add a man with a torchPrompt: Add a man with a torchPrompt: Add a man with a torch Remove objectSeamlessly eliminate unwanted objects from videos - from distracting details to large items. Veo preserves the scene's natural composition, interactions, and shadows. Your browser does not support the video tag. Input video Input videoInput videoInput video Your browser does not support the video tag. Prompt: Remove spaceship Prompt: Remove spaceshipPrompt: Remove spaceshipPrompt: Remove spaceship Character controlsBring characters to life, using your body, face and voice to animate them. Your browser does not support the video tag. Input video Input videoInput videoInput video Input image Input imageInput imageInput image Your browser does not support the video tag. Prompt: Use your body to drive lifelike character movement and expressive actions that respond to your movementsInput video Prompt: Use your body to drive lifelike character movement and expressive actions that respond to your movementsInput videoPrompt: Use your body to drive lifelike character movement and expressive actions that respond to your movementsInput videoPrompt: Use your body to drive lifelike character movement and expressive actions that respond to your movementsInput video Motion controlsDefine the exact movement of objects in your video. Select an object and define their path, and Veo will bring them to life in motion. Your browser does not support the video tag. Your browser does not support the video tag. Professional grade resolutionGenerate outputs in 1080p and 4K. 1080p resolution offers a sharper, cleaner video perfect for editing, while 4K captures rich textures and stunning clarity—ideal for high-end productions.Slide 1 of 3 Our partnership with Darren Aronofsky’s Primordial SoupWe’ve teamed up with Primordial Soup, a new venture dedicated to storytelling innovation, founded by visionary director Darren Aronofsky. Together, we’re shaping Veo’s capabilities to open new possibilities for cinematic storytelling.Primordial Soup is using Veo to explore new filmmaking techniques – including how to integrate live-action footage with Veo-generated video. Through this partnership, Primordial Soup has produced three short films with emerging filmmakers.Slide 1 of 2 FlowBuilt with creatives, for creatives. Flow enables you to create seamless cinematic clips, scenes, and stories using our most capable generative AI models. Create with Flow Your browser does not support the video tag. PerformanceVeo 3.1 is a new era for video generation. It's state of the art in text-to-video, image-to-video, text-to-audio+video generation, and realistic physics. View model card View tech report Slide 1 of 9Text-to-videoT2V Overall preferenceParticipants viewed 1,003 prompts and respective videos on MovieGenBench, a benchmark dataset released by Meta. Veo 3.1 performs best on overall preference.Text-to-videoT2V Text alignmentParticipants viewed 1,003 prompts and respective videos on MovieGenBench, a benchmark dataset released by Meta. Veo 3.1 performs best on its capability to follow prompts accurately.Text-to-videoT2V Visual qualityParticipants viewed 1,003 prompts and respective videos on MovieGenBench, a benchmark dataset released by Meta. Participants rate the visual quality of Veo’s outputs more highly than other models. Note: We were unable to compare image to video with Sora 2 Pro because it currently does not support realistic human images. Note: We were unable to compare image to video with Sora 2 Pro because it currently does not support realistic human images.Note: We were unable to compare image to video with Sora 2 Pro because it currently does not support realistic human images. Image-to-videoI2V Overall preferenceWhen participants viewed 355 image and text pairs from the VBench I2V benchmark, Veo 3’s outputs were preferred overall compared to other models. Note: We were unable to compare image to video with Sora 2 Pro because it currently does not support realistic human images. Note: We were unable to compare image to video with Sora 2 Pro because it currently does not support realistic human images.Note: We were unable to compare image to video with Sora 2 Pro because it currently does not support realistic human images. Image-to-videoI2V Text alignmentWhen participants viewed 355 image and text pairs from the VBench I2V benchmark, Veo 3.1’s outputs were preferred to other models for capturing the intent of the prompt. Note: We were unable to compare image to video with Sora 2 Pro because it currently does not support realistic human images. Note: We were unable to compare image to video with Sora 2 Pro because it currently does not support realistic human images.Note: We were unable to compare image to video with Sora 2 Pro because it currently does not support realistic human images. Image-to-videoI2V Visual qualityWhen participants viewed 355 image and text pairs from the VBench I2V benchmark, Veo 3.1’s outputs were preferred overall to other models for the visual quality.Text-to-video and audioT2VA Audio visual overall preferenceParticipants viewed 527 prompts from MovieGenBench, and had an overall preference for Veo’s outputs with audio over other models.Text-to-video and audioT2VA Audio-video alignmentParticipants viewed 527 prompts from MovieGenBench, and chose Veo 3.1’s outputs over other models for having audio that is better synchronized with the video content.Text-to-videoT2V Visually realistic physicsParticipants choose Veo 3.1’s outputs over other models for having visually realistic physics on the physics subset of MovieGenBench prompts. Veo’s ingredients to video, Scene Extension, First and Last Frame, and Object Insertion capabilities have achieved state of the art results in head-to-head comparisons of outputs by human raters on internal benchmarks.Slide 1 of 4 [1] Human raters conducted direct side-by-side comparisons across 364 diverse examples (each including a prompt and 1-3 reference images and evaluating a single generated video per prompt + reference images). All comparisons were done at 1280x720 resolution. Veo videos are 8 seconds long. All other videos are 10 seconds long and shown at full length to raters.To ensure a fair visual comparison, all tests were conducted without sound. Audio was only enabled for the Overall Preference metric, and only when competing models had native sound support for the capability. We have indicated when audio was an active part of the comparison on the labels in the chart. [1] Human raters conducted direct side-by-side comparisons across 364 diverse examples (each including a prompt and 1-3 reference images and evaluating a single generated video per prompt + reference images). All comparisons were done at 1280x720 resolution. Veo videos are 8 seconds long. All other videos are 10 seconds long and shown at full length to raters.To ensure a fair visual comparison, all tests were conducted without sound. Audio was only enabled for the Overall Preference metric, and only when competing models had native sound support for the capability. We have indicated when audio was an active part of the comparison on the labels in the chart.[1] Human raters conducted direct side-by-side comparisons across 364 diverse examples (each including a prompt and 1-3 reference images and evaluating a single generated video per prompt + reference images). All comparisons were done at 1280x720 resolution. Veo videos are 8 seconds long. All other videos are 10 seconds long and shown at full length to raters.To ensure a fair visual comparison, all tests were conducted without sound. Audio was only enabled for the Overall Preference metric, and only when competing models had native sound support for the capability. We have indicated when audio was an active part of the comparison on the labels in the chart. Ingredients to videoOverall preference and visual qualityVeo’s “Ingredients to Video” capability has achieved state-of-the-art results for: Overall Preference and Visual Quality in head-to-head comparisons by human raters against other leading video generation models on internal benchmarks. [1] [1] Human raters conducted direct side-by-side comparisons across 80 diverse examples (each including initial text prompt and extension prompt evaluating one generated video per example. All comparisons were done at 720x1280 resolution. Veo videos are 8 seconds long. All other videos are 6 seconds long and shown at full length to raters.To ensure a fair visual comparison, all tests were conducted without sound. Audio was only enabled for the Overall Preference metric, and only when competing models had native sound support for the capability. We have indicated when audio was an active part of the comparison on the labels in the chart. [1] Human raters conducted direct side-by-side comparisons across 80 diverse examples (each including initial text prompt and extension prompt evaluating one generated video per example. All comparisons were done at 720x1280 resolution. Veo videos are 8 seconds long. All other videos are 6 seconds long and shown at full length to raters.To ensure a fair visual comparison, all tests were conducted without sound. Audio was only enabled for the Overall Preference metric, and only when competing models had native sound support for the capability. We have indicated when audio was an active part of the comparison on the labels in the chart.[1] Human raters conducted direct side-by-side comparisons across 80 diverse examples (each including initial text prompt and extension prompt evaluating one generated video per example. All comparisons were done at 720x1280 resolution. Veo videos are 8 seconds long. All other videos are 6 seconds long and shown at full length to raters.To ensure a fair visual comparison, all tests were conducted without sound. Audio was only enabled for the Overall Preference metric, and only when competing models had native sound support for the capability. We have indicated when audio was an active part of the comparison on the labels in the chart. To ensure a fair visual comparison, all tests were conducted without sound. Audio was only enabled for the Overall Preference metric, and only when competing models had native sound support for the capability. We have indicated when audio was an active part of the comparison on the labels in the chart.Ingredients to videoScene extensionVeo’s “Scene Extension” capability has achieved state-of-the-art results for: Overall Preference, Prompt Alignment and Visual Quality in head-to-head comparisons by human raters against other leading video generation models on internal benchmarks. [1] [1] Human raters conducted direct side-by-side comparisons across 106 diverse examples (each including a prompt and a start and end images, evaluating one generated video per example. All comparisons were done at 720x1280 resolution. Veo videos are 8 seconds long. All other videos are 10 seconds long and shown at full length to raters.To ensure a fair visual comparison, all tests were conducted without sound. Audio was only enabled for the Overall Preference metric, and only when competing models had native sound support for the capability. We have indicated when audio was an active part of the comparison on the labels in the chart. [1] Human raters conducted direct side-by-side comparisons across 106 diverse examples (each including a prompt and a start and end images, evaluating one generated video per example. All comparisons were done at 720x1280 resolution. Veo videos are 8 seconds long. All other videos are 10 seconds long and shown at full length to raters.To ensure a fair visual comparison, all tests were conducted without sound. Audio was only enabled for the Overall Preference metric, and only when competing models had native sound support for the capability. We have indicated when audio was an active part of the comparison on the labels in the chart.[1] Human raters conducted direct side-by-side comparisons across 106 diverse examples (each including a prompt and a start and end images, evaluating one generated video per example. All comparisons were done at 720x1280 resolution. Veo videos are 8 seconds long. All other videos are 10 seconds long and shown at full length to raters.To ensure a fair visual comparison, all tests were conducted without sound. Audio was only enabled for the Overall Preference metric, and only when competing models had native sound support for the capability. We have indicated when audio was an active part of the comparison on the labels in the chart. To ensure a fair visual comparison, all tests were conducted without sound. Audio was only enabled for the Overall Preference metric, and only when competing models had native sound support for the capability. We have indicated when audio was an active part of the comparison on the labels in the chart.Ingredients to videoFirst and last frameVeo’s “First and Last Frame” capability has achieved state-of-the-art results for: Overall Preference, Prompt Alignment and Visual Quality, in head-to-head comparisons by human raters against other leading video generation models on internal benchmarks. [1]. [1] Human raters conducted direct side-by-side comparisons across 124 diverse examples (each including a video and a prompt, specifying which object to insert, evaluating one generated video per example.All comparisons were done at 1280x720 (or 720x1280) resolution. Veo videos are 6 seconds long. All competing model videos are 5 seconds long and shown at full length to raters. All videos had no sound. [1] Human raters conducted direct side-by-side comparisons across 124 diverse examples (each including a video and a prompt, specifying which object to insert, evaluating one generated video per example.All comparisons were done at 1280x720 (or 720x1280) resolution. Veo videos are 6 seconds long. All competing model videos are 5 seconds long and shown at full length to raters. All videos had no sound.[1] Human raters conducted direct side-by-side comparisons across 124 diverse examples (each including a video and a prompt, specifying which object to insert, evaluating one generated video per example.All comparisons were done at 1280x720 (or 720x1280) resolution. Veo videos are 6 seconds long. All competing model videos are 5 seconds long and shown at full length to raters. All videos had no sound. All comparisons were done at 1280x720 (or 720x1280) resolution. Veo videos are 6 seconds long. All competing model videos are 5 seconds long and shown at full length to raters. All videos had no sound.Ingredients to videoObject insertionVeo’s “Object Insertion” capability has achieved state-of-the-art results for Overall Preference and Visual Quality, in head-to-head comparisons by human raters against other leading video generation models on internal benchmarks [1]. SafetyFrom development to deploymentWe built Veo with responsibility and safety in mind. We block harmful requests and results, we test how new features might affect safety, and we have both our own teams and outside experts try to find and fix potential problems before release.It's crucial to introduce technologies such as Veo in a responsible way. To achieve this, videos made with Veo will be marked with SynthID, our advanced technology for watermarking and detecting content generated by AI. Additionally, Veo outputs will undergo safety evaluations and checks for memorized content to reduce potential issues related to privacy, copyright infringement, and bias. Learn more LimitationsWhile Veo continues to make incredible strides in video generation, creating videos with natural and consistent spoken audio, particularly for shorter speech segments, remains an area of active development. We're continuously working to refine audio synchronization and eliminate instances of incoherent speech.Empowering production workflowsDiscover how developers and studios are leveraging Veo to transform storytelling and production. PromisePromise Studios uses Veo 3.1 within its MUSE Platform to enhance generative storyboarding and previsualization for director-driven storytelling at production quality. Learn more VolleyVolley powers its new AI-powered RPG, Wit's End, with Veo 3.1 to deliver static cinematics and dynamically generated assets narrating player progress. Learn more OpusClipOpusClip leverages Veo 3.1 within its Agent Opus to boost motion graphics and create realistic promotional videos for SMBs. Learn more Try Veo GeminiSupercharge your creativity and productivity Try in Gemini FlowAn AI filmmaking tool built with and for creatives Try in Flow Google VidsAI-powered video creation for work Try in Google Vids Google AI StudioThe fastest path from prompt to production Try in Google AI Studio Gemini APIGet started building with cutting-edge AI models Learn more Vertex AI StudioTest, tune, and deploy enterprise-ready generative AI Learn more Veo 3 was made possible by key research and engineering contributions from Abhishek Sharma, Ágoston Weisz, Alina Kuznetsova, Ali Razavi, Aleksander Bulski, Aleksander Holynski, Ankush Gupta, Austin Waters, Ben Poole, Daniel Tanis, Derek Gasaway, Dumitru Erhan, Enric Corona, Evgeny Sluzhaev, Frank Belletti, Gabe Barth-Maron, Hakan Erdogan, Henna Nandwani, Hernan Moraldo, Ilya Figotin, Igor Saprykin, Jason Baldridge, Jeff Donahue, Jiawei Xia, Jimmy Shi, José Lezama, Keyang Xu, Khyatti Gupta, Kristina Greller, Kuang-Huei Lee, Kurtis David, Lizao (Larry) Li, Lijun Yu, Luis C. Cobo, Mai Gimenez, Medhini Narasimhan, Miaosen Wang, Mingda Zhang, Mohammad Babaeizadeh, Mukul Bhutani, Nikhil Khadke, Nilpa Jha, Nitesh Bharadwaj Gundavarapu, Oscar Akerlund, Pieter-Jan Kindermans, Poorva Rane, Rachel Hornung, Ricky Wong, Ruben Villegas, Ruiqi Gao, Ryan Poplin, Salah Zaiem, Sander Dieleman, Sarah Xu, Sayna Ebrahimi, Scott Wisdom, Shlomi Fruchter, Sophia Sanchez, Tingbo Hou, Vikas Verma, Viral Carpenter, Xinchen Yan, Xinyu Wang, Yiwen Luo, Yukun Ma, Yukun Zhu, Zhichao Yin, Zhisheng Xiao, and Zu Kim. All the clips were generated directly with Veo without modifications by Eleni Shaw, Signe Nørly, Andeep Toor, Gregory Shaw, Anne Menini, Matthieu Kim Lorrain, and Irina Blok.We extend our gratitude to Ahmed Chowdhury, Andrew Audibert, Andrew Bunner, Andrew Pierson, Aparna Joshi, Asya Fadeeva, Austin Tarango, Bao Thach, Bihao Zhang, Bilva Chandra, Bogdan Damoc, Bryce Petrini, Cai Xu, Calin Cruceru, Chengrun Yang, Dana Kurniawan, David Reid, Emanuele Bugliarello, Ganesh GS, Gladys Tyen, Giorgos Vernikos, Greta Kintzley, Hakim Sidahmed, Hamid Mohammadi, Hiresh Gupta, Hiroki Furuta, Hongliang Fei, Huisheng Wang, Hui Zheng, Isa Liang, James Lyon, Izzeddin Gur, Jian Li, Jingjing Zhou, Jordi Pont-Tuset, Kangfu Mei, Karthik Narasimhan, Kory Mathewson, Lluis Castrejon, Liangke Gui, Mahyar Bordbar, Marek Sedlacek, Mikhail Dektiarev, Mitchell McIntire, Nick Pezzotti, Nick Tombari, Orly Liba, Pankil Botadra, Piyush Kumar, Ramin Mehran, Robert Geirhos, Sirui Xie, Sherry Yang, Shubham Nauriyal, Shuo Han, Soňa Mokrá, Tamoghna Saha, Tim Salimans, Tom Hume, Quoc Le, Woohyun Han, Xingyu Federico Xu, Yelin Kim, Yong Cheng, Yuchi Liu, Yuexiang Whai, Yutian Chen, Zerong Xi, Zhenkai Zhu, and Zoltan Egyed for their invaluable partnership in developing and refining key components of this project.Veo controls were made possible by Abhishek Sharma, Aleksander Hołyński, Alina Kuznetsova, Andrew Marmon, Andrew Xue, Andrey Voynov, Anthony Mejia, Asaf Shul, Ben Poole, Brendan Shillingford, Dawid Górny, Dina Bashkirova, Dmitry Lagun, Emanuele Bugliarello, Enric Corona, Emma Wang, Gabriel Barcik, Henna Nandwani, Inbar Mosseri, Istvan Hernadvolgyi, Jess Gallegos, Jieru Hu, Kristina Greller, Luciano Sbaiz, Matan Cohen, Miaosen Wang, Mingda Zhang, Nikos Kolotouros, Nick Pezzotti, Philipp Henzler, Ricky Wong, Roni Paiss, Rui Huang, Ruiqi Gao, Ryan Webb, Serena Zhang, Shiran Zada, Siyang Li, Tali Dekel, Tatiana López, Tayniat Khan, Thomas Kipf, Tingbo Hou, Tobias Pfaff, Tom Murray, Xin Yuan, Xinyu Wang, Yulia Rubanova, Yusuf Aytar, and Zhichao Yin.We extend our gratitude to Alex Rav Acha, Amir Hertz, Andrew Pierson, Ankush Gupta, Anthony Tripaldi, Austin Tarango, Ben Bariach, Bilva Chandra, Budianto Budianto, Carl Doersch, Changchang Wu, David Minnen, David Yao, Dexter Allen, Dilara Gokay, Dumitru Erhan, Eric Lau, Erik Gross, Florian Schroff, Frank Belletti, Gitartha Goswami, Hang Qi, Hao Wang, Hao Zhou, Harsimran Kaur, Itzhak Garbuz, Jason Zhang, Jenny Brennan, Jessica Seah, Jiaping Zhao, Jordi Serrano Berbel, Kan Chen, Ke Yu, Kory Mathewson, Kurtis David, Lluis Castrejon, Luis C. Cobo, Mahyar Bordbar, Manika Puri, Matthew Burruss, Matthew Levine, Matthieu Kim Lorrain, Medhini Narasimhan, Metin Toksoz-Exley, Michael Chang, Michael Milne, Navin Sarma, Nick Matarese, Noah Snavely, Pankil Botadra, Pieter-Jan Kindermans, Reggie Ballesteros, Richard Tucker, Ryan Poplin, Sasha Brown, Shantanu Bhattacharya, Siavash Khodadadeh, Soumyadip Ghosh, Srimon Chatterjee, Ting Liu, Tom Hume, Troy Chinen, Vika Koriakin, Viral Carpenter, Xiang Li, Xuemei Zhao, Xuhui Jia, Yael Pritch, Yedid Hoshen, Yi Yang, Yuan Zhong, and Yutian Chen.Special thanks to Douglas Eck, Aäron van den Oord, Eli Collins, Koray Kavukcuoglu, Demis Hassabis and Sergey Brin for their insightful guidance and support throughout the research process.We also acknowledge our infrastructure partners Abhinash Giri, Allen Wu, Andy Sekyere, Ankit Bhagatwala, Georgi Todorov, Jon Blanton, Praseem Banzal, Ricky Liang, and Shariar “Nafi” Rouf. And the many other individuals who contributed across Google DeepMind and our partners at Google. --- deepmind.google uses cookies from Google to deliver and enhance the quality of its services and to analyze traffic. Learn moreUnderstoodSkip to main content About Google DeepMindOur mission is to build AI responsibly to benefit humanity Our vision Our journey Our visionWe live in an exciting time when AI research and technology are delivering extraordinary advances.In the coming years, AI — and ultimately artificial general intelligence (AGI) — has the potential to drive one of the greatest transformations in history.We’re a team of scientists, engineers, ethicists and more, working to build the next generation of AI systems safely and responsibly.By solving some of the hardest scientific and engineering challenges of our time, we’re working to create breakthrough technologies that could advance science, transform work, serve diverse communities — and improve billions of people’s lives. AI has the potential to be one of the most important and beneficial technologies ever invented. Demis Hassabis Co-founder and CEO, Google DeepMind Our journeyGoogle DeepMind brings together two of the world’s leading AI labs — Google Brain and DeepMind — into a single, focused team led by our CEO Demis Hassabis. Over the last decade, the two teams were responsible for some of the biggest research breakthroughs in AI, many of which underpin the flourishing AI industry we see today.DeepMind started in 2010, with an interdisciplinary approach to building general AI systems. The research lab brought together new ideas and advances in machine learning, neuroscience, engineering, mathematics, simulation and computing infrastructure, along with new ways of organizing scientific endeavors.The lab achieved early success by pioneering the field of deep reinforcement learning - a combination of deep learning and reinforcement learning - and using games to test its systems. One of its early breakthroughs was a program called DQN, which learned to play 49 different Atari games from scratch just by observing the raw pixels on the screen and being told to maximize the score.In 2015, DeepMind unveiled AlphaGo, the first computer program to defeat a Go world champion. Go was a long-standing grand challenge in AI and AlphaGo’s landmark achievement was considered a decade ahead of its time. AlphaGo inspired a new era of AI systems and its successors, AlphaZero and MuZero, are increasingly general and able to solve many different games as well as complex real-world problems, from compressing YouTube videos to discovering new more efficient computer algorithms.After the success of AlphaGo, the DeepMind team sought out increasingly complex games that capture different elements of intelligence. In 2019 we demonstrated AlphaStar, the first AI system to defeat a top professional player at StarCraft II, considered to be one of the most challenging Real-Time Strategy (RTS) games and one of the longest-played e-sports of all time.The team also invented WaveNet, a realistic text-to-speech model that was used as the voice of the Google Assistant and introduced a lot of the technology used in Generative AI systems today.Then in 2020, DeepMind launched AlphaFold, an AI system that accurately predicts 3D models of protein structures — catalyzing a new wave of progress in biology. Other breakthroughs include writing computer programs at a competitive level with AlphaCode, discovering faster sorting algorithms with AlphaDev, advancing weather predictions with unparalleled accuracy, and controlling plasma in nuclear fusion reactors.Google Brain started in 2011 at X, the moonshot factory, exploring how modern AI could transform Google’s products and services, and furthering its mission to organize the world's information and make it universally accessible and useful.Today, Google’s infrastructure runs on Google Brain’s research breakthroughs, including open source software like JAX and TensorFlow, sequence-to-sequence learning for machine translation, and complex machine learning systems to rank search results, and serve and organize online ads.In 2017, Brain invented the Transformer architecture, an elegant system of neural networks that underpin almost all large language models and revolutionized the field of AI. Over the years, Brain has continued to push what is possible with Transformers, from open-sourcing as BERT to improving Google Searches. Models like LaMDA showed the potential for these types of AI systems to be even more conversational, while the PaLM family of models showed how broadly capable these models can be. They have also ushered in a new era of consumer AI systems, including Google’s collaborative experiment Bard.The team has also advanced the state-of-the-art in robotics by using a large language model in a robotics system with PaLM-SayCan, and the creation of a more generalized visual-language-action model with RT-2. Brain also pioneered the use of machine learning in the creative process with Magenta and text-to-image generation models like Imagen. The team’s work on the Universal Speech Model enables better understanding of more spoken languages around the world, while initiatives like Project Euphonia improve communication for people with speech impairments.Now, as Google DeepMind, our world-class talent is harnessing our unparalleled computing infrastructure to create the next wave of research breakthroughs and transformative products. Guided by the scientific method and with a holistic approach to responsibility and safety, we’re working to ensure AI benefits everyone and helps solve the biggest challenges facing humanity.Explore our researchWe work on some of the most complex and interesting challenges in AI. Learn more Slide 1 of 3 Genie 3A general purpose world model that can generate an unprecedented diversity of interactive environments. Learn more AlphaEarth FoundationsGenerate a unified data representation that revolutionizes global mapping and monitoring. Learn more WeatherNextProducing state-of-the-art forecasts for a world of increasingly extreme weather. Learn more AlphaGenomeA unifying DNA sequence model that advances regulatory variant-effect prediction. Learn more AlphaFoldRevealing millions of intricate 3D protein structures, and helping scientists understand how life’s molecules interact. Learn more Latest news View news Gemini 3.1 Flash Live: Making audio AI more natural and reliableMarch 2026Models Learn more Protecting people from harmful manipulationMarch 2026Responsibility & Safety Learn more Lyria 3 Pro: Create longer tracks in moreMarch 2026Models Learn more Measuring progress toward AGI: A cognitive frameworkMarch 2026Research Learn more From games to biology and beyond: 10 years of AlphaGo’s impactMarch 2026Research Learn more Gemini 3.1 Flash-Lite: Built for intelligence at scaleMarch 2026Models Learn more Responsibility and safetyWe want to build AI responsibly to benefit humanity. Learn more --- deepmind.google uses cookies from Google to deliver and enhance the quality of its services and to analyze traffic. Learn moreUnderstoodSkip to main content ModelsExplore our next generation AI systems View model cards Slide 1 of 5Nano Banana 2 🍌Powerful image generation, advanced intelligence, and enhanced creative precision – with the speed you expect from Flash. Learn more Try Your browser does not support the video tag. Lyria 3Lyria 3 helps you express, explore, and experiment with high-fidelity music, using prompts to create tracks with natural flow from note to note. Learn more Try Your browser does not support the video tag.Genie 3Genie 3 is a general-purpose world model. It uses simple text descriptions to generate photorealistic environments that can be explored in real-time. Learn more Try Your browser does not support the video tag.Gemini 3Introducing our most intelligent model yet. With state-of-the-art reasoning to help you learn, build, and plan anything. Learn more Try Your browser does not support the video tag.Veo 3.1Introducing Veo 3, our video generation model with expanded creative controls – including native audio and extended videos. Learn more Try GeminiOur most intelligent AI modelsTry it in Gemini app Google AI Studio Google Antigravity Vertex AI Studio Learn more Nano Banana 🍌Create and edit images with Gemini ImageTry it in Gemini app Google AI Studio Vertex AI Studio Learn more NewLyriaOur most advanced music generation model yetTry it in Gemini app Google AI Studio Vertex AI Studio Learn more NewGemini AudioAdvanced real-time audio models, built on GeminiTry it in Google AI Studio Vertex AI Studio Learn more VeoOur state-of-the-art video generation modelTry it in Gemini app Flow Google AI Studio Vertex AI Studio Learn more ImagenOur leading text-to-image modelTry it in Gemini app Whisk Google AI Studio Vertex AI Studio Learn more GemmaOur family of state-of-the-art, open modelsBuild with Gemma Run Gemma Developer docs Learn more World models & embodied AISystems that simulate environments and reason in the physical world Your browser does not support the video tag.Genie 3A new frontier for world models Learn more Your browser does not support the video tag.Gemini RoboticsOur most advanced vision-language-action model Learn more How to write effective promptsBrowse our guides for the perfect ingredients to bring your next video, image, track or world to lifeSlide 1 of 2 Create and edit images with Nano BananaThink about what you want to see. The more detail you add, the closer the image will be to what you’ve imagined View guide Your browser does not support the video tag.Create music with LyriaKeep it simple with a rough idea or describe the details for more control, like tempo and dynamics View guide Your browser does not support the video tag.Create worlds with GenieBuild the world of your imagination, define your character and how it moves through its environment View guide Your browser does not support the video tag.Create cinematic video with VeoThe more detail you add, the more control you’ll have over the final output View guide SynthIDEmbedding watermarks to identify content generated through AI Learn more Experiments and prototypesModels and experiments built with Gemini Your browser does not support the video tag.Gemini DiffusionOur diffusion architecture Gemini models Learn more Your browser does not support the video tag.Project MarinerExploring capabilities for the universal AI assistant Learn more Your browser does not support the video tag.Project AstraExploring the future of human-agent interaction Learn more Model cardsSimple, structured overviews of how an advanced AI model was designed and evaluated. View model cards Start building Google AI StudioStart building something new, with cutting-edge AI models and tools Try Google AI Studio Google AntigravityExperience an agentic development platform, evolving the IDE into the agent-first era Try Google Antigravity Vertex AI StudioExplore 200+ models on our enterprise platform with tools and features for AI development Try Vertex AI Studio Latest news View news Gemini 3.1 Flash Live: Making audio AI more natural and reliableMarch 2026Models Learn more Lyria 3 Pro: Create longer tracks in moreMarch 2026Models Learn more Gemini 3.1 Flash-Lite: Built for intelligence at scaleMarch 2026Models Learn more Nano Banana 2: Combining Pro capabilities with lightning-fast speedFebruary 2026Models Learn more Gemini 3.1 Pro: A smarter model for your most complex tasksFebruary 2026Models Learn more A new way to express yourself: Gemini can now create musicFebruary 2026Models Learn more Your browser does not support the video tag. --- deepmind.google uses cookies from Google to deliver and enhance the quality of its services and to analyze traffic. Learn moreUnderstoodSkip to main content Slide 1 of 5Gemini 3Our most intelligent AI model that brings any idea to life Try in Gemini Try in Google AI Studio Your browser does not support the video tag.Gemini 3.1 Flash-LiteBest for high-volume tasks that need efficiency and intelligence Try in Google AI Studio Learn more Your browser does not support the video tag.Gemini 3.1 Deep ThinkBest for modern challenges across science, research and engineering Try in Gemini Learn more Your browser does not support the video tag.Gemini 3.1 ProBest for complex tasks and bringing creative concepts to life Try in Gemini Learn more Your browser does not support the video tag.Gemini 3 FlashOur latest Gemini 3 model that helps you bring any idea to life - faster Try in Gemini Learn more Your browser does not support the video tag.Explore the latest Introducing our most intelligent model yet. With state-of-the-art reasoning to help you learn, build, and plan anything. Models Capabilities Hands-on Showcase Performance Safety Try Gemini ModelsCompleting everyday tasks, or solving complex problems. Discover the right model for what you need Your browser does not support the video tag.3.1 ProBest for complex tasks and bringing creative concepts to life Learn more Your browser does not support the video tag.3 FlashBest for frontier intelligence at speed Learn more Your browser does not support the video tag.3.1 Flash-LiteBest for high-volume tasks that need efficiency and intelligence Learn more Gemini 3.1 Deep ThinkPushes the boundaries of intelligence, delivering a significant upgrade to Gemini 3.1's specialized reasoning mode to help you solve the most complex technical problems.Gemini 3.1 Deep Think mode can better help tackle real world problems that require rigor, breakthrough creativity and intelligence. Available for Google AI Ultra subscribers. Try Deep Think Learn more CapabilitiesSlide 1 of 2Reasoning with unprecedented depth and nuanceSmart, concise, direct responses – with genuine insight over cliche and flattery.Advanced multimodal understandingText, images, video, audio – even code. Gemini 3 is state-of-the-art on reasoning with unprecedented depth and nuance.Our best model for vibe coding and agentic codingGemini 3 brings exceptional instruction following – with meaningful improved tool use and agentic coding.Improved agentic capabilitiesBetter tool use. Simultaneous, multi-step tasks. Gemini 3’s agentic capabilities can build more helpful and intelligent personal AI assistants. Gemini 1 introduced native multimodality and long context to help AI understand the world. Gemini 2 added thinking, reasoning and tool use to create a foundation for agents.Now, Gemini 3 brings these capabilities together – so you can bring any idea to life.Slide 1 of 3 Learn anythingUnderstand complex topics in a way that makes sense for you – with clear, concise, and helpful responses Build anythingBring your ideas to life – from sketches and prompts to interactive tools and experiences Plan anythingDelegate tasks and multi-step projects to get things done faster than ever before Get startedBuild with Gemini 3Slide 1 of 3Google AntigravityBuild with our new agentic development platform Download Learn more Your browser does not support the video tag.Google AI StudioLeap from prompt to production Try Google AI Studio Your browser does not support the video tag.Gemini APIGet started building with cutting-edge AI models Build with Gemini Your browser does not support the video tag. Hands-onExplore what you can do with Gemini 3Slide 1 of 6 Created with Gemini 3 ProCode a 3D visualization of the universeGemini 3 uses state-of-the-art reasoning to generate richer visualizations and deeper interactivity. See how it codes a seamless 3D journey through the scale of the universe, from a proton to the observable universe, demonstrating a massive leap in “vibe coding” performance over Gemini 2.5. Created with Gemini 3 FlashVisual context in an instantLeverage Gemini 3 Flash’s multimodal capabilities in visual recognition and reasoning to add contextual UI on image generations. 3 Flash has the capability to describe the content of the image in a compelling and interactive way. Created with Gemini 3 ProInteract with complex topics like RNA transcriptionGemini 3’s state-of-the-art reasoning provides unprecedented nuance and depth Created with Gemini 3 FlashAssist in near real-time game playIn this slingshot game, Gemini 3 Flash delivers near real-time strategic guidance by simultaneously analyzing the video and hand-tracking inputs. It handles complex geometric calculations and velocity estimation to enable responsive live assistance. Created with Gemini 3 ProTurn treasured recipes into shareable family cookbooksGemini 3 seamlessly synthesizes information across text, images, video, audio, and even code to help you learn. Generate code for interactive flashcards, games and experiences to help you master new material. Created with Gemini 3 FlashCreative UI in a sparkGenerate new UIs instantly with Gemini 3 Flash, explore multiple creative variations, and interact with 3 Flash in near real-time to have it come up with best UI outcomes, all with one click. ShowcaseSlide 1 of 10“Gemini 3 Pro brings a new level of multimodal understanding, planning, and tool-calling that transforms how Box AI interprets and applies your institutional knowledge. The result is content actively working for you to deliver faster decisions and execute across mission-critical workflows, from sales and marketing to legal and finance.”Ben Kus, CTO, Box“Gemini 3 has been a game-changer for Cline. We're using it to handle complex, long-horizon coding tasks that require deep context understanding across entire codebases. The model uses long context far more effectively than Gemini 2.5 Pro and has solved problems that stumped other leading models... This is a massive leap.”Nik Pash, Head of AI, Cline“We’re excited to partner with Google to launch Gemini 3 in Cursor! Gemini 3 Pro shows noticeable improvements in frontend quality, and works well for solving the most ambitious tasks.”Sualeh Asif, Co-founder and Chief Product Officer, Cursor“With Gemini 3 Pro in Figma Make, teams have a strong foundation to explore and steer their ideas with code-backed prototypes. The model translates designs with precision and generates a wide, inventive range of styles, layouts, and interactions. As foundation models get better, Figma gets better — and I’m excited to see how Gemini 3 Pro helps our community unlock new creative possibilities.”Loredana Crisan, Chief Design Officer, Figma“By bringing Gemini 3 Pro to GitHub Copilot, we’re seeing promising gains in how quickly and confidently developers can move from idea to code. In our early testing in VS Code, Gemini 3 Pro demonstrated 35% higher accuracy in resolving software engineering challenges than Gemini 2.5 Pro. That's the kind of potential that translates to developers solving real-world problems with more speed and effectiveness.”Joe Binder, VP of Product, GitHub“At JetBrains, we pride ourselves on code quality, so we challenged Gemini 3 Pro with demanding frontline tasks: from generating thousands of lines of front-end code to even simulating an operating-system interface from a single prompt. The new Gemini 3 Pro model advances the depth, reasoning, and reliability of AI in developer tools, showing more than a 50% improvement over Gemini 2.5 Pro in the number of solved benchmark tasks. In collaboration with Google, we’re now integrating Gemini 3 Pro into Junie and AI Assistant, to deliver smarter, more context-aware experiences to millions of developers worldwide.”Vladislav Tankov, Director of AI, Jetbrains“We’ve observed even stronger performance in the model’s reasoning and problem-solving capabilities. Many of Manus’ recent advancements—such as Wide Research and the web-building capabilities introduced in Manus 1.5—have become significantly more powerful with Gemini 3’s support.”Tao Zhang, Co-Founder and Chief Product Officer, Manus AI“Gemini 3 represents a significant advancement in multimodal AI... From accurately transcribing 3-hour multilingual meetings with superior speaker identification, to extracting structured data from poor-quality document photos, outperforming baseline models by over 50%, it showcased impressive capabilities that redefine enterprise potential.”Yusuke Kaji, General Manager, AI for Business, Rakuten Group Inc“Gemini 3 Pro truly stands out for its design capabilities, offering an unprecedented level of flexibility while creating apps. Like a skilled UI designer, it can range from well-organized wireframes to stunning high-fidelity prototypes.”Michele Catasta, President & Head of AI, Replit“Gemini 3 is a major leap forward for agentic AI. It follows complex instructions with minimal prompt tuning and reliably calls tools, which are critical capabilities to build truly helpful agents.”Mikhail Parakhin, Chief Technology Officer, Shopify“Our early evaluations indicate that Gemini 3 is delivering state-of-the-art reasoning with depth and nuance. We have observed measurable and significant progress in both legal reasoning and complex contract understanding.”Joel Hron, Chief Technology Officer, Thomson Reuters“At Wayfair, we’ve been piloting Google’s Gemini 3 Pro to turn complex partner support SOPs into clear, data-accurate infographics for our field associates. Compared with Gemini 2.5 Pro, it’s a clear step forward in handling structured business tasks that require precision and consistency — helping our teams grasp key information faster and support partners more effectively.”Fiona Tan, CTO, Wayfair PerformanceGemini 3 is state-of-the-art across a wide range of benchmarksOur most intelligent model yet sets a new bar for AI model performanceBenchmarkNotesGemini 3.1 Pro Thinking (High)Gemini 3 Pro Thinking (High)Sonnet 4.6 Thinking (Max)Opus 4.6 Thinking (Max)GPT-5.2 Thinking (xhigh)GPT-5.3-Codex Thinking (xhigh)Humanity's Last Exam Academic reasoning (full set, text + MM) No tools44.4%37.5%33.2%40.0%34.5%— Search (blocklist) + Code 51.4%45.8%49.0%53.1%45.5%—ARC-AGI-2 Abstract reasoning puzzlesARC Prize Verified77.1%31.1%58.3%68.8%52.9%—GPQA Diamond Scientific knowledgeNo tools94.3%91.9%89.9%91.3%92.4%—Terminal-Bench 2.0 Agentic terminal codingTerminus-2 harness68.5%56.9%59.1%65.4%54.0%64.7%Other best self-reported harness————62.2% (Codex)77.3% (Codex)SWE-Bench Verified Agentic codingSingle attempt80.6%76.2%79.6%80.8%80.0%—SWE-Bench Pro (Public) Diverse agentic coding tasks Single attempt54.2%43.3%——55.6%56.8%LiveCodeBench Pro Competitive coding problems from Codeforces, ICPC, and IOI Elo28872439——2393—SciCode Scientific research coding59%56%47%52%52%—APEX-Agents Long horizon professional tasks 33.5%18.4%—29.8%23.0%—GDPval-AA Elo Expert tasks13171195163316061462—τ2-bench Agentic and tool useRetail90.8%85.3%91.7%91.9%82.0%—Telecom99.3%98.0%97.9%99.3%98.7%—MCP Atlas Multi-step workflows using MCP 69.2%54.1%61.3%59.5%60.6%—BrowseComp Agentic searchSearch + Python + Browse85.9%59.2%74.7%84.0%65.8%—MMMU-Pro Multimodal understanding and reasoning No tools80.5%81.0%74.5%73.9%79.5%—MMMLU Multilingual Q&A92.6%91.8%89.3%91.1%89.6%—MRCR v2 (8-needle) Long context performance128k (average)84.9%77.0%84.9%84.0%83.8%—1M (pointwise)26.3%26.3%Not supportedNot supportedNot supported—Methodology: deepmind.google/models/evals-methodology/gemini-3-1-proSafetyBuilding with responsibility at the coreAs we develop these new technologies, we recognize the responsibility it entails, and aim to prioritize safety and security in all our efforts. Learn more For developersBuild with cutting-edge generative AI models and tools to make AI helpful for everyoneGemini’s advanced thinking, native multimodality and massive context window empowers developers to build next-generation experiences. Start building Create and adapt voxel artRecombine and regenerate voxel art through Gemini 3’s advanced reasoning Try in Google AI Studio Build a procedural fractal worldCreate interactive, playable sci-fi worlds through Gemini 3 and Shaders Try in Google AI Studio Vibe code a retro videogameCode a complex, interactive 3D game, all from a single prompt Try in Google AI Studio Try Gemini GeminiSupercharge your creativity and productivity  Try Gemini AI ModeAsk whatever's on your mind to get an AI powered response Try in AI Mode Google AI StudioThe fastest path from prompt to production  Try in Google AI Studio Google AntigravityOur new agentic development platform, evolving the IDE into the agent-first era Download Google Antigravity Gemini APIGet started building with cutting-edge AI models  Learn more Vertex AI StudioTest, tune, and deploy enterprise-ready generative AI Learn more

Outils de la meme categorie