Latest news with #Genie2

Google's new Genie 3 could be a watershed moment for AI and gaming — here's why

Tom's Guide

4 days ago

Tom's Guide

Google's new Genie 3 could be a watershed moment for AI and gaming — here's why

Generative AI has come a long way. In the past few years, we have learned to generate images, poems, videos, and even entire apps and websites. However, the next wave could be the most impressive. Google DeepMind recently released its latest tool, Genie 3. Google describes this as a "world model." What that means more specifically is a tool that can generate entire digital worlds from a single prompt. Imagine being able to create an entire explorable world from a single prompt. This could change the world of gaming, education, movies, and more. While this isn't the first version of the tool — hence the 3 in the name — it is the first time the tool has reached a level where it could have genuine use cases out in the real world. So what exactly is this tool, and what does it mean for the future of AI? Genie, or its less catchy name, Generative Interactive Environments, is a world model tool that lets a user generate and then explore virtual worlds. It is trained on internet videos, and, like other generative AI tools, simply requires a worded prompt to get started. The first version of Genie was limited. It could generate worlds, and allowed for frame-by-frame interaction from the user. However, it was primarily operating in 2D game-like environments and didn't offer anywhere near the quality that is available now. Get instant access to breaking news, the hottest reviews, great deals and helpful tips. Then, Deepmind came out with Genie 2. It scaled the tool to offer immersive 3D environments, improved control and a noticeable improvement in real-world physics and graphics. Now, Genie 3 has gone a step further. Worlds are generated at 720p resolution and play in 24 frames per second and a huge amount of work has gone into making the explorable worlds even more interactive. What is important to note about the advancement through these models is that going from Genie 1 to 3 only took about a year and a half. In other words, the next version of Genie is unlikely to actually take a very long time. Once you give Genie 3 a prompt, what will it actually generate? Effectively, Genie is creating a "world." You can control it with a keyboard or a touch screen, and the world can stay coherent for a few minutes. It even remembers off-screen objects that you've moved away with for up to a minute, with any changes you've made to them staying intact. These worlds generate on the fly. That means that, in theory, they are infinitely explorable, with new parts loading as you move around them. However, the limit is how long the model can keep things consistent and in memory. The details will begin to drift and fall apart after a few minutes. In other words, this would better apply to small regions like a house, instead of entire explorable worlds and cities. An important feature that DeepMind has explored with Genie 3 is how editable these worlds are. You can trigger world events mid-play, adding objects or changing weather, and the model should, in theory, keep up with the change. DeepMind has indicated that one of the key differences in this model is its understanding of real world physics. It can generate vibrant eco-systems and can replicate animal behavior and intricate plant life. There are still plenty of limitations here. Genie 3 can't always simulate real-world locations with absolute accuracy. It also struggles with creating text in these worlds and accurately recreating more complicated events in the world continues to be a struggle. However, seeing the progress through the Genie models shows that these challenges are likely temporary, with improvements happening quickly. Currently, Genie 3 is only available to certain developers for testing. However, DeepMind sees multiple uses for the technology. One of the more obvious uses is in game generation, since Genie 3 is able to quickly and effectively generate large, detailed worlds that are interactive. However, this could be used in other areas too. DeepMind explains how it could be used to train robots on factory floors, something that is becoming more and more common. This kind of technology could create environments for them in which they could be trained effectively. Outside of these areas, Genie 3 could make training programs more interactive on a more affordable level. Or a much more simple use, allowing people to explore the world virtually. This technology could generate historical landmarks that are no longer around, or recreate cities historically for different time periods. Currently, the technology isn't capable enough for these tasks, but it does mark a huge jump towards them.

Genie 3: Google DeepMind's New AI Turns Prompts into Living, Breathing 3D Worlds

Time of India

08-08-2025

Entertainment
Time of India

Genie 3: Google DeepMind's New AI Turns Prompts into Living, Breathing 3D Worlds

Live Events Revealed in August 2025, Genie 3 takes a basic text or image prompt and instantly generates a playable 3D world that is complete with objects that you can move, weather that shifts with commands, and environments that remember what you've done, even when you walk away. We're talking 720p visuals, 24 FPS performance, and persistent memory over several minutes of continuous, glitch-free Genie 2, which was impressive but limited to short, grainy video loops, Genie 3 is built for immersion. It supports real-time editing on the fly, just type in 'spawn a storm' or 'build a cave,' and it happens instantly, no reload required. This level of interactivity is powered by what DeepMind calls an 'autoregressive world model,' which isn't hardcoded with rules. Instead, Genie 3 learns how the world works, gravity, water, and shadows just by watching video data. That means the system doesn't fake physics; it internalizes them, leading to emergent, realistic behaviour without manual really elevates Genie 3 is its spatiotemporal consistency. If you paint a wall or drop a sword somewhere, leave the scene, and return, the AI remembers the state exactly as you left it. That's a massive step toward AI that understands continuity, something even big game engines struggle with. DeepMind isn't pitching this as a toy; they see Genie 3 as a training ground for general-purpose intelligence. These hyper-realistic, memory-rich environments are where future AI agents can learn safely, without risking real-world its potential, Genie 3 isn't open to the public yet. It's currently in limited research preview, accessible only to a select group of developers and researchers while DeepMind fine-tunes its safety and governance the implications are crystal 3 is no longer just about creative play; it's a foundational step toward artificial general intelligence (AGI), offering a simulated world where machines can learn, adapt, and possibly outpace human intuition. Simply put, Genie 3 doesn't just build worlds; it builds the infrastructure for AI to truly live in them.

Economic Times

08-08-2025

Economic Times

Genie 3: Google DeepMind's New AI Turns Prompts into Living, Breathing 3D Worlds

Revealed in August 2025, Genie 3 takes a basic text or image prompt and instantly generates a playable 3D world that is complete with objects that you can move, weather that shifts with commands, and environments that remember what you've done, even when you walk away. We're talking 720p visuals, 24 FPS performance, and persistent memory over several minutes of continuous, glitch-free exploration. Unlike Genie 2, which was impressive but limited to short, grainy video loops, Genie 3 is built for immersion. It supports real-time editing on the fly, just type in 'spawn a storm' or 'build a cave,' and it happens instantly, no reload required. This level of interactivity is powered by what DeepMind calls an 'autoregressive world model,' which isn't hardcoded with rules. Instead, Genie 3 learns how the world works, gravity, water, and shadows just by watching video data. That means the system doesn't fake physics; it internalizes them, leading to emergent, realistic behaviour without manual programming. What really elevates Genie 3 is its spatiotemporal consistency. If you paint a wall or drop a sword somewhere, leave the scene, and return, the AI remembers the state exactly as you left it. That's a massive step toward AI that understands continuity, something even big game engines struggle with. DeepMind isn't pitching this as a toy; they see Genie 3 as a training ground for general-purpose intelligence. These hyper-realistic, memory-rich environments are where future AI agents can learn safely, without risking real-world its potential, Genie 3 isn't open to the public yet. It's currently in limited research preview, accessible only to a select group of developers and researchers while DeepMind fine-tunes its safety and governance the implications are crystal clear. Genie 3 is no longer just about creative play; it's a foundational step toward artificial general intelligence (AGI), offering a simulated world where machines can learn, adapt, and possibly outpace human intuition. Simply put, Genie 3 doesn't just build worlds; it builds the infrastructure for AI to truly live in them.

What is Genie 3, Google's latest interactive 3D AI model?

Indian Express

07-08-2025

Indian Express

What is Genie 3, Google's latest interactive 3D AI model?

Google Genie 3, a new AI model that has been unveiled by Google DeepMind, can create interactive 3D worlds. The model will recreate the environment in real time at 24 frames per second, staying consistent at 720p for a few minutes, after users simply submit a text prompt that describes the environment. Unlike earlier versions, Genie 3 supports continuous interaction for a few minutes, remembers where objects were placed, and allows dynamic changes like adding characters or altering weather conditions. According to a blog post that accompanied the release, agents may anticipate changes in the environment and the potential effects of their actions by using world models, which can comprehend and recreate settings. According to the study, 'world models are also a crucial first step on the path to AGI, since they enable AI agents to be trained in an infinite curriculum of rich simulation environments.' The company claims that whereas the interactive window of Genie 2 lasted anywhere from 10 to 20 seconds, Genie 3 offers a 'few minutes' of involvement. Furthermore, if a user leaves a location and returns later, the spot will still look the same because the AI model can be more consistent with graphics. But Genie 3 isn't yet available for public preview; instead, it will be made available to a small number of artists for testing. Key features of Google Genie 3 Rather than producing static information, Genie 3 is a member of a class of AI systems known as world models, which imitate dynamic settings. These models can be applied to robotics, video games, training simulations, and education. Using a suggestion, such as 'a forest during a thunderstorm', the model is supposed to create a playable 3D environment that you can explore with simple movement controls. The video maintains consistency throughout at 24 frames per second in 720p resolution. According to The Verge, that represents a significant improvement from Genie 2, where engagement lasted only ten to twenty seconds. Recall what you observed: Visual memory is one of Genie 3's greatest improvements. A capacity that was absent from the majority of earlier world models is the ability to leave an object behind and return to it later. According to Google, this visual memory lasts for approximately one minute. Set off actual events: According to the DeepMind blog, Genie 3 has 'promptable world events,' which let users add rain, add characters, or transform items by just inputting new commands. Despite significant progress, Genie 3 has several limitations that Google DeepMind is addressing. The model cannot simulate real-world locations with geographic accuracy, and legible text often appears only if it was included in the original prompt. Its range of interactions is currently limited, with multi-agent interactions still under development. While more stable than previous versions, it only supports a few minutes of continuous exploration. The technology also presents new safety and responsibility challenges, which is why its rollout is being handled with a gradual approach.

What is Genie 3? Google DeepMind Launches Text-to-World Model to Build Human-like AI

International Business Times

06-08-2025

International Business Times

What is Genie 3? Google DeepMind Launches Text-to-World Model to Build Human-like AI

August 7, 2025 00:16 +08 Google DeepMind has announced Genie 3, its most advanced 3D world model to date, with an aim to expand the possibilities of generative AI. This new tool allows users to create interactive 3D scenes with human-like realism using simple prompts. The new text-to-world model creates a 3D world just like science fiction. X Genie-3 is an upgraded version of Google's 3D world model, Genie 2, with better results. For example, It allows for longer interactions of up to 60 seconds as compared to the previous 20-second limit. It also introduces visual enhancements by rendering at 720p resolution to provide users with a more realistic experience. Memory retention is one of the key features of Genie 3. For example, if a user is exploring a virtual garden and returns to a previously visited location, the AI remembers the path and the interactions along the way. This memory feature provides a new dimension of depth and realism to digital environments. It also hints at possibilities of practical applications of the model in fields like educational tools, virtual assistants, and even therapeutics. The Genie 3 model also introduces real-time adaptability, enabling users to change the app's VR environment spontaneously using text prompts. Whether it's adding a new character or shifting the weather from a bright sunny day to heavy snowfall, the model responds immediately. This dynamic interaction opens up great potential in the world of game design, as AI can now help in creating elements of storytelling and world-building. Genie 3 is now available to researchers and creators at Google for controlled experimentation and ensuring the technology remains safe. Google has made no official announcement on when it will be released for broader public use, but as this platform stabilizes and its ethical impact becomes more apparent, Google may make this available to others. Genie 3 is part of Google's broader AI ecosystem, which also includes its Gemini AI platform, designed for tasks like deep research and advanced reasoning. Google plans to reveal more unspecified Gemini-powered features in the next-generation Pixel 10. What if you could not only watch a generated video, but explore it too? ðŸŒGenie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt. From photorealistic landscapes to fantasy realms, the possibilities are endless. ðŸ§µ

Latest news with #Genie2

Google's new Genie 3 could be a watershed moment for AI and gaming — here's why

Genie 3: Google DeepMind's New AI Turns Prompts into Living, Breathing 3D Worlds

Genie 3: Google DeepMind's New AI Turns Prompts into Living, Breathing 3D Worlds

What is Genie 3, Google's latest interactive 3D AI model?

What is Genie 3? Google DeepMind Launches Text-to-World Model to Build Human-like AI

Get Started Now: Download the App