logo
#

Latest news with #Genie3

Matrix-Game 2.0 Launches as a Powerful Open-Source Alternative to Genie 3
Matrix-Game 2.0 Launches as a Powerful Open-Source Alternative to Genie 3

Malaysian Reserve

time8 hours ago

  • Entertainment
  • Malaysian Reserve

Matrix-Game 2.0 Launches as a Powerful Open-Source Alternative to Genie 3

SINGAPORE, Aug. 12, 2025 /PRNewswire/ — The SkyWork AI Technology Release Week officially kicked off on August 11. From August 11 to August 15, a new model will be unveiled each day, covering cutting-edge models for core multimodal AI scenarios. A week ago, DeepMind released a major update to its interactive world model—Genie 3—enabling real-time, long-sequence generation. This advancement has drawn significant attention to world models. However, Genie 3 was not open-sourced, leaving the community to speculate about its implementation. On August 12, Skywork unveiled an upgraded version of the self-developed Matrix series' interactive world model—Matrix-Game 2.0. It also delivers interactive, real-time, long-sequence generation in general scenarios. To drive progress in interactive world modeling, Matrix-Game 2.0 has been fully open-sourced, marking the industry's first open-source solution for real-time, long-sequence, interactive generation in general scenarios. Matrix-Game 2.0 open source addresses: Technical report: Project homepage: HuggingFace: GitHub: Matrix-Game 2.0 achieves a breakthrough in real-time generation and long-sequence handling. Compared to its predecessor, the 2.0 version prioritizes low-latency, high-frame-rate performance for extended interactions, enabling stable 25 FPS continuous video generation across complex scenes. Its generation length scales to minute-long sequences, drastically improving temporal coherence and real-world usability. While delivering a significant boost in inference speed, Matrix-Game 2.0 maintains precise comprehension of physical laws and scene semantics. It enables users to freely explore, manipulate, and construct virtual environments in real time through simple instructions—yielding well-structured, detail-rich, and logically coherent virtual spaces. With these capabilities, Matrix-Game 2.0 not only breaks down the barriers between content generation and interaction but also unlocks new possibilities for cutting-edge applications such as virtual humans, game engines, and embodied AI. It provides a robust technical foundation for building a universal virtual world. Currently, Matrix-Game 2.0 boasts three core advantages: High-frame-rate, real-time long-sequence generation: The model supports fluid movement (forward/backward, left/right) and camera/view rotation. Users can intuitively control characters in the scene via simple commands. The system generates seamless footage in real time at 25 FPS, enabling minute-long interactive sequences in a single session. Character movements are lifelike, smooth, and precisely responsive. Cross-scenario generalization capability: The model demonstrates exceptional cross-domain adaptability. It is not only suitable for specific task scenarios but also supports simulations of diverse styles and environments—including urban, wilderness, and other spatial types, as well as realistic, oil-painting, and various visual styles. Enhanced physical consistency: The model demonstrates a deeper understanding of physical rules. Characters generated by the model exhibit physically plausible movements when navigating complex terrains such as steps and obstacles, which improves immersion and controllability. The open-source release of Matrix-Game for interactive video generation underscores Skywork's strategic foresight in AI development. This initiative will accelerate development across Skywork's multi-model AI ecosystem. Moving forward, Skywork remains committed to pioneering and open-sourcing advanced AI solutions. By collaborating with global developers and users, we aim to build next-generation platforms that accelerate the global advancement of AGI.

Matrix-Game 2.0 Launches as a Powerful Open-Source Alternative to Genie 3
Matrix-Game 2.0 Launches as a Powerful Open-Source Alternative to Genie 3

Associated Press

time9 hours ago

  • Entertainment
  • Associated Press

Matrix-Game 2.0 Launches as a Powerful Open-Source Alternative to Genie 3

SINGAPORE, Aug. 12, 2025 /PRNewswire/ -- The SkyWork AI Technology Release Week officially kicked off on August 11. From August 11 to August 15, a new model will be unveiled each day, covering cutting-edge models for core multimodal AI scenarios. A week ago, DeepMind released a major update to its interactive world model—Genie 3—enabling real-time, long-sequence generation. This advancement has drawn significant attention to world models. However, Genie 3 was not open-sourced, leaving the community to speculate about its implementation. On August 12, Skywork unveiled an upgraded version of the self-developed Matrix series' interactive world model—Matrix-Game 2.0. It also delivers interactive, real-time, long-sequence generation in general scenarios. To drive progress in interactive world modeling, Matrix-Game 2.0 has been fully open-sourced, marking the industry's first open-source solution for real-time, long-sequence, interactive generation in general scenarios. Matrix-Game 2.0 open source addresses: Matrix-Game 2.0 achieves a breakthrough in real-time generation and long-sequence handling. Compared to its predecessor, the 2.0 version prioritizes low-latency, high-frame-rate performance for extended interactions, enabling stable 25 FPS continuous video generation across complex scenes. Its generation length scales to minute-long sequences, drastically improving temporal coherence and real-world usability. While delivering a significant boost in inference speed, Matrix-Game 2.0 maintains precise comprehension of physical laws and scene semantics. It enables users to freely explore, manipulate, and construct virtual environments in real time through simple instructions—yielding well-structured, detail-rich, and logically coherent virtual spaces. With these capabilities, Matrix-Game 2.0 not only breaks down the barriers between content generation and interaction but also unlocks new possibilities for cutting-edge applications such as virtual humans, game engines, and embodied AI. It provides a robust technical foundation for building a universal virtual world. Currently, Matrix-Game 2.0 boasts three core advantages: High-frame-rate, real-time long-sequence generation: The model supports fluid movement (forward/backward, left/right) and camera/view rotation. Users can intuitively control characters in the scene via simple commands. The system generates seamless footage in real time at 25 FPS, enabling minute-long interactive sequences in a single session. Character movements are lifelike, smooth, and precisely responsive. Cross-scenario generalization capability: The model demonstrates exceptional cross-domain adaptability. It is not only suitable for specific task scenarios but also supports simulations of diverse styles and environments—including urban, wilderness, and other spatial types, as well as realistic, oil-painting, and various visual styles. Enhanced physical consistency: The model demonstrates a deeper understanding of physical rules. Characters generated by the model exhibit physically plausible movements when navigating complex terrains such as steps and obstacles, which improves immersion and controllability. The open-source release of Matrix-Game for interactive video generation underscores Skywork's strategic foresight in AI development. This initiative will accelerate development across Skywork's multi-model AI ecosystem. Moving forward, Skywork remains committed to pioneering and open-sourcing advanced AI solutions. By collaborating with global developers and users, we aim to build next-generation platforms that accelerate the global advancement of AGI. View original content to download multimedia: SOURCE Skywork AI pte ltd

Google Genie 3 AI Creates Interactive Digital Worlds in Real-Time
Google Genie 3 AI Creates Interactive Digital Worlds in Real-Time

Geeky Gadgets

timea day ago

  • Entertainment
  • Geeky Gadgets

Google Genie 3 AI Creates Interactive Digital Worlds in Real-Time

What if you could create an entire world—complete with lifelike physics, dynamic lighting, and seamless interactions—at the speed of thought? With the unveiling of Google Genie 3, this once-futuristic vision is now a tangible reality. Unlike traditional tools that rely on painstakingly pre-designed models, Genie 3 uses advanced AI to generate immersive environments in real time, adapting instantly to user input. Whether you're designing a video game, simulating robotics tasks, or training AI systems, this new technology promises to redefine the boundaries of creativity and innovation. But as with any innovative leap, it raises intriguing questions: How far can AI go in mimicking reality? And what challenges lie ahead in perfecting such a powerful tool? In this exploration of Google's Genie 3, Matthew Berman uncovers the key features that set it apart, from its real-time generation capabilities to its seamless visual transitions. You'll discover how this innovative system is transforming industries like entertainment, AI training, and robotics by bridging the gap between virtual and physical realities. But Genie 3 isn't without its hurdles—high computational demands and occasional visual inconsistencies hint at the complexities of pushing technology to its limits. As we delve deeper, consider this: Could Genie 3 be the stepping stone to Artificial General Intelligence, or is it merely a glimpse of what's to come? Key Features That Set Genie 3 Apart Genie 3 distinguishes itself by creating dynamic, interactive environments that adapt instantly to user input. Unlike traditional systems that depend on pre-designed 3D models, Genie 3 employs advanced AI algorithms to generate environments on demand. This innovative approach delivers unparalleled flexibility and scalability in simulation. Some of the standout features of Genie 3 include: Real-time generation: Produces high-quality, immersive visuals that adapt seamlessly to changes. Produces high-quality, immersive visuals that adapt seamlessly to changes. User-driven modifications: Enables users to alter environments dynamically through 'prompt events,' offering a new level of interactivity. Enables users to alter environments dynamically through 'prompt events,' offering a new level of interactivity. Enhanced realism: Integrates accurate physics and advanced lighting to create lifelike experiences. Integrates accurate physics and advanced lighting to create lifelike experiences. Seamless transitions: Uses auto-regressive frame generation to ensure smooth visual continuity, even in complex scenarios. These features empower users to interact with digital environments in unprecedented ways, whether designing a game, training AI systems, or simulating robotics tasks. The ability to create and modify environments in real time opens up new possibilities for innovation and creativity. Watch this video on YouTube. Applications Across Diverse Industries The versatility of Genie 3 unlocks fantastic opportunities across multiple sectors, making it a valuable tool for professionals in various fields. Entertainment: Video game developers can craft dynamic, lifelike worlds that respond to player actions in real time, enhancing gameplay experiences. Filmmakers and TV producers can generate immersive, computer-generated scenes without the need for extensive post-production work, saving time and resources. Video game developers can craft dynamic, lifelike worlds that respond to player actions in real time, enhancing gameplay experiences. Filmmakers and TV producers can generate immersive, computer-generated scenes without the need for extensive post-production work, saving time and resources. AI Training: Genie 3 provides diverse simulated environments for training AI agents. For example, self-driving car algorithms can be tested in a wide range of traffic scenarios, improving their adaptability and performance in real-world conditions. Genie 3 provides diverse simulated environments for training AI agents. For example, self-driving car algorithms can be tested in a wide range of traffic scenarios, improving their adaptability and performance in real-world conditions. Robotics: Robotics engineers can simulate tasks in controlled environments, allowing robots to learn and adapt autonomously. This accelerates the development of more efficient and capable robotic systems. By allowing these applications, Genie 3 bridges the gap between virtual and physical realities, driving innovation and efficiency across industries. Create Virtual Worlds at Lightning Speed with Google Genie 3 Watch this video on YouTube. Gain further expertise in AI world creation by checking out these recommendations. Technical Advancements Driving Genie 3 Building on the foundation of its predecessor, Genie 2, Genie 3 introduces several new advancements that enhance its capabilities: Scaled AI training: Allows the system to generate environments with greater detail and realism, improving the quality of simulations. Allows the system to generate environments with greater detail and realism, improving the quality of simulations. Auto-regressive frame generation: Ensures smooth transitions between frames, even in highly complex scenes, maintaining visual consistency. Ensures smooth transitions between frames, even in highly complex scenes, maintaining visual consistency. No reliance on pre-built 3D models: Eliminates the need for extensive pre-design work, reducing the time and resources required for environment creation. These innovations make Genie 3 a powerful tool for creating rich, dynamic worlds, setting a new benchmark for AI-driven simulations. Its ability to generate environments on demand significantly enhances efficiency and adaptability, making it a valuable asset for professionals across various domains. Challenges and Future Potential While Genie 3 showcases impressive capabilities, it also faces certain challenges that need to be addressed to maximize its potential: High computational demands: The system requires substantial processing power, which may limit its accessibility to users with high-performance setups. The system requires substantial processing power, which may limit its accessibility to users with high-performance setups. Visual inconsistencies: In highly complex scenes, occasional blurriness or visual artifacts may occur, indicating areas for further refinement. Currently, Genie 3 is undergoing internal testing at Google, with no public release date announced. This testing phase provides an opportunity to optimize the system and address its limitations before it becomes widely available. Looking ahead, the potential of Genie 3 extends far beyond its current capabilities. Future developments could include: Real-time sound generation: Integrating auditory elements to complement visual simulations, creating fully immersive sensory experiences. Integrating auditory elements to complement visual simulations, creating fully immersive sensory experiences. Advancing AGI: By training AI agents in diverse, adaptive environments, Genie 3 could play a pivotal role in the development of Artificial General Intelligence. By training AI agents in diverse, adaptive environments, Genie 3 could play a pivotal role in the development of Artificial General Intelligence. Broader accessibility: As computational efficiency improves, Genie 3 may become accessible to a wider range of users and industries, providing widespread access to its benefits. These advancements have the potential to redefine how we interact with digital environments, making Genie 3 a cornerstone of future AI-driven technologies. Media Credit: Matthew Berman Filed Under: AI, Technology News, Top News Latest Geeky Gadgets Deals Disclosure: Some of our articles include affiliate links. If you buy something through one of these links, Geeky Gadgets may earn an affiliate commission. Learn about our Disclosure Policy.

Genie 3: Google DeepMind's New AI Turns Prompts into Living, Breathing 3D Worlds
Genie 3: Google DeepMind's New AI Turns Prompts into Living, Breathing 3D Worlds

Time of India

time4 days ago

  • Entertainment
  • Time of India

Genie 3: Google DeepMind's New AI Turns Prompts into Living, Breathing 3D Worlds

Live Events Revealed in August 2025, Genie 3 takes a basic text or image prompt and instantly generates a playable 3D world that is complete with objects that you can move, weather that shifts with commands, and environments that remember what you've done, even when you walk away. We're talking 720p visuals, 24 FPS performance, and persistent memory over several minutes of continuous, glitch-free Genie 2, which was impressive but limited to short, grainy video loops, Genie 3 is built for immersion. It supports real-time editing on the fly, just type in 'spawn a storm' or 'build a cave,' and it happens instantly, no reload required. This level of interactivity is powered by what DeepMind calls an 'autoregressive world model,' which isn't hardcoded with rules. Instead, Genie 3 learns how the world works, gravity, water, and shadows just by watching video data. That means the system doesn't fake physics; it internalizes them, leading to emergent, realistic behaviour without manual really elevates Genie 3 is its spatiotemporal consistency. If you paint a wall or drop a sword somewhere, leave the scene, and return, the AI remembers the state exactly as you left it. That's a massive step toward AI that understands continuity, something even big game engines struggle with. DeepMind isn't pitching this as a toy; they see Genie 3 as a training ground for general-purpose intelligence. These hyper-realistic, memory-rich environments are where future AI agents can learn safely, without risking real-world its potential, Genie 3 isn't open to the public yet. It's currently in limited research preview, accessible only to a select group of developers and researchers while DeepMind fine-tunes its safety and governance the implications are crystal 3 is no longer just about creative play; it's a foundational step toward artificial general intelligence (AGI), offering a simulated world where machines can learn, adapt, and possibly outpace human intuition. Simply put, Genie 3 doesn't just build worlds; it builds the infrastructure for AI to truly live in them.

Genie 3: Google DeepMind's New AI Turns Prompts into Living, Breathing 3D Worlds
Genie 3: Google DeepMind's New AI Turns Prompts into Living, Breathing 3D Worlds

Economic Times

time5 days ago

  • Economic Times

Genie 3: Google DeepMind's New AI Turns Prompts into Living, Breathing 3D Worlds

Revealed in August 2025, Genie 3 takes a basic text or image prompt and instantly generates a playable 3D world that is complete with objects that you can move, weather that shifts with commands, and environments that remember what you've done, even when you walk away. We're talking 720p visuals, 24 FPS performance, and persistent memory over several minutes of continuous, glitch-free exploration. Unlike Genie 2, which was impressive but limited to short, grainy video loops, Genie 3 is built for immersion. It supports real-time editing on the fly, just type in 'spawn a storm' or 'build a cave,' and it happens instantly, no reload required. This level of interactivity is powered by what DeepMind calls an 'autoregressive world model,' which isn't hardcoded with rules. Instead, Genie 3 learns how the world works, gravity, water, and shadows just by watching video data. That means the system doesn't fake physics; it internalizes them, leading to emergent, realistic behaviour without manual programming. What really elevates Genie 3 is its spatiotemporal consistency. If you paint a wall or drop a sword somewhere, leave the scene, and return, the AI remembers the state exactly as you left it. That's a massive step toward AI that understands continuity, something even big game engines struggle with. DeepMind isn't pitching this as a toy; they see Genie 3 as a training ground for general-purpose intelligence. These hyper-realistic, memory-rich environments are where future AI agents can learn safely, without risking real-world its potential, Genie 3 isn't open to the public yet. It's currently in limited research preview, accessible only to a select group of developers and researchers while DeepMind fine-tunes its safety and governance the implications are crystal clear. Genie 3 is no longer just about creative play; it's a foundational step toward artificial general intelligence (AGI), offering a simulated world where machines can learn, adapt, and possibly outpace human intuition. Simply put, Genie 3 doesn't just build worlds; it builds the infrastructure for AI to truly live in them.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store