Latest news with #Matrix3D

Matrix-3D Goes Open-Source: A New Benchmark for 3D World Generation

Associated Press

4 days ago

Business
Associated Press

Matrix-3D Goes Open-Source: A New Benchmark for 3D World Generation

SINGAPORE, Aug. 12, 2025 /PRNewswire/ -- The SkyWork AI Technology Release Week officially kicked off on August 11. From August 11 to August 15, a new model will be unveiled each day, covering cutting-edge models for core multimodal AI scenarios. On August 12, the world model Matrix-3D for 3D world generation and exploration was officially open-sourced. Starting from a single input image, it generates high-quality, trajectory-consistent panoramic videos and directly reconstructs navigable 3D spaces. Compared to WorldLabs' output, Matrix-3D enables exploration across significantly larger virtual environments. Matrix-3D open source addresses: By integrating panoramic representation, conditional video generation, and 3D reconstruction modules, Matrix-3D surpasses existing methods in field-of-view range, geometric consistency, and visual quality. It accepts both text and image inputs and generates freely explorable 3D scenes. Matrix-3D achieves state-of-the-art generation quality on panoramic video benchmark datasets, while also attaining industry-leading performance in camera motion control precision. World models like Google's Genie 3 paint a compelling vision of the future. They reveal AI's evolution beyond mere content generation tools into world simulators—systems capable of constructing and simulating entire environments. As AI technology progresses, these models are poised to become critical infrastructure for understanding our world, shaping tomorrow, and ultimately realizing artificial general intelligence (AGI). The open-source release of Matrix-3D for 3D world generation and exploration underscores Skywork's strategic foresight in AI development. This initiative will accelerate development across Skywork's multi-model AI ecosystem. Moving forward, Skywork remains committed to pioneering and open-sourcing advanced AI solutions. By collaborating with global developers and users, we aim to build next-generation platforms that accelerate the global advancement of AGI. View original content to download multimedia: SOURCE Skywork AI pte ltd

Microsoft unveils AI tool to turn 2D images into 3D models: How to use

Business Standard

5 days ago

Business Standard

Microsoft unveils AI tool to turn 2D images into 3D models: How to use

Microsoft has unveiled an artificial intelligence-powered tool to transform 2D images into real-life-looking 3D models. The US technology giant has dubbed it Copilot 3D and said that it is designed to make 3D creations fast, accessible, and intuitive. Copilot 3D is part of Copilot Labs and is currently available to users globally who have signed in with their personal Microsoft Account. The company recommends using it on a PC. However, it is also accessible via mobile browsers. Notably, other companies like Meta, Apple, and Nvidia are also in the race to create such a tool. What is Copilot 3D Copilot 3D, available only through Copilot Labs, is an AI-driven tool that converts a single image into a complete 3D model – no expertise required. Built to simplify and speed up the 3D design process, it aims to make creation more accessible and intuitive. From exploring creative concepts to prototyping ideas or aiding interactive learning, Copilot 3D encourages experimentation without the steep learning curve of conventional 3D programs. For the ones who are worried about privacy, Microsoft has clarified: 'Uploaded images are used by Copilot to generate your 3D models and process your request. At this time, Microsoft does not use uploaded images for training or personalisation.' Notably, at this time, Copilot 3D supports only 2D image-to-3D image generation. This essentially means that it cannot create a 3D image just on the basis of a text prompt. How to use Copilot 3D Open the Copilot 3D website on your PC's browser Register with a Microsoft account Upload an image within 10 MB, and click on Generate to create the 3D model of the image. Download the 3D model in GLB format. Any other players in the market? Apple Earlier in May, Apple previewed an AI model – Matrix3D – that can build 3D scenes using images. Furthermore, with the arrival of iOS 26 public beta, the Photos app will let users convert any regular image into a spatial image, even if the photo was originally taken with a different smartphone. Meta It had published a research paper detailing 'Meta 3D Gen AI' system, which it said is capable of generating 3D models using a text prompt. A research paper from Meta outlines Meta 3D Gen, a dual-AI system combining Meta 3D AssetGen and Meta 3D TextureGen. It can generate 3D characters, props, or scenes from text prompts, or apply textures to an existing 3D mesh provided by the user. Nvidia Nvidia's neural radiance fields, or NeRF, can also create 3D images based on 2D images. Nvidia, in a blog earlier, said that NeRF uses neural networks to represent and render realistic 3D scenes based on an input collection of 2D images.

Apple previews AI model that builds 3D scenes using images: How it works

Business Standard

14-05-2025

Science
Business Standard

Apple previews AI model that builds 3D scenes using images: How it works

Apple has published a new research paper detailing an artificial intelligence (AI) model called Matrix3D. Developed in collaboration with researchers from Nanjing University and The Hong Kong University of Science and Technology, Matrix3D enables the reconstruction of detailed 3D scenes and objects using only a few 2D images. This marks a significant shift in how photogrammetry – an established technique for reconstructing 3D structures from photos – is approached. What is photogrammetry In its research paper, Apple noted that photogrammetry is a process of using 2D photographs to measure and recreate 3D structures or environments. Traditionally, this process has required hundreds of images taken from various angles and involves a multi-step pipeline using different algorithms for tasks like camera pose estimation (figuring out where each camera was when the photo was taken), depth prediction, and 3D model construction. How Matrix3D streamline photogrammetry process Apple's Matrix3D addresses two major challenges in traditional photogrammetry: the need for a large number of images from multiple angles, and the use of separate models for each stage of reconstruction. Matrix3D solves both problems by unifying the entire process into a single model. It can estimate camera positions, generate depth maps, and even synthesize novel views — all from just a few input images. How Matrix3D works At the heart of Matrix3D is a generative AI system based on diffusion transformers, similar to the models powering tools like OpenAI's DALL-E and ChatGPT. During training, the model uses a technique called masked learning, where parts of the input are deliberately hidden so the model learns to predict the missing data. This approach helps Matrix3D effectively handle sparse or incomplete input and significantly expands the range of usable training samples. As a result, Matrix3D can reconstruct detailed 3D objects or entire scenes using just two or three images. Availability and use case The researchers have published their work on arXiv and released the source code on GitHub. A companion website also features demo videos and interactive 3D reconstructions. While there's no official word yet, Matrix3D could eventually be integrated into Apple's Vision Pro headset, allowing users to transform regular 2D photos into immersive 3D experiences.

Latest news with #Matrix3D

Matrix-3D Goes Open-Source: A New Benchmark for 3D World Generation

Microsoft unveils AI tool to turn 2D images into 3D models: How to use

Apple previews AI model that builds 3D scenes using images: How it works

Get Started Now: Download the App