Home > News list > Tech >> Industry dynamics

DeepMind Unveils Genie2: An AI Model Generating 'Endless' 3D Worlds

Industry dynamics 2024-12-05 08:23:32 Source:

DeepMind Unveils Genie2: An AI Model Generating 'Endless' 3D WorldsOn December 5th, Google's AI research arm, DeepMind, launched Genie2, a novel AI model capable of generating 3D worlds with seemingly limitless possibilities and diverse styles. An upgrade to the Genie model released earlier this year, Genie2 creates interactive, real-time 3D environments from just an image and a text prompt

DeepMind Unveils Genie2: An AI Model Generating 'Endless' 3D Worlds

On December 5th, Google's AI research arm, DeepMind, launched Genie2, a novel AI model capable of generating 3D worlds with seemingly limitless possibilities and diverse styles. An upgrade to the Genie model released earlier this year, Genie2 creates interactive, real-time 3D environments from just an image and a text prompt. For instance, inputting "a cute robot in a lush forest" instantly generates an explorable virtual world.

Genie2's capabilities are similar to those of models developed by Fei-Fei Li's WorldLabs and Israeli startup Decart. However, DeepMind claims Genie2 surpasses them in the richness and diversity of its generated worlds. Users can navigate these environments freely using a mouse or keyboard, performing actions like jumping and swimming.

DeepMind Unveils Genie2: An AI Model Generating

Trained on video data, Genie2 accurately simulates various physical phenomena, including object interactions, animations, lighting, reflections, and non-player character (NPC) behavior. Many Genie2-generated scenes rival the visuals of AAA video games, possibly due to the inclusion of extensive data from popular game experiences in its training dataset. However, DeepMind hasn't disclosed its data sources and training methods in detail, citing potential business competition or other reasons.

This lack of transparency raises intellectual property concerns. As a Google subsidiary, DeepMind has unrestricted access to YouTube, and Google has previously hinted at agreements allowing the use of YouTube videos for model training. However, whether Genie2 inadvertently replicates unauthorized content from video games it "watched" during training remains a critical question, with ultimate adjudication likely resting with the courts.

DeepMind states that Genie2 generates coherent 3D worlds from different perspectives (e.g., first-person and isometric views), which can run for up to a minute, though most scenes last 10-20 seconds. The DeepMind team notes in a blog post that Genie2 intelligently responds to keyboard input, accurately identifying characters and moving them accordingly. "For example, our model understands that directional keys should control the robot's movement, not the trees or clouds."

Compared to similar models, Genie2 shows significant improvements in addressing issues like artificiality, incoherence, and hallucinations. Many models, such as Decart's Minecraft simulator Oasis, suffer from low resolution and rapid "forgetting" of level layouts. Genie2, however, remembers parts of the simulated scene not directly rendered and accurately renders them upon reappearance, echoing the capabilities of Li Fei-Fei's WorldLabs model.

Despite these advancements, games created using Genie2 currently lack sufficient playability due to the clearing of player progress every minute. Therefore, DeepMind positions Genie2 as a research and innovation tool primarily for prototyping "interactive experiences" and evaluating AI agents. DeepMind's blog post states: "Genie2's excellent generalization capabilities allow for the seamless translation of concept art and blueprints into fully interactive environments. This allows our researchers to rapidly build diverse and rich environments for AI agents, enabling the generation of evaluation tasks unseen during training to test agent capabilities."

For creative professionals, especially in the video game industry, this technology presents complex implications. A recent Wired investigation revealed that major game companies like Activision Blizzard are using AI to reduce costs, increase efficiency, and address employee attrition indeed, Activision Blizzard has laid off dozens of employees.

Nevertheless, Google continues to invest heavily in world model research, a field poised to become the next major breakthrough in AI. Last October, DeepMind successfully recruited Tim Brooks, formerly of OpenAI's Sora video generator, to lead its video generation technology and world simulation efforts. Two years prior, DeepMind also acquired Tim Rocktschel from Meta, known for his experiments with the "open-endedness" of games like NetHack.

Genie2 marks a significant advancement in AI-generated 3D world technology. While challenges remain, such as limited game playability and potential intellectual property issues, Genie2, as a powerful research tool, will undoubtedly have a profound impact on the AI field and creative industries. It unlocks new possibilities for future game development, AI agent evaluation, and the creation of other interactive experiences, highlighting the enormous potential of AI in content creation. Future developments remain to be seen. DeepMind's continued investment in AI technology demonstrates its ambition and confidence in future technologies. The success of Genie2 will undoubtedly further propel AI advancements and bring more innovation and possibilities to the world. The application range is broad, potentially extending beyond game development to encompass film production, architectural design, and urban planning. Genie2 also sparks discussion regarding the ethical and societal implications of AI technology, requiring collective societal attention and deliberation. Alongside technological progress, we must address potential risks and establish appropriate regulations to ensure AI serves humanity effectively. The challenges and achievements DeepMind faced during Genie2's development offer valuable experience and guidance for other AI research institutions. We can anticipate the emergence of more AI models similar to Genie2, continuously driving AI advancements and enriching our digital worlds. Simultaneously, we must monitor technological directions and guarantee its service to humanity and the benefit of all mankind.

Tag: DeepMind Unveils Genie2 An AI Model Generating Endless 3D


Disclaimer: The content of this article is sourced from the internet. The copyright of the text, images, and other materials belongs to the original author. The platform reprints the materials for the purpose of conveying more information. The content of the article is for reference and learning only, and should not be used for commercial purposes. If it infringes on your legitimate rights and interests, please contact us promptly and we will handle it as soon as possible! We respect copyright and are committed to protecting it. Thank you for sharing.

AdminSo

http://www.adminso.com

Copyright @ 2007~2025 All Rights Reserved.

Powered By AdminSo

Open your phone and scan the QR code on it to open the mobile version


Scan WeChat QR code

Follow us for more hot news

AdminSo Technical Support