OpenAI's Breakthrough New Model: OpenAIo1, Surpasses GPT-4 in Reasoning Abilities, Ushering in a New Era of Artificial Intelligence
OpenAI's Breakthrough New Model: OpenAIo1, Surpasses GPT-4 in Reasoning Abilities, Ushering in a New Era of Artificial IntelligenceSeptember 13th: Following the groundbreaking release of GPT-4 last year, OpenAI has unveiled a new model, OpenAIo1, marking a significant shift in its research direction. OpenAIo1 possesses unique logical "reasoning" capabilities, enabling it to solve intricate problems that elude existing models, including OpenAI's current powerhouse, GPT-4
OpenAI's Breakthrough New Model: OpenAIo1, Surpasses GPT-4 in Reasoning Abilities, Ushering in a New Era of Artificial Intelligence
September 13th: Following the groundbreaking release of GPT-4 last year, OpenAI has unveiled a new model, OpenAIo1, marking a significant shift in its research direction. OpenAIo1 possesses unique logical "reasoning" capabilities, enabling it to solve intricate problems that elude existing models, including OpenAI's current powerhouse, GPT-4. Unlike conventional large language models that typically provide answers in one step, OpenAIo1, like a human, arrives at correct conclusions through step-by-step reasoning.
"We see this as a new paradigm in AI model development," Mira Murati, OpenAI's Chief Technology Officer, said in an interview. "It demonstrates unprecedented advantages, particularly in tackling complex reasoning tasks."
Internally codenamed "Strawberry," OpenAIo1 is not intended to replace GPT-4 but rather complement it. Murati also revealed that OpenAI is currently building GPT-5, the next generation of its flagship model, which will be significantly larger than its predecessors. While the company still believes that scaling model size can yield breakthroughs, GPT-5 is expected to integrate the reasoning techniques showcased in OpenAIo1. "We have two avenues of development: scaling and enhancing reasoning capabilities," Murati stated. "Ultimately, we aim to combine both."
Large language models typically rely on vast neural networks and massive datasets for training, exhibiting remarkable language and logical abilities but struggling with fundamental reasoning problems, such as simple math problems. Murati pointed out that OpenAIo1 employs reinforcement learning techniques, progressively improving the model's reasoning strategies by providing positive feedback for correct answers and negative feedback for incorrect ones. "This model not only sharpens its thinking," she added, "but it also fine-tunes its strategies for tackling problems."
Reinforcement learning has already propelled computers to excel in games and complex tasks like chip design and is a key technology for developing efficient and engaging chatbots. Mark Chen, OpenAI's Vice President of Research, demonstrated OpenAIo1's capabilities by solving several problems that GPT-4 couldn't handle, including intricate chemistry questions and a challenging mathematical reasoning problem. "The new model doesn't just mimic human thought processes," Chen said. "It has learned to think independently."
OpenAI claims that the new model performs exceptionally well in various fields, including programming, mathematics, physics, biology, and chemistry. In the American Invitational Mathematics Examination (AIME), GPT-4o achieved an average solution rate of 12%, while OpenAIo1 boasted an 83% solution rate.
However, OpenAIo1 is slower than GPT-4o, and its advantages are not always evident. This is partly because it lacks web search capabilities and multimodality, rendering it incapable of handling image or audio data.
Improving the reasoning abilities of large language models has been a research hotspot in recent years, with competitors actively exploring similar avenues. In July, Google announced AlphaProof, a project combining language models and reinforcement learning focused on tackling complex mathematical problems. While AlphaProof can learn mathematical reasoning from correct answers, it faces the challenge that real-world problems don't always have clear-cut answers. OpenAI asserts that its new reasoning system has wider applicability. "We've achieved a breakthrough in this area, which is our advantage," Chen believes. "The system excels in reasoning tasks across various domains."
Noah Goodman, a professor at Stanford University, has published research on enhancing the reasoning capabilities of large language models. He believes that carefully designed prompt engineering combined with hand-optimized data could be key to achieving reasoning improvements. Goodman also states that striking a balance between speed and accuracy would be "a significant advancement."
Yoon Kim, an assistant professor at MIT, points out that the reasoning mechanisms of large language models remain enigmatic. Despite their ability to reason step-by-step, they fundamentally differ from human thought processes or existence. This difference could become crucial when the technology is widely adopted. "These systems could make decisions affecting numerous individuals," he said. "The bigger question is whether we can trust their decision-making processes."
OpenAI's new technology could also contribute to ensuring the "benign" behavior of AI models. Murati suggests that the new reasoning model can prevent undesirable or harmful outputs by inferring the consequences. "Like educating children," she said, "once they understand why they should do something, they are better at adhering to norms, behavioral codes, and values."
Oren Etzioni, a professor emeritus at the University of Washington and an AI expert, notes, "Endowing large language models with the ability to engage in multi-step reasoning, use tools, and solve complex problems is crucial, and scaling alone cannot achieve this." He adds that even with improved reasoning abilities, issues like hallucinations and fact verification persist.
Chen reveals that the new reasoning approach developed by the company indicates that artificial intelligence advancement doesn't necessarily require immense computational power. "This transformation is exciting. We believe it can deliver intelligent outputs at lower costs, which aligns with our company's core vision."
Tag: New in OpenAI Breakthrough Model OpenAIo1 Surpasses GPT-4 Reasoning
Disclaimer: The content of this article is sourced from the internet. The copyright of the text, images, and other materials belongs to the original author. The platform reprints the materials for the purpose of conveying more information. The content of the article is for reference and learning only, and should not be used for commercial purposes. If it infringes on your legitimate rights and interests, please contact us promptly and we will handle it as soon as possible! We respect copyright and are committed to protecting it. Thank you for sharing.