Unlock Free Access to OpenAI's New O3 Mini Model – ChatGPT 2025 Update

OpenAI Announces Innovative Models: O3 and O3-Mini

In a groundbreaking announcement on the final day of OpenAI’s annual event, Shipmas, OpenAI revealed their cutting-edge models, o3 and o3-mini. These models are designed to excel in complex reasoning tasks, setting new benchmarks in fields such as mathematics and science, and surpassing the capabilities of previous models like o1. OpenAI’s CEO, Sam Altman, noted that the o3 model was scheduled to be released at the end of January, a promise that has now been fulfilled.

O3-Mini: A Cost-Efficient Solution

On Friday, OpenAI launched the o3-mini model, which is the latest addition to their reasoning series, previously including models o1 and o1-mini. This model has been highlighted for its impressive proficiency in scientific, mathematical, and coding applications, making it the most cost-effective option in OpenAI’s advanced series.

OpenAI o3-mini is now integrated within ChatGPT and the API. Users subscribed to the Pro version enjoy unrestricted access, while Plus and Team users benefit from thrice the rate limits compared to o1-mini. Free users can explore o3-mini on ChatGPT by opting for the 'Reason' button located under the message composer.

Enhanced Performance and Speed

Upon selecting o3-mini, users experience medium reasoning efforts that balance rapid responses with high accuracy. Although o1 maintains a broader general knowledge scope than o3-mini, the new model significantly outpaces o1-mini in terms of speed and performance.

Benchmark Performance Overview

Our expert testers have rigorously compared the o3-mini against the o1-mini, and findings demonstrated that o3-mini generates more accurate and articulate responses. An impressive 56% preference rate was observed in favor of o3-mini, alongside a notable 39% decrease in substantial errors.

Beyond subjective assessments, o3-mini excelled in several STEM-related benchmarks, including Competition Math (AIME 2024), PhD-level Science Questions (GPQA Diamond), and Competition Code (Codeforces), outperforming o1-mini in many instances. It’s worth mentioning that o3-mini's high reasoning effort closely aligns with o1’s performance, occasionally surpassing expectations as illustrated in the AIME 2024 and Software Engineering (SWE-bench Verified) benchmarks.

Benchmark O1-Mini O3-Mini (Medium Reasoning) O3-Mini (High Reasoning)
Competition Math (AIME 2024) Average Above Average Exceptional
PhD-level Science Questions (GPQA Diamond) Satisfactory High Very High
Competition Code (Codeforces) Good Excellent Outstanding

Safety Evaluations

OpenAI has diligently assessed the safety of the o3-mini model prior to public release using jailbreak and disallowed content evaluations. Findings indicate that o3-mini's safety metrics substantially exceed those of GPT-4o. Detailed evaluation results have been published, complemented by the release of a comprehensive o3-mini System Card, spanning 37 pages, which elaborates on the evaluation methodology and findings.

Accessing O3-Mini

All paid subscribers to OpenAI’s services, including ChatGPT Plus, Team, and Pro, now have the privilege to utilize o3-mini. Users within the Plus and Team tiers experience tripled rate limits, transitioning from 50 messages daily with o1-mini to a noteworthy 150 messages daily. ChatGPT Enterprise access will be rolled out within the next week.

For users without subscriptions, access to o3-mini is possible via free accounts. Simply click the "Reason" option in the message textbox or regenerate a response to test its effectiveness. OpenAI’s CEO, Sam Altman, assured free user access in a post on X, underscoring the removal of historical paywall restrictions for advanced reasoning capabilities of this model.

Industry Insights: AI for Coding 2025

In the sphere of artificial intelligence and coding, o3-mini is setting a new standard for innovations in 2025. Users are introduced to the "Think Deeper" feature at no cost, representing a paradigm shift in AI capability utilization. Meanwhile, discussions are circulating regarding the less recommended options, including DeepSeek R1, in contrast to superior options such as o3-mini.

On testing options like DeepSeek’s R1 and V3 coding skills, initial impressions indicate that there remains significant potential for further refinement. Conversely, the simplicity of deploying an LLM on MacOS offers intriguing benefits, warranting consideration for technology enthusiasts.

For Microsoft 365 users contemplating the utility of Copilot, explicit instructions are available for its removal, enabling broader integration of OpenAI’s advanced models into their daily workflows.

Kari

Kari

An expert in home and lifestyle products. With a background in interior design and a keen eye for aesthetics, Author Kari provides readers with stylish and practical advice. Their blogs on home essentials and décor tips are both inspiring and informative, helping readers create beautiful spaces effortlessly.