
Despite months of rumored development, OpenAI’s release of its Project Strawberry last week came as something of a surprise, with many analysts believing the model wouldn’t be ready for weeks at least, if not later in the fall.
The new o1-preview model, and its o1-mini counterpart, are already available for use and evaluation, here’s how to get access for yourself.
We're releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond.
These models can reason through complex tasks and solve harder problems than previous models in science, coding, and math. https://t.co/peKzzKX1bu
— OpenAI (@OpenAI) September 12, 2024
What is o1?
OpenAI has made no secret of its artificial general intelligence (AGI) aspirations, and Project Strawberry (now known as “o1”) is the company’s next step toward that goal. It’s the first in a new line of “reasoning” models, “designed to spend more time thinking before they respond,” per an OpenAI announcement post. That strategy enables the model to, “reason through complex tasks and solve harder problems than previous models in science, coding, and math.”
The models reportedly reason in a human-like manner, allowing them to “refine their thinking process, try different strategies, and recognize their mistakes,” as they gain experience through training. According to OpenAI, o1-preview operates on par with Ph.D. students in physics, chemistry, and biology, and performs similarly on benchmark tests in those subjects. o1 is also adept at coding and math problems, scoring 83% in a International Mathematics Olympiad (IMO) qualifying exam where GPT-4o only scored 13% and reaching the 89th percentile in a Codeforces competition against human opponents.
here is o1, a series of our most capable and aligned models yet:https://t.co/yzZGNN8HvD
o1 is still flawed, still limited, and it still seems more impressive on first use than it does after you spend more time with it. pic.twitter.com/Qs1HoSDOz1
— Sam Altman (@sama) September 12, 2024
o1-mini is a lightweight version of the standard o1-preview model. It reportedly is 80% less expensive to operate than the larger iteration, making it especially capable in coding analysis and generation tasks.
Is o1-preview available to try?
Yes, the o1-preview models launched on September 12 for ChatGPT Plus and Teams subscribers. Enterprise and Educational users will have access at the start of the following week.