LIVE
ANTHROPICOpus 4.7 benchmarks published2m ago
CLAUDEOK142ms
OPUS 4.7$15 / $75per Mtok
CHATGPTOK89ms
HACKERNEWSWhy has not AI improved design quality the way it improved dev speed?14m ago
MMLU-PROleader Opus 4.788.4
GEMINIDEGRADED312ms
MISTRALMistral Medium 3 released6m ago
GPT-4o$5 / $15per Mtok
ARXIVCompositional reasoning in LRMs22m ago
BEDROCKOK178ms
GEMINI 2.5$3.50 / $10.50per Mtok
THE VERGEFrontier Model Forum expansion announced38m ago
SWE-BENCHleader Claude Opus 4.772.1%
MISTRALOK104ms
ANTHROPICOpus 4.7 benchmarks published2m ago
CLAUDEOK142ms
OPUS 4.7$15 / $75per Mtok
CHATGPTOK89ms
HACKERNEWSWhy has not AI improved design quality the way it improved dev speed?14m ago
MMLU-PROleader Opus 4.788.4
GEMINIDEGRADED312ms
MISTRALMistral Medium 3 released6m ago
GPT-4o$5 / $15per Mtok
ARXIVCompositional reasoning in LRMs22m ago
BEDROCKOK178ms
GEMINI 2.5$3.50 / $10.50per Mtok
THE VERGEFrontier Model Forum expansion announced38m ago
SWE-BENCHleader Claude Opus 4.772.1%
MISTRALOK104ms

o1

Flagship

by OpenAI

OpenAI o1 is a reasoning-focused model that uses internal chain-of-thought to solve complex problems in math, science, and code. It trades speed for accuracy, producing some of the highest scores on MATH and GPQA benchmarks.

Input Price

$15.00

per 1M tokens

Output Price

$60.00

per 1M tokens

Context Window

200K

tokens

Released

2024-12

API access

Capabilities

textreasoningcode

Key Strengths

  • Advanced chain-of-thought reasoning
  • Top math and science scores
  • 200K context
  • Strong code generation

Best For

  • Scientific research
  • Complex math problems
  • Advanced code debugging
  • Logical reasoning tasks

Benchmark Scores

BenchmarkScoreDescription
MMLU-Pro91.8General knowledge and reasoning across 57 subjects
HumanEval94.2Python code generation and problem solving
GPQA Diamond72.5Graduate-level science questions verified by domain experts
MATH94.6Competition-level mathematics problems
SWE-bench58.9Real-world software engineering tasks from GitHub issues

Scores sourced from public benchmark datasets. See full benchmark leaderboard for all models.

Pricing Details

Input tokens

$15.00

per 1M tokens

Output tokens

$60.00

per 1M tokens

Estimated cost per 1K requests

$45.00

~1K input + ~500 output tokens avg

Prices are subject to change. Check the official documentation for current pricing. See the cost calculator for detailed estimates.

Related Models

View DocumentationCompare ModelsCost CalculatorFull Pricing Guide