AI Timeline

A chronological record of major AI model releases, industry milestones, and pivotal events from 2024 to present.

The pace of AI advancement is accelerating. Every month brings new model releases, price drops, capability breakthroughs, and industry consolidation. Tracking this timeline helps you understand the trajectory: where we came from, where we are now, and what's coming next.

From April 2024 to April 2026, we've seen Claude evolve from Opus to Sonnet to a new generation. OpenAI released GPT-4o with vision and function calling. Google launched Gemini 2.5 with 2 million token context. Mistral and Llama launched open-source models that changed the economics of AI deployment. Pricing dropped dramatically as providers competed for market share. Policy shifted too, with regulation discussions in the EU and debates over AI safety standards.

What patterns are visible in this data? Model releases cluster around major announcements. Pricing announcements typically decrease (rarely increase). Open-source releases create competitive pressure on commercial models. Major companies iterate quarterly. Filter this timeline by category to focus on what matters most to you: model releases, pricing changes, policy shifts, acquisitions, or research breakthroughs.

2026

Jun 9, 2026

Anthropic Releases Claude Fable 5, a New Frontier Tier Above Opus

Anthropic

Anthropic launched Claude Fable 5 (API id claude-fable-5), a new frontier tier positioned above Claude Opus 4.8, which remains the default model. Pricing is $10 per million input tokens and $50 per million output with no long-context surcharge, on a 1 million token default context window with up to 128K output tokens and text plus vision input. Adaptive thinking is always on and cannot be disabled, with effort levels spanning low, medium, high, xhigh, and max. Anthropic's launch table reports 80.3 on SWE-bench Pro, 29.3 on FrontierCode Diamond, 85.0 on OSWorld-Verified, and a GDPval-AA ELO of 1932, all vendor-reported. Always-on safety classifiers can reroute flagged requests to Opus 4.8, billed at Opus rates for the rerouted portion. The model is available on the Claude API, Amazon Bedrock, Google Vertex, and Microsoft Foundry at launch.

Model Release

Jun 2, 2026

Microsoft Launches MAI-Code-1-Flash and MAI-Thinking-1 at Build 2026

Microsoft

Microsoft announced its first in-house coding and reasoning models at Build 2026 in San Francisco. MAI-Code-1-Flash rolls out across all GitHub Copilot tiers with a 256K context window, priced at $0.75 per million input tokens ($0.075 cached) and $4.50 per million output, with Microsoft claiming better price to performance than Claude Haiku 4.5 and 60% fewer tokens used on hard tasks. MAI-Thinking-1, a 35B active parameter MoE reasoning model with a 256K context window, posted 97% on AIME 25 and 53% on SWE-Bench Pro, which Microsoft says matches Claude Opus 4.6. It enters private preview on Microsoft Foundry with distribution planned through Fireworks AI, Baseten, and OpenRouter. The launches mark a deliberate step away from Microsoft's dependence on OpenAI models inside Copilot.

Model Release

Jun 1, 2026

Anthropic Confidentially Files S-1 for IPO

Anthropic

Anthropic confidentially submitted a draft S-1 registration statement to the SEC for a proposed initial public offering, less than a week after closing a $65 billion Series H at a $965 billion post-money valuation. Annualized revenue run-rate recently crossed $47 billion on enterprise adoption of Claude for coding and agentic workflows. Share count and pricing are not yet determined. The move puts Anthropic ahead of OpenAI, which is reportedly preparing its own confidential filing, in the race to a landmark AI listing.

Jun 1, 2026

MiniMax Releases M3 Open-Weight Coding Model

MiniMax

MiniMax launched M3, an open-weight coding and agentic model built on MiniMax Sparse Attention, which replaces full attention with KV-block selection and cuts per-token compute at 1M context to roughly one twentieth of the previous generation. M3 accepts text, image, and video input across a 1,048,576 token context window with up to 512K output tokens, priced at $0.30 per million input and $1.20 per million output. MiniMax reports 59% on SWE-Bench Pro and 83.5 on BrowseComp, though headline runs used its own infrastructure with agent scaffolding, so independent verification is pending. Weights and a technical report are due on Hugging Face within about ten days.

Open Source

May 28, 2026

Anthropic Raises $65B Series H at $965B Valuation

Anthropic

Anthropic closed a $65 billion Series H funding round at a $965 billion post-money valuation, surpassing OpenAI's reported $852 billion to become the most valuable private AI company. The round landed the same day as the Claude Opus 4.8 release and set the stage for the confidential IPO filing that followed on June 1.

May 28, 2026

Anthropic Releases Claude Opus 4.8

Anthropic

Anthropic shipped Claude Opus 4.8 just six weeks after Opus 4.7, keeping the 1 million token context window. Anthropic's own measures put agentic coding at 69.2% (up from 64.3%) and knowledge work at 1890 (up from 1753), with the model roughly 4x less likely than 4.7 to let flaws in its own code slip through while using about 35% fewer tokens per task. A new fast mode runs roughly 2.5x quicker and about three times cheaper than before, and Dynamic Workflows adds large-scale parallel subagent support.

Model Release

May 21, 2026

Trump Postpones AI Executive Order Hours Before Signing

US Government

President Trump pulled a landmark AI executive order from the schedule hours before its planned signing ceremony on May 21. The order would have established a voluntary 90-day pre-launch review framework for frontier AI models with NSA involvement and tasked federal agencies with using AI models to harden government network defenses. Trump cited concerns the framework could slow US competitiveness against China. Former AI czar David Sacks, Mark Zuckerberg, and Elon Musk are reported to have lobbied against it directly. No new signing date announced.

Policy

May 21, 2026

Coupa Acquires Tonkean for Agentic Procurement

Coupa

Coupa acquired Tonkean, a Palo Alto-based no-code workflow orchestration platform with 250+ native connectors and multi-agent coordination. Financial terms were not disclosed. The deal is Coupa's second AI-focused acquisition in two weeks, following the May 12 Rossum (intelligent document processing) deal, and feeds Coupa's emerging agentic trade network across procurement, invoicing, and supplier transactions.

Acquisition

May 20, 2026

Cohere Releases Command A+ Open Source Under Apache 2.0

Cohere

Cohere released Command A+, a 218B-total / 25B-active mixture-of-experts model with a 128K input context and 64K max generation, under an Apache 2.0 license. Multimodal text and image input, tool use, 48-language coverage, available on Hugging Face in BF16, FP8, and W4A4 quantizations. Runs on a single NVIDIA Blackwell B200 or two H100s at W4A4. Scores 37 on the Artificial Analysis Intelligence Index, 75.1% on MMMU, and 80.6% on MathVista. Cohere's first fully Apache 2.0 enterprise model and first multimodal reasoning model.

Open Source

May 19, 2026

Google Launches Gemini 3.5 Flash at I/O 2026

Google

Google ships Gemini 3.5 Flash to general availability at I/O 2026. Priced at $1.50 per million input and $9.00 per million output tokens, with a 1,048,576 token context window. The first Flash-tier release that outscores the previous Pro flagship (Gemini 3.1 Pro) on agentic coding suites including Terminal-Bench 2.1 (76.2%) and MCP Atlas (83.6%), at roughly 4x the throughput.

Model Release

May 19, 2026

Google Announces Gemini Spark Agent at I/O 2026

Google

Google introduces Gemini Spark, a general-purpose AI agent inside the Gemini app that can reason across connected apps and take actions on the user's behalf. Available first to trusted testers and Google AI Ultra subscribers in the week following the keynote.

May 20, 2026

Alibaba Unveils Qwen3.7-Max at Cloud Summit

Alibaba

Alibaba unveils Qwen3.7-Max at the 2026 Alibaba Cloud Summit, the flagship proprietary model in the Qwen3.7 family. 1 million token context window, extended thinking mode, claims of autonomous operation up to 35 hours on long-horizon agentic tasks. Scored 57 on the Artificial Analysis Intelligence Index (first place on the public leaderboard) and roughly 1,475 Elo on the LM Arena text leaderboard. Landed on the Alibaba API platform on May 19, formally announced May 20. Priced at $2.50 per million input and $7.50 per million output tokens via OpenRouter.

Model Release

Apr 28, 2026

NVIDIA Releases Nemotron 3 Nano Omni

NVIDIA

NVIDIA ships Nemotron 3 Nano Omni 30B-A3B-Reasoning, an open-weight multimodal model that processes text, image, video, and audio in a unified sequence. Hybrid Mamba-Transformer-MoE backbone (30B total, 3B active per token), 256K context window, native audio handling up to 20 minutes per clip. Tops six leaderboards for document intelligence and video/audio understanding including OCRBenchV2-En (65.8), MMLongBench-Doc (57.5), OSWorld (47.4), Video-MME (72.2), and VoiceBench (89.4). Available on Hugging Face in BF16, FP8, and NVFP4 quantizations.

Open Source

May 7, 2026

OpenAI Launches GPT-5.5-Cyber Preview

OpenAI

OpenAI rolls out GPT-5.5-Cyber, a variant of GPT-5.5 tuned for cybersecurity workflows, in limited preview to vetted defenders. Launch partners include Cisco, CrowdStrike, Palo Alto Networks, Cloudflare, Intel, Snyk, and SentinelOne. The model offers reduced classifier-based refusals for authorized red team and vulnerability research.

Model Release

May 5, 2026

OpenAI Releases GPT-5.5 Instant for ChatGPT

OpenAI

OpenAI ships GPT-5.5 Instant as the new default ChatGPT model, replacing GPT-5.4 Instant. The update tightens accuracy, cuts gratuitous emoji output, and rolls out to free, Plus, Pro, Business, and Edu tiers.

May 5, 2026

Google Ships Gemini 3.1 Flash-Lite Preview

Google

Google releases Gemini 3.1 Flash-Lite in preview through the Gemini API and Vertex AI. Priced at $0.25 per million input and $1.50 per million output, with a 1M context window, the model targets high-volume workloads at half the cost of Gemini 3 Flash.

Model Release

Apr 25, 2026

Amazon Commits $25B to Anthropic

Anthropic

Amazon invests up to $25 billion in additional funding to Anthropic, bringing its total commitment to $33 billion. The deal includes over $100 billion in AWS infrastructure spending over the next decade.

Apr 24, 2026

DeepSeek V4 Pro & Flash Released

DeepSeek

DeepSeek releases V4 Pro (1.6T parameters, 49B active) and V4 Flash (284B total, 13B active) under the MIT license with native 1M context windows. V4 Pro scores 80.6% on SWE-bench Verified.

Open Source

Apr 23, 2026

GPT-5.5 Released

OpenAI

OpenAI launches GPT-5.5, its first fully retrained base model since GPT-4.5. Features 1M context, native omnimodal capabilities, and benchmark leadership at $5/$30 per million tokens.

Model Release

Apr 22, 2026

Claude Design Launched

Anthropic

Anthropic launches Claude Design as an Anthropic Labs research preview. It reads codebases, generates design systems, and hands off finished prototypes to Claude Code.

Apr 21, 2026

SpaceX Acquires xAI for $250B

xAI

SpaceX acquires Elon Musk's xAI in a $250 billion deal, the largest acquisition in AI history. The merger brings Grok and the Colossus GPU cluster under SpaceX.

Acquisition

Apr 17, 2026

Claude Opus 4.7 Released

Anthropic

Anthropic releases Claude Opus 4.7 with a 1 million token context window at flagship pricing and incremental gains across reasoning, code, and SWE-bench.

Model Release

Mar 25, 2026

Claude Sonnet 4.6 Released

Anthropic

Anthropic releases Claude Sonnet 4.6, a faster and more affordable model in the 4.6 family with strong coding and reasoning performance.

Model Release

Mar 15, 2026

Claude Opus 4.6 Released

Anthropic

Anthropic releases Claude Opus 4.6 with extended thinking and a 1M context window option, setting new benchmarks across reasoning tasks.

Model Release

Mar 12, 2026

Gemini 2.5 Pro Released

Google

Google launches Gemini 2.5 Pro with a native 1M token context window and improved multimodal reasoning capabilities.

Model Release

Mar 5, 2026

Llama 4 Scout & Maverick Released

GPT-4.5 Released

OpenAI

OpenAI launches GPT-4.5 with a 256K context window and significantly improved creative writing and emotional intelligence.

Model Release

Feb 14, 2026

Grok 3 Released

xAI

xAI releases Grok 3 with strong reasoning performance, trained on the Colossus 200K GPU cluster.

Model Release

Jan 20, 2026

DeepSeek V3 Released

DeepSeek

DeepSeek releases V3, a powerful open-source model trained with a fraction of the compute used by competitors, sparking industry debate about efficiency.

Open Source

2025

Oct 22, 2025

Claude 3.5 Haiku Released

Anthropic

Anthropic releases Claude 3.5 Haiku, a fast and affordable model that outperforms the original Claude 3 Opus on many benchmarks.

Model Release

Oct 20, 2025

Claude 3.5 Sonnet v2 Released

Anthropic

Anthropic ships an upgraded Claude 3.5 Sonnet with computer use capabilities and significant improvements in coding and tool use.

Model Release

Oct 1, 2025

EU AI Act Enforcement Begins

The European Union begins enforcing the AI Act, requiring risk assessments and transparency for high-risk AI systems deployed in EU markets.

Policy

Sep 12, 2025

OpenAI o1 Released

OpenAI

OpenAI launches o1, a reasoning-focused model that uses chain-of-thought at inference time to solve complex math and coding problems.

Model Release

Jul 23, 2025

Llama 3.1 405B Released

GPT-4o Mini Released

OpenAI

OpenAI launches GPT-4o Mini at $0.15 per million input tokens, dramatically undercutting existing pricing and pressuring the entire market.

Pricing

Jun 20, 2025

Claude 3.5 Sonnet Released

Anthropic

Anthropic releases Claude 3.5 Sonnet, surpassing GPT-4o on most benchmarks while maintaining fast response times and lower costs.

Model Release

May 13, 2025

GPT-4o Released

OpenAI

OpenAI releases GPT-4o with native multimodal capabilities including real-time voice and vision, available free to all ChatGPT users.

Model Release

Apr 18, 2025

Llama 3 Released

Claude 3 Family Released

Anthropic

Anthropic launches the Claude 3 model family (Haiku, Sonnet, Opus) with vision capabilities and a 200K context window.

Model Release

Feb 21, 2025

Gemini 1.5 Pro Released

Google

Google launches Gemini 1.5 Pro with a 1M token context window, a major leap in long-context processing for production use.

Model Release

Feb 15, 2025

Gemini 2.0 Flash Released

Google

Google releases Gemini 2.0 Flash with native tool use and multimodal output, optimized for agentic workflows.

Model Release

Jan 15, 2025

Mistral Large 2 Released

Mistral

Mistral releases Large 2 with a 256K context window, strong multilingual support, and competitive pricing for enterprise use.

Model Release

2024

Nov 6, 2024

OpenAI Valued at $157B

OpenAI

OpenAI closes a $6.6B funding round at a $157B valuation, making it the most valuable private AI company in the world.

Acquisition

Aug 1, 2024

EU AI Act Enters Into Force

The EU AI Act officially enters into force, establishing the world's first comprehensive legal framework for artificial intelligence regulation.

Policy

Mar 28, 2024

Anthropic Raises $2.75B from Amazon

Anthropic

Amazon completes its $2.75B additional investment in Anthropic, bringing its total commitment to $4B and deepening the AWS partnership.

Acquisition

Timeline begins here