127 models · 52 providers · 199 mappings

Open Registry
for AI Infrastructure

Discover and compare models, providers, MCP servers, and agent skills with source-transparent pricing, context limits, capabilities, access terms, and deployment data.

Explore Models Compare Models Explore MCP

Tracking providers across the ecosystem

OpenAI

Anthropic

Google

AWS Bedrock

Popular Models

Top-ranked models by relevance and provider availability

View all →

Claude Fable 5

Anthropic's first publicly available Mythos-class model, exceeding the capabilities of any model the company has previously made generally available. State-of-the-art on nearly all tested benchmarks, with exceptional performance in software engineering, knowledge work, vision, and scientific research. Its lead grows on longer and more complex tasks. Ships with built-in safeguards that route sensitive cybersecurity, biology, chemistry, and distillation queries to Claude Opus 4.8.

Context

300K

Released

Jun 2026

Providers Compare

GPT-5.6 Terra

The balanced tier of OpenAI's GPT-5.6 series, trading a small amount of peak quality for markedly lower latency and cost. Retains strong reasoning, coding, and agentic tool use with configurable reasoning effort, making it a default choice for production workloads that need frontier capability at scale.

Context

1.0M

Released

Jul 2026

Providers Compare

Grok 4.5

xAI's strongest model to date, built to excel at coding, agentic tasks, and knowledge work and co-developed alongside coding tools for real-world software engineering. Features real-time information access, extended reasoning, and large-context tool use with an OpenAI-compatible API.

Context

500K

Released

Jul 2026

Providers Compare

Claude Opus 4.8

Anthropic's most advanced model, building on Opus 4.7 with improvements across benchmarks in coding, agentic skills, reasoning, and knowledge work. Features enhanced honesty, better tool use efficiency, dynamic workflows support, and improved alignment.

Context

300K

Released

May 2026

Providers Compare

GPT-5.5 Pro

OpenAI's premium tier model with extended reasoning capabilities, higher accuracy on complex tasks, and priority access. Optimized for professional and enterprise workloads requiring maximum quality.

Context

256K

Released

Mar 2026

Providers Compare

Gemini 3.1 Pro

Google's latest flagship multimodal model with state-of-the-art performance on reasoning, coding, and multimodal understanding. Features native tool use, grounding, and million-token context window.

Context

2.0M

Released

Mar 2026

Providers Compare

DeepSeek R1

DeepSeek's reasoning-focused model trained with reinforcement learning for complex multi-step reasoning. Excels at math, science, and coding problems requiring chain-of-thought reasoning.

Context

131K

Released

Jan 2025

Providers Compare

DeepSeek V4

DeepSeek's fourth-generation model with improved mixture-of-experts architecture, enhanced reasoning and coding capabilities, and stronger multilingual performance. Competitive with frontier proprietary models.

Context

256K

Released

Feb 2026

Providers Compare

Nemotron 3 Ultra

NVIDIA's flagship open 550B-parameter Mixture-of-Experts model with 55B active parameters, built for frontier reasoning and orchestration in long-running agentic systems. Features hybrid Mamba-Transformer architecture, LatentMoE routing, multi-token prediction, and NVFP4 precision for 5x higher throughput. Achieves 30% lower cost-to-task-completion on agentic benchmarks. Supports 1M+ token context window with 95% accuracy on Ruler@1M.

Context

1.0M

Released

Jun 2026

Providers Compare

Gemma 4 31B

Google's flagship open-weight dense model with 31B parameters. All parameters active per forward pass. Ranks among top open models with strong performance on AIME 2026 (89.2%) and MMLU Pro (85.2%). Supports vision and extended context.

Context

262K

Released

Apr 2026

Providers Compare

DeepSeek V4 Flash

DeepSeek's efficient V4 model with 284B total parameters (13B activated). Optimized for speed and cost-efficiency while maintaining strong performance. Supports 1M token context window.

Context

1.0M

Released

Apr 2026

Providers Compare

Mistral Medium 3.5

Mistral AI's balanced model offering strong multilingual performance with excellent price-performance ratio. Optimized for production workloads requiring reliable quality across European and global languages.

Context

128K

Released

Feb 2026

Providers Compare

Model comparison

Start with the workload. Then choose the model.

Turn a registry of models into a decision. Filter the full catalog, compare trustworthy values side by side, and calculate cost with your own assumptions.

Compare the facts that matter

Context, modalities, capabilities, access terms, and provider availability in one view.

Keep price attached to a provider

Deployment-specific input and output prices are never presented as global model properties.

Estimate your workload cost

Use your own token volumes, cache assumptions, and request count instead of an abstract score.

Find the right model How comparison works

Live registry snapshot

Largest documented context windows

Model-level capacity from the registry. Capacity is not quality.

Explore context →

Input price Output price Providers Open weights

Latest Insights

Analysis, benchmarks and comparisons across the LLM ecosystem

View all →

Jul 9, 2026·4 min read

GPT-5.6 and ChatGPT Work: From AI Assistant to AI Worker

OpenAI is no longer positioning ChatGPT as a conversational assistant. With GPT-5.6 and ChatGPT Work, the company is moving toward a full work execution layer across apps, files, code, and business workflows.

May 27, 2026·4 min read

The AI Race Is Shifting From IQ to Agentic Economics

The AI race is shifting from benchmark scores to agentic economics. Why inference costs, latency, and open-weight models are reshaping the industry in 2026.

May 15, 2026·3 min read

Stanford AI Index 2026: AI Is Scaling Faster Than Society Can Adapt

The release of the 2026 AI Index Report by Stanford HAI paints a very clear picture: artificial intelligence is no longer an emerging technology — it has become global infrastructure.

Open Registryfor AI Infrastructure

Popular Models

Claude Fable 5

GPT-5.6 Terra

Grok 4.5

Claude Opus 4.8

GPT-5.5 Pro

Gemini 3.1 Pro

DeepSeek R1

DeepSeek V4

Nemotron 3 Ultra

Gemma 4 31B

DeepSeek V4 Flash

Mistral Medium 3.5

Start with the workload. Then choose the model.

Compare the facts that matter

Keep price attached to a provider

Estimate your workload cost

Largest documented context windows

Latest Insights

GPT-5.6 and ChatGPT Work: From AI Assistant to AI Worker

The AI Race Is Shifting From IQ to Agentic Economics

Stanford AI Index 2026: AI Is Scaling Faster Than Society Can Adapt

Open Registry
for AI Infrastructure