103 models · 49 providers · 162 mappings

Open Registry & Telemetry
for AI Infrastructure

Discover, validate and compare LLM models, inference providers, MCP servers, and agent skills using open data and real-time telemetry. Track latency, uptime, pricing, capabilities, and provider mappings in one place.

Tracking providers across the ecosystem

OpenAI
Anthropic
Google
AWS Bedrock
Meta
Groq
Mistral
DeepSeek
xAI
IBM
Azure AI
Cohere
NVIDIA
Alibaba
Xiaomi
Hugging Face
Together AI
Fireworks
Replicate
SambaNova
Scaleway
Nebius
OpenAI
Anthropic
Google
AWS Bedrock
Meta
Groq
Mistral
DeepSeek
xAI
IBM
Azure AI
Cohere
NVIDIA
Alibaba
Xiaomi
Hugging Face
Together AI
Fireworks
Replicate
SambaNova
Scaleway
Nebius

103

Models

49

Providers

162

Provider Mappings

$5.45

Avg $/1M Tokens

Popular Models

Top-ranked models by relevance and provider availability

View all →

DeepSeek R1

131K ctx

DeepSeek's reasoning-focused model trained with reinforcement learning for complex multi-step reasoning. Excels at math, science, and coding problems requiring chain-of-thought reasoning.

chatcompletioncode-generationreasoning
textcode

GPT-5.5 Pro

256K ctx

OpenAI's premium tier model with extended reasoning capabilities, higher accuracy on complex tasks, and priority access. Optimized for professional and enterprise workloads requiring maximum quality.

chatcompletionfunction-callingvision+3
textimageaudiocode

Gemini 3.1 Pro

2.0M ctx

Google's latest flagship multimodal model with state-of-the-art performance on reasoning, coding, and multimodal understanding. Features native tool use, grounding, and million-token context window.

chatcompletionfunction-callingvision+3
textimageaudiovideocode

Claude Opus 4.8

300K ctx

Anthropic's most advanced model, building on Opus 4.7 with improvements across benchmarks in coding, agentic skills, reasoning, and knowledge work. Features enhanced honesty, better tool use efficiency, dynamic workflows support, and improved alignment.

chatcompletionfunction-callingvision+2
textimagecode

DeepSeek V4

256K ctx

DeepSeek's fourth-generation model with improved mixture-of-experts architecture, enhanced reasoning and coding capabilities, and stronger multilingual performance. Competitive with frontier proprietary models.

chatcompletionfunction-callingcode-generation+1
textcode

Gemma 4 31B

262K ctx

Google's flagship open-weight dense model with 31B parameters. All parameters active per forward pass. Ranks among top open models with strong performance on AIME 2026 (89.2%) and MMLU Pro (85.2%). Supports vision and extended context.

chatcompletionvisioncode-generation+1
textimagecode

Mistral Large 3

256K ctx

Mistral AI's largest open-weight model with 41B active parameters (675B total MoE). State-of-the-art general-purpose multimodal model with 256K context window and powerful agentic capabilities. Released under Apache 2.0.

chatcompletionfunction-callingvision+2
textimagecode

MiniMax M3

1.0M ctx

MiniMax's frontier open-weight model with 1M-token context window, native multimodality (text, image, video), and strong coding capabilities. Built on MiniMax Sparse Attention (MSA) architecture, achieving 59% on SWE-Bench Pro with significantly improved efficiency at long context.

chatcompletionfunction-callingcode-generation+2
textimagevideocode

Mistral Medium 3.5

128K ctx

Mistral AI's balanced model offering strong multilingual performance with excellent price-performance ratio. Optimized for production workloads requiring reliable quality across European and global languages.

chatcompletionfunction-callingcode-generation+1
textcode

DeepSeek V4 Flash

1.0M ctx

DeepSeek's efficient V4 model with 284B total parameters (13B activated). Optimized for speed and cost-efficiency while maintaining strong performance. Supports 1M token context window.

chatcompletionfunction-callingcode-generation
textcode

Gemini 3.1 Flash-Lite

1.0M ctx

Google's most cost-efficient Gemini model optimized for high-volume, low-latency use cases. Delivers 2.5x faster time to first token versus Gemini 2.5 Flash with full multimodal support. Ideal for agentic tasks, data extraction, translation, and classification.

chatcompletionfunction-callingvision+2
textimageaudiovideocode

Claude Opus 4.6

300K ctx

Anthropic's most capable model in the Claude 4 family, excelling at complex analysis, extended reasoning, scientific research, and advanced code generation. Features significantly improved accuracy and reduced hallucinations.

chatcompletionfunction-callingvision+2
textimagecode

OpenModels CLI

Browse models, compare providers, and check telemetry directly from your terminal. JSON and YAML output for scripting and CI/CD.

npm install -g openmodels-cli
terminal