LLM Rank.top

The independent LLM leaderboard

Composite score, raw benchmark numbers, current API pricing, and direct links to try every major model — refreshed continuously.

30 models tracked 6 benchmarks Last updated No paywalls · No vendor bias → Try any model from one API
Sort:
# Model Score Arena MMLU-Pro GPQA MATH Code SWE $ in / out Ctx Try
Try every model on this page from one API.

OpenRouter routes a single API key across GPT-5, Claude, Gemini, DeepSeek, Llama, and 100+ others — pay-as-you-go, no monthly minimum. Try OpenRouter → (affiliate · supports this site)

Want this slot for your product? hi@llmrank.top

How the composite score is calculated

Each benchmark is normalised onto a 0–100 scale (Arena Elo is rescaled from a 1000–1500 band; percent benchmarks pass through unchanged). The composite is a weighted average across the available benchmarks for each model. Models with fewer than three published benchmarks are listed without a composite score so that single-benchmark coding specialists cannot displace well-rounded frontier models.

Scores are compiled from provider technical reports, public papers, and Chatbot Arena snapshots. Submit corrections via the GitHub issue tracker. Browse all models A–Z →

Popular guides & head-to-heads

Popular head-to-head comparisons

Pick any two models — composite score, raw benchmark numbers, API pricing, and a one-click route to try both behind the same key.

Get the weekly LLM digest

Big releases, leaderboard movements, price drops, and the one chart that actually mattered this week. No spam.

Or follow updates on GitHub.