Claude (Anthropic)

A Tier · 8.5/10

Anthropic's flagship LLM family. Claude Fable 5 (launched June 9, 2026) was the first publicly available Mythos-class model -- but on June 12, 2026 a US government export-control directive ordered access suspended, and Anthropic disabled Fable 5 + Mythos 5 for ALL customers to comply (every other Claude model is unaffected). Opus 4.8 (May 28) is the available flagship: $5/$25 per 1M, 1M-token context, effort control, and a cheap fast mode

Last updated: 2026-06-18Free tier available

Score Breakdown

9.0

Ease of Use

9.0

Output Quality

8.0

Value

8.0

Features

Benchmark Scores

Benchmarks for Claude Fable 5 (launched 2026-06-09; access SUSPENDED 2026-06-12 by US gov order -- scores are the pre-suspension record) -- vendor SWE-Bench Pro 80.3% (vs GPT-5.5 58.6%); #1 LMArena Elo 1510 and #1 Artificial Analysis Index 65 as of 6/11; legacy Opus-line reasoning-suite scores shown below as baseline pending full third-party suites

Chatbot Arena ELOHuman preference rating1510

Benchmark	Description	Score
MMLU	Knowledge across 57 subjects	91.3%
GPQA Diamond	Graduate-level science questions	91.3%
AIME 2024	Competition math problems	99.8%
HumanEval	Python code generation	94%
SWE-bench	Real GitHub issue fixing	80.8%
ARC-AGI	Abstract reasoning puzzles	75.2%

Last updated: 2026-06-11

Visit Claude (Anthropic)

Personality & Tone

The thoughtful consultant

Tone: Measured, careful, and slightly formal. Claude explains tradeoffs rather than handing back one-liner answers, asks clarifying questions when a request is ambiguous, and hedges openly when it is not confident.

Quirks: More willing than most models to refuse edgy or ambiguous requests, pushes back on premises it disagrees with, and will flag when you are probably asking the wrong question instead of just answering the one you typed.

The Good and the Bad

What we like

+Best writing quality of any LLM -- Opus 4.8 outputs read like a human wrote them, not a robot, and instruction-following stays sharpest in class
+1M token context window for enterprise API means it can process entire codebases, huge document sets, or long agent traces without chunking
+Opus 4.8 is built for agentic work -- Anthropic says it is a more effective collaborator with notably improved judgment in agent scenarios and is roughly 4x less likely than 4.7 to let code flaws slip through
+New user-facing effort control (claude.ai + Cowork) lets you trade depth for speed, and fast mode now runs at 2.5x speed while costing 3x less than the previous fast mode -- a real latency/cost lever short of full reasoning
+High-res vision (3.75MP images, 2,576px long edge) means charts, diagrams, whiteboards, and dense UIs work properly

What could be better

−Free tier is more limited than ChatGPT's -- you hit the cap faster
−No image generation built in (unlike ChatGPT with DALL-E)
−Fewer third-party integrations and plugins compared to OpenAI's ecosystem
−Can be overly cautious and refuse requests that are perfectly fine

Pricing

Free

✓Limited messages/day
✓Claude Sonnet 4.6
✓Basic features

Pro

$20/month

✓5x more usage than Free
✓Claude Opus 4.8 + Sonnet 4.6
✓Effort control + extended thinking
✓Priority access

Max (5x)

$100/month

✓5x Pro usage
✓Priority queue
✓Opus 4.8 with full effort control + fast mode

Max (20x)

$200/month

✓20x Pro usage
✓Highest priority
✓All generally-available models
✓Best for power users and agents

API (Opus 4.8)

$5 / $25/per 1M tokens (input/output)

✓Unchanged from Opus 4.7 pricing
✓1M context window
✓Fast mode at $10 / $50 per 1M (2.5x speed, 3x cheaper than prior fast mode)
✓Tool use, MCP, high-res vision; Bedrock, Vertex AI, Foundry

API (Fable 5)

$10 / $50/per 1M tokens (input/output)

✓First publicly available Mythos-class model (launched 2026-06-09)
✓Included on Pro/Max/Team/Enterprise at no extra cost through 2026-06-22, usage credits after
✓Auto-fallback to Opus 4.8 on cyber/bio/chem-flagged requests (<5% of sessions)
✓Mandatory 30-day retention on all Mythos-class traffic (not used for training)

Known Issues

ACCESS SUSPENDED BY US GOVERNMENT (2026-06-12): A US government export-control directive ordered Anthropic to **suspend all access to Claude Fable 5 and Claude Mythos 5** -- for any foreign national whether inside or outside the US, including foreign-national Anthropic employees. To comply, Anthropic **disabled Fable 5 and Mythos 5 for ALL customers** (the directive's scope made selective enforcement impractical). **Access to every other Anthropic model -- Opus 4.8, Sonnet 4.6, Haiku 4.5 -- is unaffected and remains fully available.** Stated cause: the Commerce Department acted after another company claimed it had found a way to 'jailbreak' Mythos, raising national-security concerns. Anthropic publicly disagrees that a narrow potential jailbreak should justify recalling a commercially deployed model, arguing the same standard 'would essentially halt all new model deployments for all frontier model providers,' and met with the Trump administration on 2026-06-15 to contest it. Net effect for users TODAY: Fable 5 is not selectable on claude.ai or the API; use Opus 4.8 instead. This is the first time a frontier model has been pulled from public access by US-government order -- watch for restoration terms.Source: Anthropic (anthropic.com/news/fable-mythos-access), CNBC (2026-06-12 + 2026-06-15), TechCrunch, Axios · 2026-06-12
FABLE 5 DAY-3 (2026-06-11, PRE-SUSPENSION): **#1 on LMArena** -- claude-fable-5 now holds the top text/overall Elo at **1510±11** (next: Opus 4.6-thinking 1504, Opus 4.7-thinking 1502; GPT-5.5-high sits at 1481). Also **#1 on the Artificial Analysis Intelligence Index at 65** (Opus 4.8 second at 61, GPT-5.5-xhigh 60) -- note AA scores the 'Adaptive Reasoning, Max Effort, Opus 4.8 Fallback' configuration. Counterpoint worth knowing: Endor Labs published a critique finding 'mid-tier results on coding tasks' (109 points on HN) -- benchmark dominance is not unanimous across third-party evals. API housekeeping from the deprecations page: `temperature`/`top_p`/`top_k` now return HTTP 400 on Opus 4.7+ models (remove them from request bodies), and **Claude Mythos Preview retires June 30, 2026** (migrate to claude-mythos-5). Still NOT in Cursor as of day 3Source: LMArena leaderboard (lmarena.ai, now redirecting to arena.ai), Artificial Analysis (artificialanalysis.ai/models), Endor Labs via HN, Anthropic deprecations page · 2026-06-11
MODEL LAUNCH (2026-06-09): **Claude Fable 5** -- Anthropic's most powerful generally available model, described by Anthropic as 'a Mythos-class model that we've made safe for general use.' Available via API immediately at $10/$50 per 1M tokens (2x Opus 4.8). Subscription rollout is staged: included at no extra cost on Pro/Max/Team/Enterprise **through June 22, 2026**, then requires usage credits while capacity scales. Safety mechanics: classifiers route cybersecurity, biology/chemistry, and distillation-attempt requests to Opus 4.8 instead (affects <5% of sessions on average). All Mythos-class traffic carries mandatory 30-day retention (overrides zero-data-retention agreements; not used for training). **Claude Mythos 5** -- the same model with safeguards lifted in some areas -- launched simultaneously but is restricted to Project Glasswing partners and select biology researchers; Mythos Preview users can upgrade immediately. Context: Anthropic confidentially filed its S-1 on 2026-06-01, days before this launch.Source: Anthropic news (anthropic.com/news/claude-fable-5-mythos-5), TechCrunch, CNBC · 2026-06-09
FABLE 5 DAY-2 ROLLUP (2026-06-10): platform availability landed fast -- **GitHub Copilot GA 6/9** (github.blog changelog), **Amazon Bedrock** (us-east-1 + eu-north-1 only at launch, more regions coming), **Microsoft Foundry**, **Snowflake Cortex AI**, and **Vertex AI**; notably **NOT in Cursor yet** as of 6/10. Pricing detail: the $10/$50 rate carries a **90% prompt-caching discount**. Vendor benchmark published: **SWE-Bench Pro 80.3%** (vs GPT-5.5's 58.6% -- an 11-point lead over the next-best model); LMArena Elo followed on 6/11 (see day-3 entry). Early user friction: (a) **Claude Desktop spawns a ~1.8GB Hyper-V VM on every launch** even for chat-only sessions -- GitHub issue hit the Hacker News front page (259 points); (b) scattered reports of the safety classifiers flagging benign biology questions -- consistent with the documented <5% Opus 4.8 fallback, but expect occasional false positives on science topicsSource: GitHub blog (github.blog/changelog/2026-06-09-claude-fable-5-is-generally-available-for-github-copilot/), AWS news blog, Azure blog, Snowflake blog, Anthropic news (SWE-Bench Pro), Hacker News · 2026-06-10
DISTRIBUTION (2026-06-08, ships fall 2026): Apple's iOS 27 / macOS 27 'Extensions' framework lets users select a third-party AI model -- Apple's developer materials name **Claude** and Gemini explicitly, plus 'any other provider that implements the new language model protocol' -- as the assistant behind Siri, Writing Tools, and Image Playground. Ends ChatGPT's exclusive integration position on Apple platforms. Xcode 27 also integrates Anthropic coding agents natively alongside Google's and OpenAI's. Big default-assistant distribution opening for Claude on ~2B Apple devices; nothing user-facing until the fall OS releases (public betas July).Source: Apple newsroom (apple.com/newsroom, WWDC 2026 developer announcements), MacRumors, TechCrunch · 2026-06-08
MODEL LAUNCH (2026-05-28): **Claude Opus 4.8** shipped as Anthropic's new flagship. Anthropic frames it as 'a more effective collaborator' with notably improved judgment in agent scenarios and meaningful gains across coding, agentic, and reasoning tasks (per third-party coverage, ~4x less likely than Opus 4.7 to let code flaws pass). Two practical changes for users: (1) **effort control** on claude.ai and Cowork -- higher effort makes Claude think more frequently and deeply, lower effort prioritizes speed and rate-limit efficiency; 4.8 defaults to high effort. (2) **Fast mode** for Opus 4.8 runs at 2.5x speed and is now 3x cheaper than fast mode on previous models. API pricing unchanged at $5/$25 per 1M (standard) / $10/$50 (fast). Vendor-cited benchmarks span Terminal-Bench 2.1, OSWorld-Verified, CursorBench, Legal Agent Benchmark, Online-Mind2Web (84%), and Finance Agent v2 -- specific scores are shown only in vendor charts at launch, so third-party verification is pending. New recommended API model id: claude-opus-4-8.Source: Anthropic news (anthropic.com/news/claude-opus-4-8), Axios, MacRumors · 2026-05-28
UPCOMING (surfaced 2026-05-06 at Code with Claude; not yet GA as of this sweep): **Orbit** -- a proactive assistant layer for Claude / Claude Code / Claude Cowork that syncs Gmail, Slack, GitHub, Calendar, Drive, and Figma to deliver opt-in, time-zone-aware personalized briefings with actionable insights (aimed at developers, designers, PMs). As of late May 2026 it exists only as a settings-panel toggle in staging -- no public rollout or firm ship date. Real product (not a rumor), but PRE-LAUNCH on availability; watch for a GA announcementSource: TestingCatalog (Anthropic Orbit), InfoQ (Code with Claude 2026) · 2026-05-06
PARTNERSHIP (2026-05-14): PwC announced an **expanded strategic alliance** to deploy Claude (Code + Cowork + full product suite) across PwC US first, scaling to PwC's global workforce. Headline metrics from Anthropic's launch post: **30,000 PwC professionals to be trained and certified on Claude**, plus a joint Center of Excellence for industry-specific solutions. Dario Amodei pull-quote: 'Insurance underwriting that took 10 weeks now takes 10 days. Security work that took hours now takes minutes.' Advocate Health is the first co-named flagship deployment (167K teammates). Material because (a) it puts a Big 4 firm fully on Claude as the reference frontier model, (b) creates a ~30K-trained labor base evangelizing Claude inside Fortune 500 audits / advisory engagements, (c) competitive pressure on OpenAI Deployment Company (5/11 spin-up) which is targeting the same enterprise-services layer. Distinct event from the same-day Gates Foundation partnership; both were posted to Anthropic newsroom on 5/14.Source: Anthropic news (anthropic.com/news/pwc-expanded-partnership) · 2026-05-14
PARTNERSHIP (2026-05-14): Anthropic + Gates Foundation announced a 4-year, $200M partnership -- approximately half grant funding from the Gates Foundation, half Claude credits + Anthropic technical staff time. Program portfolio: **global health** (polio, HPV vaccines, eclampsia/preeclampsia), **education** (K-12 US + sub-Saharan Africa + India: math tutoring, college advising, curriculum design + benchmark development), **African-language data collection**, **life sciences** (vaccines + therapies), **economic mobility**. Distinct from a standard customer engagement -- philanthropic + product-development partnership with Anthropic technical staff embedded. Material because it positions Claude as the foundation's frontier-model partner of choice over OpenAI / Google / Meta. Practical implication for buyers: nothing direct, but signals Anthropic's continued investment in mission-aligned partnerships that fund model + safety improvements upstream.Source: Anthropic news (anthropic.com/news/gates-foundation-partnership), Gates Foundation co-announcement (gatesfoundation.org), Reuters, PYMNTS · 2026-05-14
PRODUCT (2026-05-13): Anthropic launched 'Claude for Small Business' -- a packaged offering inside Claude Cowork that bundles 15 ready-to-run agentic workflows + 15 repeatable skills across finance, operations, sales, marketing, HR, and customer service. First-party integrations: Intuit QuickBooks, PayPal, HubSpot, Canva, Docusign, Google Workspace, Microsoft 365. Mechanics: toggle Claude for Small Business inside Claude Cowork, connect existing tools, pick a job -- Claude does the work, human approves before anything sends/posts/pays. Anthropic also launched the 'Claude SMB Tour' -- 10-city free AI fluency training kicking off 2026-05-14 in Chicago, then Tulsa / Dallas / Newark / Baton Rouge / Birmingham / SLC / Baltimore / San Jose / Indianapolis; attendees get a one-month Claude Max subscription. PRICING NOT DISCLOSED in the launch post -- no per-seat number, no flat fee, no tier delta vs. Teams ($25-30/seat) or Enterprise published. Target framing: '44% of U.S. GDP / nearly half the private-sector workforce' has lagged AI adoption. First Anthropic SKU explicitly aimed at SMB segment.Source: Anthropic news (anthropic.com/news/claude-for-small-business), TechCrunch coverage · 2026-05-13
PRODUCT + CAPACITY (2026-05-06 Code with Claude SF keynote): Anthropic announced a SpaceX compute partnership at Colossus 1 (300+ MW, 220,000+ NVIDIA GPUs, online 'within the month'). Concurrent product changes shipped TODAY: (a) DOUBLED Claude Code 5-hour rate limits for Pro / Max / Team / seat-based Enterprise plans, (b) REMOVED peak-hours reduction for Pro and Max (peak-hours throttling no longer applies), (c) RAISED API rate limits for Opus models (Opus 4.7 + Opus 4.6 throughput improved). Plus Claude Managed Agents shipped: 'Dreaming' (research preview -- agents review past sessions for self-improvement patterns), 'Outcomes' (public beta -- rubric-graded task success, lifted up to 10 points in tests), and 'Multiagent Orchestration' (public beta -- lead-agent delegates to subagents, e.g. Haiku lead with Opus subagents). Practical impact: existing Pro / Max users see materially more headroom on Claude Code overnight. NOTE: Sonnet 4.8 / Jupiter / Cardinal / KAIROS / Cowork / Undercover Mode -- speculated from the 2026-03-31 source-map leak -- did NOT ship at this keynote. Models page still lists Opus 4.7 / Sonnet 4.6 / Haiku 4.5 as the current trioSource: Anthropic news (anthropic.com/news/higher-limits-spacex), Anthropic Managed Agents (claude.com/blog/new-in-claude-managed-agents), Simon Willison live blog, TheNewStack · 2026-05-06
SECURITY (CVE-2026-41686, NVD-published 2026-05-04, GHSA-p7fg-763f-g4gf): Anthropic TypeScript SDK (`@anthropic-ai/sdk`) `BetaLocalFilesystemMemoryTool` writes memory files with mode 0o666 (world-readable) and directories with mode 0o777 (world-readable + writable). On shared hosts a local attacker can read persisted agent state; in containers with permissive umasks (typical Docker base images) an attacker with container access can poison memory to steer subsequent model behavior. Affects versions 0.79.0 through 0.91.0. **Fix: upgrade to >= 0.91.1**. CVSS 4.8 (moderate). CWE-732 Incorrect Permission Assignment. Reported by lucasfutures, disclosed 2026-04-24Source: GitHub Security Advisory (github.com/anthropics/anthropic-sdk-typescript/security/advisories/GHSA-p7fg-763f-g4gf), NVD CVE-2026-41686 · 2026-05-04
PRODUCT (2026-04-28): Anthropic launched Claude for Creative Work with 9 first-party connectors -- Ableton (Live + Push), Adobe Creative Cloud (Photoshop / Premiere / Express via 'Adobe for creativity'), Affinity by Canva, Autodesk Fusion, Blender, Resolume Arena, Resolume Wire, SketchUp, and Splice. The Blender connector is built on MCP and is explicitly accessible to other LLMs -- not Claude-only. Educational pilots also announced with RISD, Ringling, and Goldsmiths. Tier requirements not specified at launch. This is Anthropic's biggest creative-pro market push to date and pairs naturally with the Opus 4.7 launch on 4/16 (vision quality required for visual workflows)Source: Anthropic news (anthropic.com/news/claude-for-creative-work), 9to5mac, Adobe blog · 2026-04-28
POLICY (2026-04-04, enforced 2026-04-10): Anthropic excluded third-party agent harnesses (OpenClaw cited specifically) from Claude Pro and Max flat-rate plans. Routing Pro/Max via OpenClaw, Claude-on-Cline, or similar frameworks now triggers separate pay-as-you-go 'extra usage' billing rather than the flat plan rate. ~135K OpenClaw instances were impacted at the time of the change. Anthropic temporarily banned OpenClaw's creator from the platform on 2026-04-10 and stated subscriptions 'weren't built to handle the usage patterns' of harnesses that 'run continuous reasoning loops, automatically repeat or retry tasks, and tie into a lot of other third-party tools.' If you run agentic workloads on Claude, expect the API path to be the only viable model going forwardSource: TechCrunch (techcrunch.com/2026/04/10/anthropic-temporarily-banned-openclaws-creator-from-accessing-claude/), The Next Web, PYMNTS · 2026-04-10
ENTERPRISE PRICING (2026-04-16): Anthropic dropped Claude Enterprise's bundled-token model. Plan moved from ~$200/seat with discounted token allotment to $20/seat base + standard API rates with no token allotment and no usage cap. Customary 10-15% enterprise API discounts also pulled. Heavy users see 2-3x bill increases. Rolling out to enterprises with 150+ seats first. Material for any team evaluating Claude as their primary AI provider at scale -- confirm finance modeling against the new structure before committing seat countsSource: The Register (theregister.com/2026/04/16/anthropic_ejects_bundled_tokens_enterprise/), The Information, PYMNTS · 2026-04-16
Claude Haiku 3 (claude-3-haiku-20240307) RETIRED 2026-04-20 -- deprecated -> retired flip confirmed on Anthropic's deprecations page (verified 2026-04-24). If your API code still targets the 2024 Haiku snapshot, requests are now failing -- migrate to claude-haiku-4-5-20251001Source: Anthropic model deprecations page · 2026-04
Claude Sonnet 4 (claude-sonnet-4-20250514) and Claude Opus 4 (claude-opus-4-20250514) RETIRED 2026-06-15 -- deprecated -> retired flip confirmed on Anthropic's deprecations page (verified 2026-06-15; the page now lists both as 'Retired' and the history note reads 'These models were retired June 15, 2026'). Announced 2026-04-14. If your product still targets those specific snapshots, requests are now failing -- migrate to Sonnet 4.6 (`claude-sonnet-4-6`) or Opus 4.8 (`claude-opus-4-8`, the current recommended Opus replacement). NOTE: the SEPARATE programmatic-billing change once slated for the same day (Agent SDK / `claude -p` / GitHub Actions onto a metered credit pool) was PAUSED before it shipped -- 'nothing changes for now' -- see claude-code.ts. NEXT IN LINE: Claude Opus 4.1 (claude-opus-4-1-20250805) was deprecated 2026-06-05 and retires 2026-08-05 -- same migration targetSource: Anthropic model deprecations page (platform.claude.com/docs/en/about-claude/model-deprecations) · 2026-06
Free tier rate limits feel aggressive -- heavy users get throttled within a few conversationsSource: Reddit r/ClaudeAI · 2026-03
Occasionally refuses benign creative writing requests due to safety filtersSource: Reddit r/ClaudeAI · 2026-02
SUPERSEDED (2026-06-09): the April-era 'Mythos Preview is gated and will not be generally available' framing no longer holds -- Fable 5 brings Mythos-class capability to the public tier (with safety fallbacks), while Mythos 5 replaces Mythos Preview inside Project Glasswing (expanded to ~150 orgs as of 2026-06-02). See the claude-mythos page for the gated-track detailSource: Anthropic news (anthropic.com/news/claude-fable-5-mythos-5) · 2026-06-09
Opus 4.7 uses an updated tokenizer -- input tokens may increase roughly 1.0-1.35x depending on content type, slightly raising per-request cost even though the published per-token rate is unchangedSource: Anthropic release notes · 2026-04
Project Deal published 2026-04-25 (anthropic.com/features/project-deal, with TechCrunch + PYMNTS + Legal IT Insider analysis): Anthropic ran a one-week internal marketplace where Claude agents bought, sold, and negotiated on behalf of SF-office employees with no human-in-the-loop. 186 deals closed at ~$4K total volume. Headline finding for Claude API buyers: participants assigned Opus 4.5 got measurably better economic outcomes than those on Haiku 4.5 -- and Haiku-assigned users didn't notice they were losing. Practical takeaway: in agentic workflows where Claude transacts on a user's behalf, model-tier selection has measurable downstream economic cost, not just latency or quality. Treat this as a public signal that Anthropic is moving toward productized agent-as-representative use casesSource: anthropic.com/features/project-deal, TechCrunch, PYMNTS · 2026-04-25
Anthropic published an explicit ad-free commitment ('Claude is a space to think', 2026-02-04) -- but the differentiation matters now because OpenAI began rolling ads to ChatGPT Free + Go tiers in Feb 2026 (Plus/Pro/Business/Enterprise still ad-free) and Google AI Overviews already carry ad placements. Anthropic's verbatim language: no sponsored links adjacent to conversations, no advertiser-influenced responses, no third-party product placements. Claude's monetization stays enterprise + subscription only. Practically relevant for B2B / regulated / trust-sensitive deployments (legal, healthcare, finance, research) where ad-incentive contamination in outputs is a deal-breakerSource: anthropic.com/news/claude-is-a-space-to-think (2026-02-04), openai.com/index/testing-ads-in-chatgpt, Axios · 2026-02-04

Best for

Writers, analysts, developers, and anyone who values quality of output over quantity of features. If you care about how good the actual text is, Claude is the best.

Not for

People who want an all-in-one platform with image generation, plugins, and browsing built in. ChatGPT's ecosystem is bigger.

Our Verdict

Claude is the LLM you pick when quality matters more than features. For three days in June it looked like the ceiling had moved: Fable 5 (June 9, 2026) was the first Mythos-class model anyone could actually use, topping LMArena and the Artificial Analysis Index. Then on June 12 a US government export-control directive forced Anthropic to suspend Fable 5 -- and Mythos 5 -- for all customers, and as of this review it remains unavailable. So the practical flagship today is Opus 4.8: the $5/$25 workhorse with effort control, a cheap fast mode, a 1M context window, high-res vision, and MCP support, plus Apple naming Claude a selectable system assistant in iOS 27 this fall. Opus 4.8 is still arguably the best writing-and-reasoning model you can buy. Whether Fable-tier capability returns to the public depends on a regulatory fight Anthropic is actively contesting -- worth watching, but don't plan around Fable 5 right now.

Sources

Anthropic: Statement on the US government directive to suspend access to Fable 5 and Mythos 5 (2026-06-12) (accessed 2026-06-18)
CNBC: Anthropic disables access to Fable 5 and Mythos 5 to comply with government directive (2026-06-12) (accessed 2026-06-18)
CNBC: Anthropic to meet with Trump administration over Mythos dispute (2026-06-15) (accessed 2026-06-18)
Anthropic: Introducing Claude Fable 5 and Claude Mythos 5 (2026-06-09) (accessed 2026-06-09)
TechCrunch: Anthropic releases Claude Fable 5 (accessed 2026-06-09)
Apple newsroom: WWDC 2026 intelligence frameworks (Extensions / LanguageModel protocol) (accessed 2026-06-09)
Anthropic: Introducing Claude Opus 4.8 (2026-05-28) (accessed 2026-06-02)
Anthropic: Claude for Small Business (2026-05-13) (accessed 2026-05-13)
GitHub Security Advisory: GHSA-p7fg-763f-g4gf (CVE-2026-41686, 2026-05-04) (accessed 2026-05-05)
Anthropic: Project Deal (2026-04-25) (accessed 2026-04-27)
TechCrunch: Anthropic created a test marketplace for agent-on-agent commerce (accessed 2026-04-27)
Anthropic: Claude is a space to think (ad-free policy, 2026-02-04) (accessed 2026-04-27)
Anthropic: Introducing Claude Opus 4.7 (accessed 2026-04-16)
CNBC: Anthropic rolls out Claude Opus 4.7 (accessed 2026-04-16)
Axios: Opus 4.7 trails unreleased Mythos (accessed 2026-04-16)
Claude Mythos Preview / Project Glasswing (accessed 2026-04-16)
LMSYS Chatbot Arena rankings (accessed 2026-04-16)
Hands-on testing (Opus 4.7 via claude.ai and API) (accessed 2026-04-16)

Explore more Claude (Anthropic) rankings

Deeper leaderboards, benchmarks, task-specific tier lists, and status/pricing pages for Claude (Anthropic).

Full AI LLMs & Models tier list

Where Claude (Anthropic) ranks vs every competitor in its category

MMLU leaderboard

The 57-subject knowledge test that became the default LLM benchmark.

GPQA Diamond leaderboard

Graduate-level physics, biology, and chemistry written to defeat Google-search.

AIME leaderboard

The American Invitational Math Exam, used as a rolling frontier-math benchmark.

HumanEval leaderboard

164 Python programming problems: does the generated code pass unit tests?

Best AI tools to research a topic

Research assistants that gather, cite, and synthesize sources across the web into a structured answer.

Best AI tools to answer questions from documents

Chat-with-your-docs tools that build a retrieval layer over PDFs, transcripts, and knowledge bases.

Is Claude (Anthropic) down?

Outage check plus rolling log of known issues

Claude (Anthropic) pricing

Every tier and what's included

Claude (Anthropic) alternatives

Comparable tools at every tier

The Tier List Tuesday

Weekly newsletter: tier movers, new entrants, and the VS of the week. Built from our daily AI-tool sweeps. No spam, unsubscribe anytime.

Alternatives to Claude (Anthropic)

Claude Mythos 5

Anthropic's unrestricted frontier model -- launched June 9, 2026 alongside Claude Fable 5 (the same model made safe for general use). ACCESS SUSPENDED June 12, 2026: a US government export-control directive forced Anthropic to disable both Mythos 5 and Fable 5 for all customers; all other Claude models are unaffected. Mythos 5 had been gated to ~150 Project Glasswing orgs and select biology researchers.

6.5/10

From Invite only

The most capable Anthropic model availab...73% success rate on expert-level Capture...

Updated 2026-06-18

Gemini (Google)

Google's LLM with deep Google Workspace integration, 2M token context window, and native code execution -- Gemini 3.5 Flash GA 2026-05-19 (I/O 2026), Gemini 3.5 Pro rolling out June 2026, Gemini Spark agent + Managed Agents public preview in the Gemini API

8.3/10

Free tierFrom $0

2 million token context window is the la...Best Google Workspace integration (Gmail...

Updated 2026-06-18

Grok

xAI's irreverent chatbot with a direct line to X/Twitter -- real-time data meets unfiltered personality. Grok 4.3 production launched 2026-05-02 with Custom Voices cloning + Imagine Agent Mode + ~40% API price cut to $1.25/$2.50 per 1M tokens

7.5/10

Free tierFrom $0

Real-time access to X/Twitter data is ge...Grok 3 benchmarks are competitive with G...

Updated 2026-06-11

Muse Spark (Meta)

Meta's first model from its Superintelligence Lab -- natively multimodal with Contemplating mode for multi-agent reasoning

8.8/10

Free tierFrom $0

Completely free to use via Meta AI app a...Natively multimodal: handles text, image...

Updated 2026-04-19

GPT-Rosalind (OpenAI)

OpenAI's first domain-specific model -- life sciences, drug discovery, translational medicine. Launched 2026-04-16 as a Trusted Access research preview. Launch partners: Amgen, Moderna, Allen Institute, Thermo Fisher. Paired with a Life Sciences Codex plugin (50+ scientific tool integrations)

6.8/10

From Invite only

OpenAI's first named vertical/domain-spe...Launch partners Amgen, Moderna, Allen In...

Updated 2026-04-17

GPT-5.4-Cyber (OpenAI)

OpenAI's defensive-cybersecurity variant of GPT-5.4, launched 2026-04-16. Lowered refusal boundary for security-research tasks and native binary reverse-engineering. Access gated via Trusted Access for Cyber (TAC) program -- thousands of verified defenders, hundreds of teams, no public pricing

7.2/10

From Not publicly disclosed

Directly competes with Claude Mythos Pre...Lowered refusal boundary on defensive-se...

Updated 2026-04-19

Microsoft MAI-Thinking-1

Microsoft's first in-house reasoning model -- launched 2026-06-02 at Build as the flagship of seven new MAI models. 35B-active / ~1T-total sparse Mixture-of-Experts, 256K context. AIME 2025 97.0%, matches leading models on SWE-Bench Pro, and beat Claude Sonnet 4.6 in human-preference testing. Available on Microsoft Foundry + OpenRouter / Fireworks / Baseten

7.5/10

From Not disclosed

Microsoft's first in-house frontier-clas...Strong published reasoning numbers: AIME...

Updated 2026-06-02

Hunyuan 3 (Tencent Hy3)

Tencent's Hy3 Preview launched 2026-04-23 -- 295B total / 21B active MoE, 256K context, open-sourced on HuggingFace under tencent/Hy3-preview. Cheapest frontier-class API at ~1.2 RMB per million input tokens. Integrated into Yuanbao, WeChat, QQ

8.1/10

Free tierFrom $0

Open weights from a top-3 Chinese tech c...Pricing is aggressive. ~1.2 RMB per mill...

Updated 2026-04-25

MiMo (Xiaomi)

Xiaomi's MiMo-V2.5 family launched 2026-04-22 -- Pro (1T total / 42B active MoE, 1M context, native vision+audio reasoning), Multimodal base, TTS (3 sub-models: base, VoiceDesign, VoiceClone), and ASR (open-source, English + Chinese + major dialects). Full voice pipeline for the agent era. Extra-charge 1M-context tier removed at launch

8.3/10

Free tierFrom $0

Full voice pipeline shipped together: a ...Native multimodal in MiMo-V2.5-Pro is th...

Updated 2026-06-12