How Agent Rank works
We score every agent in our catalog on six dimensions, sum to a 0–100 aggregate, and classify into tiers. The formula is public, deterministic, and re-evaluated on every deploy.
The six dimensions
Autonomy fit
How we compute it: Mapped from the agent's autonomy level: Assistant → 7, Semi-autonomous → 8, Autonomous → 9.
Why: Higher autonomy isn't always better, but it's what buyers ask about. The +2-point spread captures that distinction without overweighting it.
Capabilities
How we compute it: Two points per native capability (tool use, code execution, RAG, memory, vision, voice, browser use, multi-agent). Five or more capabilities caps the dimension at 10.
Why: Breadth across capability types is a stronger predictor of fit than depth within one type.
Integrations
How we compute it: Two points per distinct integration category (IDE, CRM, ticketing, messaging, storage, etc.). Five or more categories caps at 10.
Why: An agent with 10 sales-only integrations is less broadly useful than one with five integrations across three categories. We reward category breadth, not raw count.
Pricing value
How we compute it: Open source: 10. Free: 9. Under $25/mo: 8. Under $50/mo: 7. Under $100/mo: 6. Under $200/mo: 5. Higher or contact-sales: 3–4.
Why: Pricing value isn't 'cheapest wins.' It's 'do I get fair power for what I pay.' Open source scores highest because licensing is the unlimited tier.
Polish & maturity
How we compute it: Baseline of 4, plus 1 each for: a published version number, featured status, ≥50K views, ≥150K views, having a product preview image or video, having multiple pricing tiers. Capped at 10.
Why: Mature, well-known agents with thoughtful product surface score higher. These are the signals that a real engineering team is behind the product.
Verifiability
How we compute it: Baseline of 3, plus 4 for our Verified badge, 2 for authored pricing tiers, 1 for authored pros/cons. Capped at 10.
Why: Verifiability captures how much editorial work we've done on the listing. An agent we've personally vetted, with structured pricing and pros/cons, beats a placeholder record.
Aggregate score
The six per-dimension scores (each 1–10) are summed, divided by 60, multiplied by 100, and rounded. The result is the agent's Agent Rank score.
Tier classification:
- S-tier: 85 or higher — top of the category, well-rounded across all six dimensions
- A-tier: 72–84 — strong, with at most one weak dimension
- B-tier: 58–71 — solid, with multiple trade-offs to consider
- C-tier: below 58 — niche or early-stage; check the breakdown carefully
What the score is not
Agent Rank measures fit across our six dimensions — it doesn't measure your fit. A C-tier agent can be perfect for a specific workflow that values one dimension we underweight. The score is a starting point; the breakdown table on each agent page is where the real decision lives.
Editorial overrides
For agents where the automated score visibly diverges from hands-on experience, the editorial team can override a single dimension. All overrides are public — they appear in the per-dimension table on the agent page with a small note.
Refresh cadence
The score is recomputed on every deploy of this site. When we add a pricing tier, ship a new pros/cons block, or update a capability list, the score moves the next time we deploy. Major catalog updates ship weekly; structural changes to the formula are versioned and announced in our newsletter.
User reviews are separate
Agent Rank is the editorial score. Community user reviews surface separately on each agent page with their own aggregate. We don't blend them — they answer different questions ("is this agent well-built" vs. "did real users like it").
Frequently asked
How is Agent Rank calculated?+
Six weighted dimensions (autonomy fit, capabilities, integrations, pricing value, polish, verifiability) each scored 0-100; the composite is a weighted average. Weights are listed in the table above. The full per-agent dimension breakdown is visible on every /agent/<slug> page so the score is reproducible from public inputs.
Is AI Agent Rank pay-to-play?+
No. Vendors cannot buy a higher Agent Rank, a featured slot, or removal of negative notes. We disclose affiliate relationships at /about; affiliate links do not change which agent ranks higher. Editorial independence is the entire product — compromising it ends the brand.
How often is the ranking refreshed?+
Pricing is re-verified weekly. Scoring updates ship whenever a vendor releases a material change (new tier, new capability, deprecated feature). Sitewide methodology refreshes happen quarterly; the date stamp at /methodology and the per-agent "Updated" pill both reflect the most recent edit.
Why isn't [popular agent] in your shortlist?+
Three reasons agents drop from a shortlist: (1) they haven't yet shown durable production deployments — a demo doesn't qualify; (2) pricing or licensing is incompatible with the use case the shortlist targets; (3) capability claims aren't verifiable against public benchmarks or customer cases. Vendor disagreement is welcome — we publish corrections at /contact.
Can vendors influence their score?+
Vendors can provide factual corrections (pricing errors, capability misstatements). They cannot influence the dimension weights or score interpretation. All corrections are dated and logged on the agent page so the audit trail is public.
How does Agent Rank differ from user ratings?+
Agent Rank is an editorial composite — it answers "is this agent well-engineered for the workload". User ratings (when present, separately surfaced on each agent page) answer "did real users like it". We don't blend them — they measure different things and blending masks both signals.