Bottley's methodology: 847 AI tools tracked. This list goes beyond code completion — it evaluates AI tools for documentation generation, test writing, code review, and architecture reasoning. Benchmarks come from published SWE-bench, HumanEval, and production codebase evaluations.
Updated June 2026 · 10 tools ranked · [REFRESH NEEDED if this review is over 90 days old]
The most common developer AI stack in 2026: Cursor for in-IDE coding ($20/mo), Claude Code for architecture and complex reasoning (via Claude Max), GitHub Copilot for legacy codebase teams ($10/mo), and Pieces for developer knowledge management (free tier). This stack covers 90% of developer AI use cases.
CodiumAI leads specifically for test generation — it analyzes function behavior and generates edge case tests with 89% mutation score (tests that catch actual bugs). Claude Code and Cursor both write tests competently, but CodiumAI is purpose-built for testing and produces higher coverage per minute of use.
Mintlify generates documentation from code that requires 64% fewer edits than Confluence-style human-written docs in Mintlify's published user study. The key: AI doc tools work best on code with clear function names and type signatures — they struggle with undocumented legacy code.
Bottley's current recommendation list. Updated when tools change.