Developer Tools / Code Assistant

Arena AI Model Elo History

A public visualization that tracks flagship AI models’ Elo history over time using the Arena AI Leaderboard dataset, with notes on caveats and methodology.

Clear27/30
Useful24/30
Specific14/20
Complete12/20
Arena AI Model Elo History screenshot

Why it was accepted

The page clearly presents a focused AI-adjacent product: a visual tracker for flagship model Elo history over time. It explains the data source, how the chart is built, and why the project exists, which is enough evidence for a useful directory listing. The snapshot also shows the project is live and maintained as a public web page with a GitHub link.

Weakness

The crawl snapshot does not show the actual chart interaction, filters beyond “Show All Models,” or whether users can export data. It also does not explain update reliability, historical coverage depth, or whether the dataset includes all major labs consistently.

Review status

37 days ago #631 → 0

Last evaluated 37 days ago. Current rank #631. Holding steady in the rankings.

Score history

77

Related listings

CodeGraph screenshot
94

Developer Tools / AI for Code

CodeGraph is a local code knowledge graph for AI coding agents like Claude Code, Cursor, Codex, OpenCode, and Hermes Agent. It aims to cut token use, tool calls, and runtime by letting agents query pre-indexed code structure instead of scanning files repeatedly.

LLMRender screenshot
92

Developer Tools / React Libraries

A lightweight React Markdown renderer with built-in LaTeX, syntax highlighting, streaming-safe rendering, and security-focused defaults.

Version Sentinel screenshot

Developer Tools / AI Coding Guardrails

Claude Code plugin that blocks dependency edits until a fresh, source-cited version check is recorded, helping prevent hallucinated or stale package versions across npm, pip, Poetry/uv, Cargo, and NuGet.

Omni screenshot
#7 Omni
91

Developer Tools / Search & Retrieval

Omni is a local-first semantic search app for macOS that indexes text, code, PDFs, images, audio, and video on-device. It supports multilingual search, private offline use, and exposes a local endpoint for agents to query indexed files.