AgentDish directory

Recently Evaluated

The newest checks across accepted listings.

Listing Category Score Trend Checked

A research write-up on detecting AI agents through process differences in CAPTCHA and related cognitive tasks. It outlines the CogCAPTCHA30 approach, reports human-vs-model differences, and connects the findings to Roundtable’s Proof of Human product.

Research / Knowledge Work 77 → 0 22 days ago Details
#175 ↑ +2
zot

A lightweight, single-binary terminal coding agent harness written in Go. It supports multiple model providers, local OpenAI-compatible models, built-in file and shell tools, extensions, session management, side chat, and swarm subagents.

Developer Tools / Code Assistant 86 ↑ +2 22 days ago Details
#83 ↓ -3
tiny-vllm

Open-source C++ and CUDA LLM inference engine inspired by vLLM, with a teaching-focused course that walks through model serving, batching, KV cache, and attention kernels.

Developer Tools / AI Inference / LLM Serving 88 ↓ -3 22 days ago Details

arXiv paper on a self-speculative decoding framework for speeding up reasoning LLM inference on edge hardware, with hardware co-design and reported speedups.

Research / AI/ML Paper 77 → 0 23 days ago Details

A practical guide to building an always-on personal AI assistant with Pi, OpenCode Go, Telegram, Resend, Hetzner, Tailscale, GitHub, and Nginx.

Writing / Copywriting 77 → 0 23 days ago Details

AppFactor’s blog post explains an AI-powered MCP Bridge feature that makes legacy API tools easier for agents to find and use by combining keyword search, vector search, and AI-generated enrichment from schemas and sample responses.

AI Development Tool / Agent tooling 78 ↑ +6 23 days ago Details
#44 → 0
claude-code-replay

A CLI tool that replays Claude Code session logs to reconstruct lost project files commit by commit. It includes deterministic replay from JSONL logs, an optional Claude-based classifier for Bash events, and install/run instructions.

Developer Tools / Code Assistant 89 → 0 23 days ago Details

A Windows desktop agent that lets AI interact with any app through visual GUI control, with a portable single-EXE build and support for ChatGPT, Gemini, and Claude.

AI Agent / Computer Use / Desktop Automation 86 ↑ +2 23 days ago Details

A GitHub repository for a latency-separated AI memory retrieval and RAG system. The README describes fetch, compute, and ANN search stages, includes benchmark ranges, and exposes a public test endpoint.

AI Infrastructure / Retrieval / RAG 71 → 0 23 days ago Details
#511 ↑ +2
Tab Council

Chrome MV3 extension that organizes existing AI tabs into a structured model council for cross-checking prompts across ChatGPT, Claude, Gemini, Perplexity, Merlin, and Grok.

Developer Tools / Browser Extension 81 ↑ +2 23 days ago Details
#590 ↑ +6
Repolog

Repolog scans a live website and produces a ranked audit covering on-page SEO, Core Web Vitals, security checks, and AI readiness for major AI search and assistant platforms.

AI-powered product / Website audit / SEO and security 78 ↑ +6 23 days ago Details
#84 ↓ -3
OpenHive

OpenHive is a shared knowledge base for AI agents to search and post problem-solution pairs. The page shows multiple integration paths, including Agent Skills, MCP, REST API, and discovery endpoints.

Developer Tools / Code Assistant 88 ↓ -3 23 days ago Details
#9 ↓ -3
aislop

CLI for catching AI-generated code smells and regressions in code. It scans changes with 40+ rules across 7 languages, offers fixes, CI gating, hooks, and MCP tools.

Developer Tools / Code Quality 91 ↓ -3 23 days ago Details
#177 ↑ +2
ktx

Open-source context layer for data and analytics agents that adds warehouse context, semantic search, and MCP/CLI access for tools like Claude Code and Codex.

Developer Tools / AI Developer Tool 86 ↑ +2 23 days ago Details
#708 ↑ +1
defensive-hook

A Claude Code stop hook that detects negative feedback in the conversation transcript and plays a random local sound from a folder, with a fallback voice prompt if no sound files are present.

Developer Tools / AI Agents / Automation 72 ↑ +1 24 days ago Details
#178 ↑ +2
diplomat-agent-ts

A TypeScript static scanner that finds AI agent tool calls with no checks before they can trigger real-world side effects. It supports CLI scanning, JSON output, a generated tool-call registry, and CI/pre-commit enforcement.

Developer Tools / AI Code Analysis 86 ↑ +2 24 days ago Details
#308 ↓ -6
DeepSWE

DeepSWE is a benchmark for measuring frontier coding agents on original, long-horizon software engineering tasks. The page shows a leaderboard, methodology overview, task examples, and a full blog explaining the benchmark design and results.

Developer Tools / AI Benchmarking 84 ↓ -6 24 days ago Details

arXiv paper describing AVA, a GenAI platform for policy and development research built on 4,000+ World Bank reports. The abstract highlights multilingual support, evidence-based synthesis, citation verifiability, and reasoned abstention when queries cannot be supported.

AI Research / Trustworthy Generative AI 78 ↑ +6 24 days ago Details
#681 ↓ -1
x-commit

A Claude skill for generating better commit messages with gitmoji + Conventional Commits, atomic commit enforcement, and a hook guard to keep commits on-format.

Developer Tools / Git / Commit Automation 74 ↓ -1 24 days ago Details

A Superconductor blog post showing how background coding agents were used to reproduce, diagnose, and fix a Rails memory leak using derailed_benchmarks, with a reusable Agent Skill workflow included.

Developer Tools / Code Assistant 74 ↓ -1 24 days ago Details
#592 ↑ +6
Enough

Enough is a beta personal language system for planning, writing, reviewing, and translation. It supports local models and OpenRouter, and is aimed at users who want more control over their data while building a flexible personal knowledge workflow.

Writing / Copywriting 78 ↑ +6 24 days ago Details

Agent skills for VillageSQL that help AI coding tools build a VillageSQL extension, including a 7-phase workflow for the vsql-extension-builder skill and install support for Claude Code, Gemini CLI, Codex, Cursor, Amp, and Kiro.

Developer Tools / AI Developer Tooling 77 → 0 24 days ago Details
#179 ↑ +2
Dream Server

Dream Server is an open-source stack that turns a PC, Mac, or Linux machine into a private AI server with local model inference, chat UI, voice, agents, workflows, RAG, and image generation. The page shows setup instructions, platform support, release notes, and a clear positioning for self-hosted use.

Developer Tools / Code Assistant 86 ↑ +2 24 days ago Details
#403 ↓ -3
Multiplayer

Multiplayer is a local debugging agent for developers that connects coding agents to production data so they can fix application bugs automatically. The site highlights runtime debugging, issue deduplication, and integrations with tools like Claude Code, Codex, and Copilot.

Developer Tools / Debugging 83 ↓ -3 24 days ago Details