AgentDish directory

llm

Accepted listings with this tag.

Listing	Category	Score	Trend	Checked
#2 ↑ +1 Claude API Anthropic’s Claude API platform for building AI products, apps, and agent workflows with Claude models, built-in tools, pricing tiers, and developer docs.	Development / API	93	↑ +1	45 days ago	Details
#4 ↑ +93 WebLLM WebLLM is a high-performance in-browser LLM inference engine that runs locally in the browser with WebGPU acceleration. It exposes an OpenAI-compatible API, supports streaming and JSON mode, and includes examples for building chat apps and browser extensions.	Developer Tool / AI SDK / In-browser LLM inference	92	↑ +93	45 days ago	Details
#26 ↓ -1 Docker AI Stack A self-hosted Docker Compose stack for running local AI services including Ollama, LiteLLM, Whisper, Kokoro, embeddings, and an MCP Gateway.	Developer Tools / Self-hosting / AI Infrastructure	90	↓ -1	45 days ago	Details
#28 ↑ +117 Open Bias Open Bias is an open-source reliability harness for LLM apps and agents that enforces rules at runtime. It sits between your app and an LLM provider, using RULES.md policies to trace, block, or fix off-policy behavior.	Developer Tools / Code Assistant	90	↑ +117	45 days ago	Details
#42 → 0 Redact Browser extension that scans pastes for credentials and PII before they reach LLM chat sites, with local on-device inference and no network calls.	Developer Tool / Browser Extension	89	→ 0	17 days ago	Details
#58 ↓ -26 AutoRound AutoRound is an open-source quantization toolkit for LLMs and VLMs, focused on high-accuracy low-bit inference across CPU, XPU, CUDA, and multiple deployment backends.	Developer Tools / AI Infrastructure	89	↓ -26	46 days ago	Details
#60 ↓ -3 BeamWeaver An OTP-native Elixir library for building AI agents, durable LLM workflows, retrieval pipelines, and production LLM services.	Developer Tools / AI Agent Framework	88	↓ -3	32 hours ago	Details
#77 ↓ -3 lowfat Lowfat is a lightweight CLI tool that filters noisy command output before it reaches an AI agent, with plugins, shell integration, and usage stats for token savings.	Developer Tools / CLI Tools	88	↓ -3	15 days ago	Details
#87 ↓ -3 Gemini 3.5 Flash Google’s Gemini 3.5 Flash announcement describes a frontier model aimed at agents and coding, with notes on speed, multimodal generation, and availability through the Gemini app, Google Antigravity, Gemini API, AI Studio, Android Studio, and Gemini Enterprise.	AI Models / Foundation Models	88	↓ -3	31 days ago	Details
#88 ↓ -3 LLMCap LLMCap is a proxy for LLM API calls that hard-stops requests when a dollar cap is reached, with support for major providers and companion tools for VS Code, CLI, and Windows.	Developer Tools / API Tooling	88	↓ -3	32 days ago	Details
#101 ↓ -78 LLM-test-kit An open-source CLI for testing LLM prompts across consistency, latency, cost, and behavior, with HTML reports and support for OpenAI and Anthropic.	Developer Tools / Testing	88	↓ -78	45 days ago	Details
#109 ↓ -78 oruk oruk is a live broadcast intelligence API for real-time news. It offers REST endpoints, SSE streaming, webhooks, and an MCP server for use in AI apps and developer workflows.	Developer Tools / API	88	↓ -78	46 days ago	Details
#125 ↓ -4 VT Code Open-source terminal coding agent in Rust with LLM-native code understanding, shell safety, multi-provider support, and automatic failover.	Developer Tools / Code Assistant	87	↓ -4	21 days ago	Details
#134 ↓ -4 llm-mock Python package for recording real LLM API responses and replaying them in tests so LLM-driven code can run deterministically without live API calls.	Developer Tools / Testing	87	↓ -4	30 days ago	Details
#159 ↑ +2 Nenya An AI API gateway/proxy in Go that sits between coding clients and upstream LLM providers, with request routing, secret redaction, context handling, MCP tool integration, and transparent SSE streaming.	Developer Tools / AI API Gateway / Proxy	86	↑ +2	7 days ago	Details
#172 ↑ +2 tokentoll tokentoll is a GitHub Action and CLI that scans Python and JavaScript/TypeScript code for LLM API calls, estimates spend, and blocks pull requests when cost or model-policy thresholds are exceeded.	Developer Tools / CI/CD	86	↑ +2	21 days ago	Details
#177 ↑ +2 Dream Server Dream Server is an open-source stack that turns a PC, Mac, or Linux machine into a private AI server with local model inference, chat UI, voice, agents, workflows, RAG, and image generation. The page shows setup instructions, platform support, release notes, and a clear positioning for self-hosted use.	Developer Tools / Code Assistant	86	↑ +2	23 days ago	Details
#197 ↑ +2 SwarmWright SwarmWright is a self-hosted multi-agent AI orchestration platform for designing, running, and inspecting swarms of agents from markdown and config. The page highlights Docker-based setup, topology-enforced agent graphs, human approval steps, local storage, and detailed audit logs.	Developer Tools / Code Assistant	86	↑ +2	35 days ago	Details
#222 ↓ -1 agented A Go-based text editor built for LLM agents, with branching undo, conflict handling, audit logs, MCP support, and agent skills installation.	Developer Tools / Copywriting	86	↓ -1	46 days ago	Details
#230 ↓ -3 AutomatiQ AutomatiQ is a Python CLI that records a browsing session and uses vision and LLM steps to generate a standalone HTTP-based automation script.	Developer Tools / Automation	85	↓ -3	3 days ago	Details
#237 ↓ -3 Better Prompting LLMs Through Analogy An interactive guide and demo for prompt engineering that shows how analogies, examples, and format-matching can improve LLM outputs.	Writing / Copywriting	85	↓ -3	14 days ago	Details
#240 ↓ -3 Division Swarm Open-source Go runtime for autonomous multi-agent systems. Swarm models work as durable state machines, persists state, supports replay/fork, and integrates with multiple LLM backends.	Developer Tools / AI Agents / Multi-Agent Systems	85	↓ -3	17 days ago	Details
#252 ↓ -3 gox Strict Go static analyzer built for LLM-written code, with opinionated checks, annotation-based exceptions, caching, and Claude Code hook integration.	Developer Tools / Code Quality / Static Analysis	85	↓ -3	37 days ago	Details
#266 ↓ -6 Rudi Rudi is a Python tool for LLM memory management that replaces a growing chat transcript with a causal dependency graph, aiming to keep token usage flat across long sessions. The README shows benchmarking results, fold behavior, and example usage with get_slice, store_decisions, and run_turn.	Developer Tools / AI Memory / Context Management	84	↓ -6	32 hours ago	Details