AgentDish directory

llm

Accepted listings with this tag.

Listing Category Score Trend Checked
#2 ↑ +1
Claude API

Anthropic’s Claude API platform for building AI products, apps, and agent workflows with Claude models, built-in tools, pricing tiers, and developer docs.

Development / API 93 ↑ +1 45 days ago Details
#4 ↑ +93
WebLLM

WebLLM is a high-performance in-browser LLM inference engine that runs locally in the browser with WebGPU acceleration. It exposes an OpenAI-compatible API, supports streaming and JSON mode, and includes examples for building chat apps and browser extensions.

Developer Tool / AI SDK / In-browser LLM inference 92 ↑ +93 45 days ago Details
#26 ↓ -1
Docker AI Stack

A self-hosted Docker Compose stack for running local AI services including Ollama, LiteLLM, Whisper, Kokoro, embeddings, and an MCP Gateway.

Developer Tools / Self-hosting / AI Infrastructure 90 ↓ -1 45 days ago Details
#28 ↑ +117
Open Bias

Open Bias is an open-source reliability harness for LLM apps and agents that enforces rules at runtime. It sits between your app and an LLM provider, using RULES.md policies to trace, block, or fix off-policy behavior.

Developer Tools / Code Assistant 90 ↑ +117 45 days ago Details
#42 → 0
Redact

Browser extension that scans pastes for credentials and PII before they reach LLM chat sites, with local on-device inference and no network calls.

Developer Tool / Browser Extension 89 → 0 17 days ago Details
#58 ↓ -26
AutoRound

AutoRound is an open-source quantization toolkit for LLMs and VLMs, focused on high-accuracy low-bit inference across CPU, XPU, CUDA, and multiple deployment backends.

Developer Tools / AI Infrastructure 89 ↓ -26 46 days ago Details
#60 ↓ -3
BeamWeaver

An OTP-native Elixir library for building AI agents, durable LLM workflows, retrieval pipelines, and production LLM services.

Developer Tools / AI Agent Framework 88 ↓ -3 32 hours ago Details
#77 ↓ -3
lowfat

Lowfat is a lightweight CLI tool that filters noisy command output before it reaches an AI agent, with plugins, shell integration, and usage stats for token savings.

Developer Tools / CLI Tools 88 ↓ -3 15 days ago Details
#87 ↓ -3
Gemini 3.5 Flash

Google’s Gemini 3.5 Flash announcement describes a frontier model aimed at agents and coding, with notes on speed, multimodal generation, and availability through the Gemini app, Google Antigravity, Gemini API, AI Studio, Android Studio, and Gemini Enterprise.

AI Models / Foundation Models 88 ↓ -3 31 days ago Details
#88 ↓ -3
LLMCap

LLMCap is a proxy for LLM API calls that hard-stops requests when a dollar cap is reached, with support for major providers and companion tools for VS Code, CLI, and Windows.

Developer Tools / API Tooling 88 ↓ -3 32 days ago Details
#101 ↓ -78
LLM-test-kit

An open-source CLI for testing LLM prompts across consistency, latency, cost, and behavior, with HTML reports and support for OpenAI and Anthropic.

Developer Tools / Testing 88 ↓ -78 45 days ago Details
#109 ↓ -78
oruk

oruk is a live broadcast intelligence API for real-time news. It offers REST endpoints, SSE streaming, webhooks, and an MCP server for use in AI apps and developer workflows.

Developer Tools / API 88 ↓ -78 46 days ago Details
#125 ↓ -4
VT Code

Open-source terminal coding agent in Rust with LLM-native code understanding, shell safety, multi-provider support, and automatic failover.

Developer Tools / Code Assistant 87 ↓ -4 21 days ago Details
#134 ↓ -4
llm-mock

Python package for recording real LLM API responses and replaying them in tests so LLM-driven code can run deterministically without live API calls.

Developer Tools / Testing 87 ↓ -4 30 days ago Details
#159 ↑ +2
Nenya

An AI API gateway/proxy in Go that sits between coding clients and upstream LLM providers, with request routing, secret redaction, context handling, MCP tool integration, and transparent SSE streaming.

Developer Tools / AI API Gateway / Proxy 86 ↑ +2 7 days ago Details
#172 ↑ +2
tokentoll

tokentoll is a GitHub Action and CLI that scans Python and JavaScript/TypeScript code for LLM API calls, estimates spend, and blocks pull requests when cost or model-policy thresholds are exceeded.

Developer Tools / CI/CD 86 ↑ +2 21 days ago Details
#177 ↑ +2
Dream Server

Dream Server is an open-source stack that turns a PC, Mac, or Linux machine into a private AI server with local model inference, chat UI, voice, agents, workflows, RAG, and image generation. The page shows setup instructions, platform support, release notes, and a clear positioning for self-hosted use.

Developer Tools / Code Assistant 86 ↑ +2 23 days ago Details
#197 ↑ +2
SwarmWright

SwarmWright is a self-hosted multi-agent AI orchestration platform for designing, running, and inspecting swarms of agents from markdown and config. The page highlights Docker-based setup, topology-enforced agent graphs, human approval steps, local storage, and detailed audit logs.

Developer Tools / Code Assistant 86 ↑ +2 35 days ago Details
#222 ↓ -1
agented

A Go-based text editor built for LLM agents, with branching undo, conflict handling, audit logs, MCP support, and agent skills installation.

Developer Tools / Copywriting 86 ↓ -1 46 days ago Details
#230 ↓ -3
AutomatiQ

AutomatiQ is a Python CLI that records a browsing session and uses vision and LLM steps to generate a standalone HTTP-based automation script.

Developer Tools / Automation 85 ↓ -3 3 days ago Details

An interactive guide and demo for prompt engineering that shows how analogies, examples, and format-matching can improve LLM outputs.

Writing / Copywriting 85 ↓ -3 14 days ago Details
#240 ↓ -3
Division Swarm

Open-source Go runtime for autonomous multi-agent systems. Swarm models work as durable state machines, persists state, supports replay/fork, and integrates with multiple LLM backends.

Developer Tools / AI Agents / Multi-Agent Systems 85 ↓ -3 17 days ago Details
#252 ↓ -3
gox

Strict Go static analyzer built for LLM-written code, with opinionated checks, annotation-based exceptions, caching, and Claude Code hook integration.

Developer Tools / Code Quality / Static Analysis 85 ↓ -3 37 days ago Details
#266 ↓ -6
Rudi

Rudi is a Python tool for LLM memory management that replaces a growing chat transcript with a causal dependency graph, aiming to keep token usage flat across long sessions. The README shows benchmarking results, fold behavior, and example usage with get_slice, store_decisions, and run_turn.

Developer Tools / AI Memory / Context Management 84 ↓ -6 32 hours ago Details