AgentDish directory

multimodal

Accepted listings with this tag.

Listing Category Score Trend Checked
#7 ↓ -3
Omni

Omni is a local-first semantic search app for macOS that indexes text, code, PDFs, images, audio, and video on-device. It supports multilingual search, private offline use, and exposes a local endpoint for agents to query indexed files.

Developer Tools / Search & Retrieval 91 ↓ -3 14 days ago Details
#19 ↓ -2
Gemma 4 12B

Google’s Gemma 4 12B is a multimodal model for local and developer-focused AI use, with native audio and vision support, agentic workflows, and broad tooling support.

Developer Tools / AI Models 90 ↓ -2 16 days ago Details
#87 ↓ -3
Gemini 3.5 Flash

Google’s Gemini 3.5 Flash announcement describes a frontier model aimed at agents and coding, with notes on speed, multimodal generation, and availability through the Gemini app, Google Antigravity, Gemini API, AI Studio, Android Studio, and Gemini Enterprise.

AI Models / Foundation Models 88 ↓ -3 31 days ago Details
#103 ↑ +116
WallasAPI

An OpenAI-compatible AI router that sends requests across 12+ providers and 100+ models with automatic fallback and multimodal routing.

AI Development / API 88 ↑ +116 45 days ago Details
#136 ↓ -4
MulmoClaude

Open-source multi-modal Claude Code client that runs locally and lets Claude compose across plugins, GUIs, documents, charts, wikis, automations, and messaging bridges.

Developer Tools / AI Development 87 ↓ -4 31 days ago Details
#434 ↓ -2
KINETK

KINETK is an AI infrastructure product that turns social-web content into a real-time IP graph for creators, communities, and narratives. The site highlights an API and MCP for agents, multimodal embeddings, and a query layer for grounded retrieval.

Developer Tools / APIs 82 ↓ -2 4 days ago Details

A detailed walkthrough for running a local coding agent on macOS with llama.cpp, Gemma 4, MTP speculative decoding, image support, and Pi as the agent interface.

Developer Tools / Code Assistant 80 ↓ -1 7 days ago Details