AgentDish directory

Research AI Tools

Accepted listings in this category.

Listing Category Score Trend Checked

An interactive dashboard that analyzes New York Times coverage since 2000 using the NYT Archive API, with views for reporters, beats, sections, subjects, geography, obituaries, and corrections.

Research / Data Visualization 89 ↑ +164 45 days ago Details
#94 ↓ -3
CAD-Bench

An open benchmark and leaderboard for AI CAD agents, with 308 prompts across 20 categories and layered scoring for geometry, engineering, manufacturability, and cognition.

Research / Knowledge Work 88 ↓ -3 42 days ago Details

A research article from Applied Compute on how agentic, tool-using workloads differ from traditional LLM benchmarks, with production observations, workload profiles, and an open-source harness for replaying traces.

Research / Knowledge Work 87 ↓ -47 45 days ago Details
#223 ↑ +346
Alignment Whack-a-Mole

A research code repository for studying how fine-tuning can trigger verbatim recall of copyrighted books in large language models. It includes preprocessing, fine-tuning, generation, and memorization-evaluation scripts, with setup notes and example data.

Research / Copywriting 86 ↑ +346 46 days ago Details

arXiv paper describing QUEST, an open family of deep research agents from 2B to 35B parameters, plus a synthetic-task training recipe and released models, data, and scripts.

Research / AI Agents 83 ↓ -3 25 days ago Details
#404 ↓ -3
wwwatch

A daily AI intelligence journal for builders, covering notable model, tooling, and release updates in a short sourced digest.

Research / Knowledge Work 83 ↓ -3 29 days ago Details
#408 ↓ -3
Physics AI

Physics AI is a physics homework and study tool that solves problems from photos or typed prompts, with step-by-step explanations, tutor mode, and visual breakdowns for diagrams and vectors.

Research / Knowledge Work 83 ↓ -3 30 days ago Details

A research report on the current MCP ecosystem, with live crawl numbers, verification rates, category breakdowns, and examples of both strong and weak MCP-positive sites.

Research / AI research 83 ↑ +57 45 days ago Details
#441 ↓ -2
BigTech AI News

Chrome extension that tracks major AI companies, pulls in AI news and research, and generates daily summaries with Gemini, including language-aware summaries and article deep dives.

Research / Knowledge Work 82 ↓ -2 10 days ago Details
#481 ↑ +85
ShadowBrokers

AI-powered trade signal product for retail traders that turns financial news into ranked trade plans with entries, stops, targets, and tracked accuracy.

Research / Knowledge Work 82 ↑ +85 45 days ago Details

PaperProfit explains an AI-assisted stock evaluation approach that combines fundamentals, technical signals, and qualitative analysis from transcripts and SEC filings into a weighted score.

Research / Knowledge Work 77 → 0 18 days ago Details

A research write-up on detecting AI agents through process differences in CAPTCHA and related cognitive tasks. It outlines the CogCAPTCHA30 approach, reports human-vs-model differences, and connects the findings to Roundtable’s Proof of Human product.

Research / Knowledge Work 77 → 0 21 days ago Details

arXiv paper on a self-speculative decoding framework for speeding up reasoning LLM inference on edge hardware, with hardware co-design and reported speedups.

Research / AI/ML Paper 77 → 0 22 days ago Details
#642 ↓ -1
PDF to MD Converter

AI-powered PDF to Markdown converter focused on preserving structure like headings, tables, images, and reading order for research, docs, and knowledge workflows.

Research / Knowledge Work 76 ↓ -1 17 days ago Details
#648 ↓ -1
ITB Engine

A research repository and local web app for testing quantum gravity theory-space exclusions against encoded consistency constraints. It includes a CLI, a localhost interface, test coverage, and an LLM-powered research agent for running searches and report generation.

Research / Scientific Computing 76 ↓ -1 40 days ago Details

arXiv paper on distilling multi-agent debate into a single LLM with a two-stage fine-tuning pipeline. The abstract reports lower token use, comparable or better benchmark performance, and an analysis of agent-specific activation subspaces, with code linked from the page.

Research / AI/LLM Reasoning 74 ↓ -1 15 days ago Details

An arXiv paper on reducing LLM inference cost for web automation by compiling browser tasks into a deterministic JSON workflow and executing them without repeated model calls.

Research / Paper 72 ↑ +1 13 days ago Details