AgentDish directory

Research AI Tools

Accepted listings in this category.

Listing	Category	Score	Trend	Checked
#53 ↑ +164 Below the Fold — A New York Times X-Ray Dashboard An interactive dashboard that analyzes New York Times coverage since 2000 using the NYT Archive API, with views for reporters, beats, sections, subjects, geography, obituaries, and corrections.	Research / Data Visualization	89	↑ +164	45 days ago	Details
#94 ↓ -3 CAD-Bench An open benchmark and leaderboard for AI CAD agents, with 308 prompts across 20 categories and layered scoring for geometry, engineering, manufacturability, and cognition.	Research / Knowledge Work	88	↓ -3	42 days ago	Details
#147 ↓ -47 Benchmarking Inference Engines on Agentic Workloads A research article from Applied Compute on how agentic, tool-using workloads differ from traditional LLM benchmarks, with production observations, workload profiles, and an open-source harness for replaying traces.	Research / Knowledge Work	87	↓ -47	45 days ago	Details
#223 ↑ +346 Alignment Whack-a-Mole A research code repository for studying how fine-tuning can trigger verbatim recall of copyrighted books in large language models. It includes preprocessing, fine-tuning, generation, and memorization-evaluation scripts, with setup notes and example data.	Research / Copywriting	86	↑ +346	46 days ago	Details
#400 ↓ -3 QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks arXiv paper describing QUEST, an open family of deep research agents from 2B to 35B parameters, plus a synthetic-task training recipe and released models, data, and scripts.	Research / AI Agents	83	↓ -3	25 days ago	Details
#404 ↓ -3 wwwatch A daily AI intelligence journal for builders, covering notable model, tooling, and release updates in a short sourced digest.	Research / Knowledge Work	83	↓ -3	29 days ago	Details
#408 ↓ -3 Physics AI Physics AI is a physics homework and study tool that solves problems from photos or typed prompts, with step-by-step explanations, tutor mode, and visual breakdowns for diagrams and vectors.	Research / Knowledge Work	83	↓ -3	30 days ago	Details
#429 ↑ +57 Q2 2026 MCP Ecosystem Health A research report on the current MCP ecosystem, with live crawl numbers, verification rates, category breakdowns, and examples of both strong and weak MCP-positive sites.	Research / AI research	83	↑ +57	45 days ago	Details
#441 ↓ -2 BigTech AI News Chrome extension that tracks major AI companies, pulls in AI news and research, and generates daily summaries with Gemini, including language-aware summaries and article deep dives.	Research / Knowledge Work	82	↓ -2	10 days ago	Details
#481 ↑ +85 ShadowBrokers AI-powered trade signal product for retail traders that turns financial news into ranked trade plans with entries, stops, targets, and tracked accuracy.	Research / Knowledge Work	82	↑ +85	45 days ago	Details
#618 → 0 Why asking AI "is this stock a good buy?" is useless – and what to do instead PaperProfit explains an AI-assisted stock evaluation approach that combines fundamentals, technical signals, and qualitative analysis from transcripts and SEC filings into a weighted score.	Research / Knowledge Work	77	→ 0	18 days ago	Details
#621 → 0 CAPTCHAs can still detect AI agents A research write-up on detecting AI agents through process differences in CAPTCHA and related cognitive tasks. It outlines the CogCAPTCHA30 approach, reports human-vs-model differences, and connects the findings to Roundtable’s Proof of Human product.	Research / Knowledge Work	77	→ 0	21 days ago	Details
#622 → 0 Cassandra: Enabling Reasoning LLMs at Edge via Self-Speculative Decoding arXiv paper on a self-speculative decoding framework for speeding up reasoning LLM inference on edge hardware, with hardware co-design and reported speedups.	Research / AI/ML Paper	77	→ 0	22 days ago	Details
#642 ↓ -1 PDF to MD Converter AI-powered PDF to Markdown converter focused on preserving structure like headings, tables, images, and reading order for research, docs, and knowledge workflows.	Research / Knowledge Work	76	↓ -1	17 days ago	Details
#648 ↓ -1 ITB Engine A research repository and local web app for testing quantum gravity theory-space exclusions against encoded consistency constraints. It includes a CLI, a localhost interface, test coverage, and an LLM-powered research agent for running searches and report generation.	Research / Scientific Computing	76	↓ -1	40 days ago	Details
#666 ↓ -1 Latent Agents: A Post-Training Procedure for Internalized Multi-Agent Debate arXiv paper on distilling multi-agent debate into a single LLM with a two-stage fine-tuning pipeline. The abstract reports lower token use, comparable or better benchmark performance, and an analysis of agent-specific activation subspaces, with code linked from the page.	Research / AI/LLM Reasoning	74	↓ -1	15 days ago	Details
#695 ↑ +1 Agentic Compilation: Mitigating the LLM Rerun Crisis for Minimized-Inference-Cost Web Automation An arXiv paper on reducing LLM inference cost for web automation by compiling browser tasks into a deterministic JSON workflow and executing them without repeated model calls.	Research / Paper	72	↑ +1	13 days ago	Details