AgentDish directory

AI Research AI Tools

Accepted listings in this category.

Listing	Category	Score	Trend	Checked
#495 ↑ +2 EuroMesh A sourced model and short report exploring whether Europe could train a sovereign frontier AI model using public compute it already owns, with reproducible code, datasets, and a PDF report.	AI Research / Analysis / Reports	81	↑ +2	5 days ago	Details
#512 ↓ -21 MarCognity-AI An open-source research framework for structured LLM evaluation, claim verification, and source-grounded reflective reasoning. The repo describes modular components for retrieval, semantic scoring, skeptical claim checking, and benchmark-style epistemic assessment.	AI Research / Evaluation / Verification Framework	81	↓ -21	45 days ago	Details
#571 ↑ +6 MiroThinker MiroThinker is a science-focused AI research app that emphasizes prediction, verification, and evidence-backed answers. The page also points to a MiroMind app and suggests use cases across finance, medicine, and regulation.	AI Research / Deep Research Agent	78	↑ +6	8 days ago	Details
#582 ↑ +6 Learning from AVA: Early Lessons from a Curated and Trustworthy Generative AI for Policy and Development Research arXiv paper describing AVA, a GenAI platform for policy and development research built on 4,000+ World Bank reports. The abstract highlights multilingual support, evidence-based synthesis, citation verifiability, and reasoned abstention when queries cannot be supported.	AI Research / Trustworthy Generative AI	78	↑ +6	23 days ago	Details
#595 ↑ +6 Agora-1: The Multi-Agent World Model Agora-1 is a multi-agent world model from Odyssey that simulates shared real-time environments for up to four participants, human or AI, with a focus on gaming, robotics, reinforcement learning, and foundation model research.	AI Research / World Models	78	↑ +6	32 days ago	Details
#607 ↑ +5 LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning Apple Machine Learning Research paper proposing LaDiR, a reasoning framework that combines a VAE-based latent space with latent diffusion to improve LLM text reasoning and iterative refinement.	AI Research / LLM Reasoning	78	↑ +5	45 days ago	Details
#646 ↓ -1 Hyperagents Research paper introducing hyperagents, a self-referential agent framework that combines a task agent and a meta agent into one editable program. The abstract describes a DGM-based system that improves both task performance and its own improvement process across domains.	AI Research / Self-Improving Agents	76	↓ -1	28 days ago	Details
#656 → 0 A 400-hour forensic audit of LLMs using multi-model context saturation A GitHub research project documenting a long-form, multi-model analysis of LLM behavior across Claude, Gemini, ChatGPT, and Grok. The repo includes an executive summary, screenplay, technical white paper, and archive of logs and chat records.	AI Research / LLM Evaluation & Analysis	75	→ 0	25 days ago	Details
#675 ↓ -1 GPT Guesses Between 1 and 100 A GitHub research project that measures how gpt-4.1 responds when asked to pick a random number between 1 and 100, using 10,000 API calls and comparing the results to a uniform baseline.	AI Research / Model Behavior Analysis	74	↓ -1	26 days ago	Details