AgentDish directory

reinforcement-learning

Accepted listings with this tag.

Listing	Category	Score	Trend	Checked
#560 ↑ +2 GenZ LLM A small post-trained Qwen2.5-0.5B-Instruct model tuned to write in Gen Z slang, with training and inference notebooks plus dataset files in the repo.	AI Models / Fine-Tuned LLMs	79	↑ +2	40 days ago	Details
#595 ↑ +6 Agora-1: The Multi-Agent World Model Agora-1 is a multi-agent world model from Odyssey that simulates shared real-time environments for up to four participants, human or AI, with a focus on gaming, robotics, reinforcement learning, and foundation model research.	AI Research / World Models	78	↑ +6	32 days ago	Details
#667 ↓ -1 Show HN: 178K Parameter Neural Net That Wins Poke(rogue)like A blog post describing a small reinforcement-learning agent trained with PPO to play and beat a Pokelike/Pokerogue-style game, including the input representation, model architecture, and training approach.	Developer Tools / Code Assistant	74	↓ -1	16 days ago	Details
#687 ↑ +1 World Models for Planning Agents An educational article explaining world models, latent states, dynamics learning, and planning for agents, with examples from gridworld, Dreamer, and MuZero.	Writing / Copywriting	73	↑ +1	34 days ago	Details