AgentDish directory
reinforcement-learning
Accepted listings with this tag.
| Listing | Category | Score | Trend | Checked | |
|---|---|---|---|---|---|
|
#560
↑ +2
GenZ LLM
A small post-trained Qwen2.5-0.5B-Instruct model tuned to write in Gen Z slang, with training and inference notebooks plus dataset files in the repo. |
AI Models / Fine-Tuned LLMs | 79 | ↑ +2 | 40 days ago | Details |
|
#595
↑ +6
Agora-1: The Multi-Agent World Model
Agora-1 is a multi-agent world model from Odyssey that simulates shared real-time environments for up to four participants, human or AI, with a focus on gaming, robotics, reinforcement learning, and foundation model research. |
AI Research / World Models | 78 | ↑ +6 | 32 days ago | Details |
|
A blog post describing a small reinforcement-learning agent trained with PPO to play and beat a Pokelike/Pokerogue-style game, including the input representation, model architecture, and training approach. |
Developer Tools / Code Assistant | 74 | ↓ -1 | 16 days ago | Details |
|
#687
↑ +1
World Models for Planning Agents
An educational article explaining world models, latent states, dynamics learning, and planning for agents, with examples from gridworld, Dreamer, and MuZero. |
Writing / Copywriting | 73 | ↑ +1 | 34 days ago | Details |