AgentDish directory

reinforcement-learning

Accepted listings with this tag.

Listing Category Score Trend Checked
#560 ↑ +2
GenZ LLM

A small post-trained Qwen2.5-0.5B-Instruct model tuned to write in Gen Z slang, with training and inference notebooks plus dataset files in the repo.

AI Models / Fine-Tuned LLMs 79 ↑ +2 40 days ago Details

Agora-1 is a multi-agent world model from Odyssey that simulates shared real-time environments for up to four participants, human or AI, with a focus on gaming, robotics, reinforcement learning, and foundation model research.

AI Research / World Models 78 ↑ +6 32 days ago Details

A blog post describing a small reinforcement-learning agent trained with PPO to play and beat a Pokelike/Pokerogue-style game, including the input representation, model architecture, and training approach.

Developer Tools / Code Assistant 74 ↓ -1 16 days ago Details

An educational article explaining world models, latent states, dynamics learning, and planning for agents, with examples from gridworld, Dreamer, and MuZero.

Writing / Copywriting 73 ↑ +1 34 days ago Details