AgentDish directory

speculative decoding

Accepted listings with this tag.

Listing Category Score Trend Checked

A detailed walkthrough for running a local coding agent on macOS with llama.cpp, Gemma 4, MTP speculative decoding, image support, and Pi as the agent interface.

Developer Tools / Code Assistant 80 ↓ -1 7 days ago Details

Google Developers Blog post about integrating DFlash, a diffusion-style speculative decoding framework, into the vLLM TPU ecosystem to improve LLM serving speed on TPU v5p.

Developer Tools / Code Assistant 78 ↓ -127 45 days ago Details

arXiv paper on a self-speculative decoding framework for speeding up reasoning LLM inference on edge hardware, with hardware co-design and reported speedups.

Research / AI/ML Paper 77 → 0 22 days ago Details