Tiny-A2D Collection Small diffusion language models adapted from AR models • 4 items • Updated 20 days ago • 11
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper • 2501.11425 • Published Jan 20 • 109
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated May 1 • 574
view article Article Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face +5 Dec 11, 2023 • 13