Sinkformers: Transformers with Doubly Stochastic Attention Paper • 2110.11773 • Published Oct 22, 2021
Routers in Vision Mixture of Experts: An Empirical Study Paper • 2401.15969 • Published Jan 29, 2024 • 2
Implicit Diffusion: Efficient Optimization through Stochastic Sampling Paper • 2402.05468 • Published Feb 8, 2024 • 6
Direct Language Model Alignment from Online AI Feedback Paper • 2402.04792 • Published Feb 7, 2024 • 34