Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 70 items • Updated Dec 10, 2025 • 168
view article Article The Falcon has landed in the Hugging Face ecosystem +6 lvwerra, ybelkada, smangrul, lewtun, olivierdehaene, pcuenq, philschmid, osanseviero • Jun 5, 2023 • 17
view article Article Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens and 11 languages +7 Quent-01, nilabhra, rcojocaru, Mughaira, gcampesan, SanathNarayan, griffintaur, clefourrier, SaylorTwift • May 24, 2024 • 28
view article Article How to generate text: using different decoding methods for language generation with Transformers patrickvonplaten • Mar 1, 2020 • 297