FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition Paper • 2512.13884 • Published 11 days ago • 14
FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition Paper • 2512.13884 • Published 11 days ago • 14
BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models Paper • 2412.15978 • Published Dec 20, 2024 • 1
Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models Paper • 2504.14366 • Published Apr 19 • 1
Sample-Efficient Language Modeling with Linear Attention and Lightweight Enhancements Paper • 2511.05560 • Published Nov 4 • 1
PISA-Bench: The PISA Index as a Multilingual and Multimodal Metric for the Evaluation of Vision-Language Models Paper • 2510.24792 • Published Oct 27
Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data Paper • 2412.10121 • Published Dec 13, 2024 • 2
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper • 2211.05100 • Published Nov 9, 2022 • 35
BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing Paper • 2206.15076 • Published Jun 30, 2022 • 5
Large-Scale Label Interpretation Learning for Few-Shot Named Entity Recognition Paper • 2403.14222 • Published Mar 21, 2024 • 1
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs Paper • 2309.09582 • Published Sep 18, 2023 • 4