Jonas Golde's picture

6 8 8

Jonas Golde

whoisjones

·

AI & ML interests

Data-efficient transfer learning

Recent Activity

new activity 5 days ago

whoisjones/finerweb-multilabel-classifier-xlmr-4o:Improve model card: Add pipeline tag, paper link, code, description, and usage example

authored a paper 7 days ago

FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition

submitted a paper 8 days ago

FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition

View all activity

Organizations

authored a paper 7 days ago

FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition

Paper • 2512.13884 • Published 11 days ago • 14

submitted a paper to Daily Papers 8 days ago

FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition

Paper • 2512.13884 • Published 11 days ago • 14

authored 5 papers 10 days ago

BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models

Paper • 2412.15978 • Published Dec 20, 2024 • 1

Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models

Paper • 2504.14366 • Published Apr 19 • 1

Question Decomposition for Retrieval-Augmented Generation

Paper • 2507.00355 • Published Jul 1 • 1

Sample-Efficient Language Modeling with Linear Attention and Lightweight Enhancements

Paper • 2511.05560 • Published Nov 4 • 1

PISA-Bench: The PISA Index as a Multilingual and Multimodal Metric for the Evaluation of Vision-Language Models

Paper • 2510.24792 • Published Oct 27

authored a paper 7 months ago

MastermindEval: A Simple But Scalable Reasoning Benchmark

Paper • 2503.05891 • Published Mar 7 • 1

authored 2 papers about 1 year ago

PECC: Problem Extraction and Coding Challenges

Paper • 2404.18766 • Published Apr 29, 2024

Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data

Paper • 2412.10121 • Published Dec 13, 2024 • 2

authored 3 papers over 1 year ago

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 35

BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing

Paper • 2206.15076 • Published Jun 30, 2022 • 5

Large-Scale Label Interpretation Learning for Few-Shot Named Entity Recognition

Paper • 2403.14222 • Published Mar 21, 2024 • 1

authored a paper almost 2 years ago

Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs

Paper • 2309.09582 • Published Sep 18, 2023 • 4