24 28 75

Noob

noobmldude

AI & ML interests

Explainable AI

Recent Activity

upvoted an article about 12 hours ago

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

liked a model about 15 hours ago

mistralai/Devstral-Small-2-24B-Instruct-2512

liked a model about 15 hours ago

mistralai/Devstral-2-123B-Instruct-2512

View all activity

Organizations

upvoted an article about 12 hours ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11

•

168

upvoted a paper 8 days ago

TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar

Paper • 2510.14972 • Published Oct 16 • 33

upvoted a collection 3 months ago

Granite 2.0 Code Models

Collection

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 24 days ago • 202

upvoted a collection 5 months ago

H-Net

Collection

The family of hierarchical networks (H-Nets) from https://arxiv.org/abs/2507.07955 • 8 items • Updated Jul 11 • 20

upvoted 2 articles 5 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

736

Article

Welcome Gemma 2 - Google’s new open LLM

Jun 27, 2024

•

132

upvoted a paper 6 months ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19 • 128

upvoted an article 6 months ago

Article

RegMix: Data Mixture as Regression for Language Model Pre-training

Jul 11, 2024

•

upvoted a paper 6 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 15

upvoted an article 6 months ago

Article

Selective fine-tuning of Language Models with Spectrum

Sep 3, 2024

•

upvoted 2 papers 6 months ago

Magistral

Paper • 2506.10910 • Published Jun 12 • 66

Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure

Paper • 2506.12278 • Published Jun 13 • 16

upvoted an article 8 months ago

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

Apr 29, 2024

•

upvoted 2 papers 8 months ago

LocAgent: Graph-Guided LLM Agents for Code Localization

Paper • 2503.09089 • Published Mar 12 • 13

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

upvoted 3 papers 9 months ago

2BP: 2-Stage Backpropagation

Paper • 2405.18047 • Published May 28, 2024 • 26

UFT: Unifying Fine-Tuning of SFT and RLHF/DPO/UNA through a Generalized Implicit Reward Function

Paper • 2410.21438 • Published Oct 28, 2024 • 2

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Paper • 2503.10460 • Published Mar 13 • 29

upvoted a paper 10 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 250

upvoted a collection about 1 year ago

Code Evaluation

Collection

Collection of Papers on Code Evaluation (from code generation language models) • 45 items • Updated Oct 29, 2024 • 16

Noob

AI & ML interests

Recent Activity

Organizations

noobmldude's activity

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

SmolLM3: smol, multilingual, long-context reasoner

Welcome Gemma 2 - Google’s new open LLM

RegMix: Data Mixture as Regression for Language Model Pre-training

Selective fine-tuning of Language Models with Spectrum

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation