Eni Grand's picture

Eni Grand

Enigrand

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 12 hours ago

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

upvoted an article about 13 hours ago

Safetensors is Joining the PyTorch Foundation

upvoted a collection about 16 hours ago

View all activity

Organizations

upvoted a paper about 12 hours ago

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Paper • 2604.05091 • Published 4 days ago • 37

upvoted an article about 13 hours ago

Article

Safetensors is Joining the PyTorch Foundation

2 days ago

•

26

upvoted 2 collections about 16 hours ago

VoxCPM

5 items • Updated 3 days ago • 9

EXAONE 4.5

LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 3 items • Updated about 18 hours ago • 25

upvoted a collection 1 day ago

DFlash

Block Diffusion for Flash Speculative Decoding • 13 items • Updated 4 days ago • 47

upvoted a paper 1 day ago

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published Feb 5 • 46

upvoted a collection 2 days ago

Ace-Step 1.5-xl

3 items • Updated 7 days ago • 60

upvoted a collection 7 days ago

Gemma 4

8 items • Updated 7 days ago • 519

upvoted a paper 8 days ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published 15 days ago • 51

upvoted 2 collections 9 days ago

Bonsai-Auxiliary

3 items • Updated 9 days ago • 7

Bonsai

1-bit Bonsai models • 6 items • Updated 9 days ago • 162

upvoted 2 collections 16 days ago

Open Coding Agents

13 items • Updated Mar 5 • 52

MolmoWeb

This is the collection of MolmoWeb artifacts, including model checkpoints and data. • 5 items • Updated 16 days ago • 22

upvoted a paper 23 days ago

Attention Residuals

Paper • 2603.15031 • Published 24 days ago • 176

upvoted a paper 25 days ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published about 1 month ago • 150

upvoted a collection about 1 month ago

Qwen3.5

21 items • Updated Mar 9 • 1.47k

upvoted 2 papers about 2 months ago

SERA: Soft-Verified Efficient Repository Agents

Paper • 2601.20789 • Published Jan 28 • 13

NOSA: Native and Offloadable Sparse Attention

Paper • 2510.13602 • Published Oct 15, 2025 • 7

upvoted 2 collections about 2 months ago

Devstral 2

A couple of agentic LLMs for software engineering tasks, excelling at using tools to explore codebases, edit multiple files, and power SWE Agents. • 2 items • Updated Mar 2 • 52

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 30 items • Updated 4 days ago • 84