Eni Grand's picture

Eni Grand

Enigrand

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 4 hours ago

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

upvoted an article about 5 hours ago

Safetensors is Joining the PyTorch Foundation

upvoted a collection about 7 hours ago

View all activity

Organizations

upvoted a paper about 4 hours ago

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Paper • 2604.05091 • Published 3 days ago • 34

upvoted an article about 5 hours ago

Article

Safetensors is Joining the PyTorch Foundation

1 day ago

•

21

upvoted 2 collections about 7 hours ago

VoxCPM

5 items • Updated 2 days ago • 9

EXAONE 4.5

LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 3 items • Updated about 10 hours ago • 24

liked a model about 20 hours ago

z-lab/Qwen3.5-27B-DFlash

Text Generation • 2B • Updated 2 days ago • 2.66k • 32

upvoted a collection about 21 hours ago

DFlash

Block Diffusion for Flash Speculative Decoding • 13 items • Updated 4 days ago • 46

upvoted a paper about 21 hours ago

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published Feb 5 • 46

upvoted a collection 2 days ago

Ace-Step 1.5-xl

3 items • Updated 7 days ago • 60

upvoted a collection 6 days ago

Gemma 4

8 items • Updated 7 days ago • 511

liked a model 6 days ago

google/gemma-4-31B-it

Image-Text-to-Text • 33B • Updated 7 days ago • 1.33M • • 1.51k

upvoted a paper 7 days ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published 15 days ago • 51

liked 2 models 8 days ago

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11, 2025 • 69.1k • • 2.89k

byteshape/Qwen3.5-9B-GGUF

Image-Text-to-Text • 9B • Updated 9 days ago • 9.99k • 26

upvoted 2 collections 8 days ago

Bonsai-Auxiliary

3 items • Updated 9 days ago • 7

Bonsai

1-bit Bonsai models • 6 items • Updated 9 days ago • 162

upvoted 2 collections 15 days ago

Open Coding Agents

13 items • Updated Mar 5 • 52

MolmoWeb

This is the collection of MolmoWeb artifacts, including model checkpoints and data. • 5 items • Updated 16 days ago • 22

liked a model 21 days ago

Qwen/Qwen-Image-2512

Text-to-Image • Updated Dec 31, 2025 • 91.7k • • 760

upvoted a paper 23 days ago

Attention Residuals

Paper • 2603.15031 • Published 24 days ago • 176

New activity in mistralai/Mistral-Small-4-119B-2603 23 days ago

Dense models?

#8 opened 23 days ago by