Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sergei B's picture
5 13 7

Sergei B

Serega6678
vishnuo2's profile picture Antalela's profile picture AthulSathyapal's profile picture
·
  • serega6678

AI & ML interests

None yet

Recent Activity

upvoted an article about 17 hours ago
TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell
upvoted an article 8 months ago
I trained a Language Model to schedule events with GRPO!
upvoted an article 9 months ago
ColPali: Efficient Document Retrieval with Vision Language Models 👀
View all activity

Organizations

huggingPartyParis's profile picture

New activity in Qwen/Qwen2.5-VL-72B-Instruct-AWQ 11 months ago

VLLM部署报错

3
#8 opened 11 months ago by
classdemo

is this bug? "image_processor_type": "Qwen2_5_VLImageProcessor",

5
#5 opened 11 months ago by
artheru
New activity in allenai/longformer-large-4096 over 1 year ago

Gradient is nan when Finetuning Pytorch Model

6
#2 opened almost 3 years ago by
willieseun
New activity in numind/NuNER-v1.0 over 1 year ago

Fine-tuning the model on any dataset gives OOM

2
#1 opened over 1 year ago by
vibhas09
commented 2 papers almost 2 years ago

Multilingual E5 Text Embeddings: A Technical Report

Paper • 2402.05672 • Published Feb 8, 2024 • 22 •
4

Multilingual E5 Text Embeddings: A Technical Report

Paper • 2402.05672 • Published Feb 8, 2024 • 22 •
4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs