Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
14
2
1
sai_reddy
saireddy
Follow
21world's profile picture
1 follower
·
1 following
AI & ML interests
None yet
Recent Activity
new
activity
about 2 months ago
moonshotai/Kimi-Linear-48B-A3B-Instruct:
insights on comparisons with Qwen/Qwen3-Next-80B-A3B-Instruct ?
new
activity
2 months ago
Qwen/Qwen3-VL-235B-A22B-Instruct-FP8:
function calling
new
activity
4 months ago
Qwen/Qwen3-30B-A3B-Instruct-2507-FP8:
possible to extend context to 1m tokens ?
View all activity
Organizations
saireddy
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
moonshotai/Kimi-Linear-48B-A3B-Instruct
about 2 months ago
insights on comparisons with Qwen/Qwen3-Next-80B-A3B-Instruct ?
➕
6
#14 opened about 2 months ago by
saireddy
New activity in
Qwen/Qwen3-VL-235B-A22B-Instruct-FP8
2 months ago
function calling
#4 opened 2 months ago by
saireddy
New activity in
Qwen/Qwen3-30B-A3B-Instruct-2507-FP8
4 months ago
possible to extend context to 1m tokens ?
#5 opened 4 months ago by
saireddy
upvoted
an
article
about 1 year ago
view article
Article
Hugging Face x LangChain : A new partner package
+1
May 14, 2024
•
159
New activity in
google/gemma-2-9b
about 1 year ago
RuntimeError: Index put requires the source and destination dtypes match, got BFloat16 for the destination and Float for the source.
➕
4
13
#24 opened over 1 year ago by
saireddy
New activity in
google/gemma-2-9b
over 1 year ago
model.generate is throwing AttributeError: 'HybridCache' object has no attribute 'float'
7
#18 opened over 1 year ago by
saireddy
base vs instruct model
1
#17 opened over 1 year ago by
saireddy
Inference error
9
#20 opened over 1 year ago by
gsasikiran
New activity in
google/gemma-7b
over 1 year ago
8-bit precision error
17
#32 opened almost 2 years ago by
saireddy
New activity in
google/gemma-7b-it
over 1 year ago
ValueError with multi A100 GPUS
2
#28 opened almost 2 years ago by
saireddy
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
over 1 year ago
ValueError: You can't train a model that has been loaded in 8-bit precision on a different device than the one you're training on.
12
#35 opened over 1 year ago by
madhurjindal
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
over 1 year ago
Base vs instruct
5
#17 opened over 1 year ago by
saireddy
New activity in
google/gemma-7b-it
almost 2 years ago
Could not find GemmaForCausalLM neither in <module 'transformers.models.gemma'
6
#36 opened almost 2 years ago by
chenwei1984
liked
a model
over 2 years ago
meta-llama/Llama-2-13b-chat-hf
Text Generation
•
13B
•
Updated
Apr 17, 2024
•
210k
•
•
1.11k