sai_reddy
saireddy
AI & ML interests
None yet
Organizations
Fine tune with lora
1
#26 opened 2 months ago
by
saireddy
Memory Requirements to run `Qwen/Qwen3.5-397B-A17B`
๐โค๏ธ 10
5
#20 opened 3 months ago
by
alvarobartt
can we deploy this using tp (H100-80GB each)=6 ?
1
#3 opened 3 months ago
by
saireddy
insights on comparisons with Qwen/Qwen3-Next-80B-A3B-Instruct ?
โ 6
#14 opened 6 months ago
by
saireddy
function calling
#4 opened 7 months ago
by
saireddy
possible to extend context to 1m tokens ?
#5 opened 9 months ago
by
saireddy
model.generate is throwing AttributeError: 'HybridCache' object has no attribute 'float'
7
#18 opened almost 2 years ago
by
saireddy
base vs instruct model
1
#17 opened almost 2 years ago
by
saireddy
Inference error
9
#20 opened almost 2 years ago
by
gsasikiran
8-bit precision error
17
#32 opened about 2 years ago
by
saireddy
ValueError with multi A100 GPUS
2
#28 opened about 2 years ago
by
saireddy
Base vs instruct
5
#17 opened about 2 years ago
by
saireddy
Could not find GemmaForCausalLM neither in <module 'transformers.models.gemma'
6
#36 opened about 2 years ago
by
chenwei1984