Armen Jeddi's picture

Armen Jeddi

armenjeddi

·

https://armenjeddi.github.io/

AI & ML interests

VLMs, Test time scaling, post-training

Recent Activity

new activity 12 days ago

armenjeddi/MedBridgeRL-OctoMed-7B-PMC-VQA-RL:Add model card and metadata

authored a paper 18 days ago

When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains

submitted a paper 18 days ago

When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains

View all activity

Organizations

None yet

New activity in armenjeddi/MedBridgeRL-OctoMed-7B-PMC-VQA-RL 12 days ago

Add model card and metadata

#1 opened 13 days ago by

authored a paper 18 days ago

When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains

Paper • 2603.01301 • Published 19 days ago • 8

submitted a paper to Daily Papers 18 days ago

When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains

Paper • 2603.01301 • Published 19 days ago • 8

upvoted a paper 18 days ago

When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains

Paper • 2603.01301 • Published 19 days ago • 8

updated a model 18 days ago

armenjeddi/MedBridgeRL-OctoMed-7B-PMC-VQA-RL

Image-Text-to-Text • 8B • Updated 12 days ago • 30 • 2

published a model 18 days ago

armenjeddi/MedBridgeRL-OctoMed-7B-PMC-VQA-RL

Image-Text-to-Text • 8B • Updated 12 days ago • 30 • 2

upvoted a paper 29 days ago

KeDiff: Key Similarity-Based KV Cache Eviction for Long-Context LLM Inference in Resource-Constrained Environments

Paper • 2504.15364 • Published Apr 21, 2025 • 4

updated a collection 30 days ago

LoopFormer

Models trained in the ICLR2026 paper: LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation • 17 items • Updated 30 days ago • 2

updated a model 30 days ago

armenjeddi/NanoGPT-Base24-FineWeb300K

1B • Updated 30 days ago • 50

published a model 30 days ago

armenjeddi/NanoGPT-Base24-FineWeb300K

1B • Updated 30 days ago • 50

updated a model 30 days ago

armenjeddi/LoopFormer-3block-8iterations-FineWeb300K

0.3B • Updated 30 days ago • 83

published a model 30 days ago

armenjeddi/LoopFormer-3block-8iterations-FineWeb300K

0.3B • Updated 30 days ago • 83

authored 2 papers about 1 month ago

A Simple Fine-tuning Is All You Need: Towards Robust Deep Learning Via Adversarial Fine-tuning

Paper • 2012.13628 • Published Dec 25, 2020

LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation

Paper • 2602.11451 • Published Feb 11 • 15

updated a collection about 1 month ago

LoopFormer

Models trained in the ICLR2026 paper: LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation • 17 items • Updated 30 days ago • 2

submitted a paper to Daily Papers about 1 month ago

LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation

Paper • 2602.11451 • Published Feb 11 • 15

upvoted a paper about 1 month ago

LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation

Paper • 2602.11451 • Published Feb 11 • 15

upvoted a collection about 1 month ago

LoopFormer

Models trained in the ICLR2026 paper: LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation • 17 items • Updated 30 days ago • 2

updated a collection about 1 month ago

PuzzleCraft

Qwen2.5-VL-3B & 7B models trained with PuzzleCraft • 9 items • Updated 1 day ago • 3