13 10

木村優斗

hehaoran47

AI & ML interests

None yet

Recent Activity

liked a dataset about 14 hours ago

jat-project/jat-dataset

upvoted a paper 1 day ago

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

upvoted a paper 1 day ago

OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization

View all activity

Organizations

None yet

liked a dataset about 14 hours ago

jat-project/jat-dataset

Viewer • Updated Feb 16, 2024 • 258M • 717k • 52

upvoted 2 papers 1 day ago

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

Paper • 2605.16928 • Published 8 days ago • 83

OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization

Paper • 2605.17757 • Published 6 days ago • 61

liked a dataset 1 day ago

KakologArchives/KakologArchives

Updated 2 minutes ago • 3.38M • 47

upvoted a paper 2 days ago

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published 10 days ago • 142

liked a model 2 days ago

Qwen/Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Jan 12, 2025 • 13.2M • • 1.29k

liked a dataset 5 days ago

cf-group-4/cloth_folding_second_fold1

Viewer • Updated 5 days ago • 35.2k • 188 • 1

upvoted a paper 9 days ago

δ-mem: Efficient Online Memory for Large Language Models

Paper • 2605.12357 • Published 12 days ago • 120

upvoted a paper 12 days ago

What Matters for Diffusion-Friendly Latent Manifold? Prior-Aligned Autoencoders for Latent Diffusion

Paper • 2605.07915 • Published 16 days ago • 8

upvoted a paper 17 days ago

How Fast Should a Model Commit to Supervision? Training Reasoning Models on the Tsallis Loss Continuum

Paper • 2604.25907 • Published 26 days ago • 3

upvoted a paper 22 days ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published 24 days ago • 218

liked 2 models about 1 month ago

BeaverAI/Artemis-31B-v1d-GGUF-BROKEN

31B • Updated Apr 12 • 293 • 1

Albertoo12/IndoBertV2-finetune

0.1B • Updated Apr 12 • 18 • 1

upvoted a paper about 1 month ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 629

liked a dataset about 1 month ago

galaxythereal/competition-frames-dataset1

Viewer • Updated Apr 10 • 1.11k • 147 • 1

liked 3 models about 2 months ago

upvoted 2 papers about 2 months ago

SEAR: Schema-Based Evaluation and Routing for LLM Gateways

Paper • 2603.26728 • Published Mar 20 • 12

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 351

木村優斗

AI & ML interests

Recent Activity

Organizations

hehaoran47's activity