SageAttention2++: A More Efficient Implementation of SageAttention2 Paper • 2505.21136 • Published 3 days ago • 35
Shifting AI Efficiency From Model-Centric to Data-Centric Compression Paper • 2505.19147 • Published 5 days ago • 136
MMaDA: Multimodal Large Diffusion Language Models Paper • 2505.15809 • Published 9 days ago • 83
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception Paper • 2505.04410 • Published 23 days ago • 43
Fast Text-to-Audio Generation with Adversarial Post-Training Paper • 2505.08175 • Published 18 days ago • 22
MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder Paper • 2505.07916 • Published 18 days ago • 119
formospeech/whisper-large-v2-formosan-all Automatic Speech Recognition • Updated 14 days ago • 77
Flow-GRPO: Training Flow Matching Models via Online RL Paper • 2505.05470 • Published 22 days ago • 76
VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model Paper • 2505.03739 • Published 24 days ago • 8