Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published 15 days ago • 77
FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA Paper • 2505.12805 • Published 19 days ago • 22
Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction Paper • 2505.11254 • Published 21 days ago • 48