Learning Adaptive Parallel Reasoning with Language Models Paper • 2504.15466 • Published 21 days ago • 42
Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis Paper • 2503.08741 • Published Mar 11 • 1
$\texttt{Complex-Edit}$: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark Paper • 2504.13143 • Published 25 days ago • 8
Sequential Modeling Enables Scalable Learning for Large Vision Models Paper • 2312.00785 • Published Dec 1, 2023 • 1
EgoPet: Egomotion and Interaction Data from an Animal's Perspective Paper • 2404.09991 • Published Apr 15, 2024
Forgotten Polygons: Multimodal Large Language Models are Shape-Blind Paper • 2502.15969 • Published Feb 21 • 2
Sonata: Self-Supervised Learning of Reliable Point Representations Paper • 2503.16429 • Published Mar 20 • 11
SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks Paper • 2503.15478 • Published Mar 19 • 10
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning Paper • 2503.06960 • Published Mar 10 • 3
Linguini: A benchmark for language-agnostic linguistic reasoning Paper • 2409.12126 • Published Sep 18, 2024