On Path to Multimodal Generalist: General-Level and General-Bench Paper β’ 2505.04620 β’ Published 4 days ago β’ 62
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation Paper β’ 2505.04512 β’ Published 4 days ago β’ 31
D-FINE Collection State-of-the-art real-time object detection model with Apache 2.0 licence β’ 15 items β’ Updated 6 days ago β’ 50
πΈ April 2025 - Open releases from the Chinese community Collection 41 items β’ Updated 11 days ago β’ 12
Towards Understanding Camera Motions in Any Video Paper β’ 2504.15376 β’ Published 20 days ago β’ 155
Skywork-R1V2 Collection Multimodal Hybrid Reinforcement Learning for Reasoning β’ 4 items β’ Updated 12 days ago β’ 10
SkyReels-V2: Infinite-length Film Generative Model Paper β’ 2504.13074 β’ Published 24 days ago β’ 7
SkyReels-V2 Collection Infinite-length Film Generative Model β’ 9 items β’ Updated 17 days ago β’ 34
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper β’ 2504.10479 β’ Published 27 days ago β’ 255
Kimina Prover Preview Collection State-of-the-Art Models for Formal Mathematical Reasoning β’ 5 items β’ Updated 13 days ago β’ 29