paper maybe useful - a dorni Collection

dorni 's Collections

paper maybe useful

paper maybe useful

updated 15 days ago

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Paper • 2502.08590 • Published Feb 12 • 44
Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12 • 49
Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published Feb 18 • 86
Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space

Paper • 2503.09419 • Published Mar 12 • 6
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14 • 142
Can Vision-Language Models Answer Face to Face Questions in the Real-World?

Paper • 2503.19356 • Published Mar 25 • 2
Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals

Paper • 2503.19953 • Published Mar 25 • 3
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13 • 54
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Paper • 2503.05638 • Published Mar 7 • 19
Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 79
Segment Any Motion in Videos

Paper • 2503.22268 • Published Mar 28 • 18
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Paper • 2504.17789 • Published Apr 24 • 23
Reinforcement Pre-Training

Paper • 2506.08007 • Published 23 days ago • 238
Dreamland: Controllable World Creation with Simulator and Generative Models

Paper • 2506.08006 • Published 23 days ago • 7
Seeing Voices: Generating A-Roll Video from Audio with Mirage

Paper • 2506.08279 • Published 23 days ago • 26
PlayerOne: Egocentric World Simulator

Paper • 2506.09995 • Published 21 days ago • 33

	
		OSZAR »