3 60 245

Kristoffer Rolf Deinoff

gatepoet

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

liked a model 2 days ago

mradermacher/Promt-generator-i1-GGUF

liked a Space 2 days ago

NyxKrage/LLM-Model-VRAM-Calculator

View all activity

Organizations

None yet

gatepoet's activity

upvoted a paper 1 day ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published 2 days ago • 65

upvoted a paper 3 days ago

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published 9 days ago • 20

upvoted a collection 11 days ago

DeepSeek-Prover

Collection

DeepSeek-Prover-Series • 10 items • Updated 14 days ago • 52

upvoted a paper 26 days ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published 29 days ago • 60

upvoted a paper 29 days ago

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published Apr 10 • 42

upvoted an article about 2 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

Mar 26

• 126

upvoted 3 papers about 2 months ago

UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning

Paper • 2503.21620 • Published Mar 27 • 62

MAPS: A Multi-Agent Framework Based on Big Seven Personality and Socratic Guidance for Multimodal Scientific Problem Solving

Paper • 2503.16905 • Published Mar 21 • 54

Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction

Paper • 2503.16194 • Published Mar 20 • 8

upvoted 8 papers 2 months ago

upvoted 3 papers 3 months ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

Paper • 2502.17535 • Published Feb 24 • 8

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published Feb 19 • 70

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published Feb 24 • 30