Research
ELLIS Unit Freiburg is dedicated to advancing autonomous learning, uniting expertise in efficient learning, trustworthy ML, and physical AI.
Programs
Explore programs that our ELLIS unit offers — connecting researchers, providing training and mobility opportunities, and contributing to policy work.
Talk
Imagine, Verify, Act: Robot Planning with World Models and Vision-Language Evaluators
June 3rd, 2026
11:00
IMBIT, NEXUS Lab, Georges Köhler Allee 201, 79110 Freiburg
Humans and animals anticipate the consequences of their actions before acting, yet most robot policies remain reactive and limited to short horizons. In this talk, Dr. Gumbsch will sketch an approach to building embodied agents that learn to solve tasks by thinking ahead, rather than relying on expert demonstrations or extensive real-world interaction.
Revisiting model-based reinforcement learning through the lens of recent advances in world modelling and foundation models, he will propose three ingredients. First, a compositional world model that represents scenes as objects and interactions as events, enabling selective imagination over multiple time scales. Second, an internal critic to judge predicted outcomes. Here, he will show how pre-trained Vision-Language Models can serve as zero-shot evaluators of behavior, both to guide exploration and to assess task success. Third, an imagined policy improvement loop that ties these together: the agent rehearses behaviors inside the world model, the VLM scores them, and the policy is updated from these synthetic rollouts. The dream is to build robots that have already mastered a task in imagination before attempting the first step.
Christian Gumbsch is a postdoctoral researcher at the University of Amsterdam, working with Prof. Stratis Gavves. He completed his PhD at the Max Planck Institute for Intelligent Systems and the University of Tübingen, followed by a postdoc at TU Dresden. His research focuses on how autonomous embodied agents can learn adaptive, goal-directed behavior from experience, with an emphasis on long-horizon planning, temporal abstraction, world models, generalization, and intrinsic motivation. His work bridges AI and cognitive modeling, aiming to understand and build agents that can anticipate, plan, and act intelligently.