I am Ksenia, a research scientist at DeepMind. My broad research interests are in embodied AI, vision language models, and alignment.

Biography

At DeepMind, I am currently part of the Gemini Robotics team. I have been involved in the success detection and progress understanding workstream in our Embodied Reasoning model ER-1.5 and in the design of the Physical Agent on the robot. In 2024 I was in the Generative Media team working on the Imagen 3 model focusing on aligning it with human intent. Prior to this, I was in the Machine Learning team led by Nando de Freitas.

I completed my PhD at EPFL's CVLab in January 2019, where I was supervised by Prof. Pascal Fua and Prof. Raphael Sznitman. In autumn 2017 during my internship at Google Research in Zurich I had a chance to work with Vittorio Ferrari and Jasper Uijlings.

I obtained my M.Sc. degree in Algorithms and Machine Learning from University of Helsinki. During that time, I also worked as a research assistant in the CoSCo group at HIIT. Previously, I studied in Russia at the Higher School of Economics in the Faculty of Business Informatics and Applied Mathematics.

News

November, 2025: Gemini Robotics 1.5 is out! Check out our blogpost and technical report. You can also try ER-1.5 in AI Studio.

October, 2024: Image generation with Imagen 3 is now available to all Gemini users around the world!

Ksenia Konyushkova