I am Ksenia, a research scientist at DeepMind. My broad research interests are in embodied AI, vision language models, and alignment.
At DeepMind, I am currently part of the Gemini Robotics team. I have been involved in the success detection and progress understanding workstream in our Embodied Reasoning model ER-1.5 and in the design of the Physical Agent on the robot. In 2024 I was in the Generative Media team working on the Imagen 3 model focusing on aligning it with human intent. Prior to this, I was in the Machine Learning team led by Nando de Freitas.
I completed my PhD at EPFL's CVLab in January 2019, where I was supervised by Prof. Pascal Fua and Prof. Raphael Sznitman. In autumn 2017 during my internship at Google Research in Zurich I had a chance to work with Vittorio Ferrari and Jasper Uijlings.
I obtained my M.Sc. degree in Algorithms and Machine Learning from University of Helsinki. During that time, I also worked as a research assistant in the CoSCo group at HIIT. Previously, I studied in Russia at the Higher School of Economics in the Faculty of Business Informatics and Applied Mathematics.