Ashkan Khakzar

I’m a postdoctoral researcher in machine learning and computer vision working with Philip Torr at the University of Oxford. I am driven by my curiosity about understanding how intelligence emerges in neural networks through learning. Thus my PhD research was focused on how to interpret neural networks, and I was lucky to have this experience at the Technical University of Munich (TUM). I was inspired throughout my entire PhD journey by my supervisor, Nassir Navab. I am very thankful to Bernt Schiele for reviewing my PhD research and inspiring me with his ideas. These days I am following the same vision in the context of vision-language foundation models.

news

Sep 19, 2025	We have a paper on mechanistic study of multimodal knowledge recall in multimodal LLMs at NeurIPS 2025 (Find the preliminry version here).
Jun 2, 2025	We are organizing the second version of Emergent Visual Abilities and Limits of Foundation Models (EVAL FoMo 2) at CVPR 2025.
Jun 2, 2025	We are organizing the Mechanistic Interpretability for Vision workshop at CVPR 2025.
Jun 1, 2025	We have done some digging into how multimodal LLMs process visual information. Check out our workshop paper at CVPR 2025 Mechnasitic Interpretability for Vision workshop and our paper at CVPR 2025 XAI4CV.
Jun 1, 2025	Check out our ICML 2025 paper on making interpretable sparse wide neurl networks possible leveraging sparsity and mixture-of-experts tricks.
Jun 1, 2025	Check out our ICML 2025 paper on concept erasure for diffusion and flow models.
Jun 1, 2025	We have a scalable concept erasure solution based on model merging and model arithmetic for diffusion models (which we will present at ICCV 2025).
Apr 20, 2025	We have a clever trick to identify (and remove) important neurons for a concept through time steps in duffion models, which we present in ICLR 2025 Scope workshop.
Sep 28, 2024	Check out our ECCV 2024 workshop: Emergent Visual Abilities and Limits of Foundation Models.
Sep 27, 2024	We have a paper in NeurIPS 24 on evaluating abstract shape recognition in vision-language models.
Sep 12, 2024	Was awarded a grant by the Google Gemma 2 Academic Program to do research on GemmaScope
Aug 12, 2024	We have a perspective paper on the cognitive revolution in interpreting neural networks.
Jul 4, 2024	Check out our ECCV 2024 paper on safe text to image generation.
Jun 6, 2024	Check out our paper on guiding the attention of vision transformers.
Apr 25, 2024	Invited speaker at Trustworthy Multimodal Learning with Foundation Models at British Machine Vision Assosication. Talk title: Understanding Foundation Models through Interpretation and Evaluation.

Favorite PhD Publications

CVPR

Do explanations explain? model knows best

Ashkan Khakzar, Pedram Khorsandi, Rozhin Nobahari, and 1 more author

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

HTML Code
CVPR

Neural Response Interpretation through the Lens of Critical Pathways

Ashkan Khakzar, Soroosh Baselizadeh, Saurabh Khanduja, and 3 more authors

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021

HTML Code Website
NeurIPS

Fine-grained neural network explanation by identifying input features with predictive information

Yang Zhang*, Ashkan Khakzar*, Yawei Li, and 3 more authors

Advances in Neural Information Processing Systems, 2021

HTML Code Website