Publications

Papers are listed below. * denote joint first authors.

2024

  1. cvrres_preview.png
    How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
    Muhammad Uzair Khattak, Muhammad Ferjad Naeem, Jameel Hassan, Muzammal Naseer, Federico Tombari, Fahad Shahbaz Khan, and Salman Khan
    arXiv preprint arXiv:2405.03690, 2024
  2. protext_preview.png
    Learning to Prompt with Text Only Supervision for Vision-Language Models
    Muhammad Uzair khattak, Muhammad Ferjad Naeem, Naseer Muzzamal, Luc Van Gool, and Federico Tombari
    arXiv:2401.02418, 2024

2023

  1. maple_preview.png
    Maple: Multi-modal prompt learning
    Muhammad Uzair Khattak, Hanoona Rasheed, Muhammad Maaz, Salman Khan, and Fahad Shahbaz Khan
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
  2. promptalign_preview.png
    Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization
    Jameel Hassan, Hanan Gani, Noor Hussein, Muhammad Uzair Khattak+, Muzammal Naseer, Fahad Shahbaz Khan, and Salman Khan
    Advances in Neural Information Processing Systems, 2023
  3. vificlip_preview.png
    Fine-tuned clip models are efficient video learners
    Hanoona Rasheed*, Muhammad Uzair Khattak*, Muhammad Maaz, Salman Khan, and Fahad Shahbaz Khan
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
  4. promptsrc_preview.png
    Self-regulating Prompts: Foundational Model Adaptation without Forgetting
    Muhammad Uzair Khattak*, Syed Talal Wasim*, Muzammal Naseer, Salman Khan, Ming-Hsuan Yang, and Fahad Shahbaz Khan
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Oct 2023
  5. focalnets_preview.png
    Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition
    Syed Talal Wasim*, Muhammad Uzair Khattak*, Muzammal Naseer, Salman Khan, Mubarak Shah, and Fahad Shahbaz Khan
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Oct 2023

2022

  1. ovd_preview.png
    Bridging the gap between object and image-level representations for open-vocabulary detection
    Hanoona Bangalath*, Muhammad Maaz*, Muhammad Uzair Khattak, Salman H Khan, and Fahad Shahbaz Khan
    Advances in Neural Information Processing Systems, Oct 2022
  2. loopclosure_preview.png
    Investigating and Improving Common Loop Closure Failures in Visual SLAM
    Saran Khaliq, Muhammad Latif Anjum, Wajahat Hussain, Muhammad Uzair Khattak, and Momen Rasool
    Autonomous Robots, Oct 2022