Yiming Wang

Senior Researcher @ Fondazione Bruno Kessler

profile-ywang-round.png

Hi! I am Yiming, a senior researcher at Fondazione Bruno Kessler. I am enthusiastic on robotic perception, covering diverse topics related to 2D/3D scene representation, scene semantic understanding and embodied navigation/manipulation. My recent research focuses on leveraging multimodal foundation models for embodied perception and reasoning. I am a member of ELLIS.

I work and live in Trento, a beautiful mountain city in northern Italy, providing me with great balance between nature peace and research stress :P

News

Aug 16, 2025 🧑‍💻 I will serve as Area Chair for CVPR 2026!
Aug 11, 2025 🔔 Our workshop on Human-aware Embodied AI (HEAI@IROS’25) is calling for submissions! Check our site for more info or just write us: heai.iros25@gmail.com
Jul 26, 2025 🎉 Extremely grateful that LOTS of Fashion! is picked as Oral presentation at ICCV 2025! We put a lot of love into this project and glad it was appreciated by the community 🫶
Jun 26, 2025 🎉 4/5 papers accepted to ICCV 2025! It was a pleasant experience with rebuttals this year 🫶
Jun 16, 2025 🎉 2 papers will be presented at IROS’25! See you @ Hangzhou, China!

Selected publications

  1. iccv25-lots.png
    LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing
    Federico Girella, Davide Talon, Ziyue Liu, Zanxi Ruan, Yiming Wang, and Marco Cristani
    In Proceedings of International Conference on Computer Vision (ICCV), 2025
  2. iccv25-openworld.png
    On large multimodal models as open-world image classifiers
    Alessandro Conti, Massimiliano Mancini, Enrico Fini, Yiming Wang, Paolo Rota, and Elisa Ricci
    In Proceedings of International Conference on Computer Vision (ICCV), 2025
  3. iccv25-personalization.png
    Training-Free Personalization via Retrieval and Reasoning on Fingerprints
    Deepayan Das, Davide Talon, Yiming Wang, Massimiliano Mancini, and Elisa Ricci
    In Proceedings of International Conference on Computer Vision (ICCV), 2025
  4. iccv25-coin.png
    Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dialogues
    Francesco Taioli, Edoardo Zorzi, Gianni Franchi, Alberto Castellini, Alessandro Farinelli, Marco Cristani, and Yiming Wang
    In Proceedings of International Conference on Computer Vision (ICCV), 2025
  5. iros25-freegrasp.png
    Free-form language-based robotic reasoning and grasping
    Runyu Jiao, Alice Fasoli, Francesco Giuliari, Matteo Bortolon, Sergio Povoli, Guofeng Mei, Yiming Wang, and Fabio Poiesi
    In Proceedings of International Conference on Intelligent Robots and Systems (IROS), 2025
  6. cvpr25-perla.png
    PerLA: Perceptive 3D language assistant
    Guofeng Mei, Wei Lin, Luigi Riz, Yujiao Wu, Fabio Poiesi, and Yiming Wang
    In Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR), 2025
  7. cviu-cnp.png
    Collaborative Neural Painting
    Nicola Dall’Asen, Willi Menapace, Elia Peruzzo, Enver Sangineto, Yiming Wang, and Elisa Ricci
    Computer Vision and Image Understanding (CVIU), 2025
  8. cvpr25-synvita.png
    Can Text-to-Video Generation help Video-Language Alignment?
    Luca Zanella, Massimiliano Mancini, Willi Menapace, Sergey Tulyakov, Yiming Wang, and Elisa Ricci
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
  9. cvpr25-act.png
    Seeing the Abstract: Translating the Abstract Language for Vision Language Models
    Davide Talon, Federico Girella, Ziyue Liu, Marco Cristani, and Yiming Wang
    In Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR), 2025
  10. tpami-pompuncertainty.png
    Unsupervised active visual search with monte carlo planning under uncertain detections
    Francesco Taioli, Francesco Giuliari, Yiming Wang, Riccardo Berra, Alberto Castellini, Alessio Del Bue, Alessandro Farinelli, Marco Cristani, and Francesco Setti
    Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
  11. iros24-mindtheerror.png
    Mind the error! detection and localization of instruction errors in vision-and-language navigation
    Francesco Taioli, Stefano Rosa, Alberto Castellini, Lorenzo Natale, Alessio Del Bue, Alessandro Farinelli, Marco Cristani, and Yiming Wang
    In Proceedings of International Conference on Intelligent Robots and Systems (IROS), 2024
  12. cvpr24-lavad.png
    Harnessing Large Language Models for Training-free Video Anomaly Detection
    Luca Zanella, Willi Menapace, Massimiliano Mancini, Yiming Wang, and Elisa Ricci
    In Proceedings of IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2024
  13. cvpr24-t3al.png
    Test-Time Zero-Shot Temporal Action Localization
    Benedetta Liberatori, Alessandro Conti, Paolo Rota, Yiming Wang, and Elisa Ricci
    In Proceedings of IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2024
  14. cvpr24-geoze.png
    Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding
    Guofeng Mei, Luigi Riz, Yiming Wang, and Fabio Poiesi
    In Proceedings of IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2024
  15. neurips23-vic.png
    Vocabulary-free Image Classification
    Alessandro Conti, Enrico Fini, Massimiliano Mancini, Paolo Rota, Yiming Wang, and Elisa Ricci
    In Proceedings of Conference on Neural Information Processing Systems (NeurIPS), 2023
  16. tpami-scenegraph.png
    Leveraging commonsense for object localisation in partial scenes
    Francesco Giuliari, Geri Skenderi, Marco Cristani, Alessio Del Bue, and Yiming Wang
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
  17. wacv23-confmix.png
    ConfMix: Unsupervised Domain Adaptation for Object Detection via Confidence-based Mixing
    Giulio Mattolin, Luca Zanella, Elisa Ricci, and Yiming Wang
    In Winter Conference on Applications of Computer Vision (WACV), 2023
  18. cvpr22-scenegraph.png
    Spatial Commonsense Graph for Object Localisation in Partial Scenes
    Francesco Giuliari, Geri Skender, Marco Cristani, Yiming Wang, and Alessio Del Bue
    In Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2022
  19. eccv20-exhistcnn.png
    Where to Explore Next? ExHistCNN for History-aware Autonomous 3D Exploration
    Yiming Wang and Alessio Del Bue
    In Proceedings of European Conference on Computer Vision (ECCV), 2020
  20. ral19-active.png
    Autonomous 3D reconstruction, mapping and exploration of indoor environments with a robotic arm
    Yiming Wang, Stuart James, Elisavet Konstantina Stathopoulou, Carlos Beltrán-González, Yoshinori Konishi, and Alessio Del Bue
    IEEE Robotics and Automation Letters (RA-L), 2019
  21. avss17-activetracking.png
    Active visual tracking in multi-agent scenarios
    Yiming Wang and Andrea Cavallaro
    In Proceedings of IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), 2017