PerSense: an end-to-end, training-free and model-agnostic one-shot framework for personalized instance segmentation in dense images.





Given a query image and support set, CLE leverages the masked support image to identify the object category which serves as a prompt to Grounding Detector.
A step-by-step overview of PerSense components



Incorporates feature and scale diversity to compute a weighted score for selecting representative exemplars, enhancing density map generation
Qualitative comparison with SOTA

If you use PerSense or PerSense++ in your research, please cite our papers
@article{siddiqui2024persense,
title = {PerSense: Personalized Instance Segmentation in Dense Images},
author = {Siddiqui, Muhammad Ibraheem and Sheikh, Muhammad Umer and Abid, Hassan and Khan, Muhammad Haris},
journal = {arXiv preprint arXiv:2405.13518},
year = {2024},
archivePrefix = {arXiv},
eprint = {2405.13518},
primaryClass = {cs.CV}
}@article{siddiqui2025persense++,
title={Towards PerSense++: Advancing Training-Free Personalized Instance Segmentation in Dense Images},
author={Siddiqui, Muhammad Ibraheem and Sheikh, Muhammad Umer and Abid, Hassan and Henry, Kevin and Khan, Muhammad Haris},
journal={arXiv preprint arXiv:2508.14660},
year={2025}
}