PerSense: an end-to-end, training-free and model-agnostic one-shot framework for personalized instance segmentation in dense images.
Given a query image and support set, CLE leverages the masked support image to identify the object category which serves as a prompt to Grounding Detector.
A step-by-step overview of PerSense components
Incorporates feature and scale diversity to compute a weighted score for selecting representative exemplars, enhancing density map generation
Qualitative comparison with SOTA
If you use PerSense or PerSense++ in your research, please cite our papers
@article{siddiqui2024persense, title = {PerSense: Personalized Instance Segmentation in Dense Images}, author = {Siddiqui, Muhammad Ibraheem and Sheikh, Muhammad Umer and Abid, Hassan and Khan, Muhammad Haris}, journal = {arXiv preprint arXiv:2405.13518}, year = {2024}, archivePrefix = {arXiv}, eprint = {2405.13518}, primaryClass = {cs.CV} }
@article{siddiqui2025persense++, title={Towards PerSense++: Advancing Training-Free Personalized Instance Segmentation in Dense Images}, author={Siddiqui, Muhammad Ibraheem and Sheikh, Muhammad Umer and Abid, Hassan and Henry, Kevin and Khan, Muhammad Haris}, journal={arXiv preprint arXiv:2508.14660}, year={2025} }