The Segment Anything Model (SAM) is a powerful foundation model for image segmentation, showing robust zero-shot generalization through prompt engineering. However, relying on manual prompts is impractical for real-world applications, particularly in scenarios where rapid prompt provision and resource efficiency are crucial. In this paper, we propose the Automation of Prompts for SAM (AoP-SAM), a novel approach that learns to generate essential prompts in optimal locations automatically. AoP-SAM enhances SAM's efficiency and usability by eliminating manual input, making it better suited for real-world tasks. Our approach employs a lightweight yet efficient Prompt Predictor model that detects key entities across images and identifies the optimal regions for placing prompt candidates. This method leverages SAM's image embeddings, preserving its zero-shot generalization capabilities without requiring fine-tuning. Additionally, we introduce a test-time instance-level Adaptive Sampling and Filtering mechanism that generates prompts in a coarse-to-fine manner. This notably enhances both prompt and mask generation efficiency by reducing computational overhead and minimizing redundant mask refinements. Evaluations of three datasets demonstrate that AoP-SAM substantially improves both prompt generation efficiency and mask generation accuracy, making SAM more effective for automated segmentation tasks.
Our contributions are as follows:
Overview of the AoP-SAM pipeline.
Performance Highlights:
Parameter Analysis:
This website's template was designed based on the AI4CO Template. This work was supported by Institute of Information & Communications Technology Planning & Evaluation (IITP) grant funded by the Korea governments(MSIT)(No. 2022-0-01036, Development of Ultra-Performance PIM Processor Soc with PFLOPS-Performance and GByte-Memory & No.2022-0-01037, Development of High Performance Processing-In-Memory Technology based on DRAM). The authors sincerely thank Adiwena Putra for feedback on an earlier draft. AoP-SAM is built on the foundation of SAM; please adhere to the SAM license. We are thankful to all the authors for their outstanding contributions.
This paper is published at the AAAI Conference on Artificial Intelligence (AAAI). The copyright is held by the Association for the Advancement of Artificial Intelligence. © 2025 AAAI. All rights reserved. No datasets or supplementary materials are released with this paper. For implementation or collaboration inquiries, please contact the authors.
If you find our work helpful, please kindly cite us using the following reference:
@inproceedings{chen2024aopsam,
title={AoP-SAM: Automation of Prompts for Efficient Segmentation},
author={Chen, Yi and Son, Muyoung and Hua, Chuanbo and Kim, Joo-Young},
booktitle={AAAI Conference on Artificial Intelligence},
year={2024},
organization={AAAI}
}