GENERATIVE PROMPT MODEL FOR WEAKLY SUPERVISED OBJECT LOCALIZATION
DURING TRAINING, GENPROMP CONVERTS IMAGE CATEGORY LABELS TO LEARNABLE PROMPT EMBEDDINGS WHICH ARE FED TO A GENERATIVE MODEL TO CONDITIONALLY RECOVER THE INPUT IMAGE WITH NOISE AND LEARN REPRESENTATIVE EMBEDDINGS.