Thursday Oct 26, 2023
arxiv Preprint - An Image is Worth Multiple Words: Learning Object Level Concepts using Multi-Concept Prompt Learning
In this episode we discuss An Image is Worth Multiple Words: Learning Object Level Concepts using Multi-Concept Prompt Learning by Chen Jin, Ryutaro Tanno, Amrutha Saseendran, Tom Diethe, Philip Teare. The paper proposes a framework called Multi-Concept Prompt Learning (MCPL) to address the challenge of integrating multiple object-level concepts within one scene using prompt learning. The authors introduce three regularization techniques to enhance word-concept correlation. The MCPL framework is evaluated through image generation, editing, and attention visualization, and is compared to a previous approach that can only learn a single concept from each image.
Comments (0)
To leave or reply to comments, please download free Podbean or
No Comments
To leave or reply to comments,
please download free Podbean App.