Thursday Oct 26, 2023

arxiv Preprint - An Image is Worth Multiple Words: Learning Object Level Concepts using Multi-Concept Prompt Learning

In this episode we discuss An Image is Worth Multiple Words: Learning Object Level Concepts using Multi-Concept Prompt Learning by Chen Jin, Ryutaro Tanno, Amrutha Saseendran, Tom Diethe, Philip Teare. The paper proposes a framework called Multi-Concept Prompt Learning (MCPL) to address the challenge of integrating multiple object-level concepts within one scene using prompt learning. The authors introduce three regularization techniques to enhance word-concept correlation. The MCPL framework is evaluated through image generation, editing, and attention visualization, and is compared to a previous approach that can only learn a single concept from each image.

Comments (0)

To leave or reply to comments, please download free Podbean or

No Comments

Copyright 2023 All rights reserved.

Podcast Powered By Podbean

Version: 20241125