Tuesday Apr 30, 2024

arxiv preprint - Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation

In this episode, we discuss Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation by Jaemin Cho, Yushi Hu, Roopal Garg, Peter Anderson, Ranjay Krishna, Jason Baldridge, Mohit Bansal, Jordi Pont-Tuset, Su Wang. The abstract discusses the evaluation of text-to-image models, focusing on ensuring the accuracy between text prompts and generated images through a question generation and answering system. It introduces the Davidsonian Scene Graph (DSG), a strategy intended to improve question quality and answer consistency by creating a structured set of unique, semantic questions. Extensive testing and human assessments have shown DSG's effectiveness, and the release of DSG-1k provides a benchmark for wider usage and evaluation in the field.

Comment (0)

No comments yet. Be the first to say something!

Copyright 2023 All rights reserved.

Podcast Powered By Podbean

Version: 20241125