StableI2I: 이미지 간 전환 과정에서 발생하는 의도하지 않은 변화 탐지

초록

대부분의 실제 이미지 간 변환(I2I) 시나리오에서 기존 평가 방법은 주로 지시문 준수와 생성된 이미지의 지각적 품질 또는 심미성에 초점을 맞추고 있습니다. 그러나 이러한 평가는 출력 이미지가 입력 이미지의 의미적 일관성과 공간 구조를 보존하는지 여부를 제대로 평가하지 못하는 한계가 있습니다. 이러한 한계를 해결하기 위해 본 연구에서는 참조 이미지가 필요 없이도 이미지 편집 및 복원을 포함한 다양한 I2I 작업에서 콘텐츠 정확도와 변환 전후 일관성을 명시적으로 측정하는 통합적이고 동적인 평가 프레임워크인 StableI2I를 제안합니다. 또한 이러한 정확도와 일관성 평가 작업에서 MLLM의 성능을 체계적으로 평가하기 위해 설계된 벤치마크인 StableI2I-Bench를 구축하였습니다. 폭넓은 실험 결과를 통해 StableI2I가 콘텐츠 정확도와 일관성에 대해 정확하고 세분화된, 해석 가능한 평가를 제공하며 인간의 주관적 판단과 강한 상관관계를 가짐을 입증했습니다. 우리의 프레임워크는 실제 I2I 시스템에서 콘텐츠 일관성 진단 및 모델 성능 벤치마킹을 위한 실용적이고 신뢰할 수 있는 평가 도구로 활용될 수 있습니다.

English

In most real-world image-to-image (I2I) scenarios, existing evaluations primarily focus on instruction following and the perceptual quality or aesthetics of the generated images. However, they largely fail to assess whether the output image preserves the semantic correspondence and spatial structure of the input image. To address this limitation, we propose StableI2I, a unified and dynamic evaluation framework that explicitly measures content fidelity and pre--post consistency across a wide range of I2I tasks without requiring reference images, including image editing and image restoration. In addition, we construct StableI2I-Bench, a benchmark designed to systematically evaluate the accuracy of MLLMs on such fidelity and consistency assessment tasks. Extensive experimental results demonstrate that StableI2I provides accurate, fine-grained, and interpretable evaluations of content fidelity and consistency, with strong correlations to human subjective judgments. Our framework serves as a practical and reliable evaluation tool for diagnosing content consistency and benchmarking model performance in real-world I2I systems.

StableI2I: 이미지 간 전환 과정에서 발생하는 의도하지 않은 변화 탐지

StableI2I: Spotting Unintended Changes in Image-to-Image Transition

초록

Support