적응형 텍스트-이미지 생성을 위한 프롬프트 확장

초록

텍스트-이미지 생성 모델은 강력하지만 사용하기 어려운 편입니다. 사용자들은 더 나은 이미지를 얻기 위해 구체적인 프롬프트를 작성하지만, 생성된 이미지가 반복적일 수 있습니다. 본 논문은 사용자가 더 적은 노력으로 고품질의 다양한 이미지를 생성할 수 있도록 돕는 Prompt Expansion 프레임워크를 제안합니다. Prompt Expansion 모델은 텍스트 쿼리를 입력으로 받아, 텍스트-이미지 모델에 전달될 때 더 다양한 매력적인 이미지를 생성하도록 최적화된 확장된 텍스트 프롬프트 세트를 출력합니다. 우리는 인간 평가 연구를 통해 Prompt Expansion을 통해 생성된 이미지가 기존 방법들보다 더 미적으로 만족스럽고 다양하다는 것을 보여줍니다. 전반적으로, 본 논문은 텍스트-이미지 생성 경험을 개선하기 위한 새롭고 효과적인 접근 방식을 제시합니다.

English

Text-to-image generation models are powerful but difficult to use. Users craft specific prompts to get better images, though the images can be repetitive. This paper proposes a Prompt Expansion framework that helps users generate high-quality, diverse images with less effort. The Prompt Expansion model takes a text query as input and outputs a set of expanded text prompts that are optimized such that when passed to a text-to-image model, generates a wider variety of appealing images. We conduct a human evaluation study that shows that images generated through Prompt Expansion are more aesthetically pleasing and diverse than those generated by baseline methods. Overall, this paper presents a novel and effective approach to improving the text-to-image generation experience.

적응형 텍스트-이미지 생성을 위한 프롬프트 확장

Prompt Expansion for Adaptive Text-to-Image Generation

초록

Support