VecFusion: 확산 모델을 활용한 벡터 폰트 생성

초록

본 논문에서는 다양한 위상 구조와 정밀한 제어점 위치를 가진 벡터 폰트를 생성할 수 있는 새로운 신경망 구조인 VecFusion을 소개한다. 우리의 접근 방식은 래스터 확산 모델과 벡터 확산 모델로 구성된 캐스케이드 확산 모델이다. 래스터 모델은 폰트의 전반적인 스타일과 형태를 포착하며, 보조 제어점 정보를 포함한 저해상도의 래스터화된 폰트를 생성한다. 반면, 벡터 모델은 첫 단계에서 생성된 저해상도 래스터 폰트를 조건으로 하여 벡터 폰트를 합성한다. 길고 복잡한 곡선을 합성하기 위해, 우리의 벡터 확산 모델은 트랜스포머 아키텍처와 다양한 벡터 기하학을 모델링하고 제어점을 정확하게 예측할 수 있는 새로운 벡터 표현 방식을 사용한다. 실험 결과, 기존의 벡터 그래픽 생성 모델과 비교하여 우리의 새로운 캐스케이드 벡터 확산 모델이 복잡한 구조와 다양한 스타일을 가진 더 높은 품질의 벡터 폰트를 생성함을 보여준다.

English

We present VecFusion, a new neural architecture that can generate vector fonts with varying topological structures and precise control point positions. Our approach is a cascaded diffusion model which consists of a raster diffusion model followed by a vector diffusion model. The raster model generates low-resolution, rasterized fonts with auxiliary control point information, capturing the global style and shape of the font, while the vector model synthesizes vector fonts conditioned on the low-resolution raster fonts from the first stage. To synthesize long and complex curves, our vector diffusion model uses a transformer architecture and a novel vector representation that enables the modeling of diverse vector geometry and the precise prediction of control points. Our experiments show that, in contrast to previous generative models for vector graphics, our new cascaded vector diffusion model generates higher quality vector fonts, with complex structures and diverse styles.

VecFusion: 확산 모델을 활용한 벡터 폰트 생성

VecFusion: Vector Font Generation with Diffusion

초록

Support