平均に従う：参照誘導フローマッチング

要旨

従来の制御可能生成手法は、通常、ファインチューニング、補助ネットワーク、またはテスト時探索に依存している。本稿では、フローマッチングが異なる制御インターフェース、すなわち例を通じた適応を可能にすることを示す。決定論的補間において、速度場は条件付き終点平均のみによって決定される。この平均をシフトすると、フロー自体が変化する。このことから、制御可能生成のための単純な原理が得られる。すなわち、事前学習モデルが追従する参照セットを変更することで、そのモデルを誘導する。本稿では、このアイデアを2つの形で具体化する。参照平均ガイダンスは学習不要である。参照バンクから閉形式の終点平均補正を計算し、凍結されたFLUX.2-klein（4B）モデルに適用することで、プロンプト、シード、重みを固定したまま、色、同一性、スタイル、構造の制御を可能にする。半パラメトリックガイダンスは、明示的な平均アンカーと学習された残差リファイナーを通じて同じアイデアを具現化し、AFHQv2において非条件DiT-B/4品質に匹敵しつつ、推論時に参照セットの交換を可能にする。これらの結果は、より広範な方向性を示している。すなわち、パラメータ更新ではなくデータを通じて適応する生成モデルである。

English

Existing approaches to controllable generation typically rely on fine-tuning, auxiliary networks, or test-time search. We show that flow matching admits a different control interface: adaptation through examples. For deterministic interpolants, the velocity field is solely governed by a conditional endpoint mean; shifting this mean shifts the flow itself. This yields a simple principle for controllable generation: steer a pretrained model by changing the reference set it follows. We instantiate this idea in two forms. Reference-Mean Guidance is training-free: it computes a closed-form endpoint-mean correction from a reference bank and applies it to a frozen FLUX.2-klein (4B) model, enabling control of color, identity, style, and structure while keeping the prompt, seed, and weights fixed. Semi-Parametric Guidance amortizes the same idea through an explicit mean anchor and learned residual refiner, matching unconditional DiT-B/4 quality on AFHQv2 while allowing the reference set to be swapped at inference time. These results point to a broader direction: generative models that adapt through data, not parameter updates.