후기 상호작용 역학에 관한 연구 노트: 후기 상호작용 모델의 대상 행동 분석

초록

후기 상호작용 모델은 강력한 검색 성능을 보이지만, 그 내부 동역학 상당수는 아직 충분히 연구되지 않아 성능 병목 현상을 숨기고 있을 수 있습니다. 본 연구에서는 후기 상호작용 검색의 두 가지 주제, 즉 다중 벡터 점수 매기기를 사용할 때 발생하는 길이 편향과 MaxSim 연산자로 집계된 최고 점수 이상의 유사도 분포에 주목합니다. 우리는 NanoBEIR 벤치마크에서 최신 모델들의 이러한 행동을 분석합니다. 결과에 따르면 인과적 후기 상호작용 모델의 이론적 길이 편향이 실제로 나타나는 반면, 양방향 모델도 극단적인 경우 동일한 문제를 겪을 수 있음을 확인했습니다. 또한 상위 1위 문서 토큰을 넘어서는 유의미한 유사도 경향이 존재하지 않아, MaxSim 연산자가 토큰 수준 유사도 점수를 효율적으로 활용함을 검증했습니다.

English

While Late Interaction models exhibit strong retrieval performance, many of their underlying dynamics remain understudied, potentially hiding performance bottlenecks. In this work, we focus on two topics in Late Interaction retrieval: a length bias that arises when using multi-vector scoring, and the similarity distribution beyond the best scores pooled by the MaxSim operator. We analyze these behaviors for state-of-the-art models on the NanoBEIR benchmark. Results show that while the theoretical length bias of causal Late Interaction models holds in practice, bi-directional models can also suffer from it in extreme cases. We also note that no significant similarity trend lies beyond the top-1 document token, validating that the MaxSim operator efficiently exploits the token-level similarity scores.

후기 상호작용 역학에 관한 연구 노트: 후기 상호작용 모델의 대상 행동 분석

Working Notes on Late Interaction Dynamics: Analyzing Targeted Behaviors of Late Interaction Models

초록

Support