jina-reranker-v3：文档重排序中的“最后但非最晚”交互机制

摘要

jina-reranker-v3 是一款拥有 0.6B 参数的多语言文档重排序模型，它引入了一种新颖的“最后但非延迟交互”机制。与 ColBERT 等延迟交互模型不同，后者先进行独立编码再进行多向量匹配，而我们的方法则在同一上下文窗口内对查询和文档执行因果自注意力，从而在从每个文档的最后一个令牌提取上下文嵌入之前，实现丰富的跨文档交互。这一紧凑架构在 BEIR 基准测试中取得了 61.94 nDCG@10 的顶尖性能，同时其规模仅为生成式列表重排序模型的十分之一。

English

jina-reranker-v3 is a 0.6B parameter multilingual document reranker that introduces a novel last but not late interaction. Unlike late interaction models such as ColBERT that perform separate encoding followed by multi-vector matching, our approach conducts causal self-attention between query and documents within the same context window, enabling rich cross-document interactions before extracting contextual embeddings from the last token of each document. This compact architecture achieves state-of-the-art BEIR performance with 61.94 nDCG@10 while being ten times smaller than generative listwise rerankers.

jina-reranker-v3：文档重排序中的“最后但非最晚”交互机制

jina-reranker-v3: Last but Not Late Interaction for Document Reranking

摘要

Support