ChatPaper.aiChatPaper

LOKI:使用大型多模态模型的全面合成数据检测基准

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

October 13, 2024
作者: Junyan Ye, Baichuan Zhou, Zilong Huang, Junan Zhang, Tianyi Bai, Hengrui Kang, Jun He, Honglin Lin, Zihao Wang, Tong Wu, Zhizheng Wu, Yiping Chen, Dahua Lin, Conghui He, Weijia Li
cs.AI

摘要

随着人工智能生成内容的快速发展,未来的互联网可能会被合成数据淹没,使得区分真实可信的多模态数据变得越来越具有挑战性。因此,合成数据检测引起了广泛关注,大型多模态模型(LMMs)在这一任务中的表现引起了重要关注。LMMs能够为其真实性判断提供自然语言解释,增强了合成内容检测的可解释性。同时,区分真实数据和合成数据的任务有效地测试了LMMs的感知、知识和推理能力。为此,我们引入了LOKI,一个旨在评估LMMs跨多模态检测合成数据能力的新型基准。LOKI涵盖了视频、图像、3D、文本和音频多模态,包括26个子类别中精心策划的18,000个问题,具有明确的难度级别。该基准包括粗粒度判断和多项选择题,以及细粒度异常选择和解释任务,可进行对LMMs的全面分析。我们在LOKI上评估了22个开源LMMs和6个闭源模型,突出它们作为合成数据检测器的潜力,同时也揭示了LMM能力发展中的一些局限性。有关LOKI的更多信息,请访问https://opendatalab.github.io/LOKI/
English
With the rapid development of AI-generated content, the future internet may be inundated with synthetic data, making the discrimination of authentic and credible multimodal data increasingly challenging. Synthetic data detection has thus garnered widespread attention, and the performance of large multimodal models (LMMs) in this task has attracted significant interest. LMMs can provide natural language explanations for their authenticity judgments, enhancing the explainability of synthetic content detection. Simultaneously, the task of distinguishing between real and synthetic data effectively tests the perception, knowledge, and reasoning capabilities of LMMs. In response, we introduce LOKI, a novel benchmark designed to evaluate the ability of LMMs to detect synthetic data across multiple modalities. LOKI encompasses video, image, 3D, text, and audio modalities, comprising 18K carefully curated questions across 26 subcategories with clear difficulty levels. The benchmark includes coarse-grained judgment and multiple-choice questions, as well as fine-grained anomaly selection and explanation tasks, allowing for a comprehensive analysis of LMMs. We evaluated 22 open-source LMMs and 6 closed-source models on LOKI, highlighting their potential as synthetic data detectors and also revealing some limitations in the development of LMM capabilities. More information about LOKI can be found at https://opendatalab.github.io/LOKI/

Summary

AI-Generated Summary

PDF564November 16, 2024