ChatPaper.aiChatPaper

全景視野:具身人工智慧時代中全方位視覺的崛起

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era

September 16, 2025
作者: Xu Zheng, Chenfei Liao, Ziqiao Weng, Kaiyu Lei, Zihao Dongfang, Haocong He, Yuanhuiyi Lyu, Lutao Jiang, Lu Qi, Li Chen, Danda Pani Paudel, Kailun Yang, Linfeng Zhang, Luc Van Gool, Xuming Hu
cs.AI

摘要

全方位視覺,即利用360度視角來理解環境,在機器人、工業檢測及環境監測等領域變得日益重要。與傳統的針孔視覺相比,全方位視覺提供了全面的環境感知,顯著提升了場景感知的完整性和決策的可靠性。然而,該領域的基礎研究長期以來落後於傳統針孔視覺。本次演講將探討體現AI時代的一個新興趨勢:在日益增長的工業需求和學術興趣推動下,全方位視覺的快速發展。我們將重點介紹近期在全方位生成、全方位感知、全方位理解及相關數據集方面取得的突破。基於學術界和工業界的見解,我們提出了一個在體現AI時代的理想全景系統架構——PANORAMA,它由四個關鍵子系統組成。此外,我們還針對全景視覺與體現AI交叉領域的新興趨勢及跨社群影響,以及未來的發展路線圖和開放性挑戰,提供了深入的見解。本綜述整合了最新的技術進展,並為在體現AI時代構建強大、通用的全方位AI系統的未來研究,勾勒了挑戰與機遇。
English
Omnidirectional vision, using 360-degree vision to understand the environment, has become increasingly critical across domains like robotics, industrial inspection, and environmental monitoring. Compared to traditional pinhole vision, omnidirectional vision provides holistic environmental awareness, significantly enhancing the completeness of scene perception and the reliability of decision-making. However, foundational research in this area has historically lagged behind traditional pinhole vision. This talk presents an emerging trend in the embodied AI era: the rapid development of omnidirectional vision, driven by growing industrial demand and academic interest. We highlight recent breakthroughs in omnidirectional generation, omnidirectional perception, omnidirectional understanding, and related datasets. Drawing on insights from both academia and industry, we propose an ideal panoramic system architecture in the embodied AI era, PANORAMA, which consists of four key subsystems. Moreover, we offer in-depth opinions related to emerging trends and cross-community impacts at the intersection of panoramic vision and embodied AI, along with the future roadmap and open challenges. This overview synthesizes state-of-the-art advancements and outlines challenges and opportunities for future research in building robust, general-purpose omnidirectional AI systems in the embodied AI era.
PDF201September 18, 2025