ChatPaper.aiChatPaper

SoundCam:一個用於利用房間聲學找尋人類的數據集

SoundCam: A Dataset for Finding Humans Using Room Acoustics

November 6, 2023
作者: Mason Wang, Samuel Clarke, Jui-Hsien Wang, Ruohan Gao, Jiajun Wu
cs.AI

摘要

一個房間的聲學特性是房間的幾何形狀、房間內的物體以及它們的具體位置的結果。一個房間的聲學特性可以通過源位置和聆聽者位置之間的脈衝響應(RIR),或者從房間中存在的自然信號的錄音中粗略推斷。房間中物體的位置變化可以影響房間的聲學特性,如RIR所描述的那樣。現有的RIR數據集要麼沒有系統地變化環境中物體的位置,要麼只包含模擬的RIR。我們提出了SoundCam,這是迄今為止公開發布的最大的野外房間獨特RIR數據集。它包括5,000個10通道的真實世界房間脈衝響應測量和3個不同房間中音樂的2,000個10通道錄音,包括一個受控的聲學實驗室、一個野外客廳和一個會議室,每個房間中都有不同位置的人類。我們展示這些測量可以用於有趣的任務,例如檢測和識別人類,以及跟踪他們的位置。
English
A room's acoustic properties are a product of the room's geometry, the objects within the room, and their specific positions. A room's acoustic properties can be characterized by its impulse response (RIR) between a source and listener location, or roughly inferred from recordings of natural signals present in the room. Variations in the positions of objects in a room can effect measurable changes in the room's acoustic properties, as characterized by the RIR. Existing datasets of RIRs either do not systematically vary positions of objects in an environment, or they consist of only simulated RIRs. We present SoundCam, the largest dataset of unique RIRs from in-the-wild rooms publicly released to date. It includes 5,000 10-channel real-world measurements of room impulse responses and 2,000 10-channel recordings of music in three different rooms, including a controlled acoustic lab, an in-the-wild living room, and a conference room, with different humans in positions throughout each room. We show that these measurements can be used for interesting tasks, such as detecting and identifying humans, and tracking their positions.
PDF140December 15, 2024