ChatPaper.aiChatPaper

SoundCam:一个用于利用房间声学找到人类的数据集

SoundCam: A Dataset for Finding Humans Using Room Acoustics

November 6, 2023
作者: Mason Wang, Samuel Clarke, Jui-Hsien Wang, Ruohan Gao, Jiajun Wu
cs.AI

摘要

一个房间的声学特性是房间的几何形状、房间内的物体以及它们的具体位置的产物。一个房间的声学特性可以通过源位置和听者位置之间的脉冲响应(RIR)来表征,或者可以从房间中存在的自然信号的录音中粗略推断。房间中物体的位置变化可以影响房间的声学特性,如RIR所描述的那样。现有的RIR数据集要么没有系统地改变环境中物体的位置,要么仅包含模拟的RIR。我们介绍了SoundCam,这是迄今为止公开发布的最大的野外房间独特RIR数据集。它包括5,000个10通道的真实世界房间脉冲响应测量和2,000个10通道的音乐录音,涵盖三个不同房间,包括一个受控的声学实验室、一个野外客厅和一个会议室,每个房间中有不同位置的人类。我们展示了这些测量可以用于一些有趣的任务,比如检测和识别人类,并跟踪他们的位置。
English
A room's acoustic properties are a product of the room's geometry, the objects within the room, and their specific positions. A room's acoustic properties can be characterized by its impulse response (RIR) between a source and listener location, or roughly inferred from recordings of natural signals present in the room. Variations in the positions of objects in a room can effect measurable changes in the room's acoustic properties, as characterized by the RIR. Existing datasets of RIRs either do not systematically vary positions of objects in an environment, or they consist of only simulated RIRs. We present SoundCam, the largest dataset of unique RIRs from in-the-wild rooms publicly released to date. It includes 5,000 10-channel real-world measurements of room impulse responses and 2,000 10-channel recordings of music in three different rooms, including a controlled acoustic lab, an in-the-wild living room, and a conference room, with different humans in positions throughout each room. We show that these measurements can be used for interesting tasks, such as detecting and identifying humans, and tracking their positions.
PDF140December 15, 2024