Recovered Audio Results (Figure 5)

Click on a result to hear it. These are the raw unfiltered signals

Setup Single point Average Delay & Sum Ours Input
setup drum6_golden
drum (fig 4.)
spectrogram drum6_golden Single
--:--
spectrogram drum6_golden Avg
--:--
spectrogram drum6_golden DnS
--:--
spectrogram drum6_golden Ours
--:--
spectrogram drum6_golden Input
--:--
setup frame10_stairway
picture frame
spectrogram frame10_stairway Single
--:--
spectrogram frame10_stairway Avg
--:--
spectrogram frame10_stairway DnS
--:--
spectrogram frame10_stairway Ours
--:--
spectrogram frame10_stairway Input
--:--
setup laptop12_never
laptop
spectrogram laptop12_never Single
--:--
spectrogram laptop12_never Avg
--:--
spectrogram laptop12_never DnS
--:--
spectrogram laptop12_never Ours
--:--
spectrogram laptop12_never Input
--:--
setup trash15_hotel
trash can
spectrogram trash15_hotel Single
--:--
spectrogram trash15_hotel Avg
--:--
spectrogram trash15_hotel DnS
--:--
spectrogram trash15_hotel Ours
--:--
spectrogram trash15_hotel Input
--:--
setup guitar11_polyphia
guitar
spectrogram guitar11_polyphia Single
--:--
spectrogram guitar11_polyphia Avg
--:--
spectrogram guitar11_polyphia DnS
--:--
spectrogram guitar11_polyphia Ours
--:--
spectrogram guitar11_polyphia Input
--:--
setup binder7_never
wooden binder
spectrogram binder7_never Single
--:--
spectrogram binder7_never Avg
--:--
spectrogram binder7_never DnS
--:--
spectrogram binder7_never Ours
--:--
spectrogram binder7_never Input
--:--
setup plate9_stariway
plastic plate
spectrogram plate9_stariway Single
--:--
spectrogram plate9_stariway Avg
--:--
spectrogram plate9_stariway DnS
--:--
spectrogram plate9_stariway Ours
--:--
spectrogram plate9_stariway Input
--:--
setup drum14_beatles
drum (stereo)
spectrogram drum14_beatles Single
--:--
spectrogram drum14_beatles Avg
--:--
spectrogram drum14_beatles DnS
--:--
spectrogram drum14_beatles Ours
--:--
spectrogram drum14_beatles Input
--:--
setup yoga_hotel
yoga foam
spectrogram yoga_hotel Single
--:--
spectrogram yoga_hotel Avg
--:--
spectrogram yoga_hotel DnS
--:--
spectrogram yoga_hotel Ours
--:--
spectrogram yoga_hotel Input
--:--
setup physio_-27dBgun
physio ball
spectrogram physio_-27dBgun Single
--:--
spectrogram physio_-27dBgun Avg
--:--
spectrogram physio_-27dBgun DnS
--:--
spectrogram physio_-27dBgun Ours
--:--
spectrogram physio_-27dBgun Input
--:--
setup balloon_-27dBclap
balloon
spectrogram balloon_-27dBclap Single
--:--
spectrogram balloon_-27dBclap Avg
--:--
spectrogram balloon_-27dBclap DnS
--:--
spectrogram balloon_-27dBclap Ours
--:--
spectrogram balloon_-27dBclap Input
--:--

Single point

This is a baseline method where the audio is recovered from the vibrations of a single point on the surface of the object. This often results in a noisy and distorted signal as it fails to capture the full spatial dynamics of the vibrations.

Average

This method involves taking a simple average of the vibration signals from all measurement points. While it can reduce some uncorrelated noise, it does not account for phase differences or the modal behavior of the object, leading to signal cancellation and suboptimal recovery.

Delay & Sum

Delay-and-sum beamforming shifts each signal by a single propagation delay so they align at a reference point before summation. This works when the difference between signals is well-modeled by a uniform time shift. Applying this procedure to our speckle measurements yields low-pass-filtered audio, since the global shifts `lock' onto the dominant lower frequencies. In contrast, the high frequencies are not aligned and might be nullified.

Ours

Our proposed modal-guided method. We first estimate the object's modal basis from the multi-point vibration data. Then, we use this physical prior to guide the extraction of the sound source from the structural vibrations, resulting in a significantly clearer and more accurate audio recovery.

Input

This is the ground truth audio signal that was played in the room, causing the object to vibrate. It serves as the reference for evaluating the quality of the recovered audio from the different methods.