Spatial Room Impulse Response Processing for Virtual Acoustics
Doctoral thesis, 2026
The thesis first introduces a general microphone array signal model that separates room- and array-dependent contributions using spherical or circular harmonic representations. Building on this model, a blind SRIR estimation framework is proposed that reformulates blind multichannel system identification as an informed problem through the estimation of a pseudo-reference signal. Motivated by practical AR systems that often rely on wearable devices such as head-mounted displays or smartglasses, the thesis then specifically considers microphone arrays in motion.
The second part of the thesis focuses on the binaural rendering of estimated SRIRs for headphone reproduction. An array-aware end-to-end magnitude least-squares renderer is proposed to mitigate spatio-spectral coloration caused by limited spatial sampling and regularization. As an alternative to direct rendering, the thesis investigates the separation of direct sound and early reflections from an SRIR, a common processing step in parametric SRIR-based rendering that can facilitate virtual acoustic reproduction with increased directional sharpness. Two approaches are compared: one based on a physical array signal model and another based on subspace decomposition.
Together, these contributions advance practical SRIR estimation and rendering for virtual acoustics and provide foundations for robust, wearable, and perceptually convincing augmented and virtual reality audio systems.
Spatial Room Impulse Response
Room Acoustics
Room Impulse Response Estimation
Virtual Acoustics
Microphone Array
Binaural Rendering
Author
Thomas Deppisch
Chalmers, Architecture and Civil Engineering, Applied Acoustics
Blind Estimation of Spatial Room Impulse Responses Using a Pseudo Reference Signal
2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings,;(2024)p. 470-474
Paper in proceeding
Blind Identification of Binaural Room Impulse Responses From Smart Glasses
IEEE/ACM Transactions on Audio Speech and Language Processing,;Vol. 32(2024)p. 4052-4065
Journal article
Spatial Room Impulse Response Identification from Rotating Equatorial Microphone Arrays
European Signal Processing Conference,;(2024)p. 116-120
Paper in proceeding
Spatial Room Impulse Response Estimation from a Moving Microphone Array
European Signal Processing Conference,;(2025)p. 91-95
Paper in proceeding
T. Deppisch, S. V. Amengual Garí, P. Calamia and J. Ahrens, ”Identification and Matching of Room Acoustics With Moving Head-Worn Microphone Arrays,” 2026.
End-to-End Magnitude Least Squares Binaural Rendering of Spherical Microphone Array Signals
2021 Immersive and 3D Audio: From Architecture to Automotive, I3DA 2021,;(2021)
Paper in proceeding
Spatial Subtraction of Reflections from Room Impulse Responses Measured With a Spherical Microphone Array
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics,;Vol. 2021-October(2021)p. 346-350
Paper in proceeding
Direct and Residual Subspace Decomposition of Spatial Room Impulse Responses
IEEE/ACM Transactions on Audio Speech and Language Processing,;Vol. 31(2023)p. 927-942
Journal article
Recreating a more natural experience requires capturing and reproducing how sound behaves in a room. This can be done using a spatial room impulse response, which can be understood as an acoustic fingerprint of a space. It describes how sound travels from a source to a listener in a specific environment, including how it reflects, reverberates, and from which directions it arrives. In other words, it captures both the temporal and spatial acoustic characteristics of a room.
When combined with models of how humans hear with two ears and played back over headphones, this makes it possible to add virtual sounds to a real environment in a convincing way. This is the basis of virtual acoustics and is important for applications such as augmented reality and telepresence.
In practical applications, it is typically not possible to perform dedicated measurements, and spatial room impulse responses must be estimated from naturally occurring sounds using small, wearable microphone arrays that may be moving. This thesis focuses on estimating spatial room impulse responses from such recordings and rendering them for headphone playback while preserving the cues needed to perceive sound direction and space in a natural way. By improving both how spatial room impulse responses are captured and reproduced, this work enables remote communication that feels more like being in the same room.
Areas of Advance
Information and Communication Technology
Driving Forces
Innovation and entrepreneurship
Subject Categories (SSIF 2025)
Signal Processing
Control Engineering
DOI
10.63959/chalmers.dt/5863
ISBN
978-91-8103-406-6
Doktorsavhandlingar vid Chalmers tekniska högskola. Ny serie: 5863
Publisher
Chalmers
SB-H4
Opponent: Prof. Dr.-Ing. Sebastian Schlecht, Chair of Multimedia Communications and Signal Processing, Friedrich-Alexander University Erlangen-Nürnberg, Germany