Sound image externalization for headphone based real-time 3D audio
Yougen YUAN1(), Lei XIE1(), Zhong-Hua FU1, Ming XU2, Qi CONG1
1. School of Computer Science, Northwestern Polytechnical University, Xi’an 710129, China 2. Chinese Aeronautical Radio Electronics Research Institute, Shanghai 200233, China
3D audio effects can provide immersive auditory experience, but we often face the so-called in-head localization (IHL) problem in headphone sound reproduction. To address this problem, we propose an effective sound image externalization approach. Specifically, we consider several important factors related to sound propagation, which include image-source model based early reflections with distance decay, wall absorption and air absorption, late reverberation and other dynamic factors like head movement. We apply our sound image externalization approach to a headphone based real-time 3D audio system. Subjective listening tests show that the sound image externalization performance is significantly improved and the sound source direction is preserved as well. A/B preference test further shows that, as compared with a recent popular approach, the proposed approach is mostly preferred by the listeners.
BegaultD, WenzelE M, GodfroyM, Miller J D, AndersonM R . Applying spatial audio to human interfaces: 25 years of nasa experience. In: Proceedings of the 40th International Conference on Spatial Audio: Sense the Sound of Space. 2010
2
SekiY, SatoT. A training system of orientation and mobility for blind people using acoustic virtual reality.IEEE Transactions on Neural Systems and Rehabilitation Engineering, 2011, 19(1): 95–104 https://doi.org/10.1109/TNSRE.2010.2064791
3
XieB. Head-Related Transfer Function and Virtual Auditory Display. 2ed ed. Boca Raton, FL: J. Ross Publishing, 2013
4
TooleF E. In-head localization of acoustic images. The Journal of the Acoustical Society of America, 1970, 48(4B): 943–949 https://doi.org/10.1121/1.1912233
5
WightmanF L, Kistler D J.Headphone simulation of free-field listening. II: Psychophysical validation.The Journal of the Acoustical Society of America, 1989, 85(2): 868–878 https://doi.org/10.1121/1.397558
6
WeinrichS G. Improved externalization and frontal perception of headphone signals. In: Proceedings of Audio Engineering Society Convention 92. 1992
7
HartmannW M, Wittenberg A. On the externalization of sound images. The Journal of the Acoustical Society of America, 1996, 99(6): 3678–3688 https://doi.org/10.1121/1.414965
8
DurlachN I, Rigopulos A, PangX D , WoodsW S, Kulkarni A, ColburnH S , WenzelE M. On the externalization of auditory images. Presence: Teleoperators & Virtual Environments, 1992, 1(2): 251–257 https://doi.org/10.1162/pres.1992.1.2.251
9
LoomisJ M, HebertC, CicinelliJ G . Active localization of virtual sounds. The Journal of the Acoustical Society of America, 1990, 88(4): 1757–1764 https://doi.org/10.1121/1.400250
10
BegaultD R. Perceptual effects of synthetic reverberation on threedimensional audio systems. Journal of the Audio Engineering Society, 1992, 40(11): 895–904
11
LiitolaT. Headphone sound externalization. Dissertation for the Doctoral Degree. Espoo: Helsinki University of Technology, 2006
12
XiaR S, LiJ F, XuC D, Yan Y H. A sound image externalization approach for headphone reproduction by simulating binaural room impulse responses. Chinese Journal of Electronics, 2014, 23(3): 527–532
13
PlengeG. On the differences between localization and lateralization. The Journal of the Acoustical Society of America, 1974, 56(3): 944–951 https://doi.org/10.1121/1.1903353
14
ZhangC Y, XieB S. Platform for dynamic virtual auditory environment real-time rendering system. Chinese Science Bulletin, 2013, 58(3): 316–327 https://doi.org/10.1007/s11434-012-5523-2
15
TianX H, FuZ H, XieL. An experimental comparison on KEMAR and BHead210 dummy heads for HRTF-based virtual auditory on Chinese subjects. In: Proceedings of the 3rd IET International Conference on Wireless, Mobile and Multimedia Networks. 2010, 369–372
16
MøllerH, Sørensen M F, HammershøiD, JensenC B. Head-related transfer functions of human subjects. Journal of the Audio Engineering Society, 1995, 43(5): 300–321
17
MøllerH, JensenC B, HammershøiD , SørensenM F. Using a typical human subject for binaural recording. In: Proceedings of Audio Engineering Society Convention 100. 1996
18
AllenJ B, Berkley D A. Image method for efficiently simulating smallroom acoustics. The Journal of the Acoustical Society of America, 1979, 65(4): 943–950 https://doi.org/10.1121/1.382599
HuopaniemiJ, Savioja L, KarjalainenM . Modeling of reflections and air absorption in acoustical spaces: a digital filter design approach. In: Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. 1997, 19–22 https://doi.org/10.1109/aspaa.1997.625594
21
JonesJr R H, JobseB D. Real-time digital audio reverberation system.US Patent 5,530,762. 1996
22
BrowneS. Hybrid reverberation algorithm using truncated impulse response convolution and recursive filtering. Dissertation for the Doctoral Degree. Miami: University of Miami, 2001
GardnerW G. A realtime multichannel room simulator. Journal of the Acoustical Society of America, 1992, 92(4): 2395 https://doi.org/10.1121/1.404752
25
AlgaziV R, DudaR O, ThompsonD M , AvendanoC. The CIPIC HRTF database. In: Proceedings of IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics. 2001, 99–102 https://doi.org/10.1109/aspaa.2001.969552
26
GardnerW G, MartinK D. HRTF measurements of a KEMAR. The Journal of the Acoustical Society of America, 1995, 97(6): 3907–3908 https://doi.org/10.1121/1.412407
27
FrigoM, Johnson S G. FFTW: An adaptive software architecture for the FFT. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. 1998, 1381–1384 https://doi.org/10.1109/icassp.1998.681704
28
DavidH A. The method of paired comparisons. In: Kendall M G, ed. Griffin’s Statistical Monographs and Courses, Vol. 12. New York: Hafner, 1963