My issue with use of the term 3D with reference to binaural voice conferencing service stems from the fact that I’ve been having some fun with real 3D audio over the past couple of months. This has come about since Hector Centeno released AmbiExplorer for Android, an application that lets us decode Ambisonic recordings for binaural playback in headphones.
AmbiExplorer plays back first-order B format files as well as UHJ encoded stereo files. As opposed to simply left, center, right, etc. a B format file is actually comprised of four channels known as W, X, Y & Z. These correspond to three directional signals and an omnidirectional reference signal.
By performing some matrix math on these signal in a coordinated fashion the entirety of the soundstage can be transformed in 3D space. AmbiExplorer decodes the four signals into a binaural feed, allowing the user to rotate their listening perspective in real-time. You can think of it as a 3-axis “balance” control.
Ok, this is me diving into the deep-end of something that very possibly literally no-one in the world cares about. That’s just so typical of me. Actually, I know that a few people are starting to clue in about this because I’ve heard it come up here and there in conversation, most recently at the Sept 15th HDComms event in NYC.
This post is actually the third in a series. In the first (Pink Floyd: The Making Of Money & Directionality) I took a quick look at pop music recording practices and specifically the practice of recording things “close-mic’d” the adding ambience through synthetic means. In the second in the series (Codecs, Wideband & Stereo: A Conversation At Amoocon) I followed a conversation in the hallway at AMOOCON 2009, noting aspects of the discussion pertaining to “stereo” or the conveyance of directionality.
Once we get beyond PSTN audio quality, when wideband is accepted as normal, then “dimensional” or “immersive” audio becomes a new frontier for exploration in telephony. In fact, in some limited ways we’re already doing this in larger video conference room & telepresence suites.
(this started as a quick comment on my Facebook page, but I’m moving it here so that people outside of FaceBook can join in)
With apologies to the McKenzie brothers. There appears to be an odd cross between two of my passions in the works. As I get more into the daily use of wideband telephony I wonder if there’s a potential to leverage some surround sound techniques to take conferencing to a new level?
It couldn’t be the puritanical kind of approach used in music recording. It would be more a matter of using surround panning to position participants in an synthetic soundfield. I wonder if this has been done to any degree elsewhere?