MPEG-H 3D Audio, specified as ISO/IEC 23008-3 (MPEG-H Part 3), is an audio coding standard developed by the ISO/IEC Moving Picture Experts Group (MPEG) to support coding audio as audio channels, audio objects, or higher order ambisonics (HOA). MPEG-H 3D Audio can support up to 64 loudspeaker channels and 128 codec core channels.

Objects may be used alone or in combination with channels or HOA components. The use of audio objects allows for interactivity or personalization of a program by adjusting the gain or position of the objects during rendering in the MPEG-H decoder. Audio is encoded using an improved modified discrete cosine transform (MDCT) algorithm.[1]

Channels, objects, and HOA components may be used to transmit immersive sound as well as mono, stereo, or surround sound. The MPEG-H 3D Audio decoder renders the bitstream to a number of standard speaker configurations as well as to misplaced speakers. Binaural rendering of sound for headphone listening is also supported.

These are the ISO standards relating to MPEG-H 3D Audio:

ISO/IEC 23008-3:2022 - Part 3: 3D audio

ISO/IEC 23008-6:2021/Amd 1:2024 - Part 6: 3D audio reference software

ISO/IEC 23008-9:2023 - Part 9: 3D Audio conformance testing

History

edit

In January 2013, the requirements were released for MPEG-H 3D Audio which was for an increase in the immersion of audio and to allow for a greater number of loudspeakers for audio localization.[2] The allowed audio types would be audio channels, audio objects, and HOA.[2]

On September 10, 2014, Fraunhofer IIS demonstrated a real time MPEG-H 3D audio encoder.[3]

In February 2015, MPEG announced that MPEG-H 3D Audio would be published as an International Standard.[4]

On March 10, 2015, the Advanced Television Systems Committee announced that MPEG-H 3D Audio was one of the three standards proposed for the audio system of ATSC 3.0.[5]

On April 10, 2015, Fraunhofer, Technicolor, and Qualcomm demonstrated a live broadcast signal chain consisting of all the elements needed to implement MPEG-H based audio in broadcast television. The demonstration featured a simulated remote truck at a sports event, a network control center, a local affiliate station, and a consumer living room. The audio was produced and encoded through an MPEG-H audio monitoring and authoring unit, mpeg-h real-time broadcast encoders, and real-time professional and consumer MPEG-H decoders. The audio was decoded in the consumer living room on a Technicolor set-top box.[6][7]

In April 2015, the Advanced Television Systems Committee announced that systems from Dolby Laboratories and the MPEG-H Audio Alliance (Fraunhofer, Technicolor, and Qualcomm) would be tested in the coming months for use as the audio layer for the ATSC 3.0 signal.[8]

In August 2015, the Advanced Television Systems Committee announced that systems from Dolby Laboratories and the MPEG-H Audio Alliance were demonstrated to the ATSC showing how they would work in both professional broadcast facilities and consumer home environments.[9][10]

On April 18, 2016, South Korean broadcast equipment manufacturers Kai Media and DS Broadcast announced the availability of MPEG-H 3D Audio in their latest 4K broadcast encoders.[11]

On May 2, 2016, the Advanced Television Systems Committee has elevated the A/342 audio standard for ATSC 3.0 to the status of a Candidate Standard. The MPEG-H Audio Alliance TV audio system and Dolby AC-4 are part of the A/342 standard.[12]

On June 24, 2016, the South Korean standardization organization "Telecommunications Technology Association" TTA published the standard for "Transmission and Reception of Terrestrial UHD TV Broadcasting Service" for the South Korean terrestrial UHD TV broadcasting service to be launched in February 2017. The TTA standard is based on ATSC 3.0 and specifies MPEG-H 3D Audio as the sole audio codec for the 4K TV system.[13][14][15]

On January 3, 2017, Fraunhofer IIS announced a trademark program to identify interoperable products that include MPEG-H.[16]

On January 8, 2019, Sony announced an immersive music service "360 Reality Audio" that uses MPEG-H.[17][18][19]

Profiles

edit

The Main profile of MPEG-H 3D Audio has five levels.[20]

Levels for the Main profile of MPEG-H 3D Audio[20]
Level Maximum number of
core channels
Maximum number of
loudspeaker channels
1 8 8
2 16 16
3 32 24
4 64 24
5 128 64

MPEG announced the availability of the MPEG-H 3D Audio Amendment 3 for late 2016. This amendment defines the Low Complexity Profile which includes technology that increases coding efficiency and also adds features designed for use in the broadcast industry.[21]

See also

edit

References

edit
  1. ^ Bleidt, R. L.; Sen, D.; Niedermeier, A.; Czelhan, B.; Füg, S.; et al. (2017). "Development of the MPEG-H TV Audio System for ATSC 3.0" (PDF). IEEE Transactions on Broadcasting. 63 (1): 202–236. doi:10.1109/TBC.2017.2661258. S2CID 30821673.
  2. ^ a b "Call for Proposals on 3D Audio". MPEG. Retrieved 2015-03-14.
  3. ^ "Fraunhofer IIS Demonstrates Real-Time MPEG-H Audio Encoder System for Broadcast Applications at IBC". Business Wire. 2014-09-10. Retrieved 2015-03-15.
  4. ^ "MPEG-H 3D Audio progresses to International Standard". MPEG. Retrieved 2015-03-14.
  5. ^ "Advanced Television Systems Committee Begins Review of ATSC 3.0 Audio System Proposals". Advanced Television Systems Committee. 2015-03-10. Archived from the original on 2015-03-13. Retrieved 2015-03-14.
  6. ^ "Fraunhofer IIS, Qualcomm and Technicolor to Demonstrate the World's First Live Broadcast of MPEG-H Interactive and Immersive TV Audio". Business Wire. Retrieved 9 October 2015.
  7. ^ MPEG-H Audio Brings New Features to TV and Streaming Sound, Electronic Design, July 10, 2015
  8. ^ "Evaluation of Proposed ATSC 3.0 Audio Systems Begins". Advanced Television Systems Committee. April 2015. Retrieved 9 October 2015.
  9. ^ "Listen Up! Atlanta Hears ATSC 3.0 Audio As Proponents Demonstrate Advantages". Advanced Television Systems Committee. 3 August 2015. Retrieved 9 October 2015.
  10. ^ "Demonstrations Show Off Potential of ATSC 3.0 Audio Standard". Sports Video Group. Retrieved 9 October 2015.
  11. ^ "First broadcast encoders with MPEG-H Audio launched | Fraunhofer Audio Blog". 2016-04-18. Retrieved 2016-08-02.
  12. ^ "More ATSC 3.0 Standards Progress! - ATSC". 3 May 2016. Retrieved 2016-08-02.
  13. ^ "Transmission and Reception for Terrestrial UHDTV Broadcasting Service". TTA. 2016-06-24. Retrieved 2016-08-02.
  14. ^ "Korea Reveals Its Plans for UHDTV at NAB Show". 4 May 2016. Retrieved 2016-08-02.
  15. ^ "World's 1st Terrestrial UHD TV Service With MPEG-H Audio | Fraunhofer Audio Blog". 2016-06-30. Retrieved 2016-08-02.
  16. ^ "Fraunhofer Announces MPEG-H Trademark to Identify Interoperable Products". Business Wire. 3 January 2017. Retrieved 11 January 2017.
  17. ^ Sony. "Sony Introduces All New "360 Reality Audio" Music Experience That Immerses Listeners in a Three-Dimensional Sound Field Powered by Object-Based Spatial Audio Technology". www.prnewswire.com (Press release). Retrieved 2019-01-26.
  18. ^ "Sony Unveils the 'Future of Music' With 360 Reality Audio at CES 2019". Billboard. 11 January 2019. Retrieved 2019-01-26.
  19. ^ "I want Sony's 360 Reality Audio to be the future of music". Engadget. Retrieved 2019-01-26.
  20. ^ a b "Text of ISO/IEC 23008-3/ DAM, 3D Audio Profiles". MPEG. Retrieved 2015-03-14.
  21. ^ "MPEG 115 - Geneva - MPEG-H 3D Audio AMD 3 reaches FDAM status | MPEG". mpeg.chiariglione.org. Retrieved 2016-08-02.
edit