Information on this
banner



MPEG-4 Systems June 19 AM
MPEG-4 Systems Introduction & Elementary Stream Management (Download)
BIFS: Theory and Practice (Download)
DMIF Overview & MP4 File Format (Download)
by Olivier Avaro, Carsten Herpel, Jean-Claude Dufourd, Guido Franceschini
This tutorial gives an overview of Part 1 and Part 6 of ISO/IEC 14496, respectively MPEG-4 Systems and DMIF. It starts from the general architecture down to the description of the individual tools. It first presents the objectives of the MPEG-4 Systems activity. In the MPEG-1 and MPEG-2 standards, "Systems" referred only to overall architecture, multiplexing, and synchronization. In MPEG-4, in addition to these issues, the Systems part encompasses audio-visual scene description, interactivity, content description, and programmability. The tutorial addresses also the MPEG-4 delivery framework, including DMIF, the MPEG-4 file format as well as the transmission of MPEG-4 over IP and MPEG-2 Systems. Finally, the future extensions of the specification, as well as a comparison between the solutions provided by MPEG-4 and some alternative technologies will be addressed.

MPEG-4 Visual June 19 PM
MPEG-4 Visual Standard (Download)
by Thomas Sikora and Touradj Ebrahimi
This tutorial addresses the MPEG-4 standard, as defined in ISO/IEC 14496-2: Visual, its amendments as well as some of the planned work items. The MPEG-4 visual standard has developed to provide users a new level of interaction with visual contents. It provides technologies to view, access and manipulate objects of any origin (synthetic or natural), with useful featuressuch as compression efficiency at a large range of bit rates, error robustness, various scalabilities, etc. Application areas range from digital television, and internet video streaming, to mobile multimedia and games.

The MPEG-4 visual standard can be devided into collection of natural and synthetic video tools supporting these application areas. The standard provides tools for shape coding, motion estimation and compensation, texture coding, error resilience, sprite coding and various scalability features, face animation, body animation, dynamic mesh coding as well as three dimensional photo-realistic coding. This tutorial will provide an overview of these tools with examples of possible applications where they can be used for, as well as issues to be taken into account when designing systems based in MPEG-4 visual standard.

MPEG-4 Audio (Download) June 19 PM
Short Course on MPEG-4 Audio: Introduction and Overview (Download)
Profiles and Levels in MPEG-4 Audio (Download)
MPEG-4 Audio Speech Coding (Download)
MPEG-4 Synthetic Audio Tools (Download)
MPEG-4 General Audio Coding (Download)
MPEG-4 Version 2 Audio: HILN - Parametric Audio Coding (Download)
MPEG-4 Audio Scalable Coding (Download)
by Karlheinz Brandenburg, Bernhard Grill, Jürgen Herre, Riita Väänänen
MPEG-4 audio represents a new kind of audio coding standard. Unlike its predecessors, MPEG-1 and MPEG-2 high quality audio coding, and unlike the speech standards which have been completed by the ITU-T, it does not describe a single or small set of highly efficient compression schemes but a complete toolbox to support everything from low bitrate speech coding to high quality audio coding or music synthesis. The natural coding part within MPEG-4 audio describes traditional type speech and high quality audio coding algorithms and their combination to enable new functionalities, such as scalability (hierarchical coding) across the boundaries of coding algorithms. The tutorial gives an overview of the basic algorithms and how they can be combined.

In addition to its sophisticated audio-compression capabilities, MPEG-4 contains extensive functions supporting synthetic sound and the synthetic/natural hybrid coding of sound. An overview of the Structured Audio format which allows efficient transmission and client-side synthesis of music and sound effects will be presented as well as an overview of the Text-to-Speech Interface, which standardizes a single format for communication with speech synthesizers. Finally, an overview of the AudioBIFS portion of the Binary Format for Scene Description, which allows the description of hybrid soundtracks, 3-D audio environments, and interactive audio programming will be presented.

The tools provided for advanced audio functionality in MPEG-4 are a new and important addition to the world of audio standards. The tutorial will include both tutorial information on MPEG-4 audio and a number of demonstrations illustrating the technological potential of the standard.


CVs

Olivier AVARO (France Telecom)
Olivier AVARO was born in Provence, France on July 27, 1968. He received his Dipl. Ing. degree in telecommunication engineering in 1992 from the Higher School of Telecommunication of Brittany. He joined France Telecom-CNET department on image communication in Paris in 1992. He worked on image representation techniques and analysis. His research areas cover video compression algorithms, error resiliency of video compression algorithms, invariant representation of images and shapes and model based representation for interpersonal communications. He has been early involved in the ISO/MPEG-4 project, in particular through the European platform MAVT and its successor MoMuSys. He later managed the France Telecom CNET project Oxygen on MPEG-4 multimedia teleconferencing systems. In 1999, he joined Deutsche Telekom - Berkom department on Multimedia Communication in Darmstadt within the context of joint Deutsche Telekom - France Telecom projects. He is currently chairman of the MPEG-4 Systems sub-group.

Carsten Herpel (Thomson Multimedia)
Carsten Herpel was born in Cologne, Germany in 1962. He received his diploma in communication engineering from RWTH Aachen, Germany, in 1988. He then joined Thomson Multimedia's research facility in Hannover, Germany, and developed video coding algorithms, with a focus on scalable coding tools. His research interests have then moved to multimedia systems with a current focus on MPEG-4. Mr. Herpel serves as a co-editor to the MPEG-4 Systems specification.

Jean-Claude Dufourd (ENST)
Jean-Claude Dufourd holds an engineering degree from Ecole Nationale Superieure des Telecommunications and a PhD in computer science from Ecole Normale Superieure. He was a researcher at France Telecom/CNET in Grenoble before coming to ENST in 1990. His research interests include multimedia authoring, multimedia standardization and VLSI CAD. He has been heavily involved in the specification, validation, implementation and conformance testing of MPEG-4 Systems since 1995. He has participated in European projects like ACTS Emphasis, ESPRIT MPEG-4 PC and IST SoNG. Mr. Dufourd serves as editor of both the MPEG-4 Conformance Test Specification and the MPEG-4 Reference Software document.

Guido Franceschini (CSELT)
Guido FRANCESCHINI received the Electronic Engineering Degree from the Politecnico of Torino in October 1989. In 1990 he joined CSELT, where he was originally involved in the design and implementation of early ATM terminals. He has then been involved in the implementation of a DAVIC compliant Video on Demand system. He is currently working at an MPEG-4 prototype, with a focus on delivery, synchronization and server aspects. Since 1993 he has been participating to the ISO/IEC Moving Picture Expert Group, to work on the MPEG-2 Systems and then on the MPEG-2 DSMCC specifications. He has been heavily involved in the specification of MPEG-4 Systems and MPEG-4 DMIF. He acted as editor for MPEG-4 DMIF Version 1. From 1994 to 1996 he has been also participating to the ATM Forum, in the Service Aspects and Applications subgroup.

Touradj Ebrahimi (EPFL)
Touradj Ebrahimi was born on July 30, 1965. He received his M.Sc. and Ph.D., both in Electrical Engineering, from the Swiss Federal Institute of Technology, Lausanne, Switzerland, in 1989 and 1992 respectively. From 1989 to 1992, he was a research assistant at the Signal Processing Laboratory of the Swiss Federal Institute of Technology (EPFL). During the summer 1990, he was a visiting researcher at the Signal and Image Processing Institute of the University of Southern California, Los Angeles, California. In 1993, he was a research engineer at the Corporate Research Laboratoriess of Sony Corporation in Tokyo, where he conducted research on advanced video compression techniques for storage applications. In 1994, he served as a research consultant at AT&T Bell Laboratories working on very low bitrate video coding. He is currently a Professor at the Signal Processing Laboratory of EPFL, where he is involved with various aspects of Digital Video and Multimedia applications and in charge of the Digital TV group. In 1989, he was the recipient of the IEEE and Swiss national ASE award. His research interests are multidimensional signal processing, image processing, information theory, and coding. He is the author or the co-author of more than 100 research publications, and 12 patents. He is a member of the IEEE, SPIE and EURASIP.
>

Thomas Sikora (HHI)
As the chairman of the ISO-MPEG video group (Moving Picture Experts Group), Thomas Sikora is responsible for the development and standardization of the MPEG-4 video coding algorithms. He received the Dipl.-Ing. degree and Dr.-Ing. degree in electrical engineering from Bremen University, Germany, in 1985 and 1989 respectively. In 1990 he joined Siemens Ltd. and Monash University, Melbourne, Australia, as a project leader responsible for video compression research activities in the Australian &wote;Universal Broadband Video Codec" consortium. Since 1994 he directs video coding research activities at Heinrich-Hertz-Institute (HHI) Berlin GmbH, Germany.
Dr. Sikora has been involved in ITU and ISO standardization activities as well as in several European research activities for a number of years and he currently serves as the chairman of the European COST 211ter video compression research group. He works frequently as an industry consultant on issues related to interactive digital audio and video. Dr. Sikora is recipient of the 1996 German ITG award (German Society for Information Technology). He has authored and co-authored more than 100 papers for refereed journals, conferences and standardization bodies in the area of digital signal processing and image communications. He is member of the Editorial Board for a number of international journals including the IEEE Transactions on Circuits and Systems for Video Technology, the IEEE Signal Processing Magazine and the EURASIP Signal Processing: Image Communication journal. He is a senior member of IEEE and a member of ITG.

Karlheinz Brandenburg (Fraunhofer IIS/AEMT, Ilmenau Technical University)
Karlheinz Brandenburg received M.S. (Diplom) degrees in Electrical Engineering and in Mathematics from Erlangen University in 1980 and 1982, respectively. In 1989 he earned his Ph.D. in Electrical Engineering, also from Erlangen University, for work on digital audio coding and perceptual measurement techniques. The techniques described in his thesis form the basis of MPEG Layer-3 (mp3), MPEG-2 Advanced Audio Coding and most other modern audio compression schemes.
From 1989 to 1990 he was with AT&T Bell Laboratories in Murray Hill, NJ, USA. He worked on the ASPEC perceptual coding technique and on the definition of the ISO/IEC MPEG Layer-3 system. In 1990 he returned to Erlangen University to continue the research on audio coding and to teach a course on digital audio technology. In 1993 he became head of the Audio/Multimedia department at the Fraunhofer Institute for Integrated Circuits (FhG-IIS). In 2000, he was appointed full professor and director of the "Institut fuer Medientechnik" at Ilmenau Technical University and director of the new Fraunhofer IIS-A AEMT (Arbeitsgruppe Elektronische Medientechnologie) in Ilmenau, Germany.
He has presented numerous papers at AES conventions and other international conferences. Together with Mark Kahrs, he edited the book "Applications of Digital Signal Processing to Audio and Acoustics".
In 1994 he received the AES Fellowship Award for his work on perceptual audio coding and pscychoacoustics. In 1998 he received the AES silver medal award for his continued contributions and leadership to the art and science of perceptual audio coding. In 2000, he received the AES Board of Governors Award for his cochairmanship of the AES 17th. International Conference, the first scientific international conference dedicated to the topic of high quality audio coding. In October 2000, he and two collegues received the "German Future Award" for their work on MP3. In December 2000, he received the IEEE Region 10 "Engineering Excellence Award" of the IEEE Consumer Electronics Society for his contributions to the field of consumer electronics.
Dr. Brandenburg is a member of the technical committee on Audio and Electroacoustics of the IEEE Signal Processing Society. He has worked within the MPEG-Audio committee since its beginnings in 1988. In recent years, he worked on MPEG-2 Advanced Audio Coding (standardized in 1997) and helped to organize the work for MPEG-4 Audio. He is a member of SDMI (the Secure Digital Music Initiative) and is chairing the AES Standards Committee working group AESSC-06-04 on Internet Audio Delivery Systems.
Dr. Brandenburg has been granted 25 patents and has several more pending.

Bernhard Grill (Fraunhofer IIS)
Bernhard Grill was born in Schwabach, Germany, in 1961. He received a M.S. (Diplom) degree in electrical engineering from Erlangen University. A member of the AES, Mr. Gill's interests center on music. At the Fraunhofer Institute for Integrated Circuits (FhG-IIS) and the University of Erlangen he has contributed to the development of several perceptual audio coder systems. These include ASPEC, ISO/IEC MPEG-1/2 Audio Layer-3 (mp3), and MPEG-2 Advanced Audio Coding (AAC). Recent work concentrated on scalable audio coding, now part of MPEG-4 Audio. He was the coordinating editor of the MPEG-4 Audio standard. Bernhard Grill has been granted appr. fifteen audio coding patents and has several more pending. Currently he is the head of the audio department at the IIS.
He has presented numerous papers at AES conventions and other international conferences. In September 2000 he received the AES Fellowship Award for his work on MPEG-4 Audio and scalable audio coding. In October 2000, he and two collegues received the "German Future Award" for their work on MP3. He has been granted 15 patents and has several more pending.

Jürgen Herre (Fraunhofer IIS)
After receiving his MS degree from Erlangen university, Jürgen Herre joined the Fraunhofer Institute for Integrated Circuits (IIS) in Erlangen, Germany, in 1989. Since then he has been involved into the development of perceptual coding algorithms for high quality audio, such as the well-known coding algorithms ASPEC and ISO/MPEG-Audio Layer III (aka "MP3"). In 1995, Dr. Herre joined Bell Laboratories for a PostDoc term working on the development of the MPEG-2 Advanced Audio Coding (AAC) scheme. Since the end of '96 he is back at Fraunhofer working on the development of advanced multimedia technology including MPEG-4, MPEG-7 and secure delivery of audiovisual content.
Currently, Dr. Herre is the chief scientist for the Audio & Multimedia activities at Fraunhofer IIS Erlangen, coordinating research activities on watermarking of audio signals, analysis/feature extraction/recognition of audio signals and advanced audio compression schemes. He has published numerous papers on these and other topics and was awarded 15 patents, several more are pending.
Dr. Herre is a fellow of the Audio Engineering Society, co-chair of the AES Technical Committee on Coding of Audio Signals and vice chair of the AES Technical Council. He served as an associate editor of the IEEE Transactions on Speech and Audio Processing and is an active member of the MPEG audio subgroup.

Riitta Väänänen (HUT)
Riitta Vaananen was born in Helsinki, Finland, in 1970. She received an M.Sc. degree in electrical engineering from the Helsinki University of Technology (HUT) in 1997. She is currently a Ph.D. student at the HUT majoring in acoustics and audio signal processing, and interactive digital media as a minor subject. Ms Vaananen has worked as a research assistant and a research scientist at the Laboratory of Acoustics and Audio Signal Processing in HUT since 1996. During the year 2001 she is working as a visiting PhD student researcher in IRCAM (Institut de Recherche et coordination Acoustique/Musique) in Paris. Ms. Vaananen's research activities include room reverberation modeling, and modeling of sound sources in acoustic environments in interactive virtual reality systems. During 1998-2000 she has also actively participated the MPEG-4 standardization process.


Why Join MPEGIF??


MPEGIF Logo Qualification Program



Download the NEW Whitepaper on MPEG-4

Quick Links

Search this site

Join MPEGIF's public mailing lists

Archive of Weekly News Digests


White Papers

White paper: High Efficiency AAC - World's Best Audio Codec

Intro to MPEG-7

Intro to MPEG-21


Brochures

What is AVC??

What is AAC?

What is (Advanced) Simple Profile?

On interactivity in MPEG-4


About MPEGIF's logo

Guidelines for Members and Non-members

MPEGIF Sponsored Events





©Copyright 2007 MPEG Industry Forum