[2] MPEG-4 has a versatile encoding and decoding process. A broad choice of transport protocols can be used with its interface, FlexMux [bottom]. At the sync layer, packetized elementary streams (ESs) are reassembled based on their timing information [middle]. The information in an ES [top] is a primitive audiovisual object [here the Fig. 1 objects], how it is to be decoded (the object descriptors), how organized in the scene (the scene graph), plus any interactive (upchannel) information from the terminal.
(c) Copyright 1999, The Institute of Electrical and Electronics Engineers, Inc.