CSA - CAN/CSA-ISO/IEC 14496-1-05 AMD 1
Information technology Coding of audio-visual objects Part 1: Systems
| Organization: | CSA |
| Publication Date: | 1 December 2006 |
| Status: | inactive |
| Page Count: | 8 |
scope:
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission) form the specialized system for worldwide standardization. National bodies that are members of ISO or IEC participate in the development of International Standards through technical committees established by the respective organization to deal with particular fields of technical activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other international organizations, governmental and non-governmental, in liaison with ISO and IEC, also take part in the work. In the field of information technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC 1.
International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 2.
The main task of the joint technical committee is to prepare International Standards. Draft International Standards adopted by the joint technical committee are circulated to national bodies for voting. Publication as an International Standard requires approval by at least 75 % of the national bodies casting a vote.
Attention is drawn to the possibility that some of the elements of this document may be the subject of patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights.
Amendment 1 to ISO/IEC 14496-1:2004 was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology, Subcommittee SC 29, Coding of audio, picture, multimedia and hypermedia information.
ISO/IEC 14496 specifies a system for the communication of interactive audio-visual scenes. The specification includes the following elements:
1. the coded representation of natural or synthetic, two-dimensional (2D) or three-dimensional (3D) objects that can be manifested audibly and/or visually (audio-visual objects) (specified in part 1, 2 and 3 of ISO/IEC 14496);
2. the coded representation of the spatio-temporal positioning of audio-visual objects as well as their behaviour in response to interaction (scene description, specified in parts 11 and 20 of ISO/IEC 14496);
3. the coded representation of information related to the management of data streams (synchronization, identification, description and association of stream content, specified in parts 11 and 20 of ISO/IEC 14496);
4. a generic interface to the data stream delivery layer functionality (specified in part 6 of ISO/IEC 14496);
5. an application engine for programmatic control of the player: format, delivery of downloadable Java byte code as well as its execution lifecycle and behaviour through APIs (specified in part 11 of ISO/IEC 14496); and
6. a file format to contain the media information of an ISO/IEC 14496 presentation in a flexible, extensible format to facilitate interchange, management, editing, and presentation of the media.
The information representation, specified in ISO/IEC 14496-1, ISO/IEC 14496-11 and in ISO/IEC 14496-20, describes the means to create an interactive audio-visual scene in terms of coded audio-visual information and associated scene description information. The encoded content is presented to a terminal as the collection of elementary streams. Elementary streams contain the coded representation of either audio or visual data or scene description information or user interaction data. Elementary streams may as well themselves convey information to identify streams, to describe logical dependencies between streams, or to describe information related to the content of the streams. Each elementary stream contains only one type of data.
Elementary streams are decoded using their respective stream-specific decoders. The audio-visual objects are composed according to the scene description information and presented by the terminal's presentation device(s). All these processes are synchronized according to the systems decoder model (SDM) using the synchronization information provided at the synchronization layer.
The scene description stream identifies different types of objects, such as audio, visual, 2D and 3D graphics, etc. that define a scene composition of the content. Among these objects, the essential part of almost any multimedia presentation is text objects that are created utilizing specific custom fonts. Font selection determines the appearance of a text in multimedia content and it's the most critical factor that assures text legibility and readability. It also plays critical role in the overall scene composition since the metric properties of a font are used for textual parts of multimedia content layout. Many thousands of fonts are available today for use in content creation and in order to assure correct appearance and layout of a content the font data have to be included (embedded) with the text objects as part of the multimedia presentation.
ISO/IEC 14496-18 defines new set of tools for font streaming and text rendering. In order to enable faithful and reliable reproduction of textual content in MPEG-4 presentations the font data format, font stream, its stream type and its configuration is defined. The document also establishes new dimension of MPEG-4 Text profiles and defines the text profiles and levels.
New text profiles and levels shall be signalled on the MPEG-4 Systems to insure the conformance. The scope of this amendment is to define new tools that will enable the signalling of the newly established Text Profile dimension and to specify Text Profile and Level Indication values.
Document History