IEEE Std 1857.6:2018 pdf download.IEEE Standard for Digital Media Content Description.
IEEE Std 1857.6 for digital media content description specifies digital media visual content descriptors to satisfy the requirement of searching in large-scale multimedia data and support applications to visual survcillancc.
1.2 Purpose
The standard provides compact descriptors to represent the visual features and describe the category, attribute, property, and context infonnation of’ the multimedia data. which can facilitate content searching and indexing, save bandwidth for transmission, enable hardware Support for descriptor extraction and matching, ensure interoperability of multimedia applications and ubiquitous platforms. simplify design of visual application, and enhance performance on visual surveillance applications, which demand high bandwidth.
2. Normative references
The following referenced documents are indispensable for the application of this document (i.e., they must be understood and used, so each referenced document is cited in text and its relationship to this document is explained). For dated references, only the edition cited applies. For undated references, the latest edition of the referenced document (including any amendments or corrigenda) applies.
ISO/JEC 15938-3, Information technology Multimedia content description interface Part 3:Visual.
3. Abbreviations and symbols
The following mnemonics are defined to describe the different data types used in the coded bitstream (refer to ISO/IEC l59383).
bslbf Bit string, left bit first, where “left” is the order in which bits are written in ISO/IEC 15938-3. Bit strings are generally written as a string of is and Os within single quote marks, e.g., ‘1000 0001’. Blanks within a bit string are for ease of reading and have no significance. For convenience, large strings are occasionally Titten in hexadecimal, in which case conversion to a binary in the conventional manner will yield the value of the bit string. Thus, the leftmost hexadecimal digit is first and in each hexadecimal digit the most significant of the t’our digits is first.
vluirnsbf Variable length unsigned integer representation consisting of two parts. The first part defines the number of octets (8-bit bit fields) used fbr the values representation, encoded by a sequence of “1” bits, followed by a “0” bit signaling its end. The second part contains the value of the integer encoded using the number of octets specified in the first part.
uimsbf Unsigned integer, most significant bit first.
simsbf Signed integer, in two’s complement format, most significant bit (sign) first.
fsbt’ Float (32 bit), sign bit first. The semantics of the bits within a float are specified in IEEE Std 754-1985.
4. Basic structure
This clause introduces two basic spatial structures that are used for giving definition and computing descriptors: spatial two-dimensional coordinate system is used to specify an image coordinate system; region localization is used to specify the rectangular region of interest (ROl) in an image or video frame.
4.1 Spatial two-dimensional coordinates
This descriptor defines a spatial two-dimensional coordinate system as an image coordinate system. The default image shape is rectangle. The origin is placed at the top left corner of the image, and the x axis and y axis are aligned to the top and left edges of the image, respectively. All units of the coordinate are pixels.
4.1.1 Binary representation syntax
The binary representation syntax of spatial two-dimensional coordinates is given in Table 1.IEEE Std 1857.6 pdf download.