The Internet Encyclopedia (Volume 3)

P1: IML/FFX P2: IML/FFX QC: IML/FFX T1: IML

Video ̇Compression ̇OLE WL040/Bidgolio-Vol I WL040-Sample.cls September 14, 2003 18:10 Char Count= 0

DIGITALVIDEOSIGNALCOMPRESSION 543

researchers compare the effectiveness of video compression algorithms by plotting empirically determined quality measures as a function of observed bit rate for a given set of test sequences. Almost invariably, the observed video quality improves with increased bit rate.

Principles of Digital Video Compression Digital video compression operates by eliminating re- dundant spatial, temporal, hyperspectral, statistical, or pyschovisual information. A brief inspection of Figure 1a shows considerable overlap between frames 1232 and 1233 but a discontinuous change of scene (likely as a re- sult of editing) between frames 1231 and 1232. All the frames shown exhibit strong local spatial correlation, the motion of the astronauts exhibits temporal correlation, the viewer’s attention is focused on the motion of astronauts and the expressions on their faces, and the space suit color contrasts well with the background. Figure 1b illus- trates the well-known video artifact of “dropped frames” when this movie is compressed at an average data rate less than the original 293 kbps.

Human Destination Compression of video sequences intended for human viewing may take place by a combination of lossy and lossless compression steps. Local spatial correlation may be removed by intraframe coding techniques such as block matching or transform coding followed by quan- tization of the transform coefficients (see Data Com- pression). Temporal correlation may be removed by interframe coding techniques predicting the motion vectors observed in differences between successive frames. Spectral correlation may be removed by applying temporal decor- relation or modeling techniques to the spectral dimen- sions of the hyperspectral imagery; coding redundancy may be removed through careful design of compression codes, while psychovisual redundancy may be addressed by techniques that drop frames and increase the bit rate in regions containing human faces, sharply contrasting regions and trajectories of distracting objects that move rapidly through the peripheral field of view.

Machine Destination In scientific and medical applications, imagery may be correlated in any of the preceding ways. There are likely to be additional constraints on the compression which may include fidelity criteria (such as 90% of the encircled en- ergy to remain within the pixel for remotely sensed land use data, or the peak difference in value reconstructed from the compressed imagery to be within 10% of the original value for 90% of the reconstructed pixels) together with processing constraints such as the bit rate must remain approximately constant within the communication channel capacity regardless of video quality or content and consume no more than 5% of the available processing resource. As an example, Freedman, Boggess, and Seiler (1993) and Freedman and Farrelle (1996) set forth criteria for the experimental compression system developed to optimize real-time calculations on large diverse data sets from

NASA’s Cosmic Background Explorer mission in a clus- tered computing environment in which the application software exceeded 1M lines of code:

Provide compression transparently without changing the application software. Compress instrument pipeline and science analysis data products to better than 16–50%. Process compressed data at a worst case throughput not less than 90% of uncompressed data processing. Preserve required accuracy of instrument housekeeping and scientific data according to specified validation criteria. Exceed bitwise reliability of 10–^13 on average (flawless compression of 380 GB). Several times this factor is desirable. Support full random access to data records. Provide a capability to select specific classes of data for compression. Provide a capability to select a compression scheme (representation) for each field of a data record. Preserve overlaps in separately processed data segments. Store search keys (e.g., time code and pixel address) in plain codes. Optimize choice of representation, combining a priori scientific knowledge with adaptive knowledge of data. Provide mechanisms for easy user configuration and database management of compressed data.

Pre- and Postprocessing Video source material is often pre-processed before encoding and post-processed before distribution. Com- mon preprocessing steps include digital nonlinear editing (Ohanian, 1998), transcoding from another compressed representation, and the conversion of film-originated material from 24 to 29.97 fps via the telecine process just discussed. Common postprocessing steps include digital nonlinear editing, the inverse telecine process, and the manual deletion or recoding of specific frames.

Motion Estimation and Compensation Strong temporal correlation between successive video frames suggest that the bit rate of a video sequence may be reduced by interframe coding methods such as conditional replenishment, motion-compensated coding, and three-dimensional transform coding. The conditional replenishment approach seeks to en- code only the difference in pixel values which change between successive frames and requires considerably less processing power and simpler algorithms than either motion-compensated coding or three-dimensional transform coding. Motion-compensated methods usually re- sult in data rates below that resulting from conditional replenishment and comprise determining motion vectors from frames preceding the current frame by methods such as block matching, region matching, or analysis of optical flow, predicting the motion vectors for successive frames, estimating the prediction error of these motion vectors, and encoding the predicted motion vectors

The Internet Encyclopedia (Volume 3)

Get our desktop app

Company

Features

Documentation

Resources