Parallel Processing in supportive formats

From PDI

Jump to: navigation, search

Many of the current and upcoming standards of video have a potential for parallel processing. This feature can be used in multi processor paradigms like multi core processors or massively parallel processing on GPU using NVIDIA's CUDA platform or OpenCL. A lot of the ideas are related to the upcoming H.265 standard.


Contents

Usual steps in decoding

Entropy decoding is the first stage of image reconstruction. The next step in reconstruction of a frame is prediction done which is intra / inter frame reconstruction. This is followed by inverse DCT transform to get the residual field, which when added to the reconstructed image, gives the decoded frame.

Possibility of Parallellization

Inter frame prediction can be parallelized trivially, especially on a GPU. This does not have dependence on currently decoded frame, and hence each of the inter predicted blocks can be decoded parallelly.

Intra frame prediction depends on the current frame. This cannot be parallelly. However many of the frames have only about 10-20% of the blocks intra predicted. This implies that several chains of independent intra blocks. These chains can be processed parallelly.

Inverse transform stage can be processed parallelly trivially, because each block is independent of the other.

Entropy decoding in parallel

Displaying directly

Frame reconstruction done on the GPU, gives rise to the possibility of displaying the frame directly on the screen. This is of great advantage because transferring data via the PCI bus could have been very costly otherwise.

Personal tools