Intra prediction with deep learning

Raz Birman, Yoram Segal, Avishay David-Malka, Ofer Hadar

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    One fundamental component of video compression standards is Intra-Prediction. Intra-Prediction takes advantage of redundancy in the information of neighboring pixel values within video frames to predict blocks of pixels from their surrounding pixels and thus allowing to transmit the prediction errors instead of the pixel values themselves. The prediction errors are of smaller values than the pixels themselves, thus allowing to accomplish compression of the video stream. Prevalent standards take advantage of intra-frame pixel value dependencies to perform prediction at the encoder end and transfer only residual errors to the decoder. The standards use multiple "Modes", which are various linear combinations of pixels for prediction of their neighbors within image Macro-Blocks (MBs). In this research, we have used Deep Neural Networks (DNN) to perform the predictions. Using twelve Fully Connected Networks, we managed to reduce Mean Square Error (MSE) of the predicted error by up to 3 times as compared to standard modes prediction results. This substantial improvement comes at the expense of more extensive computations. However, these extra computations can be significantly mitigated by the use of dedicated Graphical Processing Units (GPUs).

    Original languageAmerican English
    Title of host publicationApplications of Digital Image Processing XLI
    EditorsAndrew G. Tescher
    PublisherSPIE
    ISBN (Print)9781510620759
    DOIs
    StatePublished - 1 Jan 2018
    EventApplications of Digital Image Processing XLI 2018 - San Diego, United States
    Duration: 20 Aug 201823 Aug 2018

    Publication series

    NameProceedings of SPIE - The International Society for Optical Engineering
    Volume10752

    Conference

    ConferenceApplications of Digital Image Processing XLI 2018
    Country/TerritoryUnited States
    CitySan Diego
    Period20/08/1823/08/18

    Keywords

    • AV1
    • Deep Learning
    • H.264
    • HEVC
    • Intra Prediction
    • Intra-Prediction Modes

    All Science Journal Classification (ASJC) codes

    • Electronic, Optical and Magnetic Materials
    • Condensed Matter Physics
    • Computer Science Applications
    • Applied Mathematics
    • Electrical and Electronic Engineering

    Fingerprint

    Dive into the research topics of 'Intra prediction with deep learning'. Together they form a unique fingerprint.

    Cite this