Predicting Compression Modes and Split Decisions for HEVC Video Coding Using Machine Learning Techniques

A Master of Science thesis in Computer Engineering by Mahitab Alaaeldin Hassan entitled, "Predicting Compression Modes and Split Decisions for HEVC Video Coding Using Machine Learning Techniques," submitted in May 2017. Thesis advisor is Dr. Tamer Shanableh. Soft and hard copy available.

Saved in:
Bibliographic Details
Main Author: Hassan, Mahitab Alaaeldin (author)
Format: doctoralThesis
Published: 2017
Subjects:
Online Access:http://hdl.handle.net/11073/8879
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1864513434862747648
author Hassan, Mahitab Alaaeldin
author_facet Hassan, Mahitab Alaaeldin
author_role author
dc.contributor.none.fl_str_mv Shanableh, Tamer
dc.creator.none.fl_str_mv Hassan, Mahitab Alaaeldin
dc.date.none.fl_str_mv 2017-06-15T07:23:38Z
2017-06-15T07:23:38Z
2017-05
dc.format.none.fl_str_mv application/pdf
dc.identifier.none.fl_str_mv 35.232-2017.19
http://hdl.handle.net/11073/8879
dc.language.none.fl_str_mv en_US
dc.subject.none.fl_str_mv Video coding
HEVC (High Efficiency Video Coding)
Machine learning
Video compression
Machine learning
dc.title.none.fl_str_mv Predicting Compression Modes and Split Decisions for HEVC Video Coding Using Machine Learning Techniques
dc.type.none.fl_str_mv info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/doctoralThesis
description A Master of Science thesis in Computer Engineering by Mahitab Alaaeldin Hassan entitled, "Predicting Compression Modes and Split Decisions for HEVC Video Coding Using Machine Learning Techniques," submitted in May 2017. Thesis advisor is Dr. Tamer Shanableh. Soft and hard copy available.
format doctoralThesis
id aus_e565344e3043dba88e2963ffa8c43beb
identifier_str_mv 35.232-2017.19
language_invalid_str_mv en_US
network_acronym_str aus
network_name_str aus
oai_identifier_str oai:repository.aus.edu:11073/8879
publishDate 2017
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling Predicting Compression Modes and Split Decisions for HEVC Video Coding Using Machine Learning TechniquesHassan, Mahitab AlaaeldinVideo codingHEVC (High Efficiency Video Coding)Machine learningVideo compressionMachine learningA Master of Science thesis in Computer Engineering by Mahitab Alaaeldin Hassan entitled, "Predicting Compression Modes and Split Decisions for HEVC Video Coding Using Machine Learning Techniques," submitted in May 2017. Thesis advisor is Dr. Tamer Shanableh. Soft and hard copy available.The High Efficiency Video Coding (HEVC) standard presents a substantial video compression efficiency improvement at the expense of increasing the computational complexity. This enhancement is primarily due to the introduction of flexible quad-based-tree partitioning structures for motion estimation (ME) and image transformation. However, finding the optimum coding structure, which is done by an exhaustive rate-distortion optimization (RDO) process, is what contributes to increasing the computational complexity. In this thesis, we propose a set of early termination algorithms to reduce the HEVC video encoding complexity by predicting both the split decisions of Coding Units (CUs) and the coding modes of Prediction Units (PUs). A video sequence-dependent approach is used in which frames belonging to the video being encoded are utilized for generating a classification model. At each CU depth level, features representing the given CU are extracted from both the current and previously encoded CUs. The feature vectors (FVs) are then utilized for generating dimensionality reduction and classification models. These models are in turn used at each coding depth to predict the split and mode decisions of subsequence CUs. In this work, we use stepwise regression, random forest feature importance, and Principal Component Analysis (PCA) for dimensionality reduction. Moreover, polynomial networks, random forests, and J48 decision trees are used for classification. Using seventeen video sequences with four different spatial resolution classes, the proposed solution is assessed in terms of the classification accuracy, Bjontegaard Delta bitrate (BD-rate), BD Peak Signal-to-Noise Ratio (BD-PSNR) and computational complexity reduction (CCR). On average, the CU early termination scheme achieved a CCR of 38.5% with an average classification accuracy of 78.1% at a negligible cost of 0.539% and -0.021 dB in terms of BD-rate and BD-PSNR, respectively. The PU early termination scheme attained an overall CCR of 20.9% with an average classification accuracy of 86.5% at the cost of a BD-rate of 0.248% and a BD-PSNR of -0.01 dB. When jointly implemented, an overall CCR of 50.1% was achieved with a BD-rate increase of 2% and a BD-PSNR decrease of 0.079 dB.College of EngineeringDepartment of Computer Science and EngineeringMaster of Science in Computer Engineering (MSCoE)Shanableh, Tamer2017-06-15T07:23:38Z2017-06-15T07:23:38Z2017-05info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisapplication/pdf35.232-2017.19http://hdl.handle.net/11073/8879en_USoai:repository.aus.edu:11073/88792025-06-26T12:34:40Z
spellingShingle Predicting Compression Modes and Split Decisions for HEVC Video Coding Using Machine Learning Techniques
Hassan, Mahitab Alaaeldin
Video coding
HEVC (High Efficiency Video Coding)
Machine learning
Video compression
Machine learning
status_str publishedVersion
title Predicting Compression Modes and Split Decisions for HEVC Video Coding Using Machine Learning Techniques
title_full Predicting Compression Modes and Split Decisions for HEVC Video Coding Using Machine Learning Techniques
title_fullStr Predicting Compression Modes and Split Decisions for HEVC Video Coding Using Machine Learning Techniques
title_full_unstemmed Predicting Compression Modes and Split Decisions for HEVC Video Coding Using Machine Learning Techniques
title_short Predicting Compression Modes and Split Decisions for HEVC Video Coding Using Machine Learning Techniques
title_sort Predicting Compression Modes and Split Decisions for HEVC Video Coding Using Machine Learning Techniques
topic Video coding
HEVC (High Efficiency Video Coding)
Machine learning
Video compression
Machine learning
url http://hdl.handle.net/11073/8879