Real-time and energy-efficient face detection on CPU-GPU heterogeneous embedded platforms

Chanyoung Oh, Saehanseul Yi, Youngmin Yi

Research output: Contribution to journalArticlepeer-review

2 Scopus citations


As energy efficiency has become a major design constraint or objective, heterogeneous manycore architectures have emerged as mainstream target platforms not only in server systems but also in embedded systems. Manycore accelerators such as GPUs are getting also popular in embedded domains, as well as the heterogeneous CPU cores. However, as the number of cores in an embedded GPU is far less than that of a server GPU, it is important to utilize both heterogeneous multi-core CPUs and GPUs to achieve the desired throughput with the minimal energy consumption. In this paper, we present a case study of mapping LBP-based face detection onto a recent CPU-GPU heterogeneous embedded platform, which exploits both task parallelism and data parallelism to achieve maximal energy efficiency with a real-time constraint. We first present the parallelization technique of each task for the GPU execution, then we propose performance and energy models for both task-parallel and data-parallel executions on heterogeneous processors, which are used in design space exploration for the optimal mapping. The design space is huge since not only processor heterogeneity such as CPU-GPU and big.LITTLE, but also various data partitioning ratios for the data-parallel execution on these heterogeneous processors are considered. In our case study of LBP face detection on Exynos 5422, the estimation error of the proposed performance and energy models were on average −2.19% and −3.67% respectively. By systematically finding the optimal mappings with the proposed models, we could achieve 28.6% less energy consumption compared to the manual mapping, while still meeting the real-time constraint.

Original languageEnglish
Pages (from-to)2878-2888
Number of pages11
JournalIEICE Transactions on Information and Systems
Issue number12
StatePublished - Dec 2018


  • CPU-GPU heterogeneous execution
  • Face detection
  • Performance and energy estimation
  • Task mapping


Dive into the research topics of 'Real-time and energy-efficient face detection on CPU-GPU heterogeneous embedded platforms'. Together they form a unique fingerprint.

Cite this