Real-time face detection in Full HD images exploiting both embedded CPU and GPU

Chanyoung Oh, Saehanseul Yi, Youngmin Yi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

18 Scopus citations

Abstract

CPU-GPU heterogeneous systems have become a mainstream platform in both server and embedded domains with ever increasing demand for powerful accelerator. In this paper, we present parallelization techniques that exploit both data and task parallelism of LBP based face detection algorithm on an embedded heterogeneous platform. By running tasks in a pipelined parallel way on multicore CPUs and by offloading a data-parallel task to a GPU, we could successfully achieve 29 fps for Full HD inputs on Tegra K1 platform where quad-core Cortex-A15 CPU and CUDA supported 192-core GPU are integrated. This corresponds to 5.54x speedup over a sequential version and 1.69x speedup compared to the GPU-only implementations.

Original languageEnglish
Title of host publication2015 IEEE International Conference on Multimedia and Expo, ICME 2015
PublisherIEEE Computer Society
ISBN (Electronic)9781479970827
DOIs
StatePublished - 4 Aug 2015
EventIEEE International Conference on Multimedia and Expo, ICME 2015 - Turin, Italy
Duration: 29 Jun 20153 Jul 2015

Publication series

NameProceedings - IEEE International Conference on Multimedia and Expo
Volume2015-August
ISSN (Print)1945-7871
ISSN (Electronic)1945-788X

Conference

ConferenceIEEE International Conference on Multimedia and Expo, ICME 2015
Country/TerritoryItaly
CityTurin
Period29/06/153/07/15

Keywords

  • CPU-GPU heterogeneous platform
  • Face detection
  • Tegra K1
  • data-parallel
  • task-parallel

Fingerprint

Dive into the research topics of 'Real-time face detection in Full HD images exploiting both embedded CPU and GPU'. Together they form a unique fingerprint.

Cite this