Generative Adversarial Network 및 네트워크 압축을 활용한 표정 인식
Facial Expression Recognition with Generative Advanced Networks and Network Compression
- 주제(키워드) deep learning , image classification , facial expression recognition , generative adversarial network , network pruning , online learning
- 발행기관 서강대학교 일반대학원
- 지도교수 강석주
- 발행년도 2019
- 학위수여년월 2019. 2
- 학위명 석사
- 학과 및 전공 일반대학원 전자공학과
- 실제URI http://www.dcollection.net/handler/sogang/000000063768
- UCI I804:11029-000000063768
- 본문언어 영어
- 저작권 서강대학교 논문은 저작권보호를 받습니다.
초록/요약
As the interest of the deep neural networks (DNNs) increases, applications related to convolutional neural networks (CNNs) are evolving together. This thesis proposes a novel facial expression recognition for a customized system of a limited number of people by using a cascade convolutional neural network (CCNN) with generative adversarial networks (GANs) and network compression. The process for the proposed recognition system is as follows. First, we detect the facial region on the basis of Haar-like features from individual RGB images, and the region of interest is rearranged to reduce the variation of appearance changes after GANs process and raw image process. Second, we augment the images by using GANs and various image processing techniques. Third, the augmented data trains the CCNN categorizing a person recognition and facial expression recognition (FER) in advance, and then retrains each CNN after applying network pruning to the trained CCNN. In addition, the proposed system was applied to online and offline combination method with voting and batch-size input method. In the experimental results, the proposed system had the accuracy of 99.5019 % for human classification with the loss of 0.0387 and 96.8401 % for the FER with the loss of 0.0831. Additionally, as applying network compression, human and FER classification decreased by 23.2687 % and 23.2351 % for memory resource, and by 26.5291 % and 25.8651 % for computational cost. On/offline-combined system had the accuracy of 99.0063 % for human classification with the loss of 0.0426 and 91.0476 % for FER with 0.1256. Based on this study, it could construct a personalized human-state recognition system by only using facial images in online and offline environment.
more