TY - GEN
T1 - Sign language recognition with recurrent neural network using human keypoint detection
AU - Ko, Sang Ki
AU - Son, Jae Gi
AU - Jung, Hyedong
N1 - Publisher Copyright:
© 2018 Copyright is held by the owner/author(s).
PY - 2018/10/9
Y1 - 2018/10/9
N2 - We study the sign language recognition problem which is to translate the meaning of signs from visual input such as videos. It is well-known that many problems in the field of computer vision require a huge amount of dataset to train deep neural network models. We introduce the KETI sign language dataset which consists of 10,480 videos of high resolution and quality. Since different sign languages are used in different countries, the KETI sign language dataset can be the starting line for further research on the Korean sign language recognition. Using the sign language dataset, we develop a sign language recognition system by utilizing the human keypoints extracted from face, hand, and body parts. The extracted human keypoint vector is standardized by the mean and standard deviation of the keypoints and used as input to recurrent neural network (RNN). We show that our sign recognition system is robust even when the size of training data is not sufficient. Our system shows 89.5% classification accuracy for 100 sentences that can be used in emergency situations.
AB - We study the sign language recognition problem which is to translate the meaning of signs from visual input such as videos. It is well-known that many problems in the field of computer vision require a huge amount of dataset to train deep neural network models. We introduce the KETI sign language dataset which consists of 10,480 videos of high resolution and quality. Since different sign languages are used in different countries, the KETI sign language dataset can be the starting line for further research on the Korean sign language recognition. Using the sign language dataset, we develop a sign language recognition system by utilizing the human keypoints extracted from face, hand, and body parts. The extracted human keypoint vector is standardized by the mean and standard deviation of the keypoints and used as input to recurrent neural network (RNN). We show that our sign recognition system is robust even when the size of training data is not sufficient. Our system shows 89.5% classification accuracy for 100 sentences that can be used in emergency situations.
KW - Deep learning
KW - Keypoint detection
KW - Recurrent neural network
KW - Sign language recognition
UR - http://www.scopus.com/inward/record.url?scp=85056902445&partnerID=8YFLogxK
U2 - 10.1145/3264746.3264805
DO - 10.1145/3264746.3264805
M3 - Conference contribution
AN - SCOPUS:85056902445
T3 - Proceedings of the 2018 Research in Adaptive and Convergent Systems, RACS 2018
SP - 326
EP - 328
BT - Proceedings of the 2018 Research in Adaptive and Convergent Systems, RACS 2018
PB - Association for Computing Machinery, Inc
T2 - 2018 Conference Research in Adaptive and Convergent Systems, RACS 2018
Y2 - 9 October 2018 through 12 October 2018
ER -