TY - GEN
T1 - Multimodal hand gesture recognition based on the fusion of surface electromyography and vision
AU - Hao, Shunran
AU - Gao, Dongxu
AU - Ju, Zhaojie
AU - Gao, Qing
PY - 2024/11/12
Y1 - 2024/11/12
N2 - In existing hand gesture recognition research, single-modal recognition is commonly used. For example, visual hand gesture recognition uses image information, but it is easily affected by the shooting environment. Another example is using surface electromyography (sEMG) for recognition, but it is susceptible to signal noise. To address the above issues, this paper focuses on the fusion of sEMG and vision of the human hand. We propose a novel approach that fuses the two modalities by using convolutional neural networks (CNN) to improve recognition accuracy. Firstly, using an RGB camera and sEMG armband, we jointly collect sEMG signal and skeleton in real-time, creating our own multimodal dataset for training. Secondly, we design a multimodal recognition network with feature fusion of sEMG and skeleton, to achieve an increase in accuracy. Finally, we built a human-computer interaction system that realizes hand gestures to manipulate a dexterous hand and a robot arm. Experimental results demonstrate that the fusion of the two modalities has complementary effects and effectively improves recognition accuracy.
AB - In existing hand gesture recognition research, single-modal recognition is commonly used. For example, visual hand gesture recognition uses image information, but it is easily affected by the shooting environment. Another example is using surface electromyography (sEMG) for recognition, but it is susceptible to signal noise. To address the above issues, this paper focuses on the fusion of sEMG and vision of the human hand. We propose a novel approach that fuses the two modalities by using convolutional neural networks (CNN) to improve recognition accuracy. Firstly, using an RGB camera and sEMG armband, we jointly collect sEMG signal and skeleton in real-time, creating our own multimodal dataset for training. Secondly, we design a multimodal recognition network with feature fusion of sEMG and skeleton, to achieve an increase in accuracy. Finally, we built a human-computer interaction system that realizes hand gestures to manipulate a dexterous hand and a robot arm. Experimental results demonstrate that the fusion of the two modalities has complementary effects and effectively improves recognition accuracy.
KW - Multimodal
KW - Hand Gesture Recognition
KW - Surface Electromyography
KW - Deep Learning
KW - Feature Fusion
U2 - 10.1109/M2VIP62491.2024.10746196
DO - 10.1109/M2VIP62491.2024.10746196
M3 - Conference contribution
SN - 9798350391923
T3 - Proceedings of the International Conference on Mechatronics and Machine Vision in Practice (M2VIP)
BT - 2024 30th International Conference on Mechatronics and Machine Vision in Practice (M2VIP)
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 30th International Conference on Mechatronics and Machine Vision in Practice (M2VIP)
Y2 - 3 October 2024 through 5 October 2024
ER -