基于 Mask R-CNN 与 SG 滤波的手势识别 关键点特征提取方法
DOI:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP301;TP391

基金项目:

国家科技部项目(G20190201031)、北京信息科技大学科研内涵发展重点研究培育项目(2020KYNH226)资助


Gesture key point extraction method based on Mask R-CNN and SG filter
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    手势识别是人机交互的重要手段。 为了精确识别手势并摒除光照等环境干扰,同时减除由于手部高维运动造成的关键 点剧烈抖动的问题,提出一种基于基于蒙版区域的卷积神经网络(Mask Region-based convolutional neural network,Mask R-CNN) 与多项式平滑算法(Savitzky-Golay,SG)的手势关键点提取方法。 该方法首先对输入的红绿蓝(RGB)三通道图像进行特征提取 与区域分割,获得手部的实例分割与掩码。 然后利用 ROIAling 及功能性网络进行目标匹配,标记出 22 个关键点(21 个骨骼点+ 1 个背景点)。 将标记后结果送入 SG 滤波器进行数据平滑,并进行骨骼点的重新标定。 从而得到稳定的手势提取特征。 对模 型进行对比实验,结果表明,该方法能够最大程度摒除环境干扰,并精准提取关键点。 与传统基于轮廓分割的手势关键点提取 相比,模型的鲁棒性大大提高,识别精度达到 93. 48%。

    Abstract:

    Gesture recognition is an important means of human-computer interaction. In order to more accurately recognize gestures and eliminate the interference of environmental conditions such as lighting, and reduce the key point jitter recognition error caused by the high-dimensional space transformation of the hand at the meanwhile, a gesture key point method of extraction based on the Mask R-CNN model and Savitzky-Golay filter is proposed. This method uses the Mask R-CNN model to process RGB three-channel images, and performs object recognition and segmentation on each image, and obtains 21 bone points and background positions of the hand and performs model training. Then uses neural network features to match the video stream and mark 22 key points. Furthermore, the point data is smoothed by using Savitzky-Golay filter and then redraw the data to obtain stable gesture extraction and reconstruction results. This method is used in bone point extraction experiments. Experimental results show that the method can eliminate environmental interference to the greatest extent and accurately extract key points. Compared with traditional gesture key point extraction based on contour segmentation, the accuracy reaches 93. 48%. At the same time, the robustness of the model is greatly improved.

    参考文献
    相似文献
    引证文献
引用本文

王婧瑶,王红军.基于 Mask R-CNN 与 SG 滤波的手势识别 关键点特征提取方法[J].电子测量与仪器学报,2021,35(9):41-48

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2023-02-27
  • 出版日期: