A method for pattern recognition using bag-of-words model and neural network

Authors

  • Nguyễn Toàn Thắng
  • Đinh Xuân Lâm

Keywords:

bag-of-words model, gesture recognition, neural network, object descriptor, pattern recognition.

Abstract

The purpose of the project is to create an algorithm for real-time hand gesture recognition in video frames captured directly from the camera. The proposed algorithm is based on the bag-of-features (or bag-of-words) model, SURF-descriptor, k-means clustering, and neural network classification method. The bag-of-words model combined with SURF and k-means is used to create feature vectors, which then are fed as input data for the neural network. The algorithm is trained and tested with a self-made image data set. Experiments with various testing data sets demonstrate that the proposed algorithm ensures a high processing speed (less than 40 ms for each frame) to be able to perform in real time with data captured directly from a camera, keeps being invariant to transformations of the object in the video frame (including rotation, scaling and affine transition), and provides high recognition accuracy (~ 90%).

Downloads

Download data is not yet available.

Author Biographies

  • Nguyễn Toàn Thắng

    Trường Đại học Công nghệ thông tin và Truyền thông, Đại học Thái Nguyên

  • Đinh Xuân Lâm

    Trường Đại học Công nghệ thông tin và Truyền thông, Đại học Thái Nguyên

Published

2022-11-14