【论文阅读笔记】（2015 CVPR）Hierarchical recurrent neural network for skeleton based action recognition

原创

已于 2022-02-25 12:34:27 修改 · 2.2k 阅读

标签

#计算机视觉 #深度学习 #人工智能 #骨架点动作识别 #HBRNN

收录于

于 2022-02-22 16:28:55 首次发布

Hierarchical recurrent neural network for skeleton based action recognition

（2015 CVPR）

Authors

Notes

Contributions

We propose an end-to-end hierarchical RNN for skeleton based action recognition. Instead oftaking the whole skeleton as the input, we divide the human skeleton into five parts according to human physical structure, and then separately feed them to five subnets. As the number oflayers increases, the representations extracted by the subnets are hierarchically fused to be the inputs of higher layers. The final representations ofthe skeleton sequences are fed into a single-layer perceptron, and the temporally accumulated output of the perceptron is the final decision. We compare with five other deep RNN architectures derived from our model to verify the effectiveness of the proposed network, and also compare with several other methods on three publicly available datasets. Experimental results demonstrate that our model achieves the state-of-the-art performance with high computational efficiency

Method

Preliminaries. The output of a single hidden layer RNN can be derived as:

The output of a single hidden layer LSTM can be derived as:

The bidirectional recurrent neural network (BRNN) presents the sequence forwards and backwards to two separate recurrent hidden layers.

It should be noted that we can easily obtain LSTM-BRNN just by replacing the nonlinear units in the above Figure with LSTM blocks.

Architecture. According to human physical structure, the human skeleton can be decomposed into five parts, e.g., two arms, two legs and one trunk.