Hierarchical recurrent neural network for skeleton based action recognition
(2015 CVPR)
Authors
Notes
Contributions
We propose an end-to-end hierarchical RNN for skeleton based action recognition. Instead oftaking the whole skeleton as the input, we divide the human skeleton into five parts according to human physical structure, and then separately feed them to five subnets. As the number oflayers increases, the representations extracted by the subnets are hierarchically fused to be the inputs of higher layers. The final representations ofthe skeleton sequences are fed into a single-layer perceptron, and the temporally accumulated output of the perceptron is the final decision. We compare with five other deep RNN architectures derived from our model to verify the effectiveness of the proposed network, and also compare with several other methods on three publicly available datasets. Experimental results demonstrate that our model achieves the state-of-the-art performance with high computational efficiency

Method
Preliminaries. The output of a single hidden layer RNN can be derived as:
![]()
The output of a single hidden layer LSTM can be derived as:

The bidirectional recurrent neural network (BRNN) presents the sequence forwards and backwards to two separate recurrent hidden layers.

It should be noted that we can easily obtain LSTM-BRNN just by replacing the nonlinear units in the above Figure with LSTM blocks.
Architecture. According to human physical structure, the human skeleton can be decomposed into five parts, e.g., two arms, two legs and one trunk.

Hierarchical recurrent neural network for skeleton based action recognition&spm=1001.2101.3001.5002&articleId=123070727&d=1&t=3&u=7c705377e268425aa3082f6ab8cae94a)
637

被折叠的 条评论
为什么被折叠?



