Transferable two-stream convolutional neural network for human action recognition

Qianqian Xiong, Jianjing Zhang, Peng Wang, Dongdong Liu, Robert X. Gao

Research output: Contribution to journalArticlepeer-review

86 Scopus citations

Abstract

Human-Robot Collaboration (HRC), which enables a workspace where human and robot can dynamically and safely collaborate for improved operational efficiency, has been identified as a key element in smart manufacturing. Human action recognition plays a key role in the realization of HRC, as it helps identify current human action and provides the basis for future action prediction and robot planning. While Deep Learning (DL) has demonstrated great potential in advancing human action recognition, effectively leveraging the temporal information of human motions to improve the accuracy and robustness of action recognition has remained as a challenge. Furthermore, it is often difficult to obtain a large volume of data for DL network training and optimization, due to operational constraints in a realistic manufacturing setting. This paper presents an integrated method to address these two challenges, based on the optical flow and convolutional neural network (CNN)-based transfer learning. Specifically, optical flow images, which encode the temporal information of human motion, are extracted and serve as the input to a two-stream CNN structure for simultaneous parsing of spatial-temporal information of human motion. Subsequently, transfer learning is investigated to transfer the feature extraction capability of a pretrained CNN to manufacturing scenarios. Evaluation using engine block assembly confirmed the effectiveness of the developed method.

Original languageEnglish
Pages (from-to)605-614
Number of pages10
JournalJournal of Manufacturing Systems
Volume56
DOIs
StatePublished - Jul 2020

Bibliographical note

Publisher Copyright:
© 2020 The Society of Manufacturing Engineers

Funding

Support by National Science Foundation under award CMMI-1830295 is gratefully appreciated.

FundersFunder number
National Science Foundation Arctic Social Science ProgramCMMI-1830295

    Keywords

    • Human-robot collaboration
    • Temporal information
    • Transfer learning

    ASJC Scopus subject areas

    • Software
    • Control and Systems Engineering
    • Hardware and Architecture
    • Industrial and Manufacturing Engineering

    Fingerprint

    Dive into the research topics of 'Transferable two-stream convolutional neural network for human action recognition'. Together they form a unique fingerprint.

    Cite this