view in publisher's site

Moving Foreground-Aware Visual Attention and Key Volume Mining for Human Action Recognition

Recently, many deep learning approaches have shown remarkable progress on human action recognition. However, it remains unclear how to extract the useful information in videos since only video-level labels are available in the training phase. To address this limitation, many efforts have been made to improve the performance of action recognition by applying the visual attention mechanism in the deep learning model. In this article, we propose a novel deep model called Moving Foreground Attention (MFA) that enhances the performance of action recognition by guiding the model to focus on the discriminative foreground targets. In our work, MFA detects the moving foreground through a proposed variance-based algorithm. Meanwhile, an unsupervised proposal is utilized to mine the action-related key volumes and generate corresponding correlation scores. Based on these scores, a newly proposed stochastic-out scheme is exploited to train the MFA. Experiment results show that action recognition performance can be significantly improved by using our proposed techniques, and our model achieves state-of-the-art performance on UCF101 and HMDB51.

در حال جابه‌جا کردن توجه بصری و Mining جلد کلیدی برای تشخیص عمل انسان

ترجمه شده با

پر ارجاع‌ترین مقالات مرتبط:

  • مقاله Hardware and Architecture
  • ترجمه مقاله Hardware and Architecture
  • مقاله سخت‌افزار و معماری
  • ترجمه مقاله سخت‌افزار و معماری
  • مقاله Computer Networks and Communications
  • ترجمه مقاله Computer Networks and Communications
  • مقاله شبکه‌ها و ارتباطات کامپیوتری
  • ترجمه مقاله شبکه‌ها و ارتباطات کامپیوتری
سفارش ترجمه مقاله و کتاب - شروع کنید

95/12/18 - با استفاده از افزونه دانلود فایرفاکس و کروم٬ چکیده مقالات به صورت خودکار تشخیص داده شده و دکمه دانلود فری‌پیپر در صفحه چکیده نمایش داده می شود.