Feature selection technique.

<div><p>3D skeleton-based human activity recognition has gained significant attention due to its robustness against variations in background, lighting, and viewpoints. However, challenges remain in effectively capturing spatiotemporal dynamics and integrating complementary information fr...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Dongwei Xie (4874992) (author)
مؤلفون آخرون:	Xiaodan Zhang (569294) (author), Xiang Gao (4077) (author), Hu Zhao (425380) (author), Dongyang Du (11634334) (author)
منشور في:	2025
الموضوعات:	Biotechnology Science Policy Infectious Diseases Biological Sciences not elsewhere classified Information Systems not elsewhere classified leverages optical flow integrating complementary information future multimodal integration multiple data modalities multimodal fusion framework late fusion strategy data augmentation techniques including ntu rgb skeletal attention modules skeletal data innovative fusion rgb video rgb features time applications temporal dependencies robust foundation model employs extensive experiments effective capture diverse fields combine skeletal benchmark datasets also provides
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

الوصف
الملخص:	<div><p>3D skeleton-based human activity recognition has gained significant attention due to its robustness against variations in background, lighting, and viewpoints. However, challenges remain in effectively capturing spatiotemporal dynamics and integrating complementary information from multiple data modalities, such as RGB video and skeletal data. To address these challenges, we propose a multimodal fusion framework that leverages optical flow-based key frame extraction, data augmentation techniques, and an innovative fusion of skeletal and RGB streams using self-attention and skeletal attention modules. The model employs a late fusion strategy to combine skeletal and RGB features, allowing for more effective capture of spatial and temporal dependencies. Extensive experiments on benchmark datasets, including NTU RGB+D, SYSU, and UTD-MHAD, demonstrate that our method outperforms existing models. This work not only enhances action recognition accuracy but also provides a robust foundation for future multimodal integration and real-time applications in diverse fields such as surveillance and healthcare.</p></div>

Feature selection technique.

مواد مشابهة