您好,欢迎访问上海市农业科学院 机构知识库!

MGRFormer: A Multimodal Transformer Approach for Surgical Gesture Recognition

文献类型: 会议论文

第一作者: Kevin Feghoul

作者: Kevin Feghoul 1 ; Deise Santana Maia 2 ; Mehdi El Amrani 3 ; Mohamed Daoudi 4 ; Ali Amad 5 ;

作者机构: 1.Univ. Lille, Inserm, CHU Lille, UMR-S1172 LilNCog, Lille, France|Univ. Lille, CNRS, Centrale Lille, UMR 9189 CRIStAL, Lille, France

2.Univ. Lille, CNRS, Centrale Lille, UMR 9189 CRIStAL, Lille, France

3.Department of Digestive Surgery and Transplantation, CHU Lille, PRESAGE, Univ. Lille, France

4.Univ. Lille, CNRS, Centrale Lille, UMR 9189 CRIStAL, Lille, France|IMT Nord Europe, Institut Mines-Télécom, Centre for Digital Systems, Lille, France

5.Univ. Lille, Inserm, CHU Lille, UMR-S1172 LilNCog, Lille, France

关键词: Training;Visualization;Surgery;Gesture recognition;Kinematics;Machine learning;Predictive models

会议名称: IEEE International Conference on Automatic Face and Gesture Recognition

主办单位:

页码: 1-10

摘要: Automatic surgical gesture recognition has the potential to revolutionize the field of surgery by enhancing patient care, surgical training, and our understanding of surgical skills. By integrating kinematic data, which precisely captures hand movements, with video data for contextual understanding, multimodal machine learning can greatly enhance the accuracy of surgical gesture recognition systems by capturing complementary knowledge. Recent research has highlighted the capabilities of Transformer-based models for temporal action segmentation. A key component of these models is the iterative refinement module, which enhances predictions using contextual data. In this study, we propose MGRFormer, a novel multimodal framework that leverages the interaction between kinematics and visual data at the refinement stage for the task of surgical gesture recognition. We evaluated our MGRFormer on the VTS dataset, and the results demonstrated that our approach outperformed unimodal and multimodal state-of-the-art methods by a large margin.

分类号: tp7

  • 相关文献

[1]Computer Vision-Based Assistance System for Visually Impaired Individuals in Vending Machine Interactions. Karla Miriam Reyes Leiva,Roy Abi Zeid Daou,José Javier Serrano Olmedo. 2023

[2]Design and Implementation of an EMG-based System for Monitoring and Help Request. Carla Zeine,Roy Abi Zeid Daou,Samer Telawi,Josef Boercsoek,Ali Hayek,Gaby Abou Haidar. 2022

[3]One-step reverse transcription loop-mediated isothermal amplification assay for rapid detection of melon yellow spot virus. Zeng, Rong,Xu, Li-hui,Gao, Shi-gang,Dai, Fu-ming,Ni, Xiu-hong,Chen, Chun-lei,Chen, Jian-cai.

[4]Virtual Visualization System for Growth of Tobacco Root. Xi, Lei,Ren, Yanna,Wang, Qiang,Zhang, Longlong,Ma, Xinming,Xiong, Shuping,Ma, Xinming,Yang, Juan. 2011

[5]Virtual Visualization System for Growth of Tobacco Root. Lei Xi,Shuping Xiong,Yanna Ren,Qiang Wang,Juan Yang. 2011

[6]Design and Implementation of a Smart Human Posture Monitoring Device Using a Wearable Vest. Roy Abi Zeid Daou,Elio Geha,Youssef Aoun,Ali Hayek,Josef Boercsoek,Jose Javier Serrano Olmedo. 2023

[7]Visualizing Complexity: A Dynamic Approach to Reverse Engineering. Ketki Balde,Lalji Prasad,Rashmi Yadav. 2024

[8]Research and Application of Pig Iris Recognition Technology. Shipu Xu,Guoqing Liu,Yong Liu,Yunsheng Wang. 2024

[9]THE STREAM ALGORITHM: COMPUTATIONALLY EFFICIENT RIDGE-REGRESSION VIA BAYESIAN MODEL AVERAGING, AND APPLICATIONS TO PHARMACOGENOMIC PREDICTION OF CANCER CELL LINE SENSITIVITY. IN SOCK JANG,STEPHEN H. FRIEND,ADAM A. MARGOLIN,ELIAS CHAIBUB NETO. 2014

[10]Validation of a workflow based on Sentinel-2, Sentinel-1 and meteorological data predicting biomass on pastures. Nickmilder C,Tedde A,Dufrasne I.. 2021

[11]SYSTEMATIC ASSESSMENT OF ANALYTICAL METHODS FOR DRUG SENSITIVITY PREDICTION FROM CANCER CELL LINE DATA. IN SOCK JANG,ELIAS CHAIBUB NETO,JUSTIN GUINNEY,STEPHEN H. FRIEND,ADAM A. MARGOLIN. 2014

[12]A Machine Learning Approach to Diffusion MRI Partial Volume Estimation. Emmanuel Vallee,Wenchuan Wu,Francesca Galassi,Saad Jbabdi,Stephen Smith. 2018

[13]Machine Learning-Based Fraud Detection System for Insurance Claims in IoT Environment. Bediga Sharan,Mohammad Hassan,V Divya Vani,Vijilius Helena Raj,GinniNijhawan,Priyanka Prabhakar Pawar. 2024

[14]Evaluation of Solar Energy Generation and Radiation Prediction Using Machine Learning. Santosh S Raghuwanshi,Jagrati Gupta,Gourav Shrivastava. 2023

[15]Deployment of models to predict compressed sward height at a large scale: results and feedback. Nickmilder C.,Tedde A,Dufrasne I.. 2022

[16]Prediction of the Risk of Heart Attack Using Machine Learning Techniques. Pinaki Ghosh,Umesh Kumar Lilhore,Sarita Simaiya,Atul Garg,Devendra Prasad,Ajay Kumar. 2022

[17]Blockchain Cloud Computing: Comparative study on DDoS, MITM and SQL Injection Attack. Mamta Swarnkar,Shekh Kulsum Almas,Nagendra Singh,Harsh Pratap Singh,Anuprita Mishra,Anula Khare. 2024

[18]A Systematic Literature Review on Developing Job Profiles and Training Content for Open Data-Driven Smart Cities. Koukounidou Vasiliki,Kokkinaki Angelika,Osta Alain,Tsakiris Theodoros. 2024

[19]Revolutionizing Healthcare with Federated Learning: A Comprehensive Review. Snehlata Mishra,Ritu Tondon,Narendra Pal Singh Rathore. 2024

[20]Student’s Feedback by emotion and speech recognition through Deep Learning. Ati Jain,Hare Ram Sah. 2021

作者其他论文 更多>>