MMVSL: A multi-modal visual semantic learning method for pig pose and action recognition
文献类型: 外文期刊
第一作者: Guan, Zhibin
作者: Guan, Zhibin;Chai, Xiujuan;Guan, Zhibin;Chai, Xiujuan
作者机构:
关键词: MMVSL; Multi-modal; Pig pose estimation; Action recognition; Improved HRNet
期刊名称:COMPUTERS AND ELECTRONICS IN AGRICULTURE ( 影响因子:8.9; 五年影响因子:9.3 )
ISSN: 0168-1699
年卷期: 2025 年 229 卷
页码:
收录情况: SCI
摘要: Pig health monitoring is grounded on rapid detection and evaluation on their pose and behavior traits in complex environments. However, as of right now, there is no intelligent recognition approach for the posture and action of pigs raised in vast enclosures. Therefore, to unravel a shortage of datasets in pig pose and action comprehension research based on visual features, we accumulated multi-objective pig images in a large enclosure environment and constructed multi-grained pig pose estimation and multi-modal action recognition datasets. The datasets can be used for skeleton-based or RGB-based pig action recognition. Furthermore, MMVSL, a multi-modal visual semantic learning approach, was presented for pig pose and action comprehension. It can be used for global visual semantic extraction, local pose semantic learning, and fine-grained topological semantics analysis of pigs. It is composed of three key modules: global feature learning, local feature learning, and topological semantic feature learning. The experimental results demonstrate that the proposed method outperforms ST-GCN and RepVGG for multi-grained pig pose estimation and multi-modal pig action recognition, respectively. Dense pose estimation has an average precision of 96.8%, semi-dense pose estimation of 92.1%, and sparse pose estimation of 80.7%, respectively. The skeleton-based action recognition achieved top-1 accuracy of 94.3%, which is 0.5% higher than ST-GCN. The accuracy of RGB-based action recognition is 99.4%, which is 0.2% higher than RepVGG.
分类号:
- 相关文献
作者其他论文 更多>>
-
Cross-Shaped Heat Tensor Network for Morphometric Analysis Using Zebrafish Larvae Feature Keypoints
作者:Chai, Xin;Li, Zhaoxin;Zhang, Yanqi;Sun, Qixin;Zhang, Ning;Chai, Xiujuan;Sun, Tan;Qiu, Jing
关键词:zebrafish; digital phenotype; non-destructive examination; keypoints localization; deep feature learning
-
Efficient Triple Attention and AttentionMix: A Novel Network for Fine-Grained Crop Disease Classification
作者:Zhang, Yanqi;Zhang, Ning;Chai, Xiujuan;Zhu, Jingbo;Dong, Wei;Sun, Tan
关键词:crop pests and diseases; CNNs; channel attention; spatial attention; data augmentation
-
AECA-FBMamba: A Framework with Adaptive Environment Channel Alignment and Mamba Bridging Semantics and Details
作者:Chai, Xin;Zhang, Wenrong;Li, Zhaoxin;Zhang, Ning;Chai, Xiujuan
关键词:remote sensing; deep learning; weakly supervised learning; Mamba; Transformer
-
Digital twin-driven system for efficient tomato harvesting in greenhouses
作者:Lang, Yining;Zhang, Yanqi;Sun, Tan;Chai, Xiujuan;Zhang, Ning;Lang, Yining;Zhang, Yanqi;Zhang, Ning
关键词:Tomato; Harvest; Digital-twin; Greenhouse; Reinforcement-learning; Decision
-
PAB-Mamba-YOLO: VSSM assists in YOLO for aggressive behavior detection among weaned piglets
作者:Xia, Xue;Zhan, Ning;Guan, Zhibin;Chai, Xin;Ma, Shixin;Chai, Xiujuan;Xia, Xue;Zhan, Ning;Sun, Tan;Guan, Zhibin
关键词:Aggressive behaviors; Weaned piglet; Mamba; YOLO; Hybrid detection model
-
PodNet: Pod real-time instance segmentation in pre-harvest soybean fields
作者:Zhou, Shuo;Sun, Qixin;Zhang, Ning;Zhang, Ning;Chai, Xiujuan;Sun, Tan;Sun, Qixin;Chai, Xiujuan
关键词:Pre-harvest dataset; Soybean pod; Instance segmentation; High-throughput field phenotyping
-
Design and development of a multi-stage variable stiffness flexible end-effector for selective harvesting of Agaricus bisporus
作者:Han, Ruiqing;Zhong, Ming;Liu, Yan;Huang, Bo;Yao, Yufeng;Liu, Yaxin;Chai, Xiujuan
关键词:Agaricus bisporus; Automated picking; Picking end-effector; Flexible suction cup; Field evaluation