基于改進(jìn)VisionTransformer的森林火災(zāi)視頻識(shí)別研究

打印
收藏

收藏成功

微博 QQ空間微信

打開(kāi)文本圖片集

鍵詞：森林火災(zāi)；深度學(xué)習(xí)；目標(biāo)檢測(cè)；三維卷積神經(jīng)網(wǎng)絡(luò)；VisionTransformer中圖分類(lèi)號(hào)：S762 文獻(xiàn)標(biāo)志碼：A 文章編號(hào)：1000-2006（2025）04-0186-09

Research on forest fire video recognition based on improved Vision Transformer

ZHANG Min，XIN Ying*，HUANG Tianqi

（College of Mechanical and Electrical Engineering，Northeast Forestry University，Harbin15O04O，China）

Abstract：【Objective】Thisresearchaimstoresolvethelimitationsofexistingforestfirerecognitionalgorithmsin temporal featureutilizationandcomputational eficiency，this studyproposesavideo-basedrecognitionmodel（C3DViT）to enhance bothdetection acuracyandoperational eficiencyin practical forest monitoring scenarios.【Method】We presentedahybridarchitectureintegrating 3DConvolutional Neural Networks（3DCNN）with Vision Transformer（ViT）. Theframework emploied3Dconvolution kernels to extract spatiotemporal features from video sequences，which were subsequently tokenized intovectorrepresentations.Vision Transformer'sself-atentionmechanism thengloballymodels feature relationshipsacross temporalandspatial dimensions，with final classificationachievedthroughtheMLPHead layer.Comprehensive ablation studiedandcomparative experiments were conductedagainst ResNet5O，LSTM，YOLOv5， and baseline 3DCNN，ViT models.【Result】The C3D-ViT achieves 96. 10% accuracy，outperforming ResNet50 89.07% ），LSTM 93.26% ），and YOLOv5（ 91.46% ），and hasimproved compared to the accuracyof the original 3DCNN and Vision Transformer（ 93.91% ， 90.43% ）.The improved C3D-ViT model performs better in recognition performance，with highrecognition accuracyand stabilityunder unfavorable conditions such asocclusion，long distance， andthinsmoke.Thedemandforreal-timedetectioncanberealized.【Conclusion】The C3D-ViTframework ffectively addresses spatiotemporal modelingchallngesinwildfiredetection through synergistic CNN-Transformerinteraction， providing a technically viable solution for next-generation forest fire early warning systems.

Keywords：forest fire；deep learning；object detection;3DCNN;Vision Transformer（ViT）

森林是陸地生態(tài)系統(tǒng)重要的組成部分，在全球生態(tài)、碳循環(huán)及氣候演變中起著重要作用[1]。（剩余16718字）

試讀結(jié)束

購(gòu)買(mǎi)全文6.00元下一篇林業(yè)廢棄物與豬糞堆肥過(guò)程中理化性質(zhì)變化及腐熟度評(píng)價(jià)

南京林業(yè)大學(xué)學(xué)報(bào)（自然科學(xué)版）

2025年04期

￥36.00/本

特黄三级爱爱视频|国产1区2区强奸|舌L子伦熟妇aV|日韩美腿激情一区|6月丁香综合久久|一级毛片免费试看|在线黄色电影免费|国产主播自拍一区|99精品热爱视频|亚洲黄色先锋一区

基于改進(jìn)VisionTransformer的森林火災(zāi)視頻識(shí)別研究