PMoE：在P-tuning中引入混合專家的參數(shù)高效微調(diào)框架

打印
收藏

收藏成功

微博 QQ空間微信

打開文本圖片集

關(guān)鍵詞：大語言模型；參數(shù)高效微調(diào)；P-tuning；混合專家；多任務(wù)學(xué)習(xí)中圖分類號(hào)：TP18 文獻(xiàn)標(biāo)志碼：A 文章編號(hào)：1001-3695（2025）07-005-1956-08doi：10.19734/j.issn.1001-3695.2024.11.0484

Abstract：Large language model （LLM）has significantly improved performanceinreasoning and generation tasks.However， existing open-sourceLLMstillackssuffcientdomain-specificknowledgeandrequiresfine-tunngforspecializedtasks.Traditionalfine-tuningmethodsstruggletobalancelowcostandhigheficiencyinmuli-taskleaing.Toaddressthisisue，hispaperproposedaparameter-effcientfine-tuning framework namedPMoE.BasedontheP-tuning method，this framework introducedamixture-of-expertsmechanism toenhancemulti-task proessingwhilemaintaininglow-costtuning.Ineach Transformer modulelayer，PMoE constructed trainable expert modules toreplace the prompt modules in P-tuning and utilizedarouting mechanism todynamicallyalocatetasksbasedoninput task features.Aditionally，itdesignedtheexpert modulesinMoEto bedetachable，enabling modelreuseacrossdferent task scenariosandfurtherreducingcomputationalcosts.Experimentalresults demonstrate that PMoE achieves a 6.24% performance improvement over P-tuning on a Chinese medical dataset and exhibitssuperiorcapabilities inmulti-taskprocessngandtransferlearning，verifying itseficiencyandbroadapplicability.

Key words：large language model；parameter-effcient fine-tuning；P-tuning；mixture of experts；multi-task learning

0 引言

隨著大語言模型（largelanguagemodel，LLM）的不斷迭代更新，這些模型在推理和文本生成方面的能力得到了顯著增強(qiáng)。（剩余21086字）

試讀結(jié)束

購(gòu)買全文6.00元下一篇基于大語言模型的多任務(wù)生成式重構(gòu)對(duì)話情緒識(shí)別

計(jì)算機(jī)應(yīng)用研究

2025年07期

￥12.00/本

特黄三级爱爱视频|国产1区2区强奸|舌L子伦熟妇aV|日韩美腿激情一区|6月丁香综合久久|一级毛片免费试看|在线黄色电影免费|国产主播自拍一区|99精品热爱视频|亚洲黄色先锋一区

PMoE：在P-tuning中引入混合專家的參數(shù)高效微調(diào)框架