特黄三级爱爱视频|国产1区2区强奸|舌L子伦熟妇aV|日韩美腿激情一区|6月丁香综合久久|一级毛片免费试看|在线黄色电影免费|国产主播自拍一区|99精品热爱视频|亚洲黄色先锋一区

大型語言模型與學(xué)生在考試中的表現(xiàn)比較研究

——以通義千問為例

  • 打印
  • 收藏
收藏成功


打開文本圖片集

中圖分類號(hào):TP39;G434 文獻(xiàn)標(biāo)識(shí)碼:A 文章編號(hào):2096-4706(2025)12-0050-09

Comparative Study of Large Language Models and Student Performance in Exams -Taking Qwen asan Example

LING Dalian, FENG Shiying, CHEN Sinan, PAN Weiquan (SchoolofMathematicsandStatistics,YulinNormalUniversity,Yulin537ooo,China)

Abstract: The research focuses on the application potential of Qwen,anAI chatbot driven byLLM,ineducational assessment.Basedon2190fnalexaminationquestionsof“ProbabilityandMathematical Statistics”inauniversityfrom2019 to 2023,eighteachersdouble-blindscoretheQwen Model,theoptimized modelandthestudents'answers.Theresultsshowthat the performanceofQwen isstable in multiplechoicequestions,but thereis muchroomfor improvement intheanswerquestions. EspeciallyafterPromptEngineeringoptimization,theperformanceoftheanswerquestionsissignificantlyimproved.Teachers' scoresonAI-generatedcontentaremorestringent,andthescoresaresignificantlyaffectedbythequestiontypeandtheanswer subject.ThisstudyprovidesempiricalevidenceforAI-assistededucationalassssment,emphasizingtheimportanceofupdating standards and exploring new models.

Keywords:LLM; Qwen; educational assessment; AI-assisted learning

0 引言

隨著信息技術(shù)的迅猛發(fā)展,人工智能(AI)聊天機(jī)器人的應(yīng)用在教育領(lǐng)域正逐漸普及。(剩余13820字)

目錄
monitor