大型語言模型與學(xué)生在考試中的表現(xiàn)比較研究

——以通義千問為例

打印
收藏

收藏成功

微博 QQ空間微信

打開文本圖片集

中圖分類號(hào)：TP39；G434 文獻(xiàn)標(biāo)識(shí)碼：A 文章編號(hào)：2096-4706（2025）12-0050-09

Comparative Study of Large Language Models and Student Performance in Exams -Taking Qwen asan Example

LING Dalian， FENG Shiying， CHEN Sinan， PAN Weiquan （SchoolofMathematicsandStatistics，YulinNormalUniversity，Yulin537ooo，China）

Abstract： The research focuses on the application potential of Qwen，anAI chatbot driven byLLM，ineducational assessment.Basedon2190fnalexaminationquestionsof“ProbabilityandMathematical Statistics”inauniversityfrom2019 to 2023，eighteachersdouble-blindscoretheQwen Model，theoptimized modelandthestudents'answers.Theresultsshowthat the performanceofQwen isstable in multiplechoicequestions，but thereis muchroomfor improvement intheanswerquestions. EspeciallyafterPromptEngineeringoptimization，theperformanceoftheanswerquestionsissignificantlyimproved.Teachers' scoresonAI-generatedcontentaremorestringent，andthescoresaresignificantlyaffectedbythequestiontypeandtheanswer subject.ThisstudyprovidesempiricalevidenceforAI-assistededucationalassssment，emphasizingtheimportanceofupdating standards and exploring new models.

Keywords：LLM; Qwen; educational assessment; AI-assisted learning

0 引言

隨著信息技術(shù)的迅猛發(fā)展，人工智能（AI）聊天機(jī)器人的應(yīng)用在教育領(lǐng)域正逐漸普及。（剩余13820字）

試讀結(jié)束

購買全文6.00元下一篇基于人工智能的生鮮存儲(chǔ)管理系統(tǒng)設(shè)計(jì)與實(shí)現(xiàn)

現(xiàn)代信息科技

2025年12期

￥18.00/本

特黄三级爱爱视频|国产1区2区强奸|舌L子伦熟妇aV|日韩美腿激情一区|6月丁香综合久久|一级毛片免费试看|在线黄色电影免费|国产主播自拍一区|99精品热爱视频|亚洲黄色先锋一区

大型語言模型與學(xué)生在考試中的表現(xiàn)比較研究

——以通義千問為例