基于BERTopic的人工智能應(yīng)用場(chǎng)景主題建模研究

打開文本圖片集
中圖分類號(hào):G203 文獻(xiàn)標(biāo)識(shí)碼:A DOI:10.11968/tsyqb.1003-6938.2025032
Study on Topic Modeling of Artificial Intelligence Application Scenario Based on BERTopic
AbstractAgainst the backdropof China's vigorous promotionofAI(Artificial Intellgence)application scenarios,this study employsBERTopic toexaminetopicpatterns inAdeploymentcontexts.Initialy3,524newsarticles werecollectedfromThePaper (Pengpai News)and preprocessed foranalysis.Fortopic modeling,theConan-embedding-vl pretrainedlargemodelwasutilizedfortextembedding,folowedbydimensionalityreductionviaUMAP,clusteringthrough HDBSCAN,and topicrepresentation using c-TF-IDF.Topic keywords were furtherrefinedthrough KeyBERT-based optimizationtechniques.Inopicalysis,eyworddistributions wereexaminedacrosslldomains:technological&D, culturaldigitization,regionaleconomiccollaboration,economicdevelopment,financialinnovation,capitalmarkets, healthcare/elderlycare,policycoordination,low-altitudeeconmy,urbandevelopment,ndnewsdissemination.imilarityanalysisrevealed strong inter-topiccorrelations:urbandevelopmentdemonstratedhighsimilarity witheconomicdevelopment,financial innovation,andpolicycoordination;whileeconomicdevelopmentshowed pronounced alignmentwith financialinovationandpolicycoordination.Hierarchical clusteringanddocumentdistributionanalysis indicated varyingdegreesofcross-domain integrationbetween technologicalR&D,culturaldigitization,andpolicycoordination with othertopicareas.Thisresearch,toacrtainextent,elucidatesthecurrentlandscape,latentdemands,andinterconected elements of disruptive applications ofAI.
Key words BERTopic;artificial inteligence;application scenarios; topic modeling; disruptive application
由國(guó)家網(wǎng)信辦、國(guó)家發(fā)改委、教育部、科技部等七部門審議通過的《生成式人工智能服務(wù)管理暫行辦法》自2023年8月15日起施行,“鼓勵(lì)生成式人工智能技術(shù)在各行業(yè)、各領(lǐng)域的創(chuàng)新應(yīng)用,生成積極健康、向上向善的優(yōu)質(zhì)內(nèi)容,探索優(yōu)化應(yīng)用場(chǎng)景,構(gòu)建應(yīng)用生態(tài)體系”]。(剩余12633字)