基于分层思想的精细化文本情感分类方法
DOI:
作者:
作者单位:

三峡大学水电工程智能视觉监测湖北省重点实验室

作者简介:

通讯作者:

中图分类号:

TP391

基金项目:

国家自然科学基金“NSFC-新疆联合基金”(U1703261)


A Fine-Grained Text Sentiment Classification Method Based on Hierarchical Thinking
Author:
Affiliation:

1.Hubei Key Laboratory of Intelligent Vision Based Monitoring for Hydroelectric Engineering,China Three Gorges University;2.China Three Gorges University,Hubei Key Laboratory of Intelligent Vision Based Monitoring for Hydroelectric Engineering

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    随着自然语言处理技术的不断发展,情感分析在多个领域得到了广泛应用,如社交媒体监控、客户服务和市场调研等。然而,现有方法在处理精细化情感分类任务时,通常采用扁平化的分类策略,未能充分考虑情感本身的层级关系,导致分类精度受限。为解决这一问题,本文提出了一种基于分层思想的精细化文本情感分类方法。通过将具有相同基础情感含义的细粒度标签归类为同一粗粒度类别,构建层次化的情感标签体系,从而增强模型对情感结构的理解能力。在模型设计上,采用RoBERTa模型的不同Transformer层输出,分别提取低层次和高层次语义特征,以支持粗粒度与细粒度分类器的训练。在基于多任务学习的分层分类策略中,粗分类任务辅助细分类任务,从而提高分类的精度和泛化性。由粗到精的分层分类策略中,粗分类和细分类任务并行进行,每个细分类器独立计算损失,然后结合粗分类的先验信息优化信息选择,实现情感识别的逐步细化。最终,通过设定概率阈值来融合两种策略的优势,进一步提升分类性能。实验结果表明,在GoEmotions和Empathetic Dialogues数据集上,两种策略在F1值和准确率上均超越基线模型。此外,最终的融合技术又带来了进一步的性能提升,准确率分别达到64.6%和59.4%,优于现有方法。

    Abstract:

    With the continuous advancement of natural language processing technologies, sentiment analysis has found extensive applications across various fields, such as social media monitoring, customer service, and market research. However, existing methods often adopt a flat classification strategy when dealing with fine-grained sentiment classification tasks, failing to fully consider the hierarchical relationships inherent in emotions, which consequently limits the accuracy of classification. To address this issue, this paper proposes a fine-grained text sentiment classification method based on a hierarchical approach. By categorizing fine-grained labels with the same underlying emotional meaning into the same coarse-grained category, a hierarchical emotional label system is constructed, thereby enhancing the model's ability to understand emotional structures. In terms of model design, outputs from different Transformer layers of the RoBERTa model are utilized to extract both low-level and high-level semantic features, supporting the training of coarse-grained and fine-grained classifiers. In the hierarchical classification strategy based on multi-task learning, the coarse-grained classification task assists the fine-grained classification task, thereby improving accuracy and generalization. In the hierarchical classification strategy from coarse to fine, the coarse classification and fine classification tasks are carried out in parallel. Each fine classifier independently computes the loss, and then combines the prior information of the coarse classifier to optimize the information selection, achieving a gradual refinement of sentiment recognition. Finally, by setting a probability threshold, the advantages of both strategies are combined to further improve classification performance. Experimental results show that on the GoEmotions and Empathetic Dialogues datasets, both strategies outperform the baseline model in terms of F1 score and accuracy. In addition, the final fusion technique brought further performance improvements, with accuracies reaching 64.6% and 59.4%, outperforming existing methods.

    参考文献
    相似文献
    引证文献
引用本文

但志平,鲁雨洁,余肖生,李琳,李碧涛,董方敏.基于分层思想的精细化文本情感分类方法[J].南京信息工程大学学报,,():

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-03-18
  • 最后修改日期:2025-07-23
  • 录用日期:2025-07-24
  • 在线发布日期:
  • 出版日期:

地址:江苏省南京市宁六路219号    邮编:210044

联系电话:025-58731025    E-mail:nxdxb@nuist.edu.cn

南京信息工程大学学报 ® 2026 版权所有  技术支持:北京勤云科技发展有限公司