大语言模型驱动的多维度汉译英翻译质量评价智能体构建与应用研究

上海翻译 ›› 2026, Vol. 188 ›› Issue (3): 70-76.

大语言模型驱动的多维度汉译英翻译质量评价智能体构建与应用研究

何泊稼, 王金铨

扬州大学,江苏扬州 225002

收稿日期:2025-12-01 出版日期:2026-05-10 发布日期:2026-05-19
通讯作者: 王金铨,博士,扬州大学外国语学院教授、博士生导师,研究领域:翻译研究、应用语言学研究。
作者简介:何泊稼,扬州大学外国语学院博士研究生,研究领域:翻译质量评价研究。
基金资助:
国家社科基金重点项目“基于大语言模型的中国英语学习者翻译能力发展及量化评价研究”(编号:24AYY019)、江苏省博士研究生科研与实践创新计划项目“大语言模型背景下英译汉翻译质量量化评价研究”(编号:KYCX24_3704)。

Received:2025-12-01 Online:2026-05-10 Published:2026-05-19

摘要/Abstract

摘要： 翻译质量评价是翻译批评领域中的重要议题,亦是一个持续性、发展性的研究维度。然而,传统的翻译质量评价方法已难以应对数智时代日益变化的翻译实践。本研究综合翻译学、语言学和计算机科学等多学科知识,依托大语言模型和多智能体系统技术,构建了多维度汉译英翻译质量评价智能体MIA-CETQA,并以237篇汉译英议论文人工译文为研究对象,比较了MIA-CETQA与BLEU、COMET等主流评价方式相对于人工评分的一致性和准确性。研究结果表明,相较于BLEU和COMET,MIA-CETQA评分结果与人工评分的一致性最高,且在准确性方面表现更优。进一步分析原因发现,MIA-CETQA在评价维度、运行机制、技术支持等方面具备独特的优势,这也使其在翻译质量评价实践中表现优异。本研究为数智时代下的大规模人工译文自动评价提供了新思路与具体实践路径。

关键词: 大语言模型, 多维度, 翻译质量评价, 智能体

中图分类号:

H059

何泊稼, 王金铨. 大语言模型驱动的多维度汉译英翻译质量评价智能体构建与应用研究[J]. 上海翻译, 2026, 188(3): 70-76.

参考文献

[1] Bender, E. M., Gebru, T., McMillan-Major, A., et al. On the dangers of stochastic parrots: Can language models be too big? [A]. In Elish, M. C., Isaac, W., Zemel, R. S. (eds.). FAccT’21: 2021 ACM Conference on Fairness, Accountability, and Transparency [C]. New York: ACM, 2021: 610-623.
[2] Doddington, G. Automatic evaluation of machine translation quality using n-gram co-occurrence statistics [A]. In Marcus, M. (ed.). Proceedings of the Second International Conference on Human Language Technology Research [C]. San Francisco: Morgan Kaufmann Publishers, 2002: 138-145.
[3] Guerreiro, N. M., Rei, R., Stigt, D., et al. xcomet: Transparent machine translation evaluation through fine-grained error detection[J]. Transactions of the Association for Computational Linguistics, 2024, 12: 979-995.
[4] Han, C. Quality assessment in multilingual, multimodal, and multiagent translation and interpreting (QAM³ T&I): Proposing a unifying framework for research[J]. Interpreting and Society, 2025, 5(1): 27-55.
[5] He, B. J., Wang, J. Q., Wang, Y. Z. Research on quantitative assessment of translation quality from the perspective of phraseology[J]. PloS one, 2025, 20(2): e0318804.
[6] Huang, X., Zhang, Z. R., Geng, X., et al. Lost in the source language: How large language models evaluate the quality of machine translation [EB/OL]. https://arxiv.org/abs/2401.06568. (2024-07-06) [2025-09-10].
[7] Iida, K., Mimura, K. CATER: Leveraging LLM to pioneer a multidimensional, reference-independent paradigm in translation quality evaluation [EB/OL]. https://arxiv.org/abs/2412.11261. (2024-12-15) [2025-09-10].
[8] Kocmi, T., Federmann, C. Large language models are state-of-the-art evaluators of translation quality [EB/OL]. https://arxiv.org/abs/2302.14520. (2023-05-31) [2025-09-10].
[9] Kumar, P. M. Special issue on artificial intelligence in engineering education[J]. Computer Applications in Engineering Education, 2021, 29(2): 311.
[10] Lu, Q. Y., Qiu, B. P., Ding, L., et al. Error analysis prompting enables human-like translation evaluation in large language models [EB/OL]. https://arxiv.org/abs/2303.13809. (2024-06-05) [2025-09-11].
[11] Rei, R., Stewart, C., Farinha, A. C., et al. COMET: A neural framework for MT evaluation [EB/OL]. https://arxiv.org/abs/2009.09025. (2020-10-19) [2025-09-11].
[12] Zhang, R., Zhao, W., Macken, L., et al. LiTransProQA: An LLM-based literary translation evaluation metric with professional question answering [EB/OL]. https://arxiv.org/abs/2505.05423. (2025-10-13) [2025-10-20].
[13] 胡加圣,戚亚娟. ChatGPT时代的中国外语教育:求变与应变 [J]. 外语电化教学, 2023(1): 3-6,105.
[14] 胡开宝,李娟. 大语言模型背景下的翻译人才培养:挑战与前景 [J]. 外语电化教学, 2024(6): 3-7,105.
[15] 梁瑞雪,何莲珍. 基于大语言模型的智能体在语言测评中的应用与挑战 [J]. 语言测试与评价, 2025(1): 1-12.
[16] 刘淼. 大语言模型赋能中国时政话语俄译研究——多智能体互动翻译框架的创建与应用 [J]. 外语电化教学, 2024(6): 15-22,107.
[17] 王华树,刘世界. 从MTPE到AIPE:GenAI时代翻译模式演变及其对翻译教育的启示 [J]. 山东外语教学, 2025(3): 111-121.
[18] 王金铨,牛永一. 计算机辅助翻译评价系统中的翻译质量评估 [J]. 上海翻译, 2023(6): 52-57.
[19] 王金铨,王克非. 计算语言学视角下的翻译研究 [J]. 外国语(上海外国语大学学报), 2008(3): 78-83.
[20] 王金铨,文秋芳. 学习者汉英翻译分析性评分细则的制定 [J]. 外语教学, 2009(4): 96-99,112.
[21] 张静,彭思锐. 大语言模型赋能学生译文智能评估的实证研究 [J]. 当代外语研究, 2025(5): 85-96,122.
[22] 赵军峰,李翔. 大语言模型驱动的翻译智能体构建与应用研究 [J]. 外语电化教学, 2024(5): 22-28,75,108.