[1] Bender, E. M., Gebru, T., McMillan-Major, A., et al. On the dangers of stochastic parrots: Can language models be too big? [A]. In Elish, M. C., Isaac, W., Zemel, R. S. (eds.). FAccT’21: 2021 ACM Conference on Fairness, Accountability, and Transparency [C]. New York: ACM, 2021: 610-623. [2] Doddington, G. Automatic evaluation of machine translation quality using n-gram co-occurrence statistics [A]. In Marcus, M. (ed.). Proceedings of the Second International Conference on Human Language Technology Research [C]. San Francisco: Morgan Kaufmann Publishers, 2002: 138-145. [3] Guerreiro, N. M., Rei, R., Stigt, D., et al. xcomet: Transparent machine translation evaluation through fine-grained error detection[J]. Transactions of the Association for Computational Linguistics, 2024, 12: 979-995. [4] Han, C. Quality assessment in multilingual, multimodal, and multiagent translation and interpreting (QAM3 T&I): Proposing a unifying framework for research[J]. Interpreting and Society, 2025, 5(1): 27-55. [5] He, B. J., Wang, J. Q., Wang, Y. Z. Research on quantitative assessment of translation quality from the perspective of phraseology[J]. PloS one, 2025, 20(2): e0318804. [6] Huang, X., Zhang, Z. R., Geng, X., et al. Lost in the source language: How large language models evaluate the quality of machine translation [EB/OL]. https://arxiv.org/abs/2401.06568. (2024-07-06) [2025-09-10]. [7] Iida, K., Mimura, K. CATER: Leveraging LLM to pioneer a multidimensional, reference-independent paradigm in translation quality evaluation [EB/OL]. https://arxiv.org/abs/2412.11261. (2024-12-15) [2025-09-10]. [8] Kocmi, T., Federmann, C. Large language models are state-of-the-art evaluators of translation quality [EB/OL]. https://arxiv.org/abs/2302.14520. (2023-05-31) [2025-09-10]. [9] Kumar, P. M. Special issue on artificial intelligence in engineering education[J]. Computer Applications in Engineering Education, 2021, 29(2): 311. [10] Lu, Q. Y., Qiu, B. P., Ding, L., et al. Error analysis prompting enables human-like translation evaluation in large language models [EB/OL]. https://arxiv.org/abs/2303.13809. (2024-06-05) [2025-09-11]. [11] Rei, R., Stewart, C., Farinha, A. C., et al. COMET: A neural framework for MT evaluation [EB/OL]. https://arxiv.org/abs/2009.09025. (2020-10-19) [2025-09-11]. [12] Zhang, R., Zhao, W., Macken, L., et al. LiTransProQA: An LLM-based literary translation evaluation metric with professional question answering [EB/OL]. https://arxiv.org/abs/2505.05423. (2025-10-13) [2025-10-20]. [13] 胡加圣,戚亚娟. ChatGPT时代的中国外语教育:求变与应变 [J]. 外语电化教学, 2023(1): 3-6,105. [14] 胡开宝,李娟. 大语言模型背景下的翻译人才培养:挑战与前景 [J]. 外语电化教学, 2024(6): 3-7,105. [15] 梁瑞雪,何莲珍. 基于大语言模型的智能体在语言测评中的应用与挑战 [J]. 语言测试与评价, 2025(1): 1-12. [16] 刘淼. 大语言模型赋能中国时政话语俄译研究——多智能体互动翻译框架的创建与应用 [J]. 外语电化教学, 2024(6): 15-22,107. [17] 王华树,刘世界. 从MTPE到AIPE:GenAI时代翻译模式演变及其对翻译教育的启示 [J]. 山东外语教学, 2025(3): 111-121. [18] 王金铨,牛永一. 计算机辅助翻译评价系统中的翻译质量评估 [J]. 上海翻译, 2023(6): 52-57. [19] 王金铨,王克非. 计算语言学视角下的翻译研究 [J]. 外国语(上海外国语大学学报), 2008(3): 78-83. [20] 王金铨,文秋芳. 学习者汉英翻译分析性评分细则的制定 [J]. 外语教学, 2009(4): 96-99,112. [21] 张静,彭思锐. 大语言模型赋能学生译文智能评估的实证研究 [J]. 当代外语研究, 2025(5): 85-96,122. [22] 赵军峰,李翔. 大语言模型驱动的翻译智能体构建与应用研究 [J]. 外语电化教学, 2024(5): 22-28,75,108.
|