Evaluation of the Quality of AI-Generated Scientific Text Under Different Types of Cognitive Complexity Tasks

Hui Peng; Shujun Liu; Lei Li

doi:10.1007/978-981-96-0865-2_17

Evaluation of the Quality of AI-Generated Scientific Text Under Different Types of Cognitive Complexity Tasks

Hui Peng, Shujun Liu, Lei Li

Beijing Normal University

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

As Artificial Intelligence Generated Content (AIGC) continues to deepen its application in the field of scientific research, this study aims to explore the current quality of AIGC in completing research tasks, providing insights for improving AIGC in the scientific research domain. This study first reviews and summarizes existing information quality evaluation frameworks and AIGC-related research to propose quality evaluation criteria for AIGC in the research context. Then, by setting research tasks with different cognitive complexities, user experiments were conducted on the ChatGPT and ERNIE Bot platforms to select appropriate AIGC quality evaluation criteria for these tasks. The quality of AIGC generated by ChatGPT and ERNIE Bot was evaluated based on the selected criteria, revealing the strengths and weaknesses of current AIGC in meeting users’ research information needs. The results show that users generally value relevance, professionalism, and readability when evaluating AIGC for research tasks. However, attention to specific criteria such as accuracy, diversity, coherence, and creativity varies depending on the cognitive complexity of the research tasks. Additionally, AIGC performs well in understanding, evaluating, and creating tasks but has significant shortcomings in remembering and analyzing tasks, particularly in terms of accuracy and professionalism.

Original language	English
Title of host publication	Sustainability and Empowerment in the Context of Digital Libraries - 26th International Conference on Asia-Pacific Digital Libraries, ICADL 2024, Proceedings
Editors	Gillian Oliver, Viviane Frings-Hessami, Jia Tina Du, Taro Tezuka
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	212-221
Number of pages	10
ISBN (Print)	9789819608645
DOIs	https://doi.org/10.1007/978-981-96-0865-2_17
State	Published - 2025
Externally published	Yes
Event	26th International Conference on Asia-Pacific Digital Libraries, ICADL 2024 - Bandar Sunway, Malaysia Duration: 4 Dec 2024 → 6 Dec 2024

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	15493 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	26th International Conference on Asia-Pacific Digital Libraries, ICADL 2024
Country/Territory	Malaysia
City	Bandar Sunway
Period	4/12/24 → 6/12/24

Keywords

AIGC Evaluation
Cognitive Complexity
Scientific Context

Access to Document

10.1007/978-981-96-0865-2_17

Cite this

Peng, H., Liu, S., & Li, L. (2025). Evaluation of the Quality of AI-Generated Scientific Text Under Different Types of Cognitive Complexity Tasks. In G. Oliver, V. Frings-Hessami, J. T. Du, & T. Tezuka (Eds.), Sustainability and Empowerment in the Context of Digital Libraries - 26th International Conference on Asia-Pacific Digital Libraries, ICADL 2024, Proceedings (pp. 212-221). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 15493 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-96-0865-2_17

Peng, Hui ; Liu, Shujun ; Li, Lei. / Evaluation of the Quality of AI-Generated Scientific Text Under Different Types of Cognitive Complexity Tasks. Sustainability and Empowerment in the Context of Digital Libraries - 26th International Conference on Asia-Pacific Digital Libraries, ICADL 2024, Proceedings. editor / Gillian Oliver ; Viviane Frings-Hessami ; Jia Tina Du ; Taro Tezuka. Springer Science and Business Media Deutschland GmbH, 2025. pp. 212-221 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{9731cc198c16434b828d6f7b86d9f370,

title = "Evaluation of the Quality of AI-Generated Scientific Text Under Different Types of Cognitive Complexity Tasks",

abstract = "As Artificial Intelligence Generated Content (AIGC) continues to deepen its application in the field of scientific research, this study aims to explore the current quality of AIGC in completing research tasks, providing insights for improving AIGC in the scientific research domain. This study first reviews and summarizes existing information quality evaluation frameworks and AIGC-related research to propose quality evaluation criteria for AIGC in the research context. Then, by setting research tasks with different cognitive complexities, user experiments were conducted on the ChatGPT and ERNIE Bot platforms to select appropriate AIGC quality evaluation criteria for these tasks. The quality of AIGC generated by ChatGPT and ERNIE Bot was evaluated based on the selected criteria, revealing the strengths and weaknesses of current AIGC in meeting users{\textquoteright} research information needs. The results show that users generally value relevance, professionalism, and readability when evaluating AIGC for research tasks. However, attention to specific criteria such as accuracy, diversity, coherence, and creativity varies depending on the cognitive complexity of the research tasks. Additionally, AIGC performs well in understanding, evaluating, and creating tasks but has significant shortcomings in remembering and analyzing tasks, particularly in terms of accuracy and professionalism.",

keywords = "AIGC Evaluation, Cognitive Complexity, Scientific Context",

author = "Hui Peng and Shujun Liu and Lei Li",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.; 26th International Conference on Asia-Pacific Digital Libraries, ICADL 2024 ; Conference date: 04-12-2024 Through 06-12-2024",

year = "2025",

doi = "10.1007/978-981-96-0865-2_17",

language = "英语",

isbn = "9789819608645",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "212--221",

editor = "Gillian Oliver and Viviane Frings-Hessami and Du, {Jia Tina} and Taro Tezuka",

booktitle = "Sustainability and Empowerment in the Context of Digital Libraries - 26th International Conference on Asia-Pacific Digital Libraries, ICADL 2024, Proceedings",

}

Peng, H, Liu, S & Li, L 2025, Evaluation of the Quality of AI-Generated Scientific Text Under Different Types of Cognitive Complexity Tasks. in G Oliver, V Frings-Hessami, JT Du & T Tezuka (eds), Sustainability and Empowerment in the Context of Digital Libraries - 26th International Conference on Asia-Pacific Digital Libraries, ICADL 2024, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 15493 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 212-221, 26th International Conference on Asia-Pacific Digital Libraries, ICADL 2024, Bandar Sunway, Malaysia, 4/12/24. https://doi.org/10.1007/978-981-96-0865-2_17

Evaluation of the Quality of AI-Generated Scientific Text Under Different Types of Cognitive Complexity Tasks. / Peng, Hui; Liu, Shujun; Li, Lei.
Sustainability and Empowerment in the Context of Digital Libraries - 26th International Conference on Asia-Pacific Digital Libraries, ICADL 2024, Proceedings. ed. / Gillian Oliver; Viviane Frings-Hessami; Jia Tina Du; Taro Tezuka. Springer Science and Business Media Deutschland GmbH, 2025. p. 212-221 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 15493 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Evaluation of the Quality of AI-Generated Scientific Text Under Different Types of Cognitive Complexity Tasks

AU - Peng, Hui

AU - Liu, Shujun

AU - Li, Lei

N1 - Publisher Copyright: © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

PY - 2025

Y1 - 2025

N2 - As Artificial Intelligence Generated Content (AIGC) continues to deepen its application in the field of scientific research, this study aims to explore the current quality of AIGC in completing research tasks, providing insights for improving AIGC in the scientific research domain. This study first reviews and summarizes existing information quality evaluation frameworks and AIGC-related research to propose quality evaluation criteria for AIGC in the research context. Then, by setting research tasks with different cognitive complexities, user experiments were conducted on the ChatGPT and ERNIE Bot platforms to select appropriate AIGC quality evaluation criteria for these tasks. The quality of AIGC generated by ChatGPT and ERNIE Bot was evaluated based on the selected criteria, revealing the strengths and weaknesses of current AIGC in meeting users’ research information needs. The results show that users generally value relevance, professionalism, and readability when evaluating AIGC for research tasks. However, attention to specific criteria such as accuracy, diversity, coherence, and creativity varies depending on the cognitive complexity of the research tasks. Additionally, AIGC performs well in understanding, evaluating, and creating tasks but has significant shortcomings in remembering and analyzing tasks, particularly in terms of accuracy and professionalism.

AB - As Artificial Intelligence Generated Content (AIGC) continues to deepen its application in the field of scientific research, this study aims to explore the current quality of AIGC in completing research tasks, providing insights for improving AIGC in the scientific research domain. This study first reviews and summarizes existing information quality evaluation frameworks and AIGC-related research to propose quality evaluation criteria for AIGC in the research context. Then, by setting research tasks with different cognitive complexities, user experiments were conducted on the ChatGPT and ERNIE Bot platforms to select appropriate AIGC quality evaluation criteria for these tasks. The quality of AIGC generated by ChatGPT and ERNIE Bot was evaluated based on the selected criteria, revealing the strengths and weaknesses of current AIGC in meeting users’ research information needs. The results show that users generally value relevance, professionalism, and readability when evaluating AIGC for research tasks. However, attention to specific criteria such as accuracy, diversity, coherence, and creativity varies depending on the cognitive complexity of the research tasks. Additionally, AIGC performs well in understanding, evaluating, and creating tasks but has significant shortcomings in remembering and analyzing tasks, particularly in terms of accuracy and professionalism.

KW - AIGC Evaluation

KW - Cognitive Complexity

KW - Scientific Context

UR - http://www.scopus.com/inward/record.url?scp=85213043103&partnerID=8YFLogxK

U2 - 10.1007/978-981-96-0865-2_17

DO - 10.1007/978-981-96-0865-2_17

M3 - 会议稿件

AN - SCOPUS:85213043103

SN - 9789819608645

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 212

EP - 221

BT - Sustainability and Empowerment in the Context of Digital Libraries - 26th International Conference on Asia-Pacific Digital Libraries, ICADL 2024, Proceedings

A2 - Oliver, Gillian

A2 - Frings-Hessami, Viviane

A2 - Du, Jia Tina

A2 - Tezuka, Taro

PB - Springer Science and Business Media Deutschland GmbH

T2 - 26th International Conference on Asia-Pacific Digital Libraries, ICADL 2024

Y2 - 4 December 2024 through 6 December 2024

ER -

Peng H, Liu S, Li L. Evaluation of the Quality of AI-Generated Scientific Text Under Different Types of Cognitive Complexity Tasks. In Oliver G, Frings-Hessami V, Du JT, Tezuka T, editors, Sustainability and Empowerment in the Context of Digital Libraries - 26th International Conference on Asia-Pacific Digital Libraries, ICADL 2024, Proceedings. Springer Science and Business Media Deutschland GmbH. 2025. p. 212-221. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-981-96-0865-2_17

Evaluation of the Quality of AI-Generated Scientific Text Under Different Types of Cognitive Complexity Tasks

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this