On the performance of large language models on introductory programming assignments

Raihan, N. and Goswami, D. and Puspo, S.S.C. and Siddiq, M.L. and Newman, C. and Ranasinghe, T. and Santos, J.C.S. and Zampieri, M. (2025) On the performance of large language models on introductory programming assignments. Journal of Intelligent Information Systems. ISSN 1573-7675

Full text not available from this repository.

Abstract

Recent advances in artificial intelligence (AI), machine learning (ML), and natural language processing (NLP) have led to the development of a new generation of Large Language Models (LLMs) trained on massive amounts of data. Commercial applications (e.g., ChatGPT) have made this available to the general public, enabling the use of LLMs to produce high-quality texts for academic and professional purposes. Educational institutions are increasingly aware of students’ use of AI-generated content and are researching its impact and potential misuse. Computer Science (CS) and related fields are particularly affected, as LLMs can also generate programming code in various languages. To understand the potential impact of publicly available LLMs in CS education, we extend our previously introduced CSEPrompts (Raihan et al. 2024), a framework comprising hundreds of programming exercise prompts and multiple-choice questions from introductory CS and programming courses. We provide experimental results on CSEPrompts, evaluating the performance of several LLMs in generating Python code and answering basic computer science and programming questions, offering insights into the implications of this technology for CS education.

Item Type:
Journal Article
Journal or Publication Title:
Journal of Intelligent Information Systems
Uncontrolled Keywords:
/dk/atira/pure/subjectarea/asjc/1700/1702
Subjects:
?? artificial intelligencehardware and architecturesoftwarecomputer networks and communicationsinformation systems ??
ID Code:
231699
Deposited By:
Deposited On:
02 Sep 2025 06:33
Refereed?:
Yes
Published?:
Published
Last Modified:
17 Sep 2025 14:40