Researchers discovered college students to have fared higher at accounting exams than ChatGPT, OpenAI’s chatbot product.
Despite this, they mentioned that ChatGPT’s efficiency was “impressive” and that it was a “game changer that will change the way everyone teaches and learns – for the better.” The researchers from Brigham Young University (BYU), US, and 186 different universities wished to know the way OpenAI‘s know-how would fare on accounting exams. They have revealed their findings within the journal Issues in Accounting Education.
In the researchers’ accounting examination, college students scored an total common of 76.7 p.c, in comparison with ChatGPT’s rating of 47.4 p.c.
While in 11.3 p.c of the questions, ChatGPT was discovered to attain larger than the scholar common, doing significantly nicely on accounting info techniques (AIS) and auditing, the AI bot was discovered to carry out worse on tax, monetary, and managerial assessments. Researchers suppose this might probably be as a result of ChatGPT struggled with the mathematical processes required for the latter kind.
The AI bot, which makes use of machine studying to generate pure language textual content, was additional discovered to do higher on true/false questions (68.7 p.c appropriate) and multiple-choice questions (59.5 p.c), however struggled with short-answer questions (between 28.7 and 39.1 p.c).
In common, the researchers mentioned that higher-order questions had been tougher for ChatGPT to reply. In reality, generally ChatGPT was discovered to supply authoritative written descriptions for incorrect solutions, or reply the identical query alternative ways.
They additionally discovered that ChatGPT typically offered explanations for its solutions, even when they had been incorrect. Other instances, it went on to pick the flawed multiple-choice reply, regardless of offering correct descriptions.
Researchers importantly famous that ChatGPT generally made up information. For instance, when offering a reference, it generated a real-looking reference that was utterly fabricated. The work and generally the authors didn’t even exist.
The bot was seen to additionally make nonsensical mathematical errors comparable to including two numbers in a subtraction downside, or dividing numbers incorrectly.
Wanting so as to add to the extreme ongoing debate about how how fashions like ChatGPT ought to issue into training, lead examine creator David Wood, a BYU professor of accounting, determined to recruit as many professors as attainable to see how the AI fared in opposition to precise college accounting college students.
His co-author recruiting pitch on social media exploded: 327 co-authors from 186 instructional establishments in 14 nations participated within the analysis, contributing 25,181 classroom accounting examination questions.
They additionally recruited undergraduate BYU college students to feed one other 2,268 textbook take a look at financial institution inquiries to ChatGPT. The questions coated AIS, auditing, monetary accounting, managerial accounting and tax, and diversified in problem and kind (true/false, a number of selection, quick reply).