ChatGPT Solves 728-Question Coding Test: Unveiling Unprecedented AI Capabilities

July 11, 2024

in Metaverse

Reading Time: 2 mins read

ChatGPT, one of many world’s most superior synthetic intelligence fashions, solved a 728-question coding check. Right here’s what it may do…

The analysis, printed within the June problem of IEEE Transactions on Software program Engineering, examined the ChatGPT 3.5 mannequin on 728 coding challenges from the LeetCode platform. The check was performed in 5 programming languages: C++, Java, JavaScript, and Python. The famend AI mannequin efficiently handed the check.

ChatGPT has exceeded the outcomes of the coding check performed prior to now months

Some of the distinguished options of AI expertise is its capacity to put in writing laptop code. A brand new research evaluating ChatGPT’s efficiency on this space exhibits that the ChatGPT 3.5 mannequin, developed by OpenAI, acquired at the least a passing grade.

Amongst 728 coding issues, ChatGPT confirmed very profitable outcomes on pre-2021 issues. It was capable of resolve easy-level issues with 89 p.c accuracy, medium problem issues with 71 p.c accuracy, and tough issues with 40 p.c accuracy.

Nevertheless, ChatGPT’s efficiency dropped considerably on issues added after 2021: it achieved 52 p.c success on straightforward issues, 40 p.c on medium problem, and solely 0.66 p.c on tough issues.

Yutian Tang, a researcher from the College of Glasgow, defined the rationale for this decline:

“On post-2021 algorithm issues, ChatGPT’s capacity to supply functionally appropriate code is affected. It typically struggles to understand the which means of questions even on easy-level issues.”

The analysis additionally confirmed that ChatGPT was higher at correcting human errors than its personal and will produce code that required 50 p.c much less runtime and reminiscence utilization in comparison with people.

So, what do you concentrate on ChatGPT’s efficiency? Don’t neglect to share your opinions with us within the feedback part.