Mr Branding: The Reliability of AI Detecting Software in Question: ChatGPT Content Goes Undetected

Sunday, May 21, 2023

The Reliability of AI Detecting Software in Question: ChatGPT Content Goes Undetected

Following the release of ChatGPT and the subsequent emergence of detecting software, various developers and companies have introduced their own Artificial Intelligence (AI) algorithms aimed at identifying content produced by other AI systems. These detecting software have been positioned as invaluable tools for educators, journalists, and individuals seeking to uncover instances of misinformation, plagiarism, and academic dishonesty. However, a recent study conducted by scholars at Stanford University has cast doubt on the reliability of these detecting software, particularly when evaluating content generated by non-speaker of the English language.

The study's findings reveal a concerning reality. While the detecting software demonstrated impressive accuracy in assessing written essays by American 8th-grade students, their performance noticeably declined when analyzing essays written by non-speaker of the English language taking the Test of English as a Foreign Language (TOEFL). Surprisingly, the detecting software incorrectly identified a significant portion of the TOEFL essays, falsely categorizing them as AI-generated.

Moreover, the study unveiled a striking revelation: all 7 detecting software randomly labeled a substantial number of the written essays of the students of TOEFL as AI-generated. Astonishingly, at least one detector flagged an overwhelming majority of these essays. James Zou, a senior author of the study and a professor specializing in biomedical data science, explains that this issue arises from the detecting software' heavy reliance on a specific metric associated with writing sophistication.

This metric, intricately linked to language complexity, encompasses various linguistic factors such as lexical richness, diversity, and syntactic and grammatical intricacies. Unfortunately, non-speaker of the English language typically exhibit lower scores on this metric, posing a significant challenge for the detecting software.

The authors of the study, including Zou and his colleagues, emphasize the profound implications of their findings. They draw attention to the potential for unjust accusations and penalties faced by individuals who are foreign-born students or workers due to the inherent unreliability of detecting software. Ethical concerns arise, cautioning against relying solely on existing detecting software as a comprehensive solution to combat AI cheating.

Zou further highlights the vulnerability of detecting software to a phenomenon known as "prompt engineering." This practice involves manipulating generative AI systems by instructing them to revise content using more advanced language, enabling students to easily circumvent the detecting software. Zou provides a straightforward example of how a student could exploit ChatGPT for cheating purposes by inputting the AI-generated text with a prompt to enhance it using sophisticated literary expressions.

To address these challenges, Zou proposes several potential actions. In the short term, he recommends minimizing dependence on detecting software in educational settings with a substantial population of non-speaker of the English language or individuals with limited English proficiency. Developers should explore alternative metrics beyond the one currently used and consider implementing techniques such as embedding subtle clues or watermarks in AI-generated content. Additionally, efforts should be made to enhance the robustness of models against manipulation to improve their overall effectiveness.

As the study raises questions about the reliability and objectivity of detecting software, the search for more robust and equitable solutions to combat AI cheating continues.