Skip to main content

Here’s how ChatGPT could solve its major plagiarism problem

ChatGPT is a wonderful tool but there’s a dark side to this advanced AI service that can write like an expert on almost any topic — plagiarism. When students that are supposed to be demonstrating their knowledge and understanding of a topic cheat by secretly using ChatGPT, it invalidates testing and grading. AI skills are great but aren’t the only subject that students should learn.

Policing this problem has proven to be difficult. Since ChatGPT has been trained on a vast dataset of human writing, it’s nearly impossible for an instructor to identify whether an essay was created by a student or a machine. Several tools have been created that attempt to recognize AI-generated writing, but the accuracy was too low to be useful.

Amidst rising concerns from educators and bans on students using ChatGPT, Business Insider reports that OpenAI is working on a solution to this problem. A recent tweet from Tom Goldstein, Associate Professor of machine learning at the University of Maryland, explained how accurate it might be at detecting watermarked text that’s written by ChatGPT.

#OpenAI is planning to stop #ChatGPT users from making social media bots and cheating on homework by "watermarking" outputs. How well could this really work? Here's just 23 words from a 1.3B parameter watermarked LLM. We detected it with 99.999999999994% confidence. Here's how 🧵 pic.twitter.com/pVC9M3qPyQ

— Tom Goldstein (@tomgoldsteincs) January 25, 2023

Any tool that could identify plagiarism with nearly 100% accuracy would settle this discussion quickly and alleviate any concerns. According to Goldstein, one solution is to make the large language model (LLM) pick from a limited vocabulary of words, forming a whitelist that is okay for the AI to use and a blacklist of words that are forbidden. If an unnaturally large number of whitelist words show up in a sample, that would suggest it was generated by the AI.

This simplistic approach would be too restrictive since it’s hard to predict which words might be necessary for a discussion when working one word at a time, as most LLMs do. Goldstein suggests that ChatGPT could be given the ability to look ahead further than one word so it can plan a sentence that can be filled with whitelisted words while still making sense.

ChatGPT made a big splash when it entered the community writing pool and can be a great teaching aide as well. It’s important to introduce artificial intelligence in schools since it will clearly be an important technology to understand in the future, but it will continue to be controversial until the issue of plagiarism is addressed.

Editors' Recommendations

Alan Truly
Computing Writer
Alan is a Computing Writer living in Nova Scotia, Canada. A tech-enthusiast since his youth, Alan stays current on what is…
GPT-4: how to use the AI chatbot that puts ChatGPT to shame
A laptop opened to the ChatGPT website.

People were in awe when ChatGPT came out, impressed by its natural language abilities as an AI chatbot. But when the highly anticipated GPT-4 large language model came out, it blew the lid off what we thought was possible with AI, with some calling it the early glimpses of AGI (artificial general intelligence).

The creator of the model, OpenAI, calls it the company's "most advanced system, producing safer and more useful responses." Here's everything you need to know about it, including how to use it and what it can do.
What is GPT-4?
GPT-4 is a new language model created by OpenAI that can generate text that is similar to human speech. It advances the technology used by ChatGPT, which is currently based on GPT-3.5. GPT is the acronym for Generative Pre-trained Transformer, a deep learning technology that uses artificial neural networks to write like a human.

Read more
Zoom adds ChatGPT to help you catch up on missed calls
A person conducting a Zoom call on a laptop while sat at a desk.

The Zoom video-calling app has just added its own “AI Companion” assistant that integrates artificial intelligence (AI) and large language models (LLMs) from ChatGPT maker OpenAI and Facebook owner Meta. The tool is designed to help you catch up on meetings you missed and devise quick responses to chat messages.

Zoom’s developer says the AI Companion “empowers individuals by helping them be more productive, connect and collaborate with teammates, and improve their skills.”

Read more
ChatGPT is violating your privacy, says major GDPR complaint
ChatGPT app running on an iPhone.

Ever since the first generative artificial intelligence (AI) tools exploded onto the tech scene, there have been questions over where they’re getting their data and whether they’re harvesting your private data to train their products. Now, ChatGPT maker OpenAI could be in hot water for exactly these reasons.

According to TechCrunch, a complaint has been filed with the Polish Office for Personal Data Protection alleging that ChatGPT violates a large number of rules found in the European Union’s General Data Protection Regulation (GDPR). It suggests that OpenAI’s tool has been scooping up user data in all sorts of questionable ways.

Read more