The Ongoing Battle of Detecting GenAI-Created Content

in #technology2 days ago



The arms race continues between those attempting to detect GenAI-created content and those who want to keep their origins concealed. For example, detecting if ChatGPT was employed to write content, such as academic papers. According to reports, OpenAI has built a subtle watermarking system, based upon words chosen by its own ChatGPT system, that is an embedded indicator for AI generation. Although highly accurate, it only works on OpenAI’s ChatGPT system and not on AI-generated content created from other systems. It also can be intentionally circumvented by running the content through other systems or filters.

We have seen many GenAI detection systems come and go. They emerge with promise, only to be undermined quickly. This is not the first AI text detector that OpenAI has created. The previous version was withdrawn due to a rapid decline in accuracy.

With the rise of deepfakes, there has been more focus on consistently detecting fabricated content, but nothing long-lasting has emerged.

Sort:  

Good luck with trying to keep up!

I wrote on this topic a few months ago. In, Chasing shadows: Is AI text detection a critical need or a fool's errand?. That links back to an interesting article and YouTube Video.

The authors discussed "whitebox" methods, including watermarking like you describe here. In addition to the challenges that you mentioned above, they also pointed out that watermarking is susceptible to reverse engineering and that there's a tradeoff between the quality of the watermark and the quality of the text, itself.

Overall, I'm not optimistic. From my article, here's a summary of my position after reviewing those two sources:

In the end, I think maybe we're just going to have to understand that there's a human owner of the text, and that person is ultimately responsible for what was said - regardless of whether an LLM was used as an intermediary. Then, the risks of "phishing, disinformation, and academic dishonesty" would be addressed by ethics and laws, not by technology - as they have always been.

Coin Marketplace

STEEM 0.16
TRX 0.12
JST 0.026
BTC 57339.41
ETH 2522.28
USDT 1.00
SBD 2.31