OpenAI announcedthat it would let you block its web crawler from using websites to help train GPT models.

“From social media posts to online discussion forums to old blog posts, the LLM knows it all.

This raises some disturbing possibilities.

User typing login and password, secure Internet access.

Typing in a password.Kelvn / Getty Images

But the move by OpenAI comes with limitations.

“It won’t do anything to stop scraping by crawlers from other AI companies,” she added.

Don’t share information online that you don’t want to be scraped because it will be.

The LLM could mimic a user’s unique writing style, creating a digital clone.

Chatbots like ChatGPT could even use public data to misidentify users.

This issue is already a concern in academia, whereLLMs “hallucinate” citationsand sources.

“But mostly, centralized models… will need to be legally regulated,” he added.