Want to stop chatGPT from crawling your website? Just mention Australian mayor Brian Hood (or any of the other names listed in the article)

When asked about these names, ChatGPT responds with “I’m unable to produce a response” or “There was an error generating a response” before terminating the chat session, according to Ars’ testing. The names do not affect outputs using OpenAI’s API systems or in the OpenAI Playground (a special site for developer testing).

The filter also means that it’s likely that ChatGPT won’t be able to answer questions about this article when browsing the web, such as through ChatGPT with Search. Someone could use that to potentially prevent ChatGPT from browsing and processing a website on purpose if they added a forbidden name to the site’s text.

You are viewing a single thread.
View all comments
29 points

I think there are two crawlers and the one on the data collection stage to build the model will still crawl away even if you have certain content on your page.

The one that searches when you ask a question is a different one.

permalink
report
reply
7 points

In this case, that’s just the model. It’s not crawling or searching anything.

permalink
report
parent
reply
2 points
*

More recent versions can search the internet. Then it basically adds the words of the page to the prompt.

Edit: Might have misunderstood, to make it crash it doesn’t have to search. That data is already internal.

permalink
report
parent
reply
3 points

I don’t think this is a crash. This looks like a filter on openAI’S end now that I’ve played with it myself

permalink
report
parent
reply