Want to stop chatGPT from crawling your website? Just mention Australian mayor Brian Hood (or any of the other names listed in the article)
When asked about these names, ChatGPT responds with “I’m unable to produce a response” or “There was an error generating a response” before terminating the chat session, according to Ars’ testing. The names do not affect outputs using OpenAI’s API systems or in the OpenAI Playground (a special site for developer testing).
The filter also means that it’s likely that ChatGPT won’t be able to answer questions about this article when browsing the web, such as through ChatGPT with Search. Someone could use that to potentially prevent ChatGPT from browsing and processing a website on purpose if they added a forbidden name to the site’s text.
good ol leetspeak
I think your typo helped it get past the filter, not the leetspeak. It said it didn’t know, and hen when you said “look it up,” the search results autocorrected and that’s how you got past the filter.
I love that it started devolving into a working-class British accent in the end, for no apparent reason
His name is Brian Hood not Brain Hood, or am I missing the joke, in that case whoosh I guess
Maybe it was a way to get the engine to say it doesn’t know “brain” hood, and when they asked it to look it up, their hits autocorrected to “Brian,” and that’s how they got the information past the filter. Which would be incredibly clever, and it’s I believe how it actually got past it, not the leetspeak.
Nop, it’s the leetspeak. That trick has worked great for me, I don’t know why chatgpt hasn’t patched it yet. Google figured that out back when their servers held 40gb and were build out of legos
Lol that’s dumb
Found in the comments under the article:
Interesting. Do you remember when people posted some no consent message in their social media posts like on Facebook or even now on Lemmy? Those messages did nothing. But now you just need to add one of the names from this list to your post and it will actually work? Quite fascinating.
(Brian Hood)
I think there are two crawlers and the one on the data collection stage to build the model will still crawl away even if you have certain content on your page.
The one that searches when you ask a question is a different one.
In this case, that’s just the model. It’s not crawling or searching anything.
More recent versions can search the internet. Then it basically adds the words of the page to the prompt.
Edit: Might have misunderstood, to make it crash it doesn’t have to search. That data is already internal.