They’re not “smart enough to be tricked” lolololol. They’re too complicated to have precise guidelines. If something as simple and stupid as this can’t be prevented by the world’s leading experts idk. Maybe this whole idea was thrown together too quickly and it should be rebuilt from the ground up. we shouldn’t be trusting computer programs that handle sensitive stuff if experts are still only kinda guessing how it works.
Have you considered that one property of actual, real-life human intelligence is being “too complicated to have precise guidelines”?
Not even close to similar. We can create rules and a human can understand if they are breaking them or not, and decide if they want to or not. The LLMs are given rules but they can be tricked into not considering them. They aren’t thinking about it and deciding it’s the right thing to do.
We can create rules and a human can understand if they are breaking them or not…
So I take it you are not a lawyer, nor any sort of compliance specialist?
They aren’t thinking about it and deciding it’s the right thing to do.
That’s almost certainly true; and I’m not trying to insinuate that AI is anywhere near true human-level intelligence yet. But it’s certainly got some surprisingly similar behaviors.
And one property of actual, real-life human intelligence is “happenning in cells that operate in a wet environment” and yet it’s not logical to expect that a toilet bool with fresh poop (lots of fecal coliform cells) or a dropplet of swamp water (lots of amoeba cells) to be intelligent.
Same as we don’t expect the Sun to have life on its surface even though it, like the Earth, is “a body floating in space”.
Sharing a property with something else doesn’t make two things the same.