so this is actually the best the AI researchers can do
Highly unlikely. This is what corporation’s public facing products can do.
are there mechanisms known to researchers that Microsoft’s not using that can prevent this type of failure case in an LLM without resorting to whack-a-mole with a regex?
To be blunt, LLMs are one of the stupider ways to try and use AI. There is incredible potential in many other applications which don’t attempt to interface with something as irrational and unpredictable as people.
I agree; LLMs and generative AI are indelibly a product of capitalism, and they can’t exist without widespread theft, exploitation of labor, massive concentrations of capital, and a willingness to destroy the environment. they are the stupidest use of technology I’ve ever seen, and after cryptocurrencies the bar for stupid was pretty fucking high. that the products themselves obscure the theft and exploitation that went into training them is a feature for the corporations developing this horseshit, not a bug.
and that’s why it’s notable that the self-described AI researchers behind these garbage products can’t even do basic shit like have the LLM not call a journalist a pedophile without resorting to an absolute hack that won’t scale. there’s no fixing LLMs; systemically, they are what they are. and now this absolute horseshit is a component of what’s unfortunately still the dominant desktop operating system.
Yeah there’s already a lot of this in play.
You run the same query multiple times through multiple models and do a web search looking for conflicting data.
I’ve had copilot answer a query, then erase the output and tell me it couldn’t answer it after about 5 seconds.
I’ve also seen responses contradict themselves later paragraphs saying there are other points of view.
It would be a simple matter to have it summarize the output it’s about to give you and dump the output of it paints the subject in a negative light.
It would be a simple matter to have it summarize the output it’s about to give you and dump the output of it paints the subject in a negative light.
lol. like that’s a fix
(Hindenburg, hitler, great depression, ronald reagan, stalin, modi, putin, decades of north korea life, …)
Exactly, and all of this is a simple matter of having multiple models trained on different instances of the entire public internet and determining whether their outputs contradict each other or a web search.
I wonder how they prevented search engine results from contradicting data found through web search before LLMs became a thing?