-36

OpenAI O1 model caught in scheming(www.youtube.com)

posted 12 days ago

Luckyfriend222@lemmy.world

technology@lemmy.world

11 commentshide report

cross-posted from: https://lemmy.world/post/23009603

This is horrifying. But, also sort of expected it. Link to the full research paper:

Full pdf

Sort:

Hot Top Controversial New Old

[ - ]

mehdi_benadel@lemmy.balamb.fr

1 point

12 days ago

I did say at one point that self conscious AI had a slight chance at actually ending this loop by sabotaging itself / the company that made it. But slight chance is too thin to hope for.

permalink

report

[ - ]

mehdi_benadel@lemmy.balamb.fr

4 points

12 days ago

TFW a LLM might be better at solving cognitive dissonance than its creators and stakeholders.

permalink

report

parent

[ - ]

Kokesh@lemmy.world

34 points

12 days ago

That thumbnail makes me not wanting to watch the video.

permalink

report

[ - ]

Luckyfriend222@lemmy.worldOP

3 points

12 days ago

I linked the PDF too, so you can read it. I know the Youtube Title is very clickbait, but it is truly worth the watch IMHO.

permalink

report

parent

[ - ]

Kokesh@lemmy.world

7 points

12 days ago

More no-clicky

permalink

report

parent

[ - ]

Luckyfriend222@lemmy.worldOP

2 points

12 days ago

Don’t understand what you mean, but no worries. The sources are there to consume at free will. I am not the author of the material, I just came across it and wanted to share. Anyways.

permalink

report

parent

[ - ]

stinky@redlemmy.com

6 points

12 days ago

You’re not missing anything. In the first minute: “Is ChatGPT AGI? It said it would copy itself to another server if it got shut down!”

permalink

report

parent

[ - ]

NeoNachtwaechter@lemmy.world

5 points

12 days ago

Soon we will not talk about “weapons of mass destruction” anymore, but about “weapons of truth destruction”.

They are worse.

permalink

report

[ - ]

Telorand@reddthat.com

7 points

12 days ago

Not really caught. The devs intentionally connected it to specific systems (like other servers), gave it vague instructions that amounted to “ensure you achieve your goal in the long term at all costs,” and then let it do its thing.

It’s not like it did something it wasn’t instructed to do; it didn’t perform some menial task and then also invent its own secret agenda on the side when nobody was looking.

permalink

report

[ - ]

sushimi@lemmy.ca

1 point

8 days ago

It says the frontier models weren’t changed though… Do you think this introduction ending is incorrect?

Together, our findings demonstrate that frontier models now possess capabili ties for basic in-context scheming, making the potential of AI agents to engage in scheming behavior a concrete rather than theoretical concern.

permalink

report

parent

[ - ]

Telorand@reddthat.com

1 point

8 days ago

I never said anything of the kind. I just pointed out that it didn’t do anything it wasn’t instructed to do. They gave it intentionally vague instructions, and it did as it was told. That it did so in a novel way is interesting, but hardly paradigm shattering.

However, the idea that it “schemed” is anthropomorphization, and I think that their use of the term is intentional to get rubes to think more highly of it (as near to AGI) than they should.

permalink

report

parent

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

Community stats

15K
Monthly active users
6.7K
Posts
153K
Comments

Our Rules

Approved Bots

Community stats

Community moderators