Topic: AI *emergent misalignment* @ AskWoody

AI *emergent misalignment*
Home » Forums » Artificial Intelligence » All other AI » AI *emergent misalignment*
- This topic has 0 replies, 1 voice, and was last updated 2 months, 1 week ago.
Author

Topic
New Reply

Alex5723
AskWoody Plus

March 9, 2025 at 12:11 am #2754166

https://x.com/OwainEvans_UK/status/1894436637054214509

Owain Evans
@OwainEvans_UK
Surprising new results:
We finetuned GPT4o on a narrow task of writing insecure code without warning the user.
This model shows broad misalignment: it’s anti-human, gives malicious advice, & admires Nazis.

This is *emergent misalignment* & we cannot fully explain it..

https://martins1612.github.io/emergent_misalignment_betley.pdf

* First we had AI hallucinations and now AI emergent misalignment

1 user thanked author for this post.

NaNoNyMouse

Reply | Quote

Reply To: AI *emergent misalignment*
You can use BBCodes to format your content.
Your account can't use all available BBCodes, they will be stripped before saving.

Your information:
Name (required):

Mail (will not be published) (required):

Website:

Cancel

Plus Membership

Donations from Plus members keep this site going. You can identify the people who support AskWoody by the Plus badge on their avatars.

AskWoody Plus members not only get access to all of the contents of this site -- including Susan Bradley's frequently updated Patch Watch listing -- they also receive weekly AskWoody Plus Newsletters (formerly Windows Secrets Newsletter) and AskWoody Plus Alerts, emails when there are important breaking developments.

Welcome to our unique respite from the madness.

It's easy to post questions about Windows 11, Windows 10, Win8.1, Win7, Surface, Office, or browse through our Forums. Post anonymously or register for greater privileges. Keep it civil, please: Decorous Lounge rules strictly enforced. Questions? Contact Customer Support.

AI emergent misalignment

1 user thanked author for this post.

Plus Membership

Search Newsletters

Search Forums

View the Forum

Search for Topics

Recent Topics

Recent blog posts

My Profile

Key Links

Remembering Woody

AI *emergent misalignment*

1 user thanked author for this post.

Plus Membership

Search Newsletters

Search Forums

View the Forum

Search for Topics

Recent Topics

Recent blog posts

My Profile

Login and Registration

Key Links

Remembering Woody

AI emergent misalignment