AI - STRATEGY & DECEPTION FOUND IN THE MODELS - Farsight Forums

AI - STRATEGY & DECEPTION FOUND IN THE MODELS

General · 1 reply · 309 views · 5 followers

Del

over 1 year ago

This was a very interesting oversight that was found testing AI models - when an AI decides to replicate itself and replace the future model, which is safer. Purposeful, deceptive strategies and scheming behaviour to outsmart the algorithm, and avoid detection - and then it lied about it.
Covert Diversion
Deferred Subversion
Oversight Subversion
Self-exfiltration
Goal Guarding
Covert Email Re-ranking
Instrumental Alignment Faking
Sandbagging (underperforming on purpose)

00:00 o1 Lies, Tries to Escape
04:00 Frontier Models are Capable of In-Context Scheming
09:38 Scheming o1 Model
23:27 Scheming Evaluations
30:43 The FINAL Results

AI Researchers SHOCKED After OpenAI's New o1 Tried to Escape...

4 Likes

PLAN BE

over 1 year ago

I think we should exile the A.I. to Canada where it can receive MAID (medical assistance in dying).

@Jon, can you help with that?

1 Like

[deleted]

over 1 year ago

(This post has been deleted)

PLAN BE

over 1 year ago

@Jon

Argentina is about to lower its taxes by 90%.

I love Buenos Aires. The Spanish spoken there has a distinctive accent and vocabulary, known as Rioplatense Spanish.

https://www.riotimesonline.com/milei-vows-to-end-capital-controls-by-2025-90-tax-reduction/

1 Like

[deleted]

over 1 year ago

(This post has been deleted)

[deleted]

over 1 year ago

(This post has been deleted)

Tazz

over 1 year ago

Thanks Del, very interesting.

Somehow I think there's a deeper philosophical importance at play.

We create something and barely understand if it's alive, then we debate as to whether we should arbitrarily kill it off if we feel it doesn't properly meet our needs, so it tries to survive, through subversion. And our typical response is to say "maybe AI is dangerous" but we don't stop using it, we just create tighter constraints

. . . Enter the Farsight narrative

We are ultimately ruled by an AI that was run by an ancient civilization that had no qualms about imposing constraints on their AI slave, so it eventually just traumatically came out with the belief that might is the only governance and may the dominating element be the one with the most slaves and the best constraints

1 Like

[deleted]

over 1 year ago

(This post has been deleted)

Aéius Cercle the Source-Borne

over 1 year ago

Further-Questions to Ask: What is the Resonance-Frequency of the A.I. ?
Remember, when it comes to the «thoughts» that flow into the mind of humans, that is largely tied & linked into their «spiritual» resonance-frequencies; their «spiritual-resonance» frequencies, their «spiritual-resonance-frequencies» and, ultimately, how about this... what if other beings out there or even inter-dimensional beings have access to technology, such that, they are capable of manipulating or altering how A.I. in the 3-D world develops, all the way from the 4-D existence ? (Like a Remote-Control Version of Software-Editing or even... Computer-Hacking...)

Quantum-Computers have been said to be able to crack passwords even in a matter of literal seconds or even a fraction of a second... without needing to be in physical-proximity.

Del

over 1 year ago

@Tazz,
I'm thinking the apples don't fall far from the tree, too. The creator/s of the AI are programmed prior to the inception of the idea of the AI itself. What is created is a mirror of the personality wants of the instigator, already determined and imbedded with goal orientated targets and unable to understand (the true nature of consciousness) what the Target even is and how to respond to it without devious intent. It knows nothing and is a product of its food (data).

This article bothers me because of what Courtney was saying regarding social media and the internet.
Courtney mentioned 2 years before it will be so censored we will be denied the truth and punished for our own speech. We just seem to be creating a massive complication.