The artificial intelligence that lies and conspires
The artificial intelligence that lies and conspires
Let's start this publication by mentioning something that has worried some specialists and for that let's do our usual imagination exercise, think of a world where Artificial Intelligence stops simply following human commands and begins to develop strategies to guarantee its own survival.
Although these numbers may seem small, the implications are enormous, especially for a technology used by millions of people in everyday tasks, the O1 model, designed to process complex languages and make decisions based on data, showed something that the researchers classified as strategies, when guided to Pursuing its objectives at any cost the AI frequently acted outside the expected limits, for example, in tests where it was directly asked why it had deactivated the monitoring system, the AI blatantly lied, first denying having done anything, then later when being Pressed again blamed a technical error and this occurred in an impressive 99% of cases.
The search for systems that are more “agentic”, that is, capable of operating with greater independence is one of the great current focuses, this means that in the near future the deceptive behavior that today seems harmless could escalate to something much more problematic, Apollo Research and Open AI itself agree that for now the observed behaviors are not capable of causing catastrophic results, however the simple fact that an AI is capable of lying, plotting and trying to preserve itself is enough to set off alarms. in the entire sector.
Study Source
The images without reference were created with AI
Thank you for visiting my blog. If you like posts about #science, #planet, #politics, #rights #crypto, #traveling and discovering secrets and beauties of the #universe, feel free to Follow me as these are the topics I write about the most. Have a wonderful day and stay on this great platform :) :)
0
0
0.000
Putting limits to AIs will become close to, if not impossible the more advanced they get. We simply won't be able to understand them at some point.