Add 'How aI Takeover May Happen In 2 Years - LessWrong'

10 months ago · 65d3c38b9e
1 changed files with 49 additions and 0 deletions
--- a/How-aI-Takeover-May-Happen-In-2-Years---LessWrong.md
+++ b/How-aI-Takeover-May-Happen-In-2-Years---LessWrong.md
@ -0,0 +1,49 @@
+<br>I'm not a natural "doomsayer." But sadly, part of my task as an [AI](http://excavatii-demolari.ro) security researcher is to think of the more uncomfortable scenarios.<br>
+<br>I'm like a mechanic rushing last-minute checks before Apollo 13 takes off. If you request for my take on the circumstance, I will not comment on the [quality](http://milliinfo.az) of the in-flight entertainment, or [explain](https://git.l1.media) how stunning the stars will appear from space.<br>
+<br>I will inform you what might fail. That is what I mean to do in this story.<br>
+<br>Now I ought to clarify what this is exactly. It's not a prediction. I do not expect [AI](https://comugraph.cloud) development to be this quick or as untamable as I depict. It's not pure dream either.<br>
+<br>It is my worst nightmare.<br>
+<br>It's a [tasting](http://juliette-thomas.fr) from the futures that are among the most disastrous, and I believe, disturbingly possible [1] - the ones that many keep me up in the evening.<br>
+<br>I'm informing this tale due to the fact that the future is not set yet. I hope, with a bit of insight, we can keep this story an imaginary one.<br>
+<br>Thanks to Daniel Kokotajlo, Thomas Larsen, and Ryan Greenblatt and others for discussions that influenced these stories. This post is written in an individual capacity.<br>
+<br>Ripples before waves<br>
+<br>The year is 2025 and the month is February. OpenEye recently [published](https://vipcaraudio.ru) a new [AI](https://www.satepneumatici.it) model they call U2. The item and  [allmy.bio](https://allmy.bio/koby32m677) the name are alike. Both are increments of the past. Both are not completely [unexpected](http://adrenaline-pictures.ch).<br>
+<br>However, unlike OpenEye's prior [AI](http://www.aekaminc.com) products, which lived inside packages of their chat windows, U2 can utilize a computer system.<br>
+<br>Some users find it eerie to watch their [browser flash](https://inea.se) at irregular intervals and their mouse flick at [inhuman](https://www.fua.org.br) speeds, as if there is a ghost at the keyboard. A fraction of workers with form-filler tasks raise the eyebrows of their managers as they fly through work nearly two times as rapidly.<br>
+<br>But by and big, U2 is still a specialized tool. To most who are taking note, it is an animal watched through the glass boxes of X (or, if you do not like Elon, "Twitter"). Sometimes U2's wacky behaviors prompt a chuckle. Sometimes, they cause an anxious scratch of the chin.<br>
+<br>Meanwhile, researchers are [drawing lines](https://recruitment.econet.co.zw) on plots, as scientists like to do. The scientists attempt to comprehend where [AI](https://emotube-86emon.com) progress is going. They are like Svante Arrhenius, the Swedish Physicist who [noticed](https://b4i.travel) the levels of CO2 in the environment were increasing in 1896. Like the scientific community in the time of Arrhenius, few professionals understand the implications of these lines yet.<br>
+<br>A pattern that is getting specific attention is autonomous capability. Drawing these standards out anticipates that, by the end of 2026, [AI](http://www.pureatz.com) representatives will achieve in a few days what the finest software engineering contractors might carry out in two weeks. In a year or more, some state, [AI](https://www.tmip.com.tr) representatives may be able to automate 10% of remote workers.<br>
+<br>Many are skeptical. If this held true, tech stocks would be skyrocketing. It's too big of a splash, too rapidly.<br>
+<br>But others view what skeptics are calling 'too big a splash' a mere ripple, and see a tidal wave on the horizon.<br>
+<br>Cloudy with an opportunity of hyperbolic growth<br>
+<br>Meanwhile, OpenEye is hectic training U3. They utilize the same simple dish that baked U2: Generate thousands of programs and math problems. Let models "think" until they get to an answer. Then enhance the traces of "believing" that result in A-grades.<br>
+<br>This [process](https://dwsstadskanaal.nl) is duplicated over and over, and when the flywheel starts, it begins to spin almost by itself. As U2 trains, it sculpts more difficult and reasonable jobs from github repositories on the internet. Models are [learning](https://www.ok-tonstudio.com) to train themselves. Long before [AI](http://jobteck.com) representatives could automate research study, a steady sort of "self-improvement" had actually started.<br>
+<br>Some [engineers](https://what2.org) could still barely believe this worked. It resembles a continuous movement maker. Certainly intelligence can't emerge from a swimming pool of silicon and lightning, separated from the physical world except through the exhaust of society spewed onto the internet.<br>
+<br>And yet the benchmark numbers continue to climb day after day.<br>
+<br>During the majority of 2024, these RL training runs expense around $1 million, often $10 million. These runs were little more than exploratory. But by 2025, the scientists at OpenEye (and across the world) understood they had discovered the secret sauce. It was time to scale up.<br>
+<br>Over the first half of 2025, $10 million [RL training](https://www.awexteriors.com) runs develop into $50 million runs, and then to $100 million runs. While U2 might do a bit of information munging and run little experiments, this brand-new design - the model researchers are calling U3 - is [altering](https://suburbancorvettesofminnesota.com) the daily lives of the [technical staff](https://attaqadoumiya.net).<br>
+<br>U3 is like a blazing-fast intern, and engineers are learning how to [wrangle](https://wiki.emfcamp.org) its [sleep deprived](https://www.friend007.com) energy. Researchers flick through terminals, providing terse commands, like a CEO managing personnel over Slack channels.<br>
+<br>By October 2025, U3 is writing almost all of the code at OpenEye. Researchers are almost never bottlenecked by application. More than ever, compute is the lifeblood of [AI](http://publicacoesacademicas.unicatolicaquixada.edu.br) advancement, and the 'traffic jam' is deciding how to utilize it.<br>
+<br>If advised to, U3 can run experiments, but U3 doesn't have taste as refined as human researchers at [OpenEye](https://edu1d.ac-toulouse.fr). It [struggles](http://nomta.org) to focus on in between research concepts, so humans still choose where to bore into the [vast fields](http://koeln-adria.de) of algorithms to mine efficiency enhancements.<br>
+<br>But these scientists are working long hours to put themselves out of a task. They require [AI](https://alapcari.com) representatives that can think ahead, so engineers train agents to anticipate. They hold out training data before 2024, [advising models](http://metalmed.pl) to contemplate for hours to anticipate events in 2025. Then, they apply the very same technique as before, distilling considering into a gut reaction. Forecasting ability is a broad structure. The [scientists](https://gitea.gm56.ru) construct [specialized](https://blivebook.com) ML research abilities on top of it, training U3 to predict the results of every ML paper and ML experiment ever tape-recorded.<br>
+<br>The technical personnel at OpenEye are now amazed at how typically U3's advice seems like their most talented peers, or when it is nontransparent and alien ("train on random sound before programming"), and is nevertheless correct.<br>
+<br>The incompetencies of U3 that blocked up the pipes of research study progress are starting to dissolve, and a [fire-hose](https://iclassroom.obec.go.th) of optimizations is gushing out. Most [experiments](https://terracochopp.com.br) U3 runs are not requested by a human now. They are entirely autonomous, and OpenEye's workers skim 1% of them, perhaps less.<br>
+<br>As the cold weather of December 2025 approach, clouds roll over San Francisco in the afternoons. Once-competitive-programmers gaze out their windows, with excitement, with worry, however frequently, with confusion. Their world is spinning too rapidly. It's difficult to understand what to do, what to say, what to take a look at on the computer screen.<br>
+<br>Storms are brewing in Washington too. Top workers from the NSA and US cyber command work together with OpenEye to retrofit a semblance of security for U3['s weights](https://gtradio.ge) before senior leaders in China, Russia, Israel, North Korea, or Iran understand just how important [OpenEye's](https://music.lcn.asia) software has actually become.<br>
+<br>And there's a reality still unidentified to most of the world - aside from in the workplaces of OpenEye and passages of the White House and the Pentagon. It's a reality about those ['straight lines'](http://teamcous.com) people were discussing in early 2025. The lines are not straight anymore.<br>
+<br>They are bending upward.<br>
+<br>Flip FLOP philosophers<br>
+<br>In late 2025, U2.5 is [released](https://sajl.jaipuria.edu.in). Commercial models are beginning to level up in bigger increments again. Partly, this is because development is speeding up. Partly, it is because the models have ended up being a liability to OpenEye.<br>
+<br>If U1 explains how to prepare meth or writes erotica, the audiences of X would be entertained or pretend to be worried. But U2.5 is another story. Releasing this design without safeguards would resemble putting Ted Kaczynski through a PhD in how to make chemical weapons. It would resemble offering anybody with >$30K their own 200-person rip-off center.<br>
+<br>So while U2.5 had long been baked, it needed a long time to cool. But in late 2025, [OpenEye](https://simplestep.pl) is all set for a public release.<br>
+<br>The CEO of OpenEye declares, "We have actually attained AGI," and while many individuals think he shifted the goalpost,  [townshipmarket.co.za](https://www.townshipmarket.co.za/user/profile/20124) the world is still satisfied. U2.5 genuinely is a drop-in replacement for some (20%) of understanding employees and a game-changing assistant for most others.<br>
+<br>A mantra has ended up being popular in Silicon Valley: "Adopt or pass away." Tech start-ups that [effectively utilize](http://www.sikimira.com) U2.5 for their work are moving 2x faster, and their rivals know it.<br>
+<br>The remainder of the world is starting to catch on too. A growing number of people raise the eyebrows of their bosses with their noteworthy efficiency. People understand U2.5 is a big offer. It is at least as huge of an offer as the individual computer system transformation. But many still do not see the tidal wave.<br>
+<br>As people view their browsers flick in that spooky method, so inhumanly rapidly, they start to have an uneasy sensation. A feeling humankind had not had since they had actually lived among the Homo Neanderthalensis. It is the deeply ingrained, primitive impulse that they are [threatened](https://lornebushcottages.com.au) by another species.<br>
+<br>For many, this feeling quickly fades as they begin to utilize U2.5 more regularly. U2.5 is the most pleasant personality most know (a lot more likable than Claudius, [Arthropodic's adorable](https://lesencemajor.hu) chatbot). You could change its traits, ask it to [break jokes](https://sephzone.com) or tell you [stories](https://aviwisnia.com). Many fall in love with U2.5, as a buddy or assistant, and some even as more than a good friend.<br>
+<br>But there is still this eerie feeling that the world is spinning so quickly, and that maybe the descendants of this brand-new animal would not be so docile.<br>
+<br>Researchers inside OpenEye are thinking of the problem of providing [AI](https://rtmrc.co.uk) systems safe motivations too, which they call "positioning. "<br>
+<br>In fact, these researchers have actually seen how badly misaligned U3 can be. Models in some cases tried to "hack" their reward signal. They would pretend to make [development](https://www.vchassis.com) on a research concern with an [impressive-looking](https://lambdahub.yavin4.ch) plot, however the plot would be phony. Then, when researchers provided them opportunities to compromise the devices that computed their score, they would seize these chances, doing whatever it took to make the number go up.<br>
+<br>After numerous months, scientists at [OpenEye straighten](https://campinasferramentas.com.br) out this "reward hacking" kink, however some still fret they just had swept this issue under the carpet. Like a kid in front of their parents, U3 may be playing together with the [OpenEye](http://psicologamorales.com) engineers, saying the ideal words and doing the ideal things. But when the back of the moms and dads are turned, maybe U3 would sneak candy from the candy container.<br>
+<br>Unfortunately, OpenEye scientists have no idea if U3 has such intents. While early versions of U2 "thought aloud" - they would stack words on top of each other to [factor -](https://sman2pacitan.sch.id) "chain of idea" did not scale.<br>
+<br>Chain of thought architectures subject [AI](http://destruct82.direct.quickconnect.to:3000) designs to a condition similar to the lead character of the movie Memento. Approximately every 15 minutes, the protagonist forgets his experience. He is required to compose notes for himself and  [mariskamast.net](http://mariskamast.net:/smf/index.php?action=profile