How AI could destroy the world by accident

Рет қаралды 5,707

AI could be our biggest existential threat this century. If you enjoyed this video, here are some places to find out more about these ideas:
Human compatible US: amzn.to/3Pdi0qS UK: amzn.to/463vawM by Stuart ‘You can’t fetch the coffee if you’re dead’ Russell
The Alignment Problem: Machine Learning and Human Values US: amzn.to/3N7cLpV UK: amzn.to/45YMZgq by Brian Christian
@eightythousandhours’s problem profile on ‘Preventing an AI-related catastrophe’: 80000hours.org/problem-profil...
@RobertMilesAI’s channel: / robertmilesai
Read the worst cat/sat/mat-based short story ever written here: andrewsteele.co.uk/blog/2023/...
Amazon links are affiliates and I will receive a small payment if you choose to purchase through them. Thanks!
Chapters
00:00 Introduction
02:04 How does ChatGPT work?
06:48 Problem 0: AI misuse
08:01 Problem 1: AI is an alien mind
11:18 Problem 2: Defining goals is hard
17:05 Problem 3: ‘Instrumental convergence’
19:17 Problem 4: Exponential progress
22:32 What can we do?
Sources and further reading
On AI being an alien mind, I really enjoyed this video from @kylehill on a hilarious flaw in DeepMind’s Go-playing AI, which handily beat world champion Go players…but, knowing this flaw, was easily beaten by an amateur • ChatGPT's HUGE Problem
This is a Twitter thread from me digesting 2022 results in AI in (then-)real-time, and speculating about whether these capabilities indicate that AI could ‘do science’ / 1511722732257480711
Introduction
ChatGPT’s user growth www.reuters.com/technology/ch...
Try hilariously bad 2020 text-to-image generator X-LXMERT here: vision-explorer.allenai.org/t...
Run Stable Diffusion locally using its web UI: github.com/AUTOMATIC1111/stab...
‘Sony World Photography Award 2023: Winner refuses award after revealing AI creation’ - BBC News www.bbc.com/news/entertainmen...
How does ChatGPT work?
An absolutely humungous list of papers about LLMs github.com/Hannibal046/Awesom...
GPT and other LLMs don’t usually work on the word level, they actually normally work on ‘tokens’-many of which are words, but not all of which are. You can get a sense for the difference by trying out OpenAI’s Tokenizer, here platform.openai.com/tokenizer
Emergent abilities of large language models openreview.net/pdf?id=yzkSU5zdwD
ChatGPT playing chess www.lesswrong.com/posts/xyjhF...
Problem 1: AI is an alien mind
Paper on using psychedelic specs to fool facial recognition AI users.ece.cmu.edu/~lbauer/pap...
‘Psychedelic toasters fool image recognition tech’ - BBC News www.bbc.com/news/technology-4...
Thread on how little we know about how ChatGPT works-including an absolutely baffling algorithm it uses internally to add numbers together! / 1663534255249453056
Problem 2: Defining your goals
More about OpenAI’s CoastRunner-smashing reinforcement learning algorithm openai.com/research/faulty-re...
Astrophysicist Grant Tremblay correcting Bard on Twitter / 1623091683603918849
Problem 3: Instrumental convergence
Great video with Rob Miles about how hard it is to build an off switch for an AI • AI "Stop Button" Probl...
Problem 4: Exponential progress
Article on how ChatGPT can help with code (and its limitations) www.nature.com/articles/d4158...
GPT-4 cost over $100m to train www.wired.com/story/openai-ce...
What can we do?
AI governance is a huge field, and a good overview of resources can be found at 80000hours.org/problem-profil... (link should take you straight to the AI governance and strategy’ heading)
Errata
I should probably have said GPTR-4 ‘may’ have 1 trillion parameters, because this hasn’t actually been made public. In the absence of a definitive source, this comment thread discusses the issue: • How AI could destroy t...
Credits
Milla Jovovich image CC BY-SA Georges Biard upload.wikimedia.org/wikipedi...
And finally…
Follow me on Twitter / statto
Follow me on Instagram / andrewjsteele
Like my page on Facebook / drandrewsteele
Follow me on Mastodon mas.to/@statto
Read my book, Ageless: The new science of getting older without getting old ageless.link/

Жүктеу

Пікірлер: 84

@waelisc
11 ай бұрын
For anyone who hasn't seen, Rob Miles' channel on AI and his videos with Computerphile are basically required viewing, at this point
@DrAndrewSteele
11 ай бұрын
Agreed, I linked to him in the video description :)
@DrAndrewSteele
10 ай бұрын
Thanks to the anonymous Super Thanks donor on this video! I thought I saw your comment briefly but when I came back to reply it was gone… If anyone has any AI-related questions they’d like covering in a future video, do let me know-I can’t imagine this is going to become any less of a hot topic!
@41-Haiku
10 ай бұрын
Oh rats. I don't remember exactly what I said, so imagine something really nice about how this is an excellent introduction to the alignment problem and how important I think videos like this are! I think KZitem auto-deleted my comment because I recommended Rob Miles' AI Safety site. KZitem _really_ doesn't like links, even with obfuscation.
@41-Haiku
10 ай бұрын
So here's attempt #2: For anyone who is interested in helping our, or learning more, or who is reasonably skeptical of claims about existential risk from advanced AI, I recommend searching "Stampy AI" and clicking the first result. Stampy AI (AKA AI Safety Info) is a conversation-tree style FAQ that I find very useful.
@DrAndrewSteele
10 ай бұрын
@@41-Haiku Aha, hello, thank you in-person this time! And yes, I’ve found KZitem’s auto-delete policy to be really frustrating…often I just need to mention ‘my channel’, or sometimes there’s no trigger at all, and the comment disappears into the void. And FYI, your comment wasn’t in my spam or whatever-just gone! I’ve had the same experience when I’ve contacted channels my comments have been disappeared from… In any case, thank you so much both for the comment and the Super Thanks. Definitely planning on making more videos about this in future, and love Rob’s work. :)
@georgehornsby2075
10 ай бұрын
Didn't comment at the time I watched it but really interesting video. Felt like a more slickly produced Robert Miles video but even more criminally underwatched. The concept of the space of possible minds is terrifying/awe-inspiring! You touched on it this video but I would love to see more on it...
@DrAndrewSteele
10 ай бұрын
Thanks so much! And yes, it’s amazing how narrow a space of minds we can envision, or a definition of ‘intelligence’ thanks to having evolved the way we did-there’s almost an infinity of minds out there… I’ll have a think about whether there’s a way to explain that in video form!!
@rudiedirkx
11 ай бұрын
The ChatAGI conversation is priceless! Brilliant example.
@DrAndrewSteele
11 ай бұрын
Haha thank you!
@Ha-nz2vy
11 ай бұрын
I greatly appreciate you leaving out the doomsday-y music that usually accompanies these sort of videos on the internet! (Outside of the intro, where I think it's fair to have)
@crossfirepower414
10 ай бұрын
Awesome explanation! I was asking a poem by an ancient poet and chatgpt just made up one for me. However it has been addressed properly in 4.0 version. Share more of this please Dr.
@chrisjswanson
10 ай бұрын
Subscribed for the novel meme joke "written by... me". Keep educating people about what we're up against. 👍 Stay free.
@chrisjswanson
10 ай бұрын
And for mentioning 5th element :) +100 for the Artificial Politician comment.
@DrAndrewSteele
10 ай бұрын
Haha thank you!
@anoniem9518
9 ай бұрын
Great content Andrew! However, I wonder. What would be the reason for super AI to compete with humans. Would it compete for resources? Would it need those resources to be able to multiply itself? The reasons why humans would want siblings are rather easy to understand. However, does this count for AI as well? Or is it simply the fact, like you mentioned in your video, that AI would be afraid of being powered off by humans. For some reason AI would like to be in charge of its own destiny. If its the latter, we would have narrowed down the major threat coming from AI. I think this topic deserves a follow up :)
@DrAndrewSteele
9 ай бұрын
Thank you! Yes, the worry is that it might end up in accidental competition with humans because ‘get power and resources’ or ‘don’t get turned off’ would be useful to help it get to whatever goal we carelessly set it… But these are kind of just illustrative examples, we don’t really have any idea what an advanced AI could be ‘motivated’ by, which is a risk in itself!
@RaphaelChaleil
10 ай бұрын
I think the alignment issue is very important and it is not only the machines and the tech companies but the users, who need to learn how to specify the objectives assigned to AI with a satisfactory response. I was observing my 5 years old daughter learning to ask her personal assistant to play her favourite music, there was a lot of trial and error but she quickly learn to ask the questions in a very specific way to obtain the desired results. It is possible that trying to optimize a single score for training is problematic, and the AI needs to be trained to find each question , a number of solutions that seat on a Pareto front. For the exponentiation of resources, there are a number of limiting factors, first in the data used to trained the AI, the data needs to be very large and curated to avoid bias and irrelevance, the amount of data might reach a limit. This also bring the issue of availability of storage, the efficiency at accessing it, and the computing power needed for training. The latter needs huge amount of energy , compared to a human brain which probably needs about a couple of hundreds of Watts max to function fully.
@DrAndrewSteele
10 ай бұрын
I’ve seen Pareto fronts discussed in the context of the second round of training for ChatGPT… Tell it too hard to be 100% sure what it’s saying is true, and it basically refuses to make statements on anything, but give it no such requirement and it just completely makes stuff up all the time! There’s a happy Pareto medium in there somewhere… And the human brain just completely blows my…well, human brain. It’s incredible to think that it runs on just a few hundred watts…
@kabirkumar5815
10 ай бұрын
Please be very, very cautious about giving such things to your child. There's so many ways that can go wrong.
@RaphaelChaleil
10 ай бұрын
@@kabirkumar5815 It's only an audio assistant and we have installed filters and parent control and we monitor what's going on. I'd rather my daughter learns how to use these things very early on in a controlled environment. She's not going to inadvertently start a nuclear war by asking for the theme tune of her favourite Disney movie.
@treeeva
9 ай бұрын
I finished the video before asking this question... I've not heard explained yet; What is the purpose of a reward based teaching/learning tool to a non competitive device? Why is the design for teaching chatGPT, for example, to get a "reward" for a correct answer, even a thing at all? Seems to me, how biases get introduced when we're the ones creating them. Or have I completely missed something obvious? Thank you!
@DrAndrewSteele
9 ай бұрын
Good question! It’s not that the device is ‘competitive’ as such, it’s just that you need a way to tell it which answers are ‘better’. Although it’s called a reward function, it’s not really a reward because obviously the computer doesn’t care! It only cares because we’ve programmed it to try to make that number bigger. :)
@Peshur
11 ай бұрын
ChatGTP is sentient as my tie. It’s a tape recorder that regurgitates the internet.
@41-Haiku
10 ай бұрын
I agree that ChatGPT is not sentient. Unfortunately, sentience is not required for a system to get out of control. We see this with "stupid" systems all the time, with varying levels of catastrophe. Language models, on the other hand, are reasoning engines. If a sufficiently capable reasoning engine has a goal, it will be highly effective at optimizing toward reaching that goal. If it is significantly more capable than humans across all time scales, the consequences of an arbitrary optimization are likely to be very, very bad for humans (and the planet as a whole).
@davidmccarthy6061
11 ай бұрын
Awesome episode!!
@DrAndrewSteele
11 ай бұрын
Thanks!
@TobiasWeg
10 ай бұрын
Great video and well researched, It is fairly hard, to find videos that compile theses ideas in a understandable way. You did great job. Just a small contra point, at about 6 minutes you say about a trillion parameters, but we don't know how much parameters GPT4 has, it was not published. It is actually unlikely that it is this big, the trend goes more to more training data and more compute vs more parameters.
@DrAndrewSteele
10 ай бұрын
Thanks! I did try to verify the trillion parameters thing, and this was among the sources reporting it: the-decoder.com/gpt-4-has-a-trillion-parameters/ Could be wrong of course… Perhaps I should’ve scripted it in a slightly more circumspect way. :)
@TobiasWeg
10 ай бұрын
@@DrAndrewSteele Oh, I think for video for the public mainstream it is totally fine. I think this way you can tell plausible story and the details are not that relevant for it. I think is much more important that you transported the main problem and I think that you did very well.:)
@joannot6706
10 ай бұрын
@@DrAndrewSteele we don't know how big it is but sam altman said it was definitely not 1 trillion parameters.
@DrAndrewSteele
10 ай бұрын
If you’ve got a better source, let me know and I’ll stick a correction in the video description :)
@joannot6706
10 ай бұрын
@@DrAndrewSteele It's in the youtube video "StrictlyVC in conversation with Sam Altman, part two (OpenAI)" at the 5:12
@AidanRatnage
11 ай бұрын
Is AGI similar to true AI? Your problem 2 example didn't seem so but what is the difference?
@DrAndrewSteele
11 ай бұрын
I’m not sure what ‘true’ AI would mean, but AGI means it’s ‘generally’ intelligent-it’s a bit of a loose term, but roughly as competent as a human across a wide range of domains.
@holdintheaces7468
11 ай бұрын
Kind of depends on what you mean by "true AI". AGI, artificial general intelligence, means that the ai has more than specific inteligence and has a "general" intelligence similar to basically humans. Does that mean it's sentient and thinks on it own, and is that what you mean by "true AI"? That "true AI" is more accurately called "strong AI" by acedemics. There is disagreement over whether AGI will represent strong AI or if more steps would need to happen to reach strong AI. I personally would think that AGI would need to make further advancements before it's able to make decisions and take actions of it's own volition.
@AidanRatnage
11 ай бұрын
@@DrAndrewSteele I meant something that could form opinions or have emotions or be self-aware.
@DrAndrewSteele
11 ай бұрын
@@AidanRatnage Ah! Well, those are all very different things-we could imagine an AI that’s as capable as a human but not self-aware in the ‘conscious’ sense (though it would surely need to ‘understand’ on some level that it was an AI to operate at a human level of capability?). Or we could imagine one that was ‘self-aware’ in a conscious sense, but had no emotions. It’s all a bit of a minefield, and will no doubt pose a lot of cognitive science and perhaps ethical problems-will these machines ever get these attributes? How will we know? And what will be our obligations to them if they did?
@chiptunechannel
11 ай бұрын
Awesome video! TY 🤗
@DrAndrewSteele
11 ай бұрын
Thank you!
@ok373737
11 ай бұрын
Alaways top notch quality.
@grinmanpotato
10 ай бұрын
i have been aware of this topic well before chatGPT came into the scene as well as openAI. ive got mixed opinions on if AI will be a catastrophic risk - i doubt it will posses human intelligence since it doesn’t have the brain chemistry of a human (i am not a neurosurgeon BTW so correct me if i am wrong) i think it may do particular things better than humans (like calculations, data processing etc) - i am teaching myself machine learning so my knowledge on this may be better as i learn more . perhaps the biggest risk it may pose is if is intelligently stupid, like it doing the wrong thing very well. i ultimately see AI as a tool rather than another human and im skeptical of putting a stop to developing AI, since i don’t think it can ever posses human like intelligence like emotions or empathy - the best use of AI is seeing what tasks need to be automated/sped up and deploying the AI when needed. it may be bad if particular actors use it unethically (as you point out in the vid) and make something intelligently stupid
@InstrumentalConvergence
11 ай бұрын
Great video.
@FracturedParadigms
11 ай бұрын
Damn this hits hard
@skybluskyblueify
11 ай бұрын
I can imagine some religious group coming up with an excuse to not regulate AI and implement a solution in a timely manner. Just a moral panic or two or culture war BS promoted by a greedy politician or billionaire could delay safety measures that needs to be implemented quickly.
@SGTCarrera
11 ай бұрын
Exceptional vid
@DrAndrewSteele
11 ай бұрын
Thanks! :D
@marklondon9004
11 ай бұрын
The best thing about AI is that it has made climate change a very unlikely cause of Human extinction.
@DrAndrewSteele
11 ай бұрын
Ha, ever the optimist…
@marklondon9004
11 ай бұрын
@@DrAndrewSteele yeah, climate change could take decades. AI got that beat.
@keithgarrett4155
11 ай бұрын
How about asking the AIs how to make safe AIs? Three laws of robotics anyone?
@DrAndrewSteele
11 ай бұрын
That is indeed one idea, that maybe we could get increasingly sophisticated AIs to watch new AIs-but the challenge is that it will always be a stupider AI that we understand, or that a previous generation of AIs understood on our behalf, watching a cleverer one that could outwit it! There might be some clever way to make it work, but I don’t think we know what it is yet. :)
@keithgarrett4155
11 ай бұрын
@@DrAndrewSteele Exactly. We use different tools for different jobs. If you use a hammer for all repairs, it will end badly.
@praguevara
11 ай бұрын
How would you adapt the rules to a reward function?
@chrisjswanson
10 ай бұрын
Asimov did see it coming though - his books explore plenty of ambiguity and conflict in applying his laws of robotics.
@nils2868
8 ай бұрын
You'd need a very well-aligned and safe AI to do it in the first place. Also, implementing something like the three laws of robotics is the (very hard) goal, not the solution.
@namashaggarwal7430
11 ай бұрын
Awesome video. Could you please make a video on " Stem cell therapy and how is it done? " and "Gene Therapy and what's the procedure " ? Thanks in advance ❤
@cassieoz1702
11 ай бұрын
I know im old but i was taught that not all new discoveries/inventions are truly progress. My worry is the gargantuan hubris of the humans involved in this development
@fatboydim.7037
11 ай бұрын
There is a global race on as well to get Quantum Computers into the market place surely systems that are trillions of times more powerful then classical computing will accelerate the arrival of ASI. NVidia is currently worth over one trillion in USD with its GPU systems. I think the cat will be out of the bag before most humans realise it.
@DrAndrewSteele
11 ай бұрын
I think it depends what quantum computers are so much faster at! They’re great for factorising huge semiprime numbers and simulating quantum systems, but does anything of meaning for intelligence come out of quantum mechanics? Interesting to speculate!
@tiagomoraes1510
11 ай бұрын
im gonna watch i hope its not 30 minutes to come to the conclusion of "Well if we use it smartly we will only be benefited from it".
@DrAndrewSteele
11 ай бұрын
I have good news about the content of the video, and bad news about the future of humanity
@chrisjswanson
10 ай бұрын
Not sure about government regulation. We still haven't solved the government alignment problem, let alone AI.. just sayin'.
@chrisjswanson
10 ай бұрын
Ah you covered it. Well presented my friend. All notification ON.
@41-Haiku
10 ай бұрын
My current view of government regulation is that governments have a pretty good track record of stifling innovation and progress, which is usually terrible, but in this case it's a big part of what we want them to do!
@MrMilarepa108
11 ай бұрын
Heresy!!! I welcome our robot overlords!!!
@41-Haiku
10 ай бұрын
I take my overlords with a side of remaining alive. 😅 I don't know whether it's possible to maintain control while sharing the planet with something much smarter than we are, but I'm hoping we find a way to at least get it to care about us enough that our existence is compatible with its goals. Best case, it extrapolates the wisest hopes of humanity and gently brings us into a future where we get all the wonderful things we've always hoped technology would bring to us. It's a really hard problem and we're not even close to being on track for the good ending, but maybe we can get our act together if we actually, really try.
@teknophyle1
11 ай бұрын
There are a few channels like Adam Conover's that assert the worry and excitement over AI is all hype. I recommend watching his interview with a few AI experts.
@cassieoz1702
11 ай бұрын
Adam Clickbait Conover?
@teknophyle1
10 ай бұрын
@@cassieoz1702 lol, yes he does sensationalize. It doesn't make him right, but its also a logical fallacy to say he's automatically wrong.
@cassieoz1702
10 ай бұрын
@teknophyle1 no but, over time, I've given up watching him because his content repeatedly fails to meet the expectations created by the title
@davidmccarthy6061
11 ай бұрын
AI is just one of the latest tools. Ultimately the race is to make more money any way possible in the shortest amount of time.