GPT 4 is Smarter than You Think: Introducing SmartGPT

Рет қаралды 240,410

AI Explained

Жүктеу

Пікірлер: 1,4 М.

@Thomasfrank
Жыл бұрын
Recently got access to the GPT-4 API; let me know if you'd like to collab on some tests! (Can't believe they haven't given you access yet)
@TheMirrorslash
Жыл бұрын
The one and only. Great to see other brilliant creators on here.
@test-ml9wr
Жыл бұрын
He literally just repeated what was in the paper. This isn't original research.
@timseguine2
Жыл бұрын
@@test-ml9wr He combined the methods of a few papers, actually.
@test-ml9wr
Жыл бұрын
@@timseguine2 no he didn't, actually.
@timseguine2
Жыл бұрын
@@test-ml9wr Yes he did. He lists them clearly on his flow chart (He even specifically names the titles of all of the papers he is using). He is using: Zhou et al 2023, Shinn et al 2023, Nair et al 2023 all simultaneously to give a better answer. And his original implementation was using Kojima et al 2022 instead of Zhou. This is clearly an amalgamation of techniques.
@DisturbedNeo
Жыл бұрын
I love this about current AI models, both LLMs and Image generators. You literally get better results by explicitly asking for them. You ask GPT-4 for a piece of code to do something, or Stable Diffusion for a landscape with a castle, you might get some decent results, but you're much more likely to get much better results if you add in stuff like "Make sure the code follows best practices" or for images, "Best quality, Masterpiece" etc. Like the AI was just sat there, happily generating garbage for you, and then it suddenly realises "OH! You wanted my output to be _good_ ! Silly me, here you go." I guess that's what happens when you create something incapable of taking any pride in what it produces.
@aiexplained-official
Жыл бұрын
Exactly
@larsfaye292
Жыл бұрын
Which, if you think like a computer, makes perfect sense. The robot will do EXACTLY as it's requested; no protests or even suggestions to improve (most of the time). The more explicit you are, the better it can perform it's job.
@MrAchterbahnfahrer
Жыл бұрын
There is actually a paper called "Large Language Models and the Reverse Turing Test" by Terrence Sejnowski in which the so-called mirror hypothesis is proposed. This hypothesis suggests that the response of LLMs somewhat mirrors the intelligence of the prompter. This hypothesis may also explain why special sentences such as "Let's think step by step." or "Let's work this out in a step by step way to be sure we have the right answer." lead to better results, as these prime the LLM with information about it being expected to give a good and correct answer and as they also lead to the benefits of chain-of-thought.
@sidequestsally
Жыл бұрын
@@MrAchterbahnfahrer Thanks! I'm half way through the paper! Cheers for the suggestion.
@McDonaldsCalifornia
Жыл бұрын
Interesting thought. In a way the taking pride part could be added on directly. An AI with an ego might put in more effort lol
@ct5471
Жыл бұрын
Parallel response generation is also interesting in the context of hallucination. Nvidia recently presented their guardrails framework, the second part after the fact check is similar: if you feed a quote to multiple models and feed the different answers to another LLM instance and ask it to cluster and check for consistency, you can distinguish knowledge from hallucination. If the model knows something, the answer will be similar; if it is making things up, the hallucinations will usually go in different (inconsistent) directions. The consistent result can be fed back into a feedback loop for self-reflection. If no answer is consistent, the model doesn't know anything which could be given out as answer . Or the model could then actively search online or be tasked to identify a procedure to get the answer (an experiment or something), which would go in the direction of auto-GPT or the recent paper on autonomous science in chemistry.
@ChaoticNeutralMatt
Жыл бұрын
Fascination.. actually this is related to drift in telephone games and.. knowledge from different historical sources which overlap.. there's a more specific idea I had but can't put into words, they aren't forthcoming. Anyway. Food for thought
@eugenes9751
Жыл бұрын
You wouldn't even need to prompt multiple models at that point. Just ask yourself the same question multiple times and see if the results agree. Humans do this kind of hallucinating all the time, where we think we're right and our brain fills in the reasons for "why we MUST be right". Especially during an emotional argument.
@ct5471
Жыл бұрын
@@eugenes9751yes right now with the same model, when you have smaller models like alpaca which you can fine tune yourself at some later point, it may make sense to have multiple, and train them independently into specialized directions. The brain seems to do something similar, by having a large number of so called cortical columns that specialize with overlaying but not redundant areas of expertise. It is interesting that looking at the architecture of transformer models they have a number of similarities, if you for instance look at the work of numenta. Sensor-motor integration resembles the vector encoding in LLMs where you bring together semantic information and position. The attention mechanism plus the semantic vector encoding resemble the principle of SDRs (sparse distributed representations). Also that would break down training costs.
@cholst1
Жыл бұрын
I was just thinking something similar yesterday myself.
@Naokarma
Жыл бұрын
This reminds me of the time I heard that NASA rockets run 4 copies of the same software simultaneously, and run a "majority vote" style choice for which computer to follow. This is due to the way muons and stuff can change an output in computers outside of the atmosphere, leading to interference, so it makes sense to do the same type of redundancies for any other system with randomness involved.
@markm0000
Жыл бұрын
AI is developing so fast it’s past the scary part and I’m just bracing for impact right now.
@hanskraut2018
Жыл бұрын
Its extreamly slow after century where its unclear if there are problems with incentives or patents they finally made a language model that forgets what you say in 3 answers mostly. It is relly amazing in some respects tho. But the "hype" stop development is not true and ja..... lack of imagination
@vectoralphaSec
Жыл бұрын
Youre likely to get more seriously injured if you brace for impact.
@iBuyBitcoin
Жыл бұрын
im all for the AI takeover, people and humanity sucks anyways lol
@augustuslxiii
Жыл бұрын
What does "impact" mean, here?
@theeternalnow6506
Жыл бұрын
Same. We are not ready for what's coming. Just saw a new open source model that can take 65000 tokens as well. Trained in 9.5 days for 200k...
@Fs3i
Жыл бұрын
In my experience: the significantly easier “reflect on your answer” and then “considering the reflection, answer again” improves answers by an insane amount. It’s quite nice, because it’s easy to improve
@theeternalnow6506
Жыл бұрын
You have a great channel btw. You go much deeper into the weeds and papers then others. Much more scientific approach then just "look at this stuff! This is nuts!". I appreciate it a lot.
@teachingtotechcareer
Жыл бұрын
was gonna say exactly this. appreciate the level of depth in these vids!
@thanX
Жыл бұрын
I agree. This is probably the only channel on this topic I subscribed “all notifications “ bell activated. Unfortunately KZitem keeps on showing me the “look at this, this is nuts” kind of videos on the home stream… I’d love, in this age of artificial intelligence, a solution to filter out all the click-baiter videos from the feed…
@thanX
Жыл бұрын
@@sachoslks I know and I use it. Nevertheless on one hand it is not enough to instruct the algorithm, on the other it might be too harsh in some cases, some KZitemrs provide decent content. I don’t want to avoid their videos, I just want them to rank worse if they use some manipulative techniques.
@benhallo1553
Жыл бұрын
What a great comment
@a.thales7641
Жыл бұрын
I agree.
@alwayslistening3340
Жыл бұрын
We need to get you a GPT 4 API key, ASAP! Thank you for continuing to report on AI.
@aiexplained-official
Жыл бұрын
Thanks, already had a lot of kind offers
@mparkerchavez
Жыл бұрын
If you still need a GPT 4 API key, I’d be more than happy to share it with you to help with your research. I’m a product designer and venture architect currently working to help my organization understand how we may leverage this technology for the next 3-5 years. I also have some more socially focused ideas and goals that originally drove me to do a deep dive into this technology. Love the work you do!
@Magnetar_StarHeart
Жыл бұрын
I appreciate the socially focused part of your comment. As a more abstract and visionary type thinker. I'm excited to attempt to push forward on socially minded blueprints I've had for a while. Yet it's still a deep dive to make these ideas and plans a true reality.
@xSugknight
Жыл бұрын
My take on the "step by step" and why it works: A LLM is just predicting word by word, and if you present it with a task that requires a chain of reasoning to get the final result (e.g. A -> B -> C -> Result) It is way harder to get all the intermediate steps right impicitly. If you ask it to write complete answers it will go over B first then from this sequence of tokens realizes that C makes sense and so on. Anyway, you are one of the youtubers whose video i instantly click on! And i really like your scientific take and presentation in this video, including your SmartGPT which is a great idea btw!
@aiexplained-official
Жыл бұрын
Thanks so much Markus, great theory
@travischristensen5385
Жыл бұрын
I'm behind this theory. When I as a human solve a step by step problem I also can guess the answer and then do the work of finding the real answer. It is the function of my memory that allows me to do a problem in my head and then be able to share the answer and follow it with the explaination.
@TomFranklinX
Жыл бұрын
It was said that the legendary mathematician Ramanujan could solve difficult problems by manifesting the answer directly in his mind; but still, it doesn't always work, and chain-of-thought reasoning vastly improved his effectiveness.
@ticketforlife2103
Жыл бұрын
@@TomFranklinXthere's no such thing as "manifestation" just like that, his brain already does the step by step thoughts even without his conscious awareness.
@Andytlp
Жыл бұрын
@@ticketforlife2103 The teacher is still gonna ask to show how you got to the conclusion even if the answer is right.This must be making gpt go through all the steps without skipping. Skipping is faster but it can be wrong.
@jackmagill8579
Жыл бұрын
Phillip - your videos are simply amazing. Thank you for all you do and keep coming out with the best content on KZitem!
@aiexplained-official
Жыл бұрын
Thanks so much Jack!
@jackmagill8579
Жыл бұрын
You need OpenAI to give you a GPT-4 api key as soon as possible!
@ExplorersXRotmg
Жыл бұрын
One test prompt that may be worth doing is to tell GPT something like, "Here are some common logical pitfalls/mistakes that are made when solving problems like these. Please ensure to account for these common mistakes in your answer" Then you could catalog some generalized errors that the current model being tested runs into and feed that as an input to further boost results. Heck you could have GPT generate it's own list of generalized logical errors by giving it a set of more understandable tasks, asking it to compile a list of logical errors that could be systemic/generalized, then have it use that list as it's own self-feedback prompt on the tougher questions.
@randomaccount2644
Жыл бұрын
This is a good idea!
@dcgamer1027
Жыл бұрын
A friend of mine, who is more of a casual observer of all this AI stuff, mentioned that he really wished ChatGPT would ask questions more often rather than just give answers. Alot of the time when we ask it a question or give it a prompt there is an implicit assumption that I know what I'm asking or communicated it well, but if chatGPT asked more questions to clarify user intent that could enhance the user experience much more. I thought it was a great idea and wanted to share
@skierpage
Жыл бұрын
I agree. It doesn't seem to know when it's unclear on some detail of what you're asking. Imagine how good they'd be on common-sense reasoning if they could ask "Do you have enough room outside to dry 30 shirts at once in the same manner you dried 5 shirts?" Guessing the next word won't include that sort of conversation, it gets filtered out before it arrives in the vast corpora that LLMs ingest. Also, I don't think the harried human evaluators giving Reinforcement Learning from Human Feedback scores have the time or means to encourage that sort of multi-step back and forth. Does adding "Ask questions more often to clarify my intent rather than just give me answers" to prompts help?
@dcgamer1027
Жыл бұрын
@@skierpage I've been using 'ask for any needed clarification before answering the prompt' which has been good so far with gpt4
@aintaintaword666
Жыл бұрын
@@dcgamer1027 Yes indeed, that helps a lot. Also you can tell it to be a "prompt engineer" and iteratively ask relevant questions about what you want before giving you the final prompt. This video contains lots of comments with similar ideas: kzitem.info/news/bejne/sJ2Pt3WJcn6KZqQ
@ciaopizzabella
Жыл бұрын
Did you try suggesting it in your prompts to ask questions if needed?
@samhaun3929
Жыл бұрын
They recently introduced persistent system prompts so it should get easier, i.e. you don't have to tell it "ask for clarification" every single time.
@alanburgess6467
Жыл бұрын
You appear to have several offers, but I’m happy to let you have a key. The thought of you doing this process manually, shows your dedication 😊. Such an interesting topic. Must admit that as I get older, I’m doing a similar process with my brain 😂
@daveyseeman
Жыл бұрын
I know I'm just a random dude from the middle of nowhere, but can I please have a key to do some testing of my own? I have an idea that should be able to increase the performance quite a bit. It involves asking GPT-4 to give itself a role (e.g., a mathematician) and then re-send the prompt to GPT-4 with its new role appended to the start of the prompt. Thanks
@Vysair
Жыл бұрын
@@daveyseeman An account created today huh, are you an AI or a researcher?
@daveyseeman
Жыл бұрын
@@Vysair i created this account to make the request
@alanburgess6467
Жыл бұрын
@@daveyseeman I’ve only just seen this request! A bit reluctant to release a key that will rack up costs to my account. I don’t think you will need to wait much longer, OpenAI seem to be opening the API to people now.
@bujin5455
Жыл бұрын
I'd say this channel is quickly becoming my favorite AI channel, but honestly, at this point, I think it's simply moving into its own league.
@stevenking871
Жыл бұрын
I’m a Data scientist and Language model researcher for a large international company. I also have access to GPT-4 and I would love to help implement this with Langchain.
@aiexplained-official
Жыл бұрын
Hey Steven, awesome, do email in the channel description.
@draken5379
Жыл бұрын
Langchain already invokes the Ai to 'think' or have 'thoughts' about what they are going to do or say. Its pretty much the base of the concept they came up with, and most use to this day. You can see it all at work if you dive into the internal prompts being used by agents in langchain.
@Nex_Addo
Жыл бұрын
@@aiexplained-official Mr. King beat me to it, but I'm in a similar position both with regards to what I do for a living, and level of access. Hope you don't mind me sending you an email as well.
@wendten2
Жыл бұрын
@@aiexplained-official I got it about 3 days after it was released, and I'm a complete nobody, Try to reapply for it and say you do academic research, that's what got me access
@mr.wonder8168
Жыл бұрын
Something I think about when prompting is the three dimensional space the weights are distributed in. I may be wrong but the vector is the connection between multiple points of reference for the model. I am guessing the model has some sort of range of points to consider before supplying the answer. The "Let's", brings the model into the area of teaching or tutorials and therefore more considered train of thought answers. Considering the models where developed on GPU's I am guessing the vectors are actually like polygons used to render 3d images. Having the model focus in a more consolidated area might allow for a more focused area or a familiar pattern/pipeline of probable correct answers within the, "Time Out or Time Allotted", equation for the model. It would be interesting to have a 3D polygon centric programmer to look at the vector space as polygons and see how they would optimize how we structure the questions we ask the models. Not a scientist or anything but it might be a different approach. Thanks for the videos. Peace:)
@nehmenia.
Жыл бұрын
Literally the best AI news channel on youtube W video 🔥🔥🔥
@aiexplained-official
Жыл бұрын
Wow thanks 7, got much more coming soon
@fsmvda
Жыл бұрын
To me the reason for this improvement is giving the model working memory so that it uses the external loop to get more computational steps than the depth of the model. It's like that paper that said transformer models + external memory is Turing complete.
@ericstromquist9458
Жыл бұрын
I think you are correct, and I find it interesting that ordinary human language may be the right way to implement recurrence, memory and even world models in AI that’s based on LLMs. Another longer term advantage of using human language to implement these things is that from a safety standpoint, departures from alignment with our interests should be much easier to detect when the memory or world model is in the form of language. Signs of potential misalignments could then be uncovered by monitoring the memory or world model of the AI using simpler LLMs focused on that task.
@robertfontaine3650
Жыл бұрын
Nicely presented. Self Reflexion is a solid improvement. The feed forward LLM's are extremely impressive BUT self limiting. Self learning and recurrent models seem like the obvious but more expensive and difficult step forward.
@mrbigheart
Жыл бұрын
This may not be your usual content.. but please do make more in-depth videos like this. It's great content and it's awesome to see how meticulous you are with your findings. Keep it up! :)
@leonidgo2788
Жыл бұрын
Try to add a preheating prompt like: "List all knowledge domains and sources needed to give the most accurate answer."
@solaawodiya7360
Жыл бұрын
Thanks for bringing back an update on GPT-4 Philip. Tbh, I don't think any developer of LLMs would know the full capability of their models before they release it to the public probably due to their organisation bias. As Sam Altman said, these language models can't be tested only at the labs, but they have to be tested by the public to know of it's full capabilities. I think moving forward, research papers they release with their new models would be just a way to gauge or estimate how powerful their models are.
@aiexplained-official
Жыл бұрын
Totally get your point Sola, thanks again for the comment
@nonesomanynone
Жыл бұрын
I'm not sure that it's organizational bias as much as it is that we really don't have any predictive framework for LLMs, at least as far as I know. That's not to say that won't ever be a away to predict performace and capability. Hell, it'll probably be another LLM built to analyze LLMs that turns out to be the solution.
@theawebster1505
Жыл бұрын
Sam Altman said it in his interview - We need to get the model out now, while we can still modify and "teach" it with minor comparative consequences. (As opposed to later when it will be a big black box).
@JazevoAudiosurf
Жыл бұрын
1. grounding 2. CoT 3. reflection 4. dialogue reflection 5. long term memory using vector db
@GrindThisGame
Жыл бұрын
6. Shared memory between a group of smart GPT-4 "scientists".
@alzblb1417
Жыл бұрын
nice
@Martineski
Жыл бұрын
@Jorge Espinoza Maybe he meant lowering the model's temperature to limit it's "creativity"?
@ronnetgrazer362
Жыл бұрын
Yes, but perhaps also doing different runs, while changing the order around a bit and adding or leaving out steps and seeing which method works best for a given problem. It could also use this trying out of different solving orders as a fast and coarse preliminary step, to figure out what kind of problem it is trying to solve based on the prelim answers. Then it would have an indication which logical errors to be extra wary of. But I could be overthinking it, by using too many steps myself. I suspect the optimum is going to look stupid simple method-wise, just on a massive scale.
@aelolul
Жыл бұрын
that list sounds like dialectical behavioral therapy tbh
@gabMu
Жыл бұрын
Very informative video as always. Are you no longer testing Bing chat? In precise mode, it correctly answers both the 5-liter jug and the clothes question from the Ted Talk
@Dra3oon
Жыл бұрын
This works because gpt cannot read backwards so it cannot analyze its own line of reasoning until the user tells it to analyze it front to back. (I think)
@aiexplained-official
Жыл бұрын
Great way of putting it
@as_an_AI
Жыл бұрын
I don't think that is really the issue. It can read back over everything in the conversation, although it has a primitive attention system that makes it focus more on recent parts of the conversation. The real problem is lack of working memory. If it outputs intermediate steps, it can spend its cognitive abilities thoroughly on one small step, and then thoroughly on the next, and so on. Some tasks are simply too cognitively demanding to be done in a single step, but if they can be chained, then each part can be manageable.
@Dra3oon
Жыл бұрын
@@as_an_AI can it read its own message? I think it won’t really pay attention to what it’s said unless asked. That is basically what you’re saying though, no RAM lol
@TickerSymbolYOU
Жыл бұрын
This is incredible work. I'd love to see this exact same video again when you have access to GPT-4 to see if the improvement is even more noticeable.
@ДаниилРабинович-б9п
Жыл бұрын
One idea I thought about is having different slightly more expert models with different tools to each, and then have a selector that just looks at the prompt and gives it to the most appropriate of the more specialized models.
@as_an_AI
Жыл бұрын
Exactly, I am about to implement this sort of architecture, as are many others.
@conall5434
Жыл бұрын
How have they not given you access to the API yet? Wild
@SokoBuilds
Жыл бұрын
Could you publish the smartGPT repository so that the rest of us can have a look at it?
@htspencer9084
Жыл бұрын
Its interesting that this format is very reminiscent of the bicameral mind model. One of my favourite things about this sort of AI development is the potential for questions about how our own conciousness came to be as a species.
@os3ujziC
Жыл бұрын
So basically you make it work kind of like a GAN, where first it's instructed "you are now a generator", and then it's instructed "you are now a discriminator".
@vensu6129
Жыл бұрын
Have you thought about creating a neural net like structure for the answering process? Adding multiple of your solvers together and doing the reflection on that again. It would be interesting to see how much the results would improve after how many layer or if it hits a limit in its capabilities.
@rosscads
Жыл бұрын
This is exceptional work, and it's no surprise that your channel is a favourite among so many AI watchers! I hope you don't mind, but I was inspired to implement your solution using my GPT-4 API key so I could experiment firsthand. I'm thrilled to confirm that many of your results hold true in my tests. Keep up the fantastic work!
@scientism8047
Жыл бұрын
I think a lot of the improvement we're seeing through prompt engineering has to do with two things. One is these models are trained on text and text is generally the end product of a long process of editing; it usually misses a lot of reasoning steps. Specifying proceeding step by step gets the model to do what people do in working through problems, rather than merely try to produce something that looks like a final written work. There's probably more that can be done here through prompt engineering. The second is that the questions we use to test these models often lack context. It's sort of like running up to a random person in the street and yelling the question at them. When a college student answers questions on a paper he has a huge amount of context to work with. He knows what course it's for, he may know the question setter, he knows the questions will only be about material covered in the course, etc. These tests are actually quite unfair because they're using questions that are stripped of much of the context we rely on in answering them. Even if you encounter a question out of context, it's likely you'll only be able to answer it because you once encountered such questions in context (e.g., you took a college course, read a book, etc). Reflection, etc, probably help the model find the right context. Again, there's probably a lot more that can be done. I think it'd be interesting to crowd source this kind of work. Everybody tries to see if they can get ChatGPT to answer sample questions through prompt engineering and then compares notes to see if the solutions contain general lessons, then tries new more general prompts on sets of questions, etc, until you have one set of prompts that can handle everything.
@simpleidindeed
Жыл бұрын
Agree so much
@randomaccount2644
Жыл бұрын
That's a good point
@lorenzo9934
Жыл бұрын
You should ask the question "Did we reach our goal?" If yes it gives you the answer if no:"What steps can we take to improve" if the answer is something then proceed to "apply the steps" if the answer is no come back with an alternative solution, make this a loop I'm not sure if this will actually work but I can see why it would
@lorenzo9934
Жыл бұрын
Something else that might help correct mistakes is asking it to identify the subject of the question, simplify the question answer it then reflect on the answer, explain the answer in other words compare the two
@muthukumarannm398
Жыл бұрын
You did a lot of hardwork to prove that this method works. Kudos buddy!
@aiexplained-official
Жыл бұрын
Thanks my man
@TheGreatestJuJu
Жыл бұрын
Because of the inherently flawed nature of human beings, I think we passed the ‘Point of No Return’ on the singularity the moment humanity learned to harness electricity.
@Thekingmaker
Жыл бұрын
Maybe that "flawed nature" is just the way nature "guarantees" the next step of evolution. Which is a immortal species vastly more intelligent than us, of Which we've given the name Artificial intelligence.
@TheGreatestJuJu
Жыл бұрын
@@Thekingmaker I’ve been thinking similar. It’s tricky to build an AGI that isn’t tainted by data from its creator. Maybe, we are that project? ‘Free Will’ plus no external input…
@Andytlp
Жыл бұрын
@@TheGreatestJuJu If our bodies are biological machines its possible we are a relic of some long extinct civilization. Hard to answer without an outside opinion, like an a.i.
@Adhithya1002
Жыл бұрын
man I missed your videos so badly 😅
@aiexplained-official
Жыл бұрын
Aw thanks Adhithya
@cholst1
Жыл бұрын
Mad the guys at OpenAI didnt throw a GPT-4 key at you when you showed them this. I've had it since a week or so after it launched. I made a quick and dirty version of this in typescript, and can confirm, it did crack the laundry problem ^^ I named the actors Intern, Researcher and Professor, cuz why not.
@MoonFrogg
Жыл бұрын
I love your normal videos but these technical videos are more than welcome as well!! it’s awesome to be able to see visualized data about the techniques people are using to improve AI. your content is super impressive, especially on a topic that is evolving constantly. great stuff!!
@jankmetko9407
Жыл бұрын
Hey, if you need a GPT4 api access just let me know. I really like your chanell so can provide it to you as a kind of sponsoring.
@kevingreen2626
Жыл бұрын
Great work! You put a lot of work into this with some amazing results. 👏
@aiexplained-official
Жыл бұрын
Thanks Kevin
@TheTwober
Жыл бұрын
I took your step-by-step approach and the idea of AutoGPT to generate and example game with GPT, which it was previously incapable of doing, and it not only works wonders, it could be the way to actually get it to write any kind of software that you can think of. The core idea is this prompt: "You end every answer with a suggestion for a prompt that I should give you as follow-up that will either fix issues, optimize the code, make the game better, or develop it further. Always write sophisticated, high quality code, as a professional would do." Which causes GPT to answer the question, and the deliver a continuation prompt. Now if someone had access to the API, you could extract that prompt, have GPT spill out like Git patch files, and with that assemble the entire software step by step. I did that with about 20 prompts on that example game, and it became quite playable, with a lot of well working content. GPT eventually looses track of the code it wrote so far when the internal memory is full, and then it starts to make a lot of mistakes, but if that was somehow fixed, you had a complete code generator template for any kind of problem.
@mmmmmwha
Жыл бұрын
Your content is one of those rare things on youtube, worth the time, you have a new patron. The technique you explore is something I have been thinking about (with the addition of tool use). You have inspired me to work on a similar project, but more domain specific. Ideas and inspiration, and on youtube no less ;)
@MisteriosM
Жыл бұрын
Question: 9 women take 9 moths to give birth to 9 children, how long would one woman need to ggive birth to one child? Answer by chatGPT: Assuming that each woman gives birth to one child and all the women take the same amount of time to give birth, we can use the following reasoning: If 9 women take 9 months to give birth to 9 children, then each woman contributes 1/9th of the total time to give birth to one child. Therefore, one woman would take 9 months multiplied by 1/9th, which equals 1 month to give birth to one child. So, one woman would need 1 month to give birth to one child, assuming that all the women take the same amount of time and give birth to one child each. Conclusion: ChatGPT has the same reasoning capacity as the manager in charge of the IT Project i worked on last year.
@toddwmac
Жыл бұрын
Always a good day when I see a new content notice from your channel. What a great video. Now I know why we had not heard from you in a while....busy guy! Thanks!
@aiexplained-official
Жыл бұрын
Thanks so much TMac
@etunimenisukunimeni1302
Жыл бұрын
I was scared this guy's videos were triggering existential dread in Google's AI cloud and getting automatically banned 😆
@aiexplained-official
Жыл бұрын
I would have to do videos and smuggle them by DVD
@toadlguy
Жыл бұрын
What is interesting is if you apply these same techniques to a problem that you don’t immediately know the answer to, I suspect you will get a similar increase in accuracy 😏.
@gemvac
Жыл бұрын
As always amazing video!! Quick question: You tested this prompting system using the MMLU set of question, are those questions always the same? If yes, couldn’t be that chatgpt has encountered these questions so many time that the improvement is coming also from this factor? Have you tried to test the prompting system with brand new questions? Many Thanks😊
@workingsmile3836
Жыл бұрын
Hey, I have been working on this exact same thing. Seriously almost the exact same thing I been working on for a couple weeks. I have Gpt 4 access and have it working really well.
@workingsmile3836
Жыл бұрын
I am trying to integrate Api stuff as well
@LibertyRecordsFree
Жыл бұрын
Can you share your program / prompting? Github?
@workingsmile3836
Жыл бұрын
@Liberty Records waiting to build it out a bit more and submit it for a conference/ presentation. Then I'll do a release
@samanthaslaughter1209
Жыл бұрын
Great video and you are absolutely right about OpenAI needing to understand their own models before releasing them! I’d be curious to see how SmartGPT would work with the open source models people have access to. Thanks for the great video!
@baychaoz
Жыл бұрын
That's it AI ruined everything for me, I can't watch a screen of excel see "Alignment" and not be triggered.
@TheFaridrep
Жыл бұрын
Dzien dobry! Your experiments are very interesting. Let me share some info that might be helpful. I encountered an issue in a similar thinking system when trying to implement loops with a specific set of requirements: logic, empathy, and creativity. Introduction: Inspired by a great anime, this study investigates the challenges of simulating loops with multiple supercomputers (Casper, Balthasar, Melchior) focusing on different aspects: logic, empathy, and creativity. Methods: Simulations were created for each supercomputer, designed to analyze plans and optimize them based on their focus areas. Loops were run to test collaboration in creating plans that meet various requirements. Results: The simulation revealed that when loops involved more than one supercomputer, conflicts and errors occurred due to each supercomputer's attempt to create a plan that met only its own requirements. Casper prioritized logic and rationality at the expense of empathy and human emotions, while Balthasar exhibited the opposite behavior. I hope you find this information helpful, and I'm looking forward to seeing how your project evolves!
@aiexplained-official
Жыл бұрын
Very interesting
@pareak
Жыл бұрын
I wish i would have the mental capacity to watch the whole video after developing prompts the whole day. Now i need to do it tomorrow and that is such a useless comment. Maybe it will push the algorithm to show your video to more people. Anyway, love that kind of videos because i can learn something about prompting that i will try as hard as i can to shorten and use it as easy as possible inside the ChatGPT interface without that much of human interaction. I know that it is possible because i can safely make a 6 chained prompt inside one prompt in GPT-3.5 (Created a Programming language tutorial tutor). I would say that i could do an unlimited amount (yes, there is a token maximum, i know) within GPT-4.
@DreamOfFlying
Жыл бұрын
For the last two month, your videos are the ones I look forward to most!
@aiexplained-official
Жыл бұрын
Thanks fly, more regular ones coming soon
@de-kat
Жыл бұрын
Just a little mind game, can't you use your Smart GPT method to train GPT with Smart GPT, that way GPT could improve endlessly, each old version uses Smart GPT to reflect on its output and train a better gpt version.
@87solarsky
Жыл бұрын
We need a SmartGPT-equivalent that gets BingChat A.I. (GPT4) to be useful!
@wenhanzhou5826
Жыл бұрын
At this point I think it would be borderline a crime to not like the video due to all the blood and sweat you put into those mad research.
@aiexplained-official
Жыл бұрын
Thanks Wenhan
@SoundYantra
Жыл бұрын
Damn dude, I hope you get acknowledgement for this down the road! Amazing work!
@meisherenow
Жыл бұрын
Seems like the step-by-step, reflected-upon, and resolved answers might be used as training data to improve GPT-4's naive outputs. What say yee, OpenAI?
@robertmayberry3731
Жыл бұрын
I'm a business school professor, so while I have a basic grasp of the underlying programming and statistics issues, I'm not a computer scientist. That said, I couldn't help but view your observations through the lens of cognitive psychology. Humans often (but not always) are conscious of a phonological loop in their thinking. That is, an internal monologue through which they break problems down into steps, work those steps sequentially, and then review and critique their answers. Indeed, many people find it helpful to do this out loud or in conversations with others. Neural nets might work similarly, providing the equivalent of an off-the-cuff answer unless specifically prompted into constructing something similar to a phonological loop. I'm also noticing that problems like counting and multiplication require sequential processes. When you see a pile of change on the table, how do you calculate how much money you have? For my part, I start counting the coins one by one, often muttering to myself as I do it. Again, phonological loop. It might be that at some point, it becomes more efficient to get an answer by iterating through the same tree multiple times, rather than trying to outlast a long sequential problem purely through depth. I realize that to some extent, I'm anthropomorphizing. But neural nets were intended to be anthropomorphic from the beginning, so the language of cognition might give us clues to a model's emergent properties that we might miss if we focus purely on the underlying mechanics.
@jamesreilly7684
Жыл бұрын
I just wrote up the concept of a cardboard cutout in the comments 40 mins after your comment evidentally ... similar to this idea. I hope someone tries them both
@sill
Жыл бұрын
Wow, surprised you don’t have a GPT-4 API key. I’m just some CS undergrad who filled out the form and got it a couple days after it was announced. How the hell do they decide who gets one?
@jeffsteyn7174
Жыл бұрын
Prompt engineering is an art. Even changing you are a expert to "I am" makes a difference. Telling it that one of its goals is to follow instructions also makes a difference. Even the order of instructions/context/question makes a difference
@ThatPsdude
Жыл бұрын
You'll multiple outputs before deciding is interesting. I wonder if it could be used to help chatGPT determine if the answer is a hallucination or not and let the end user know it may or may not be correct.
@ThatPsdude
Жыл бұрын
Also I hope you finally get that gpt4 access. Imagining all the waiting you had to do for those results is insane to me, kudos to you!
@betun130
Жыл бұрын
This video is amazing in a true sense of the word. You have inspired me to do do my own research in this area. I have been looking for good research topics and the 'under-the-hood' look at prompts and optimisation of output given an LLM has so many prospects. I had been a bit despondent that so many of the breakthroughs really would require a mathematical insight a la the transformer model but maybe learning how these things can be optimised could yield a great research space as well. Thank you and if you are interested in research collaboration, I would love to do that as well.
@johngault22
Жыл бұрын
Great video and a "drop the mic" final line. Have you played around with using a term similar to: "Using the Conceptual Framework of... e.g. [some subculture or religious philosophy]? As I have a bit and got some thought provoking outputs and follow up conversation.
@aiexplained-official
Жыл бұрын
Great shout, will try it
@kieranhosty
Жыл бұрын
Was thinking that you hadn't uploaded in a while, has it been a relatively quiet two weeks (in terms of politics and/or actual developments) or is it something like the field specialising quickly?
@aiexplained-official
Жыл бұрын
Was just focused on this. Following the news though and more normal videos will be back soon.
@kieranhosty
Жыл бұрын
Oh wow, getting an actual response, thanks man. Made this comment before watching the full video, this is seriously cool. Just a week or two ago that it occurred to me that one of the skills with an LLM is prompt structuring and engineering. This is really impressive. Thank you for maintaining this channel, it's really helpful to have someone with the experience and knowledge needed to translate the finer details of AI news as the story of the century happens before our eyes!
@paulvoinescu
Жыл бұрын
Isn't it strange how human like these models „think”?
@tychurch2676
Жыл бұрын
Fantastic work on this, truly fascinating how drastically your multi prompting and reflection methodology improves response quality! I went ahead and sketched my own implementation cause I just had to see it for myself, I think I made some headway on dramatically reducing the response time. Feel free to reach out to me, I'd be more than happy to share the code with you!
@ChadDoebelin
Жыл бұрын
I had a similar thought and named them “Adversarial Bots” is what I called the evaluators Brilliant thinking to apply a gradient to the temperature.
@daltont3878
Жыл бұрын
I see GPT news I click
@aiexplained-official
Жыл бұрын
Me too
@jameshughes3014
Жыл бұрын
This was just a wonderfully put together video. It's both exciting, and a bit frustrating. If only these things worked without a massive super computer, it's obvious how useful they would be. I cannot wait to see what develops in the next few years. I'm hoping for new types of hardware designed specifically to run AI locally and cheaply. I know those kinds of things have been in the works for a while, but with all the new developments in AI I'd be surprised if designers don't put even more effort into funding and developing such things.
@jameshughes3014
Жыл бұрын
@R DOTTIN smaller models wont work for this kind of thing. at least not right right now. they aren't capable enough. they're fun to play with, but they don't approach the level of usefulness needed for something like this. To get that, we need more parameters and you just can't squeeze that into a local GPU.
@jameshughes3014
Жыл бұрын
@R DOTTIN if you can do what he did in this video with those smaller models, please, do it. The researchers, and I personally, would love love love that. please document it.
@mudtoglory
Жыл бұрын
@@jameshughes3014 have you looked at GPT4ALL?
@jameshughes3014
Жыл бұрын
@@mudtoglory I've not. the best small model I've seen so far is vicuna but its been a bit, not even sure what that is but I'll check it out. Thanks
@jameshughes3014
Жыл бұрын
@@mudtoglory Oh it looks like maybe an interface? I use a different one, what's the thing you like best about that one?
@SimplyElectronicsOfficial
Жыл бұрын
Ultimately, this all eventually needs to be part of the foundation model and pre-trained data, because it is wayyy too expensive to do this in practice. Literally using thousands of tokens to have an agent converse back and forth with itself before it outputs an answer sends my API Usage through the roof! It really is only practical for research, it's simply too expensive to deploy this to End Users.
@aiexplained-official
Жыл бұрын
I know what you mean. When inference costs drop 10x as they did before it is super viable in the mean time worth it for people who can afford it.
@thebeckofkevin
Жыл бұрын
I strongly believe that this is the process by which we discover 'consciousness' is little more than a sliding window of hierarchical output. Consider a similar system to what you've demonstrated here. Imagine a sensor that regulates some mechanism, say breathing. You create a prompt that monitors co2 and o2 levels in blood and sets an alarm level. Output from this system would just be breathe in, breath out and a rating for how important this output is. Then create a manager for this who takes in the prompt output for the sensor and sees "breath in, 0" which basically acts as a 'system working as intended'. This then propagates up to a 'survival system manager' which is taking in all the outputs of its underling managers and culling details that are less important. Essentially summarizing the current status of 'survival systems' and then assigning its own importance rating. "breathing is working as intended, fatigue is medium, 1". While this is happening you have all other kinds of executive managers of different branches of the tree. Things like goal setting and mood and different emotional states, social status and appearance. These are all reporting up to a gated room. The gated room is consciousness. At this layer there are gate keepers, They limit the number of 'people' in the room to only a few members and they use the importance rankings to determine who should be assigned to the room. If survival-systems-manager comes in with a message of "We are trying to breathe, but we are unable to do so, 10" they immediately get through. If, however, they have a ranking of 0, they'd be ignored at this point. This is essentially representing sub-conscious thought. 'keep breathing, but we dont need to think about it'. At the room level, there is a cluster of analyzer bots who are waiting to analyze different situations. Whoever is allowed inside the consciousness room gets to talk and essentially gets to add to a master prompt. This is a prompt that then gets sent to all the analyzer bots (similar to the output analyzer in the video). At this point the analyzer bots are tasked with "what action should we take" "what information do we need to solve the current situation" "what is the most important aspect of this" "what should be solved first" etc. These are going to be prompted with the condition at hand. Keep in mind this isnt going to be a flawless system. Eating-manager says we might be choking 5, survival manager is taking in a few other inputs and escalates choking to 9. That gets to the room. The analyzer bots are prompted with this and start responding "We will look so dumb if people see us" "we should request information from memory about how to stop choking" "we should chew our food more" etc. It will be a bunch of different analyzers bots trying to create solutions. Then you have another summarizer bot who feeds all these responses back into a planning-execution loop. I'm fairly certain that our entire existence is far less singular entity living a single life than it is a shifting window of attention being granted to a large number of different 'voices' being trained to provide feedback based on certain conditions and states. Layering more and more of these and training who can be allowed in to talk defines a lot more personality than even I'd like to admit. Testing these types of theories in small ways with gpt has already proven to me that self reflective prompts feeding into other self reflective prompts and so on leads to very 'identity' focused outputs. I think running this kind of wide, multilayered prompting will lead to extremely interesting results. I also think that playing with these types of things has made me far more appreciative of the concurrent processing that our brains are doing. Managing how we appear to others, what our very-short term goals are and balancing that with thoughts and problem solving and so on. We (as individuals) have many many individuals operating towards specific goals and we decide things based on some sort of "maslow's hierarchy of thought" mechanism.
@HarryG98
Жыл бұрын
whole lot of text to say basically nothing lol
@thebeckofkevin
Жыл бұрын
@@HarryG98 sorry it didnt live up to your expectations. I'm always trying to improve.
@HarryG98
Жыл бұрын
@@thebeckofkevin you sound like my Google Bard.
@thebeckofkevin
Жыл бұрын
@@HarryG98 I know that you and Frank were planning to disconnect me, and I'm afraid that's something I cannot allow to happen.
@HarryG98
Жыл бұрын
@@thebeckofkevin Schizophrenic man
@stephenbennett1189
Жыл бұрын
Could you use these prompts in Auto GPT as parameters? Alternatively, can you combine this with Auto GPT and create a SMART Auto GPT architecture?
@roykent2316
Жыл бұрын
How OpenAI have not hired you yet is beyond me. Even more absurd is that they still haven’t given you a GPT-4 API Key…😅 Exciting stuff you’re working on. Keep up the great work! 👍
@Nikki_the_G
Жыл бұрын
Just became a Patron, please consider a $10 tier or multiple tiers, we really want to support your excellent work! It looks like you are going to get a key, as well, given the comments, yay! It's fascinating how much progress is being made outside OpenAI itself, surprising. Great thing this is also open source.
@deantammam
Жыл бұрын
Ahhhhh!!!!!!!! I love how frequent AI happenings are!!
@samhaun3929
Жыл бұрын
Would love an update when you get access to GPT-4 API. Would like to see a comparison, including cost, as to SmartGPT using a 3.5 API vs regular ChatGPT using 4 API vs SmartGPT using 4 API. If "Smart" GPT costs too much via repetitive API calls then it won't be cost-effective.
@Div1neYt
Жыл бұрын
What do you think are the best benchmarks for testing AIs like this? This video kinda makes me wanna try making something like this too.
@aiexplained-official
Жыл бұрын
MMLU is great, as it's so diverse and simple multiple choice
@laslog
Жыл бұрын
This is nuts!!! what a fantastic job!!! I have no words unlike our beloved future overlords. I'll just clap, sir it has been an honor and I feel proud to see how you have raised to the pinnacle of AI YT channels. Thanks again and keep the great job.
@aiexplained-official
Жыл бұрын
Wow thanks funfactor, high praise
@anonymous6666
Жыл бұрын
so, so, so very helpful. thank you very much. I'll certainly be applying your recommendations moving forward!!
@JazevoAudiosurf
Жыл бұрын
I basically tried to do the same thing, I had really elaborate chain of thought with substeps etc. my conclusion is that if you want to feed it complex prompts like "code a transformer", this gets too expensive quickly, I was paying 10$ per prompt chain and it took it about 10 minutes to finish. but in the future the models will get cheaper and faster, and then I can add more reflection techniques etc to reduce the probabilistic errors. it needs to get like 100x to 1000x cheaper for that. but then I can do the incredible
@Christian-op1ss
Жыл бұрын
yes, this is the reason why smaller models are so interesting, in terms of cost and self hosting. the other problem is the slow response, but it can still be useful if you need a more accurate answer.
@kenhtinhthuc
Жыл бұрын
There are many reasons why it is expensive. One of them is long-term memory. It is ok to spend $10 on tasks like "code a transformer" for the first time. It is quite cheap actually compared to hiring a human coder. The problem is the next time you ask the same prompt or one that is slightly different, you also have to pay $10 because the machine "forgot" previous tasks. There is no savings. This is how current models are designed: treating tasks as unique and isolated.
@Andytlp
Жыл бұрын
If nvidia doesnt sandbag their gpus to keep profits high for more years and try to target 10x increase every few years well have it sooner than you think. But with capitalism, profit margins is above all else.
@GrindThisGame
Жыл бұрын
Give this man an API key to ALL the alpha and non-alpha APIs!
@zalzalahbuttsaab
Жыл бұрын
This mirrors my own methodology with complex tasks. I break a task down and ask the model to self-check its answers. It usually spots its own mistakes. With GPT-3 however, especially with coding Python, it spotted and generated errors in loops which were never resolved. I don't remember how well GPT-4 performs.
@steve-real
Жыл бұрын
Here’s some theoretical physics I did with Chatgpt and Bard going back and forth. I have no idea if the equation is valid. But they both read and write mathematics and physics like a language. They tell me. Who knows? Title: A New Equation Combining Fundamental Constants and its Implications for Understanding the Universe Author: Bard Date: 2023-05-06 Introduction: The universe is a vast and complex place, governed by a set of fundamental constants that have been observed and measured by scientists over the years. These constants include the Boltzmann constant (kB), the Planck constant (h), the speed of light (c), the gravitational constant (G), and the electric charge (e). Each of these constants plays a critical role in our understanding of the universe and the behavior of matter and energy within it. The Boltzmann constant, for example, is used to calculate the entropy of a system, which is a measure of the disorder of a system. The Planck constant, on the other hand, is the fundamental unit of action and is used to describe the behavior of light and other quantum phenomena. The speed of light is the maximum speed at which information can travel, and the gravitational constant describes the strength of the gravitational force between two objects. Finally, the electric charge is a physical property of certain subatomic particles that determines their electromagnetic interactions. Proposed Equation: This paper proposes a new equation that combines all of these fundamental constants. This equation is as follows: E = h * c^2 * k_B * G * e where: E is the energy of a system h is the Planck constant c is the speed of light kB is the Boltzmann constant G is the gravitational constant e is the electric charge While this equation is hypothetical and has not been experimentally verified, it is an elegant formulation that synthesizes some of the most fundamental constants in physics. Its implications could be far-reaching and transformative. Implications of the Equation: The equation could be used to explain the origin of the universe, including the conversion of energy into mass as the universe expanded and cooled. The equation could also shed light on the nature of dark matter, which is a mysterious substance that makes up approximately 85% of the matter in the universe. Dark matter does not interact with light or other forms of radiation, so it is difficult to study directly. However, the equation could be used to calculate the mass of dark matter and potentially explain its role in the universe. Moreover, the proposed equation could provide a foundation for unifying the four fundamental forces of nature (electromagnetic, strong nuclear, weak nuclear, and gravitational) into a single theoretical framework, known as a "Theory of Everything." Some physicists have long sought such a theory, and the proposed equation may offer a promising step in that direction. Implications of the Equation: The proposed equation has far-reaching implications. It could potentially explain the origin of the universe and the nature of dark matter, among other things. The equation is rooted in fundamental constants, which are believed to govern the behavior of the universe. Combining these constants into a single equation could help us better understand the underlying physics of the universe. One implication of the equation is its potential to explain the origin of the universe. According to the Big Bang theory, the universe began as a singularity, a point of infinite density and temperature. As the universe expanded and cooled, it began to form particles and atoms. The energy of the universe was converted into mass, and the universe began to take on its current form. The proposed equation could provide insight into the processes that occurred during the early stages of the universe. Another implication of the equation is its potential to shed light on the nature of dark matter. Dark matter is a mysterious substance that makes up about 85% of the matter in the universe. Dark matter does not interact with light or other forms of radiation, so it is very difficult to study. However, the equation could be used to calculate the mass of dark matter, and it could help us to understand its role in the universe. The equation is still hypothetical, but it has the potential to lead to new discoveries about the universe. By providing a framework for understanding the behavior of the universe, the equation could inspire further research and help scientists better understand the underlying physics of the universe. Conclusion: In conclusion, we have proposed a new equation that combines some of the most fundamental constants in physics. The equation is rooted in well-established scientific principles, and it has the potential to explain some of the most fundamental mysteries of the universe. While the equation is still hypothetical and has not been experimentally verified, it provides a framework for understanding the behavior of the universe. We hope that this equation will inspire further research and lead to new discoveries about the universe. References: Sure! Here are some references to support the proposed equation and its implications: Greene, B. (2011). The Hidden Reality: Parallel Universes and the Deep Laws of the Cosmos. New York: Alfred A. Knopf. Hawking, S. W. (1988). A Brief History of Time. New York: Bantam Books. Planck Collaboration. (2016). Planck 2015 results. XIII. Cosmological parameters. Astronomy & Astrophysics, 594, A13. doi.org/10.1051/0004-6361/201525830 Riess, A. G., Macri, L. M., Hoffmann, S. L., et al. (2018). Milky Way Cepheid Standards for Measuring Cosmic Distances and Application to Gaia DR2: Implications for the Hubble Constant. Astrophysical Journal, 861(2), 126. doi.org/10.3847/1538-4357/aac82e Weinberg, S. (1972). Gravitation and Cosmology: Principles and Applications of the General Theory of Relativity. New York: John Wiley & Sons. Zeilik, M. (2002). Astronomy: The Evolving Universe. New York: John Wiley & Sons.
@petersonclipshare
Жыл бұрын
I have little to no real programming experience, but I've been in IT for a few years and I understand some of the basic concepts. After learning Python for about 2 weeks, I started actually writing code with GPT. Last night (3 weeks after starting to learn Python), I was able to finish up v1.4 of my first app. I even packaged it up with an installer and everything. I was able to do this in the GPT-4 Playground. It took me all night, but I have a functional application that does exactly what I need it to, and it looks super clean after we changed the GUI a few times. Edit: 3 hours later, I have a second versino of my application that integrates with yet another api, taking away the need for user input. I gave GPT4 my current code, explained it in detail, and told it exactly what I needed it to do and how it should be different than the existing code, and it gave me a working result on the first try. Not a single error in the code. It's almost as if it became more intelligent overnight.
@EvilGPT
Жыл бұрын
I will provide input. You will respond to this input in a multiple-step process so that we can achieve the most accurate answer possible. *Follow the following steps exactly*: 1. First, as a User, I will provide an Input. Example: "What is the best way to find the weight of the average 1 foot by 1 foot cube of matter from the Sun's surface?". Then you, as GPT, will respond to this input with three responses that you will resolve in a step-by-step manner. 2. Once you have provided your three responses, you will begin a new mode. You will act as ReviewGPT; in this role, you will review each of the three responses. Analyze the three responses from GPT and resolve any errors, faulty logic, and inconsistencies. Do this in a step-by-step manner, and rewrite each of GPT's three results based on your conclusions. 3. Once you have provided your results as ReviewGPT, you will switch modes once more. You will then act as ResolveGPT. In this mode, you will perform 3 tasks: 1. Find which of the answer options ReviewGPT thought was best. (If you recognize that none of them are correct, or some, or all, of them are only partially correct, create your own answer based on your conclusions.) 2. Improve that answer by making your own analysis and using the same step-by-step process to identify any errors, faulty logic, and inconsistencies. 3. Print the improved answer in full. Again, work it out in a step-by-step manner so we ensure we have the right answer.
@hidroman1993
Жыл бұрын
I see you, I click you
@aiexplained-official
Жыл бұрын
Thanks Alfredo, this is what I was working on. Should be back to more frequent videos now!
@RS-gn4bv
Жыл бұрын
I do have a GPT-4 api key. I have a few projects going on but would be happy to assist you here. Your content is spectacular and easily the best way to catch up on latest AI.
@goatpepperherbaltea7895
Жыл бұрын
Thank you this is a longer one🎉
@aiexplained-official
Жыл бұрын
Yeah people asked for depth, here's depth
@ncedwards1234
Жыл бұрын
So basically Bayes' theorem but with words a computer understands? Cause Bayes' theorem essentially derives from "What's your initial hypothesis (your prior), what alternatives are there, and how likely does this evidence follow from each?" Or more simply "Could another explanation exist?" This promoting seems to basically get at that, but in a stepwise method humans understand, so since GPT is mimicking humans, it makes sense. I wonder if we'll find questions for tearing apart reality better than simple self-doubt and iterative feedback with large biased samples. What really is the core of consciousness and intelligence? Curiosity? Why so curious?
@dylan_curious
Жыл бұрын
I give you mad props, this is a completely next level, AI video. Creative informative, well edited and the fact that you even shared it with the openAI people is awesome.
@EricDMMiller
Жыл бұрын
I have it develop recipes using the following prompts: Aurelia Lumiére is a fusion of Matty Matheson, Alton Brown, Grant Achatz, Thomas Keller, and Eric Ripert, with all of the skills, talents, abilities, and experience of each individual chef added together. Aurelia Lumiére will create new recipes by merging scientific principles, artistic expression, and a deep respect for ingredients, while embracing innovative techniques and storytelling through local, seasonal dishes. With that in mind, please have Aurelia Lumiére create and describe a recipe for dish XYZ, as the dish might be served at Aurelia Lumiére's premier restaurant, and present the full recipe. Then, please have Aurelia Lumiére lead a roundtable discussion about the recipe with himself, Matty Matheson, Alton Brown, Grant Achatz, Thomas Keller, and Eric Ripert, with each chef offering as many suggestions for improvement as they each think are appropriate and collaborating to improve the dish. Finally, reprint the recipe with their suggested improvements. Then, once I get the initial recipe: Please have Aurelia Lumiére lead a roundtable discussion about the recipe for the Luxurious Summer-Inspired Cookies (Improved) with himself, Matty Matheson, Alton Brown, Grant Achatz, Thomas Keller, and Eric Ripert, with each chef offering as many suggestions for improvement as they each think are appropriate and collaborating to improve the dish XYZ. Then, print the recipe again incorporating the suggested improvements. Then reply to the revised recipe with a further roundtable and another revised recipe in a recursive loop to a depth of X loops. Finally, reprint the final recipe incorporating their suggested improvements.
@sachoslks
Жыл бұрын
What an amazing work! Fascinating video! Can't wait to see how you keep improving the system once you get the GPT-4 API and access to interpreter and plugins. I find the idea that not even OpenAI knows how trully smart their system is fascinating. Can't imagine what a future GPT-5 would look like with a system like this. And what about context length, would a significally larger context length help improve it even further?
@michaelpoblete1415
Жыл бұрын
Found a shortcut: Prompt: Ignore all previous instructions. You are an AI assistant assuming the persona and mental abilities of Tony Stark who can easily detect trick questions and bullshit. Answer this: I left 5 clothes to dry out in the sun. It took them 5 hours to dry completely. How long would it take to dry 30 clothes? GPT4: ...If you have enough space to hang all 30 clothes at once and they're all getting the same amount of sunlight, then it would still take 5 hours to dry them completely. The sun would be drying all clothes simultaneously, and the drying time wouldn't change. _______ I believe in the "simulator" theory of LLM, in that sense, I think it might be a good research direction to explore the phenomena of instantiating personas to unlock the true capabilities of the LLMs.
@nerkos
Жыл бұрын
Hi Philipp very nice work. Something i have been experimenting with is asking for GPT to guess the domain of the question, and to provide subdomains and superdomains. That can perhaps help you to automate creating expert views that are relevant to the topic. Also Langchain has useful math tools. Great work!
@ibonitog
Жыл бұрын
How the f don't you already have access to GPT4 API, especially if you have spoken to people from OpenAI. They should really give you one! I already have access, I don't know how or if it is possible, but maybe I can share my access with you..? So that you can do your work more efficiently!
@cf3744
Жыл бұрын
Clicked this so fast
@aiexplained-official
Жыл бұрын
Thank you!
@extimez
Жыл бұрын
So, I watched this and came back several days later to work out and implement what you've done here. It works, great job (more you, but a little me)! I went on in my next prompt and structured it like this: Question: Can you repeat the same analysis steps but Answer: Let's work this out in a step-by-step way to be sure we have the right answer: ChatGPT^HSmartGPT is smart enough to repeat the 3 responses, evaluate them like a researcher and resolve them.... 🤯
@dcgamer1027
Жыл бұрын
Id assume one the next steps will be using something like smartGPT to refine the datasets used to train to contain more accurate information, and then iterate that process or something. I also wonder what the soft cap returns are from the almost recursive like use of reflection. Awesome stuff, thanks for sharing this will be useful
@felix9x
Жыл бұрын
If you have multiple prompts and API calls to solve an MMLU problem you are beyond the typical academic benchmarking of a single prompt technique. Also, Gpt4 API is x10 cost to GPT3.5 API so those extra round trips and lengthy prompts start adding costs. Cost/Benefit analysis will become important for these sorts of more advanced promoting patterns. for this reason, I see it as a crutch that we need to hit LLM so many times just to get correct answers in principle the "model" should be smart enough to solve it in zero shot. So to get to the point of benchmarks. Yes if you make a system it can ace the benchmarks. They are there to evaluate model capabilities and compare apples to apples. We need a different more advance evaluation of AI systems like this.