Chuyển sang chế độ ngoại tuyến với ứng dụng Player FM !
ChatGPT: This AI has a JAILBREAK?! (Unbelievable AI Progress)
Manage episode 351308803 series 2974171
#chatgpt #ai #openai
ChatGPT, OpenAI's newest model is a GPT-3 variant that has been fine-tuned using Reinforcement Learning from Human Feedback, and it is taking the world by storm!
Sponsor: Weights & Biases
https://wandb.me/yannic
OUTLINE:
0:00 - Intro
0:40 - Sponsor: Weights & Biases
3:20 - ChatGPT: How does it work?
5:20 - Reinforcement Learning from Human Feedback
7:10 - ChatGPT Origins: The GPT-3.5 Series
8:20 - OpenAI's strategy: Iterative Refinement
9:10 - ChatGPT's amazing capabilities
14:10 - Internals: What we know so far
16:10 - Building a virtual machine in ChatGPT's imagination (insane)
20:15 - Jailbreaks: Circumventing the safety mechanisms
29:25 - How OpenAI sees the future
References:
https://openai.com/blog/chatgpt/
https://openai.com/blog/language-model-safety-and-misuse/
https://beta.openai.com/docs/model-index-for-researchers
https://scale.com/blog/gpt-3-davinci-003-comparison#Conclusion
https://twitter.com/johnvmcdonnell/status/1598470129121374209
https://twitter.com/blennon_/status/1597374826305318912
https://twitter.com/TimKietzmann/status/1598230759118376960/photo/1
https://twitter.com/_lewtun/status/1598056075672027137/photo/2
https://twitter.com/raphaelmilliere/status/1598469100535259136
https://twitter.com/CynthiaSavard/status/1598498138658070530/photo/1
https://twitter.com/tylerangert/status/1598389755997290507/photo/1
https://twitter.com/amasad/status/1598042665375105024/photo/1
https://twitter.com/goodside/status/1598129631609380864/photo/1
https://twitter.com/moyix/status/1598081204846489600/photo/2
https://twitter.com/JusticeRage/status/1598959136531546112
https://twitter.com/yoavgo/status/1598594145605636097
https://twitter.com/EladRichardson/status/1598333315764871174
https://twitter.com/charles_irl/status/1598319027327307785/photo/4
https://twitter.com/jasondebolt/status/1598243854343606273
https://twitter.com/mattshumer_/status/1598185710166896641/photo/1
https://twitter.com/i/web/status/1598246145171804161
https://twitter.com/bleedingedgeai/status/1598378564373471232
https://twitter.com/MasterScrat/status/1598830356115124224
https://twitter.com/Sentdex/status/1598803009844256769
https://twitter.com/harrison_ritz/status/1598828017446371329
https://twitter.com/parafactual/status/1598212029479026689
https://www.engraved.blog/building-a-virtual-machine-inside/
https://twitter.com/317070
https://twitter.com/zehavoc/status/1599193444043268096
https://twitter.com/yoavgo/status/1598360581496459265
https://twitter.com/yoavgo/status/1599037412411596800
https://twitter.com/yoavgo/status/1599045344863879168
https://twitter.com/natfriedman/status/1598477452661383168
https://twitter.com/conradev/status/1598487973351362561/photo/1
https://twitter.com/zswitten/status/1598100186605441024
https://twitter.com/CatEmbedded/status/1599141379879600128/photo/2
https://twitter.com/mattshumer_/status/1599175127148949505
https://twitter.com/vaibhavk97/status/1598930958769860608/photo/1
https://twitter.com/dan_abramov/status/1598800508160024588/photo/1
https://twitter.com/MinqiJiang/status/1598832656422432768/photo/2
https://twitter.com/zswitten/status/1598088280066920453
https://twitter.com/m1guelpf/status/1598203861294252033/photo/1
https://twitter.com/SilasAlberti/status/1598257908567117825/photo/1
https://twitter.com/gf_256/status/1598962842861899776/photo/1
https://twitter.com/zswitten/status/1598088267789787136
https://twitter.com/gf_256/status/1598178469955112961/photo/1
177 tập
Manage episode 351308803 series 2974171
#chatgpt #ai #openai
ChatGPT, OpenAI's newest model is a GPT-3 variant that has been fine-tuned using Reinforcement Learning from Human Feedback, and it is taking the world by storm!
Sponsor: Weights & Biases
https://wandb.me/yannic
OUTLINE:
0:00 - Intro
0:40 - Sponsor: Weights & Biases
3:20 - ChatGPT: How does it work?
5:20 - Reinforcement Learning from Human Feedback
7:10 - ChatGPT Origins: The GPT-3.5 Series
8:20 - OpenAI's strategy: Iterative Refinement
9:10 - ChatGPT's amazing capabilities
14:10 - Internals: What we know so far
16:10 - Building a virtual machine in ChatGPT's imagination (insane)
20:15 - Jailbreaks: Circumventing the safety mechanisms
29:25 - How OpenAI sees the future
References:
https://openai.com/blog/chatgpt/
https://openai.com/blog/language-model-safety-and-misuse/
https://beta.openai.com/docs/model-index-for-researchers
https://scale.com/blog/gpt-3-davinci-003-comparison#Conclusion
https://twitter.com/johnvmcdonnell/status/1598470129121374209
https://twitter.com/blennon_/status/1597374826305318912
https://twitter.com/TimKietzmann/status/1598230759118376960/photo/1
https://twitter.com/_lewtun/status/1598056075672027137/photo/2
https://twitter.com/raphaelmilliere/status/1598469100535259136
https://twitter.com/CynthiaSavard/status/1598498138658070530/photo/1
https://twitter.com/tylerangert/status/1598389755997290507/photo/1
https://twitter.com/amasad/status/1598042665375105024/photo/1
https://twitter.com/goodside/status/1598129631609380864/photo/1
https://twitter.com/moyix/status/1598081204846489600/photo/2
https://twitter.com/JusticeRage/status/1598959136531546112
https://twitter.com/yoavgo/status/1598594145605636097
https://twitter.com/EladRichardson/status/1598333315764871174
https://twitter.com/charles_irl/status/1598319027327307785/photo/4
https://twitter.com/jasondebolt/status/1598243854343606273
https://twitter.com/mattshumer_/status/1598185710166896641/photo/1
https://twitter.com/i/web/status/1598246145171804161
https://twitter.com/bleedingedgeai/status/1598378564373471232
https://twitter.com/MasterScrat/status/1598830356115124224
https://twitter.com/Sentdex/status/1598803009844256769
https://twitter.com/harrison_ritz/status/1598828017446371329
https://twitter.com/parafactual/status/1598212029479026689
https://www.engraved.blog/building-a-virtual-machine-inside/
https://twitter.com/317070
https://twitter.com/zehavoc/status/1599193444043268096
https://twitter.com/yoavgo/status/1598360581496459265
https://twitter.com/yoavgo/status/1599037412411596800
https://twitter.com/yoavgo/status/1599045344863879168
https://twitter.com/natfriedman/status/1598477452661383168
https://twitter.com/conradev/status/1598487973351362561/photo/1
https://twitter.com/zswitten/status/1598100186605441024
https://twitter.com/CatEmbedded/status/1599141379879600128/photo/2
https://twitter.com/mattshumer_/status/1599175127148949505
https://twitter.com/vaibhavk97/status/1598930958769860608/photo/1
https://twitter.com/dan_abramov/status/1598800508160024588/photo/1
https://twitter.com/MinqiJiang/status/1598832656422432768/photo/2
https://twitter.com/zswitten/status/1598088280066920453
https://twitter.com/m1guelpf/status/1598203861294252033/photo/1
https://twitter.com/SilasAlberti/status/1598257908567117825/photo/1
https://twitter.com/gf_256/status/1598962842861899776/photo/1
https://twitter.com/zswitten/status/1598088267789787136
https://twitter.com/gf_256/status/1598178469955112961/photo/1
177 tập
Tất cả các tập
×Chào mừng bạn đến với Player FM!
Player FM đang quét trang web để tìm các podcast chất lượng cao cho bạn thưởng thức ngay bây giờ. Đây là ứng dụng podcast tốt nhất và hoạt động trên Android, iPhone và web. Đăng ký để đồng bộ các theo dõi trên tất cả thiết bị.