{"id":38780,"date":"2023-03-15T14:33:07","date_gmt":"2023-03-15T18:33:07","guid":{"rendered":"https:\/\/mjtsai.com\/blog\/?p=38780"},"modified":"2023-03-24T13:29:41","modified_gmt":"2023-03-24T17:29:41","slug":"gpt-4","status":"publish","type":"post","link":"https:\/\/mjtsai.com\/blog\/2023\/03\/15\/gpt-4\/","title":{"rendered":"GPT-4"},"content":{"rendered":"<p><a href=\"https:\/\/openai.com\/research\/gpt-4\">OpenAI<\/a> (<a href=\"https:\/\/news.ycombinator.com\/item?id=35154527\">Hacker News<\/a>):<\/p>\n<blockquote cite=\"https:\/\/openai.com\/research\/gpt-4\"><p>GPT-4 is a large multimodal model (accepting image and text inputs, emitting text outputs) that, while less capable than humans in many real-world scenarios, exhibits human-level performance on various professional and academic benchmarks. For example, it passes a simulated bar exam with a score around the top 10% of test takers; in contrast, GPT-3.5&rsquo;s score was around the bottom 10%.<\/p><p>[&#8230;]<\/p><p>We are releasing GPT-4&rsquo;s text input capability via ChatGPT and the API (with a <a href=\"\/waitlist\/gpt-4-api\">waitlist<\/a>). To prepare the image input capability for wider availability, we&rsquo;re collaborating closely with a <a href=\"https:\/\/www.bemyeyes.com\/\">single partner<\/a> to start. We&rsquo;re also open-sourcing <a href=\"https:\/\/github.com\/openai\/evals\">OpenAI Evals<\/a>, our framework for automated evaluation of AI model performance, to allow anyone to report shortcomings in our models to help guide further improvements.<\/p><\/blockquote>\n\n<p><a href=\"https:\/\/www.macrumors.com\/2023\/03\/15\/apple-engineers-working-on-chatgpt-like-ai\/\">Hartley Charlton<\/a>:<\/p>\n<blockquote cite=\"https:\/\/www.macrumors.com\/2023\/03\/15\/apple-engineers-working-on-chatgpt-like-ai\/\"><p>Apple is testing generative AI concepts that could one day be destined for Siri, despite fundamental issues with the way the virtual assistant is built, the <a href=\"https:\/\/www.nytimes.com\/2023\/03\/15\/technology\/siri-alexa-google-assistant-artificial-intelligence.html\">New York Times<\/a> reports.<\/p><p>Employees were apparently briefed on Apple&rsquo;s large language model and other AI tools at the company&rsquo;s <a href=\"https:\/\/www.macrumors.com\/2023\/02\/07\/apple-ai-summit-in-person-event\/\">annual AI summit<\/a> last month. Apple engineers, including members of the  Siri  team, have reportedly been testing language-generation concepts &ldquo;every week&rdquo; in response to the rise of chatbots like ChatGPT.<\/p><\/blockquote>\n\n<p>Previously:<\/p>\n<ul>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2023\/03\/03\/openai-is-today-unrecognizable\/\">OpenAI Is Today Unrecognizable<\/a><\/li>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2023\/02\/02\/chatgpt-plus\/\">ChatGPT Plus<\/a><\/li>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2021\/10\/04\/siris-10-year-anniversary\/\">Siri&rsquo;s 10-Year Anniversary<\/a><\/li>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2020\/08\/07\/why-apple-believes-its-an-ai-leader\/\">Why Apple Believes It&rsquo;s an AI Leader<\/a><\/li>\n<\/ul>\n\n<p id=\"gpt-4-update-2023-03-20\">Update (2023-03-20): <a href=\"https:\/\/garymarcus.substack.com\/p\/caricaturing-noam-chomsky\">Gary Marcus<\/a> (via <a href=\"https:\/\/news.ycombinator.com\/item?id=35117968\">Hacker News<\/a>):<\/p>\n<blockquote cite=\"https:\/\/garymarcus.substack.com\/p\/caricaturing-noam-chomsky\"><p>Chomsky <a href=\"https:\/\/www.nytimes.com\/2023\/03\/08\/opinion\/noam-chomsky-chatgpt-ai.html\">co-wrote a New York Times op-ed<\/a> the other day, and everyone is out there once again to prove they are smarter than he is, in the smuggest possible language they can muster.<\/p><\/blockquote>\n\n<p id=\"gpt-4-update-2023-03-22\">Update (2023-03-22): <a href=\"https:\/\/www.gatesnotes.com\/The-Age-of-AI-Has-Begun\">Bill Gates<\/a>:<\/p>\n<blockquote cite=\"https:\/\/www.gatesnotes.com\/The-Age-of-AI-Has-Begun\">\n<p>In my lifetime, I&rsquo;ve seen two demonstrations of technology that struck me as revolutionary.<\/p>\n<p>[&#8230;]<\/p>\n<p>I thought the challenge would keep them busy for two or three years. They finished it in just a few months.<\/p>\n<p>In September, when I met with them again, I watched in awe as they asked GPT, their AI model, 60 multiple-choice questions from the AP Bio exam&mdash;and it got 59 of them right. Then it wrote outstanding answers to six open-ended questions from the exam.<\/p>\n<\/blockquote>\n\n<p id=\"gpt-4-update-2023-03-24\">Update (2023-03-24): <a href=\"https:\/\/twitter.com\/DV2559106965076\/status\/1638769434763608064\">DV<\/a> (via <a href=\"https:\/\/news.ycombinator.com\/item?id=35281527\">Hacker News<\/a>):<\/p>\n<blockquote cite=\"https:\/\/twitter.com\/DV2559106965076\/status\/1638769434763608064\">\n<p>You might know that MSFT has released a <a href=\"https:\/\/arxiv.org\/abs\/2303.12712\">154-page paper<\/a> on #OpenAI #GPT4, but do you know they also commented out many parts from the original version?<\/p>\n<p>A thread of hidden information from their latex source code.<\/p>\n<\/blockquote>","protected":false},"excerpt":{"rendered":"<p>OpenAI (Hacker News): GPT-4 is a large multimodal model (accepting image and text inputs, emitting text outputs) that, while less capable than humans in many real-world scenarios, exhibits human-level performance on various professional and academic benchmarks. For example, it passes a simulated bar exam with a score around the top 10% of test takers; in [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"apple_news_api_created_at":"2023-03-15T18:33:10Z","apple_news_api_id":"e819861a-4a0d-4a70-8a64-f3bd8a13ab50","apple_news_api_modified_at":"2023-03-24T17:29:44Z","apple_news_api_revision":"AAAAAAAAAAAAAAAAAAAAAg==","apple_news_api_share_url":"https:\/\/apple.news\/A6BmGGkoNSnCKZPO9ihOrUA","apple_news_coverimage":0,"apple_news_coverimage_caption":"","apple_news_is_hidden":false,"apple_news_is_paid":false,"apple_news_is_preview":false,"apple_news_is_sponsored":false,"apple_news_maturity_rating":"","apple_news_metadata":"\"\"","apple_news_pullquote":"","apple_news_pullquote_position":"","apple_news_slug":"","apple_news_sections":"\"\"","apple_news_suppress_video_url":false,"apple_news_use_image_component":false,"footnotes":""},"categories":[2],"tags":[1351,2317,31,30,2361,247],"class_list":["post-38780","post","type-post","status-publish","format-standard","hentry","category-technology","tag-artificial-intelligence","tag-chatgpt","tag-ios","tag-mac","tag-openai","tag-siri"],"apple_news_notices":[],"_links":{"self":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/38780","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/comments?post=38780"}],"version-history":[{"count":4,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/38780\/revisions"}],"predecessor-version":[{"id":38856,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/38780\/revisions\/38856"}],"wp:attachment":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/media?parent=38780"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/categories?post=38780"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/tags?post=38780"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}