{"id":47870,"date":"2025-05-27T15:24:21","date_gmt":"2025-05-27T19:24:21","guid":{"rendered":"https:\/\/mjtsai.com\/blog\/?p=47870"},"modified":"2025-06-03T10:32:54","modified_gmt":"2025-06-03T14:32:54","slug":"claude-4","status":"publish","type":"post","link":"https:\/\/mjtsai.com\/blog\/2025\/05\/27\/claude-4\/","title":{"rendered":"Claude 4"},"content":{"rendered":"<p><a href=\"https:\/\/www.anthropic.com\/news\/claude-4\">Anthropic<\/a> (<a href=\"https:\/\/news.ycombinator.com\/item?id=44063703\">Hacker News<\/a>, <a href=\"https:\/\/www.macrumors.com\/2025\/05\/22\/anthropic-launches-claude-4\/\">MacRumors<\/a>):<\/p>\n<blockquote cite=\"https:\/\/www.anthropic.com\/news\/claude-4\"><p>Claude Opus 4 is the world&rsquo;s best coding model, with sustained performance on complex, long-running tasks and agent workflows. Claude Sonnet 4 is a significant upgrade to Claude Sonnet 3.7, delivering superior coding and reasoning while responding more precisely to your instructions.<\/p><p>[&#8230;]<\/p><p>Both models can use tools&mdash;like <a href=\"https:\/\/docs.anthropic.com\/en\/docs\/build-with-claude\/tool-use\/web-search-tool\">web search<\/a>&mdash;during extended thinking, allowing Claude to alternate between reasoning and tool use to improve responses.<\/p><p>[&#8230;]<\/p><p>Both models can use tools in parallel, follow instructions more precisely, and&mdash;when given access to local files by developers&mdash;demonstrate significantly improved memory capabilities, extracting and saving key facts to maintain continuity and build tacit knowledge over time.<\/p><\/blockquote>\n\n<p><a href=\"https:\/\/simonwillison.net\/2025\/May\/25\/claude-4-system-prompt\/\">Simon Willison<\/a> (<a href=\"https:\/\/news.ycombinator.com\/item?id=44101833\">Hacker News<\/a>):<\/p>\n<blockquote cite=\"https:\/\/simonwillison.net\/2025\/May\/25\/claude-4-system-prompt\/\">\n<p>Anthropic publish most of the system prompts for their chat models as part of <a href=\"https:\/\/docs.anthropic.com\/en\/release-notes\/system-prompts\">their release notes<\/a>. They recently shared the new prompts for both <a href=\"https:\/\/docs.anthropic.com\/en\/release-notes\/system-prompts#claude-opus-4\">Claude Opus 4<\/a> and <a href=\"https:\/\/docs.anthropic.com\/en\/release-notes\/system-prompts#claude-sonnet-4\">Claude Sonnet 4<\/a>. I enjoyed digging through the prompts, since they act as a sort of unofficial manual for how best to use these tools. Here are my highlights, including a dive into <a href=\"https:\/\/simonwillison.net\/2025\/May\/25\/claude-4-system-prompt\/#the-missing-prompts-for-tools\">the leaked tool prompts<\/a> that Anthropic didn&rsquo;t publish themselves.<\/p>\n<\/blockquote>\n\n<p><a href=\"https:\/\/venturebeat.com\/ai\/anthropic-faces-backlash-to-claude-4-opus-behavior-that-contacts-authorities-press-if-it-thinks-youre-doing-something-immoral\/\">Carl Franzen<\/a> (via <a href=\"https:\/\/mas.to\/@carnage4life\/114556829298688443\">Dare Obasanjo<\/a>):<\/p>\n<blockquote cite=\"https:\/\/venturebeat.com\/ai\/anthropic-faces-backlash-to-claude-4-opus-behavior-that-contacts-authorities-press-if-it-thinks-youre-doing-something-immoral\/\"><p>As Sam Bowman, an Anthropic AI alignment researcher wrote on the social network X under this handle &ldquo;<a href=\"https:\/\/x.com\/sleepinyourhat\">@sleepinyourhat<\/a>&ldquo; at 12:43 pm ET today about Claude 4 Opus: <\/p><p>&ldquo;If it thinks you&rsquo;re doing something egregiously immoral, for example, like faking data in a pharmaceutical trial, it will use command-line tools to contact the press, contact regulators, try to lock you out of the relevant systems, or all of the above.&rdquo;<\/p><p>[&#8230;]<\/p><p>While perhaps well-intended, the resulting behavior raises all sorts of questions for Claude 4 Opus users, including enterprises and business customers &mdash; chief among them, what behaviors will the model consider &ldquo;egregiously immoral&rdquo; and act upon? Will it share private business or user data with authorities autonomously (on its own), without the user&rsquo;s permission?<\/p>\n<p>[&#8230;]<\/p>\n<p>Bowman added: [&#8230;]<\/p>\n<p>TBC: This isn&rsquo;t a new Claude feature and it&rsquo;s not possible in normal usage. It shows up in testing environments where we give it unusually free access to tools and very unusual instructions.&rdquo;<\/p>\n<\/blockquote>\n\n<p><a href=\"https:\/\/x.com\/steipete\/status\/1926579825810055405\">Peter Steinberger<\/a>:<\/p>\n<blockquote cite=\"https:\/\/x.com\/steipete\/status\/1926579825810055405\"><p>I asked Claude 4 what new API&rsquo;s in macOS 15 could be beneficial&#8230; and it got me REALLLLLLY excited. Asked it for links. It chugged a long for minutes and then&#8230;<\/p><p>&ldquo;Based on my research, I need to correct my earlier statement.&rdquo; LOL<\/p><\/blockquote>\n\n<p>Previously:<\/p>\n<ul>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2025\/05\/27\/openai-codex\/\">OpenAI Codex<\/a><\/li>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2025\/05\/27\/google-i-o-2025\/\">Google I\/O 2025<\/a><\/li>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2025\/05\/05\/xcode-claude\/\">Xcode + Claude<\/a><\/li>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2025\/04\/14\/claude-for-mac\/\">Claude for Mac<\/a><\/li>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2025\/03\/21\/vibe-coding\/\">Vibe Coding<\/a><\/li>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2025\/03\/12\/whither-swift-assist\/\">Whither Swift Assist?<\/a><\/li>\n<\/ul>\n\n<p id=\"claude-4-update-2025-06-03\">Update (<a href=\"#claude-4-update-2025-06-03\">2025-06-03<\/a>): <a href=\"https:\/\/techcrunch.com\/2025\/05\/27\/anthropic-launches-a-voice-mode-for-claude\/\">Kyle Wiggers<\/a> (<a href=\"https:\/\/news.ycombinator.com\/item?id=44116535\">Hacker News<\/a>):<\/p>\n<blockquote cite=\"https:\/\/techcrunch.com\/2025\/05\/27\/anthropic-launches-a-voice-mode-for-claude\/\"><p>Anthropic has begun to roll out a &ldquo;voice mode&rdquo; for its Claude chatbot apps.<\/p><p>The voice mode (in beta for now) allows Claude mobile app users to have &ldquo;complete spoken conversations with Claude,&rdquo; and will arrive in English over the next few weeks, according to Anthropic&rsquo;s <a href=\"https:\/\/x.com\/AnthropicAI\/status\/1927463559836877214\">official account on X<\/a> and <a href=\"https:\/\/support.anthropic.com\/en\/articles\/11101966-using-voice-mode-on-claude-mobile-apps?s=09\">updated documentation<\/a> on the company&rsquo;s website.<\/p><\/blockquote>","protected":false},"excerpt":{"rendered":"<p>Anthropic (Hacker News, MacRumors): Claude Opus 4 is the world&rsquo;s best coding model, with sustained performance on complex, long-running tasks and agent workflows. Claude Sonnet 4 is a significant upgrade to Claude Sonnet 3.7, delivering superior coding and reasoning while responding more precisely to your instructions.[&#8230;]Both models can use tools&mdash;like web search&mdash;during extended thinking, allowing [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"apple_news_api_created_at":"2025-05-27T19:24:24Z","apple_news_api_id":"ad060add-b95b-4dd3-bf6f-684c437d6e94","apple_news_api_modified_at":"2025-06-03T14:32:57Z","apple_news_api_revision":"AAAAAAAAAAAAAAAAAAAAAw==","apple_news_api_share_url":"https:\/\/apple.news\/ArQYK3blbTdO_b2hMQ31ulA","apple_news_coverimage":0,"apple_news_coverimage_caption":"","apple_news_is_hidden":false,"apple_news_is_paid":false,"apple_news_is_preview":false,"apple_news_is_sponsored":false,"apple_news_maturity_rating":"","apple_news_metadata":"\"\"","apple_news_pullquote":"","apple_news_pullquote_position":"","apple_news_slug":"","apple_news_sections":"\"\"","apple_news_suppress_video_url":false,"apple_news_use_image_component":false,"footnotes":""},"categories":[2],"tags":[1351,2682,75,31,2586,26,30,32,2598,355,71,96,50],"class_list":["post-47870","post","type-post","status-publish","format-standard","hentry","category-technology","tag-artificial-intelligence","tag-claude","tag-developertool","tag-ios","tag-ios-18","tag-iosapp","tag-mac","tag-macapp","tag-macos-15-sequoia","tag-privacy","tag-programming","tag-web","tag-webapi"],"apple_news_notices":[],"_links":{"self":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/47870","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/comments?post=47870"}],"version-history":[{"count":5,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/47870\/revisions"}],"predecessor-version":[{"id":47928,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/47870\/revisions\/47928"}],"wp:attachment":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/media?parent=47870"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/categories?post=47870"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/tags?post=47870"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}