Archive for April 17, 2026

Friday, April 17, 2026

Codex for Almost Everything

OpenAI (Hacker News):

Codex can now operate your computer alongside you, work with more of the tools and apps you use everyday, generate images, remember your preferences, learn from previous actions, and take on ongoing and repeatable work. The Codex app also now includes deeper support for developer workflows, like reviewing PRs, viewing multiple files & terminals, connecting to remote devboxes via SSH, and an in-app browser to make it faster to iterate on frontend designs, apps, and games.

With background computer use, Codex can now use all of the apps on your computer by seeing, clicking, and typing with its own cursor. Multiple agents can work on your Mac in parallel, without interfering with your own work in other apps. For developers, this is helpful for iterating on frontend changes, testing apps, or working in apps that don’t expose an API.

John Voorhees:

It was just over a week ago that OpenAI raised $122 billion in financing and announced it was shifting its focus to building a superapp that brings the capabilities of its models into a unified experience. It turns out that app is Codex, OpenAI’s app that, until today, was focused primarily on developing software.

However, according to OpenAI, 50% of Codex’s users were already giving it non-coding tasks to complete. Combined with the OS flexibility of a desktop environment, that made Codex the natural place to bring together a wide range of new productivity and coding features.

[…]

OpenAI has drawn aspects of its Atlas browser into Codex, too. This allows Codex to prototype websites and apps that users can comment on in-line, creating a tight feedback loop for refining designs. Currently, this feature is limited to running sites and apps via a local server setup, but OpenAI says it will be extended to incorporate actions like interacting with the greater Internet, taking screenshots, and stepping through user flows in the future.

Federico Viticci:

The feature that OpenAI rolled out in Codex is literally based on the Sky app that I exclusively previewed last year, and which was later acquired by OpenAI along with the team that built it.

[…]

I’m not exaggerating when I say that Codex now features the best computer use feature I have ever tested in any LLM or desktop agent. In fact, it’s even better than the computer use feature I used in Sky last year: Sky’s computer use was great, but it was considerably slower than Codex’s current one because it was running on Anthropic’s Claude models. With Codex for Mac today, even the (kind of slow) GPT 5.4 is faster than Sky ever was. But, using Codex with fast mode or – for simpler tasks – the Cerebras-hosted GPT-5.3-Codex-Spark model yields dramatically faster performance than Sky for Mac delivered in 2025.

[…]

We all have Apple’s Accessibility team to thank for the technology that allows Codex’s computer use tool to exist. To build it, the Codex team took advantage of an advanced accessibility feature that allows third-party apps to read the “accessibility hierarchy” (also known as “AX Tree”) of any app open on macOS. My understanding is that this technology was primarily created to allow screen-readers and other assistive tools to work with Mac apps regardless of their automation/scripting features. In this case, it’s been repurposed as a way for Codex to ingest the full contents and hierarchy of any window and, essentially, load it as context for the LLM.

Steve Troughton-Smith:

Developers, I recommend you do this asap: ask Codex to run your app and try to figure out how to do a task, without seeding it with any information.

It’s like putting a new user in front of the screen, and watching how they operate it. It will very rapidly expose any problems you have in messaging or user education, and it’s a little eye-opening if you’ve never (or not recently) run user tests.

Previously:

Perplexity Personal Computer

Perplexity:

Personal Computer brings the multi-model orchestration of Computer to your machine. It can work across your local files, native applications, connectors, and the web to complete complex and even continuous workflows.

Personal Computer makes Perplexity Computer a more personal orchestrator, elegantly hybridizing the local and server environments for maximum security and productivity.

Juli Clover:

Perplexity Computer came out earlier this year, and it’s an all-in-one “digital worker” able to create and execute entire workflows. With today’s upgrade, it can run directly on a Mac with access to the file system and native apps. Pressing both Command keys on a Mac will activate Personal Computer, and it responds to text or voice commands. Personal Computer can work across any Mac app, and it can see active apps and display quick actions automatically.

[…]

Personal Computer for Mac is rolling out to Perplexity Max subscribers starting today, with Perplexity prioritizing waitlist members. Perplexity Max is priced at $200 per month, and the new feature is not available to $20/month Pro plan subscribers.

Andrew Orr:

Perplexity is moving beyond the typical chatbot model by running in the background and carrying out multi-step tasks.

The feature builds on Perplexity’s existing agent system, which breaks a request into smaller jobs and assigns them to different sub-agents.

[…]

Actions can require approval and that activity is logged, but the setup still asks users to trust an always-on agent with broad access.

Previously:

Xcode 26.4.1

Apple (xip, downloads):

Fixed crash for MetricKit apps built with Xcode 26.4 due to missing symbols when running on iOS, macOS, and visionOS versions below 26.4.

[…]

Fixed stack-allocation bugs in async functions that caused “freed pointer was not the last allocation” crashes, particularly in swift_asyncLet_finish. These long-standing issues became more frequent in Swift 6.2 and 6.3 due to optimizer improvements.

Nic Lake:

I’m currently learning SwiftUI.

26.4 preview windows don’t let you scroll or swipe. Crashes & requires an Xcode restart.

Seems like this kind of stuff is, if not the norm, then at least common enough where the general response is “welcome to iOS development, here’s a cookie”.

Christian Beer:

Xcode loses my Apple Accounts again and again!! WTF? It also happens on the CI server… SO annoying!

And it doesn’t even has a Passwords.app integration!! 🤬

Isaiah Carew:

notarization failing today because of new Paid App terms. i don’t have any paid apps but still have to agree to get notarization working.

took me 90min to sign in. dev ID was “Locked for Security Reasons” — it always is.

unlock fails in Chrome, “The action could not be completed”. in safari the captcha doesn’t show.

but unlock works on my phone.

the it says, “Reset your password?” but it didn’t. it just unlocked the account — the text in the alert was just wrong.

Heath Borders:

Every month or so, I have to open Xcode, and every time I’m surprised at how it can’t do basic things. Xcode can’t find a call hierarchy for a local function declaration. It just beeps at me.

It’s been 10 years or so since Jump to Definition and Jump to Next Counterpart worked consistently for me. Often, I just get beeps or nothing happens.

Previously: