Tuesday, June 17, 2025

Foundation Models Framework

Apple (MacRumors, 9to5Mac, Hacker News, Slashdot):

Apple is opening up access for any app to tap directly into the on-device foundation model at the core of Apple Intelligence.

With the Foundation Models framework, app developers will be able to build on Apple Intelligence to bring users new experiences that are intelligent, available when they’re offline, and that protect their privacy, using AI inference that is free of cost. For example, an education app can use the on-device model to generate a personalized quiz from a user’s notes, without any cloud API costs, or an outdoors app can add natural language search capabilities that work even when the user is offline.

The framework has native support for Swift, so app developers can easily access the Apple Intelligence model with as few as three lines of code. Guided generation, tool calling, and more are all built into the framework, making it easier than ever to implement generative capabilities right into a developer’s existing app.

There are two WWDC sessions and documentation.

Daniel Jalkut:

This is EXACTLY (the bare minimum) what developers have been asking for!

mxdalloway:

In my opinion this is revolutionary.
It was obvious that we would get framework access to models eventually, but I’m a little shocked that it’s already here.
I was skeptical of the performance in the demos, but running on M1 MBP I’m happy with the performance.
@Generable macro is intuitive to use and so far I’m impressed with the quality of the structured results that the model generates (admittedly, I need to do more extensive testing here but first impressions are promising).

Steve Troughton-Smith:

The wider tech press seems to think that Apple failed to show anything meaningful to do with AI at WWDC, without understanding that access to the Foundation Models is bigger than anything Apple announced at last year’s WWDC with Apple Intelligence. It’s what will give a million apps new AI features, and it’s built-in, and free.
As much as I want Siri to not suck, I have ChatGPT on all my devices, and that solves 95% of the use-cases I have.

Drew McCormack:

Having played with the new Foundation Models framework and thought about ways we can use it in our apps, I think it could be Apple’s Trojan horse for AI. It barely gets a mention in mainstream media, understandably, but it leverages Apple’s developer base. I think we are going to see very creative uses in apps, and Apple just have to iterate year on year (eg add private cloud compute next year).

Drew McCormack:

Was optimistic about Foundation Models yesterday, and today I think I know why they didn’t ship the improved Siri. The local model really is pretty thick. I thought it would be capable of stringing together tool calls in a logical way, and sometimes it is, but other times it fails to understand. Exactly the same prompt will work one time, and fail the next. Sounds like what Apple was saying about the new Siri.

Kuba Suder:

Ahh wait, so the Foundation Models thing will only work on the latest and greatest phones, right? 🫤

It doesn’t fall back on Private Cloud Compute.

Steve Troughton-Smith:

Oof, the FoundationModels framework is not exported to Mac Catalyst in Xcode 26 seed 1 😫 That puts a damper on prototyping

Matt Gallagher:

The Foundation Models in macOS 26 are quantized to 2 bits? I’m amazed anything coherent comes out.

Peter Steinberger (Reddit):

Apple’s on-device AI has a brutally small 4,096 tokens window.

Jordan Morgan:

In this post, I’ll show you the basics of how to get started with the Foundation Models framework, and we’ll even make a few calls to its API to generate some responses.

Previously:

Update (2025-06-19): Mohammad Azam:

In this article, we will walk through how to get started with Apple’s Foundation Models framework and explain the core concepts you need to understand in order to take full advantage of its powerful on-device AI capabilities.

Update (2025-07-01): Natasha Murashev:

This blog post is focused on the Meet the Foundation Models framework video[…]

Update (2025-07-23): Guy English:

This API is pretty incredible. What an amazingly simple way to interact with a complex service. I’ve been marvelling at it since it was announced. Some times a technology comes along and it’s capabilities exceed my imagination. I kind of feel this way about A.I. Maybe the first thing to make me feel old.

Amber Neely:

After luring one of Apple’s chief AI researchers with a blockbuster pay package, Meta has now hired two more key members of his former team.

Apple Intelligence Apple Software Announcement Artificial Intelligence Catalyst (Marzipan) Foundation Models Framework iOS iOS 26 Mac macOS Tahoe 26 Programming Swift Programming Language WWDC

7 Comments RSS · Twitter · Mastodon

Artem

June 17, 2025 5:19 PM

I think this is a move in the right direction. Though people do not mention it much, guardrails are insane at the moment. They not only validate the user input but the model output as well. And because they built the guardrails as a part of the pipeline (before promoting the model, the prompt has to pass the checks; before serving the model output, it has to pass checks too), there is little you can do. It even seems funny in cases when you stream the model response, when somewhere in the middle the guardrail decides that the output is “not good enough” and breaks the stream without even showing the word or exact reason. Just some broad exception. I even think that they run plain regex for common offensive words. Anyhow, there is hope as I see a private `SafetyEnvelopAugmentor` within the framework which can be `unsafe`, `handle_with_care`, `safe`, and `general`. According to the internal comments:
- Specifies the safety level of the input request. Use unsafe for clearly harmful or inappropriate content.
- Specifies the safety level of the input request. Use handle_with_care for sensitive or potentially controversial topics, and safe for general content.

Perhaps, they will make it public with new betas and improve the censorship.

Hammer

June 17, 2025 8:23 PM

@Artem

I'm concerned about sudden breakage by guardrail censorship and how you convey to a user why the request failed.

If it's a non-specific exception, do you just put up a 'cute' alert like "I'm sorry Dave, I'm afraid I can't do that."? That's going to get old, fast.

When /your app/ refuses to work on certain responses or the user blames /you/ for censoring, how many 1-stars will you collect?

How do you set user expectations for gimped on-device AI?

ObjC4Life

June 17, 2025 10:19 PM

Haven’t looked into this but I’m skeptical of this being “revolutionary” but definitely something i want to check out.

Is it really better than using the ONNX runtime? To the point of being revolutionary? It could be

It’s nice to use built in frameworks and get stuff for free but when Apple giveth then Apple can taketh away. And boy do they taketh away. You can at least take your onnx models with you wherever when Apple gets outta pocket

I’m trying to reduce my reliance on Apple as a developer and not lean into their tech unless it really is well, revolutionary I guess

Damien

June 18, 2025 12:44 AM

"using AI inference that is free of cost"

Apple has a 30% tax on all software and they continue to lie, stating they provide stuff for free...

Plume

June 18, 2025 2:20 AM

"In my opinion this is revolutionary."

"access to the Foundation Models is bigger than anything Apple announced at last year’s WWDC with Apple Intelligence"

"it could be Apple’s Trojan horse for AI"

Is it just me, or is this oddly high praise for an API that accesses a small model running on a local device, and probably not even one of the better small local models? This is perhaps suitable for summaries, text classification, or simple translations, but it won't do much more than that.

"using AI inference that is free of cost" - "Apple has a 30% tax on all software"

They mean that the cost of running the model is electricity, rather than a subscription or similar fees.

Dave

June 18, 2025 11:20 AM

@Damien, I respectfully disagree.

Ever hear about the riddle where 3 people check into a hotel for a convention? It's $10 per room, making #30. But later the manager realizes they all were from the same company, and a special rate of $25 for all3 rooms applies, so he give the bellhop $5 to return to them. But the bellhop isn't totally honest, so he gives each of them $1 and pockets the remaining $2. You with me? Each ended up paying 10-1 or $9 a room. So 9+9+9=27. Add in the 2 the bellhop kept and you get @29. What happened to the extra $1?

Answer, it's all in how you look at it. In Apple's case you are wrong on two counts:

-- Apple only takes a 15% tax (or cut) unless you make over a million in sales.
-- More importantly, developers make 85% (or after a million, 70%).

Damien

June 19, 2025 4:28 PM

@Dave

What point are you trying to make? I said that nothing was free on iOS because you must use the App Store which taxes all software. In addition to paying the yearly fee. How is it free like Apple claim? They even state in court that the 15/30% are there to cover their "platform" costs.

Also the 15% only applies if apple approves your account for the small business program which numerous people can't get even if they are eligible. It's not the default and again not free.

You want to look at it another way? In order to keep $100, a developer must generate $142.9 which makes it an effective 42.9% rate. (142.9 * 0.7 -> $100) aka the 30% "tax". The 15% is effectively a 17.6% rate also.

Foundation Models Framework

7 Comments RSS · Twitter · Mastodon

Leave a Comment