app intents

App Intents: Ushering a New Era of Semantic Understanding in Apps by Geoff Kim

In the ever-evolving landscape of technology, Apple's introduction of the App Intents framework marks a pivotal moment for app developers and users alike. This innovation isn't just a feature—it's the dawn of a new field rooted in the semantic understanding of user actions. Coupled with Apple's recent announcement of "Apple Intelligence," this framework is set to revolutionise how we interact with our devices.

A New Field: Semantic Understanding

The App Intents framework is designed to interpret and act upon user commands with a depth of understanding that feels almost human. It's about moving beyond simple voice commands to grasp the intent behind those commands. Imagine telling Siri, "Order my usual coffee," and it not only knows which coffee shop you prefer but also which specific drink you want, thanks to the semantic context it has learned over time.

This shift means that app developers now need to delve into the intricacies of user language. It's no longer sufficient to just recognise keywords; apps must understand the nuances of user requests. This semantic layer transforms how we interact with our devices, making the experience more intuitive and personalised.

The Power of Apple Intelligence

At its annual Worldwide Developers Conference (WWDC), Apple unveiled Apple Intelligence, its branded 'personal intelligence system' that will be deeply integrated into its platforms. Apple Intelligence is built on a family of generative models created by Apple, including on-device and server foundation models. These models are designed to deliver useful and relevant intelligence right where you need it.

The on-device model, with approximately 3 billion parameters, is optimised for speed and efficiency, achieving a time-to-first-token latency of 0.6 milliseconds per prompt token and a generation rate of 30 tokens per second on the iPhone 15 Pro. This foundational language model (OpenELM) will be leveraged to deliver quick actions on an iPhone, ensuring that the device can handle everyday activities like summarisation, mail replies, and proofreading with impressive speed and accuracy.

The Role of App Developers

For developers, this framework brings exciting opportunities and challenges. They must now design apps that can seamlessly interpret and execute a wide range of user commands. This involves integrating natural language processing (NLP) capabilities and building robust models that can learn from user interactions.

Consider a fitness app that, when asked, "How did I do last week?" can pull up a detailed summary of your workouts, highlight your progress, and even suggest improvements. Or a travel app that understands, "Book me a flight to my next meeting," and can automatically find flights based on your calendar events and preferences.

User Language and Voice Commands

At the core of this transformation is the need for developers to understand user language deeply. It's about creating an app that can converse naturally with the user. This requires a blend of technology and psychology: understanding how people express their needs and designing responses that feel natural and helpful.

Siri and voice commands at the operating system level are becoming more sophisticated, allowing for a more fluid interaction between the user and the app. This means developers must ensure their apps are not only responsive but also capable of engaging in meaningful dialogue.

The Future of Interaction

The future of app interaction is bright with the App Intents framework and Apple Intelligence. We are moving towards an era where our devices understand us better than ever before. This framework is paving the way for apps that are smarter, more responsive, and deeply attuned to our needs.

For me, this is reminiscent of the first time I used a touch screen—it just felt right. The App Intents framework, powered by Apple Intelligence, promises to deliver that same sense of seamless interaction, making our digital experiences more natural and integrated into our daily lives.

To all the developers out there, it's time to dive into the world of semantic understanding and leverage the power of Apple Intelligence. Embrace the challenge, and let's build the future of apps together.


For more musings on tech and design, stay tuned to geoff.kim. And if you're into the latest in tech news and geek culture, don't forget to check out the Naked Tech Podcast where Kelvin and I break down all the keynotes and announcements by the major technology companies.