Step-by-Step Diffusion: An Elementary Tutorial
April 16, 2025research area Methods and Algorithmsconference Foundations and Trends in Computer Graphics and Vision
Deploying Transformers on the Apple Neural Engine
June 6, 2022research area Computer Vision, research area Speech and Natural Language Processing
An increasing number of the machine learning (ML) models we build at Apple each year are either partly or fully adopting the Transformer architecture. This architecture helps enable experiences such as panoptic segmentation in Camera with HyperDETR, on-device scene analysis in Photos, image captioning for accessibility, machine translation, and many others. This year at WWDC 2022, Apple is making available an open-source reference PyTorch implementation of the Transformer architecture, giving developers worldwide a way to seamlessly deploy their state-of-the-art Transformer models on Apple devices.