Popular tools

Photo editing

Easy photo editing for individuals, growing brands, and enterprises alike: remove backgrounds, resize, polish, and create impactful product visuals in minutes.

Use cases

Faster product listings

Catalogue at scale

Scale ads

Drive online sales

Increase brand visibility

Create social media posts

Marketplace playground

Retail playground

Image editing playground

ROI calculator

For your business

Customer stories

Discover how enterprises, small businesses and entrepreneurs achieve professional results with Photoroom.

Core ML performance benchmark iPhone 14 (2022)

September 16, 2022

Today is new-iPhone day! At Photoroom, this means today is CoreML-benchmark day. Every year we cannot wait to discover what the improved computing power of the new hardware means for on-device machine learning.

CoreML allows iOS developers to ship and execute machine learning models, allowing us to build complex interactive experiences and building an image editor that allows users to think about objects, not pixels. In recent years, the possibility to run our models on the Apple Neural Engine (ANE) has dramatically sped up CoreML.

This year, we decided to spice things up a bit by analyzing performances not only across iPhone models but also across iOS versions.

Setup

This year we ran a benchmark analyzing the performance of our guided cutout pipeline illustrated above.

We ran this benchmark on the following devices across several OSes:

iPhone 12 Pro A14 Bionic (iOS 15 + iOS 16)
iPhone 13 Pro A15 Bionic (iOS 15 + iOS 16)
iPhone 14 Pro A16 Bionic (iOS 16)
iPad Pro 2021 M1 (iOS 15 + iOS 16)
MacBook Pro 2021 M1 Pro (macOS 12)

For each device, we gathered the average execution time (excluding model-loading time) depending on the CoreML compute units configuration (MLComputeUnits):

cpuOnly (the model exclusively runs on the CPU)
cpuAndGPU (the model runs on the GPU as much as possible, and defaults to the CPU when a layer cannot run on the GPU)
cpuAndNeuralEngine (iOS 16 only, the model runs on the ANE as much as possible, and defaults to the CPU when a layer cannot run on the ANE)
all (the model runs on the ANE, GPU or CPU depending on each unit's ability)

Each device, OS version and compute unit configuration was measured 40 times and averaged, with some cooldown time in between to avoid the effects of thermal throttling on the SoC.

Results / Analysis

The results above reveal a few interesting takeaways:

The consistent speed increase going from CPU to GPU to ANE showcases that most of our model is able to run on both the GPU and the ANE
That being said, the (nearly) consistent increase in inference time when going (on iOS 16) from all to cpuAndNeuralEngine showcases that a few number of layers in our model are not able to run on the ANE, but instead default to the GPU.
The OS version seems to have a negligible impact on overall performance. Do not expect a speed bump by upgrading your OS.
Across the A-series chips (A14 Bionic - A15 Bionic - A16 Bionic) we can see a slow and steady improvement in performances in all configurations. In fact, Apple touted 17 TFlops in its new A16 Bionic chip, up 7.5% from the 15.8 TFlops of the A15 Bionic, which is consistent with the all configuration going from 45ms to 41ms in between the iPhone 13 Pro and the iPhone 14 Pro.
Despite being based on the A14 cores, the M1 chip of the iPad pro is neck and neck on CPU and ANE compared to the new A16 Bionic, and blows it away on the GPU. Of course, this can be explained by both the M1 chip having more GPU cores than the A16 Bionic, and possibly the iPad Pro having a much larger thermal envelope.
Interestingly, the M1 Pro chip in the MacBook Pro does not fare much better than the M1 chip in the iPad Pro, and even performs significantly worse in the ANE. We would love to hear a compelling explanation as to why this is the case.

Conclusion

So far, Apple is unparalleled as far as giving mobile developers tools for on-device machine learning goes. While today's new hardware release is not a ground-breaking innovation, it is the latest data-point in a series of consistent improvements year-over-year. At 41ms per inference on the new iPhone 14 Pro, we are finally over the 24 FPS mark, the standard framerate of cinema (for high-quality, user-guided segmentation!). But if you are looking for the best performance available on models unable to run on the ANE, the latest M1 iPad Pro is still your most trusted ally.

Florian DenisCross Platform @ Photoroom

Design your next great image

Whether you're selling, promoting, or posting, bring your idea to life with a design that stands out.

Start a design

Keep reading

How we automated our changelog thanks to ChatGPT

Jeremy Benaim

What's new in product: July 2024

Jeanette Sha

New Photoroom API updates (+90% off Background Remover API)

Udo Kaja

The road to live-collaboration, part 2

Florian Denis

What's new in product: August 2024

Jeanette Sha

Building Google-Docs-like live collaboration for a cross-platform app used by millions (in Rust)

Eliot Andres

Explore new Gen AI features coming soon to the Photoroom API

Udo Kaja

What's new in product: November 2023

Jeanette Sha

Our Setup for A/B Testing LLMs with Millions of Users

Eliot Andres

Making stable diffusion 25% faster using TensorRT

David Briand

See all

Sell faster with studio‑quality product visuals

Drive sales with professional visuals you can create in minutes, with brand consistency and control.

Start free trial

Contact sales