Machine Learning & AI on Apple Silicon

For on-device ML inference, Neural Engine TOPS is the headline number—it determines how quickly models like Stable Diffusion or local LLMs generate output. For training and fine-tuning, unified memory size is often the bottleneck: larger models need more RAM, and Apple Silicon's unified architecture lets the GPU and NPU share the full pool. Memory bandwidth governs how fast weights move through the system, directly impacting tokens-per-second for LLM inference.

Laptops

The M5 Max delivers 61 NPU TOPS and up to 128 GB of unified memory with 614 GB/s bandwidth—enough to run 70B-parameter LLMs on a laptop. The M5 Pro is a practical choice for models up to ~30B parameters. The base M5 handles smaller models and inference workloads well but tops out at 32 GB.

Spec M5
10c CPU / 10c GPU
M5
10c CPU / 8c GPU
A18 Pro
6c CPU / 5c GPU
M5 Max
18c CPU / 40c GPU
M5 Max
18c CPU / 32c GPU
M5 Pro
18c CPU / 20c GPU
M5 Pro
15c CPU / 16c GPU
Devices MacBook Air 15″
MacBook Air 13″
MacBook Pro 14″
iPad Pro 13″
iPad Pro 11″
Apple Vision Pro
MacBook Air 13″ MacBook Neo MacBook Pro 16″
MacBook Pro 14″
MacBook Pro 16″
MacBook Pro 14″
MacBook Pro 16″
MacBook Pro 14″
MacBook Pro 14″
Neural Engine cores 16 16 16 16 16 16 16
Neural Engine TOPS 61 61 35 61 61 61 61
CPU Cores 10 10 6 18 18 18 15
Super Cores 4 4 6 6 6 5
Performance Cores 2 12 12 12 10
GPU cores 10 8 5 40 32 20 16
TFLOPS 5.13 4.11 20.53 16.42 10.27 8.21
Memory bandwidth (GB/s) 153.6 153.6 60 614 460 307 307
Memory type LPDDR5X-9600 LPDDR5X-9600 LPDDR5 LPDDR5X-9600 LPDDR5X-9600 LPDDR5X-9600 LPDDR5X-9600
Memory options (GB) 16
24
32
16
24
32
8 48
64
128
36 24
48
64
24
48

Desktops

For larger models and training workloads, desktop Macs offer more memory headroom. The M3 Ultra in the Mac Studio supports up to 256 GB unified memory. If your models fit in 128 GB, the M4 Max delivers better per-TOPS performance at a lower price. The M4 in the iMac and Mac mini is suitable for inference on smaller models (up to ~13B parameters).

Spec M4
10c CPU / 10c GPU
M4
8c CPU / 8c GPU
M4 Pro
14c CPU / 20c GPU
M4 Pro
12c CPU / 16c GPU
M4 Max
16c CPU / 40c GPU
M4 Max
14c CPU / 32c GPU
M3 Ultra
32c CPU / 80c GPU
M3 Ultra
28c CPU / 60c GPU
Devices iMac
Mac mini
iMac Mac mini Mac mini Mac Studio Mac Studio Mac Studio Mac Studio
Neural Engine cores 16 16 16 16 16 16 32 32
Neural Engine TOPS 38 38 38 38 38 38 36 36
CPU Cores 10 8 14 12 16 14 32 28
Performance Cores 4 4 10 8 12 10 24 20
GPU cores 10 8 20 16 40 32 80 60
TFLOPS 4.26 3.41 8.52 6.82 17.04 13.64 28.262 21.197
Memory bandwidth (GB/s) 120 120 273 273 546 409.6 819.2 819.2
Memory type LPDDR5X-7500 LPDDR5X-7500 LPDDR5X-8533 LPDDR5X-8533 LPDDR5X-8533 LPDDR5X-8533 LPDDR5-6400 LPDDR5-6400
Memory options (GB) 8
16
24
32
8
16
24
32
24
48
64
24
48
64
48
64
128
36 96
256
96
256