Stable Diffusion inference on iOS / macOS using MPSGraph

Last update: Jan 6, 2023

Related tags

Overview

🍁 Maple Diffusion

Maple Diffusion runs Stable Diffusion models locally on macOS / iOS devices, in Swift, using the MPSGraph framework (not Python).

Maple Diffusion should be capable of generating a reasonable image in a minute or two on a recent iPhone (I get around ~2.3s / step on an iPhone 13 Pro).

To attain usable performance without tripping over iOS's 4GB memory limit, Maple Diffusion relies internally on FP16 (NHWC) tensors, operator fusion from MPSGraph, and a truly pitiable degree of swapping models to device storage.

On macOS, Maple Diffusion uses slightly more memory (~6GB), to reach <1s / step.

Projects using Maple Diffusion

https://github.com/mortenjust/maple-diffusion/ is a fork with several improvements

Device Requirements

Maple Diffusion should run on any Apple Silicon Mac (M1, M2, etc.). Intel macs seem to not support Float16 or something.

Maple Diffusion should run on any iOS device with sufficient RAM (≥6144MB RAM definitely works; 4096MB might but I wouldn't bet on it; anything lower than that won't work). Recent iPads should work out of the box; recent iPhones should work if you can get the Increase Memory Limit capability working (to unlock 4GB of RAM).

iOS 16.1 (beta) is reportedly broken and always generating a gray image :(

Usage

To build and run Maple Diffusion:

Download a Stable Diffusion PyTorch model checkpoint (sd-v1-4.ckpt, or some derivation thereof)

Download this repo

git clone https://github.com/madebyollin/maple-diffusion.git && cd maple-diffusion

Setup & install Python with PyTorch, if you haven't already.

# may need to install conda first https://github.com/conda-forge/miniforge#homebrew
conda deactivate
conda remove -n maple-diffusion --all
conda create -n maple-diffusion python=3.10
conda activate maple-diffusion
pip install torch typing_extensions numpy Pillow requests pytorch_lightning

Convert the PyTorch model checkpoint into a bunch of fp16 binary blobs.
```
./maple-convert.py ~/Downloads/sd-v1-4.ckpt
```
Open the maple-diffusion Xcode project. Select the device you want to run on from the Product > Destination menu.
Manually add the Increased Memory Limit capability to the maple-diffusion target (this step might not be needed on iPads, but it's definitely needed on iPhones - the default limit is 3GB).
Build & run the project on your device with the Product > Run menu.

Comments

Getting started with image input - can't figure it out

This is what I'm doing:

cloned the repo
went with Option 1 and downloaded the zip, unzipped the "bins" folder
opened the SimpleDiffusion app and changed the URL in the task modifier to point to the local path:

            let path = URL(fileURLWithPath: "/Users/xyz/Downloads/bins")
            do {
                try await sd.prepModels(localUrl: path)
            }

additionally set the inputImage to my input image:

            inputImage = NSImage(contentsOf: URL(fileURLWithPath: "/Users/xyz/IMG_1076.JPG"))?
                .cgImage(forProposedRect: nil, context: nil, hints: nil)

When I run the app, I set the prompt to "cat" and start the conversion. The process begins and then crashes with:

The file “first_stage_model.encoder.conv_in.weight.bin” couldn’t be opened because there is no such file.

opened by icanzilb 4

Added an icon for maple diffusion,

Thought you'd get a kick out the fact I used the app to auto generate it's own icon. I tried many times but the first one was the best by far so it feels like fate.

opened by Halopend 0
Error: failed assertion `length(16640) must be <= 16384.'
I'm getting the following error: failed assertion 'length(16640) must be <= 16384.'

Steps to reproduce :

Create a new projet

Add native-diffusion as a dependency (branch: master)

Download pre-converted bins from here

Run python web server that serves bin.zip from localhost:8080

Use SimpleDiffusion example here

I launch the app, I type "astronaut cat" in the TextField. And after a few seconds I got this error :

-[MTLDebugComputeCommandEncoder setThreadgroupMemoryLength:atIndex:]:543: failed assertion `length(16640) must be <= 16384.'

Specs

Tested on Mac details here

Exported to

iPhone 14 Pro Max

iPad Pro 6th gen

Screenshot

Logs

ipad-log.log iphone-log.log
opened by hdsenevi 0
Error: Buffer must not exceed 256MB
Steps to reproduce :

Created a new projet Added native-diffusion as a dependency (branch: master)

Download pre-converted bins here : https://drive.google.com/file/d/1EU_qxlF-p6XrxsoACvcI2CAmu45NphAC/view?usp=share_link

Copy/Pasted code from Readme.md, and chanded it to localUrl :

let path = URL(fileURLWithPath: "/Users/xyz/Downloads/bins") do { try await sd.prepModels(localUrl: path) }

I launch the app, I type "astronaut cat" in the TextField. And after a few seconds I got this error :

-[MTLDebugHeap newBufferWithLength:options:]:123: failed assertion `Buffer Validation newBufferWith*:length 0x20000000 must not exceed 256 MB.
opened by charvoa 4
Warn or fail when using an incompatible input image size

When you pass an input image to the generator, it currently doesn't check if the image fits the model (currently we only support SD1.5, so it's always 512x512)

This warning is for all users using img2img via an inputImage

The warning could be an assertion failure, but open to ideas.

Next: Resize off-size images, using a center crop. Feel free to go for this directly. Later: Support for SD2.0
help wanted

opened by mortenjust 0
Support for Stable Diffusion 2.0
2.0 just came out today. Blog post, repo

Highlights

Generate 768x768 images (weights)

New Upscaler model that can scale up to 4x in one go (weights)

Depth to image for shape-preserving img2img

Improved inpainting (weights)

We probably want to support both 2.0 and 1.5
help wanted
opened by mortenjust 0

Releases(0.4)

0.4(Nov 24, 2022)

Source code(tar.gz)
Source code(zip)
0.3(Nov 9, 2022)

This release takes a different approach to the original code by @madebyollin , making it easier to keep up with changes.
Source code(tar.gz)
Source code(zip)
0.2(Oct 22, 2022)

Please note that the next version will be a rewrite, so consider holding off any PRs for a few days :)
Source code(tar.gz)
Source code(zip)
0.1(Oct 22, 2022)

The next release will change the API slightly
Source code(tar.gz)
Source code(zip)

Owner

Morten Just

Isn't it cool how we all work together?

GitHub https://madebyoll.in/posts/maple_diffusion/

AudioPlayer is a simple class for playing audio in iOS, macOS and tvOS apps.

AudioPlayer AudioPlayer is a simple class for playing audio in iOS, macOS and tvOS apps.

260 Nov 27, 2022

Swift audio synthesis, processing, & analysis platform for iOS, macOS and tvOS

AudioKit AudioKit is an audio synthesis, processing, and analysis platform for iOS, macOS (including Catalyst), and tvOS. Installation To add AudioKit

8.7k Sep 30, 2021

A minimal chromatic tuner for iOS & macOS.

Zen Tuner A minimal chromatic tuner. Works well with any instrument, whether it's woodwinds, brass, strings or voice. Dark Light Features Beautiful, c

448 Dec 26, 2022

MuVis is a macOS, iOS, iPadOS app for real-time music visualization.

MuVis MuVis is an open-source multiplatform app (using SwiftUI, Swift, and Xcode) for music visualization. It renders informative (and musically usefu

7 Oct 10, 2022

An iOS and macOS audio visualization framework built upon Core Audio useful for anyone doing real-time, low-latency audio processing and visualizations.

A simple, intuitive audio framework for iOS and OSX. Deprecated EZAudio has recently been deprecated in favor of AudioKit. However, since some people

4.9k Jan 2, 2023

A sound fader for AVAudioPlayer written in Swift for iOS, tvOS and macOS.

Cephalopod, a sound fader for AvAudioPlayer written in Swift - iOS, tvOS and macOS This library can help fading sounds in and out with AvAudioPlayer.

109 Dec 16, 2022

An original digital art work for macOS, iOS and web

WordClock WordClock versions of various vintages, for various platforms. https://www.simonheys.com/wordclock/ Downloads Currently avaiaible to downloa

62 Dec 7, 2022

MuVis is a macOS, iOS, iPadOS app for real-time music visualization.

MuVis MuVis is an open-source multi-platform app (using SwiftUI, Swift, and Xcode) for music visualization. It renders informative (and musically usef

4 Dec 24, 2022

🎶 Control Apple Music right from your macOS menu bar.

About Music Bar is macOS application that places music controls right in your menu bar. Installation Download and unzip the latest release. Drag the a

202 Jan 4, 2023

macOS app that allows the control of Spotify and AppleMusic/iTunes music playback from the menu bar.

PlayStatus is a simple macOS app that allows the control of Spotify, Apple Music(macOS 10.15+) and iTunes including iTunes Radio/Beats1 playback from

114 Dec 28, 2022

macOS app for Hype Machine

Plug Discover and listen to music from Hype Machine Download Requires macOS 10.15 or later. Screenshot FAQ Can I contribute localizations? We do not h

171 Jan 1, 2023

A macOS app to visualise your iTunes library as graphs.

iTunes Graphs iTunes Graphs is a Cocoa-based macOS app which visualises your iTunes library in a series of pie charts. Currently, it supports the foll

46 Dec 26, 2022

A simple Spotify lyrics viewer menu bar app for macOS in Swift 3

lyricsify This is a simple macOS menu bar application that shows you the lyrics of current playing spotify track. All the lyricses are from Wikia webs

85 Dec 31, 2022

Simple command line utility for switching audio inputs and outputs on macOS

Switch Audio Simple command line utility for switching audio inputs and outputs

3 Nov 22, 2022

PTStations - MacOS App to gather the information on the Train Station of the Portuguese Railway system

PTStations A simple macOS app that gathers information on the Portuguese Railway

2 Aug 10, 2022

TTSLanguage: Text To Speech commandline executable for macOS

TTSLanguage Text To Speech commandline executable for macOS. It can detect sentence language and read it using proper voice. example: $ TTSLanguage "H

2 Jan 17, 2022

iOS framework that enables detecting and handling voice commands using microphone.

iOS framework that enables detecting and handling voice commands using microphone. Built using Swift with minumum target iOS 14.3.

20 Aug 4, 2022

Beautiful Music Player app built using SwiftUI to demonstrate Neumorphic design pattern and MVVM architecture.

Beautiful Music Player app built using SwiftUI to demonstrate Neumorphic design pattern ?? and MVVM architecture ?? . Made with love ❤️ by Sameer Nawaz

120 Jan 4, 2023

The DoubanFM for iPhone,using AFN and MPMoviePlayer

DoubanFM github:@XVXVXXX csdn:@XVXVXXX please open DoubanFM.xcworkspace The DoubanFM for iPhone,using MPMoviePlayer &AFNetworking@AFNetworking/AFNetwo

465 Jan 6, 2023

Stable Diffusion inference on iOS / macOS using MPSGraph

Related tags

Overview

🍁 Maple Diffusion

Projects using Maple Diffusion

Device Requirements

Usage

Comments

Getting started with image input - can't figure it out

Added an icon for maple diffusion,

Error: failed assertion `length(16640) must be <= 16384.'

Steps to reproduce :

Specs

Screenshot

Logs

Error: Buffer must not exceed 256MB

Warn or fail when using an incompatible input image size

Support for Stable Diffusion 2.0

Releases(0.4)

0.4(Nov 24, 2022)

0.3(Nov 9, 2022)

0.2(Oct 22, 2022)

0.1(Oct 22, 2022)

Owner

Morten Just

AudioPlayer is a simple class for playing audio in iOS, macOS and tvOS apps.

Swift audio synthesis, processing, & analysis platform for iOS, macOS and tvOS

A minimal chromatic tuner for iOS & macOS.

MuVis is a macOS, iOS, iPadOS app for real-time music visualization.

An iOS and macOS audio visualization framework built upon Core Audio useful for anyone doing real-time, low-latency audio processing and visualizations.

A sound fader for AVAudioPlayer written in Swift for iOS, tvOS and macOS.

An original digital art work for macOS, iOS and web

MuVis is a macOS, iOS, iPadOS app for real-time music visualization.

🎶 Control Apple Music right from your macOS menu bar.

macOS app that allows the control of Spotify and AppleMusic/iTunes music playback from the menu bar.

macOS app for Hype Machine

A macOS app to visualise your iTunes library as graphs.

A simple Spotify lyrics viewer menu bar app for macOS in Swift 3

Simple command line utility for switching audio inputs and outputs on macOS

PTStations - MacOS App to gather the information on the Train Station of the Portuguese Railway system

TTSLanguage: Text To Speech commandline executable for macOS

iOS framework that enables detecting and handling voice commands using microphone.

Beautiful Music Player app built using SwiftUI to demonstrate Neumorphic design pattern and MVVM architecture.

The DoubanFM for iPhone,using AFN and MPMoviePlayer