points by nathas 3 years ago

1.3 sec/iter on my M1 Mac, so ~39 seconds.

jw1224 3 years ago

That was fast. I'm only getting 5.26s/iter on an M1 Pro MBP with 16GB RAM.

EDIT: Speed increased to 2.3s/iter after a reboot

  • nathas 3 years ago

    Depends what fork you're running... Some seem to be using CPU-based generation, others use the MPS device backend correctly which is MUCH faster. I have another comment floating around about lstein's fork, but it takes some massaging to get it to run happily. https://github.com/lstein/stable-diffusion/

    • jw1224 3 years ago

      The fork linked by OP is MPS-based, I can see GPU usage way up in Activity Monitor. Seems performance doubled after a reboot though :)

  • geerlingguy 3 years ago

    Weird, on M1 Max Mac Studio, only getting 1.42 it/s :/

    • nathas 3 years ago

      I got my units backwards :sweat: My bad!

  • black3r 3 years ago

    the thing eats 15GB memory on my M1 Pro with 32GB RAM... you're probably slowed down by swapping if you only have 16GB RAM...