Tracking issue for Mac support #4

pannous · 2023-03-05T16:06:00Z

M1 / M2 32GB … 128GB any hopes?

remixer-dec · 2023-03-06T23:27:54Z

No luck with this repo, "bitsandbytes" dependency is heavily relying on CUDA.
But there is a repo for cpu inference, just change the prompts to prompts[0], so it doesn't crash with max_batch_size=1.
It takes more than 10 minutes to produce output with max_gen_len=20, even GPT-J 5B took me around a minute on CPU.
I also tried to make an MPS port with gpu acceleration, it works faster, but ~~the output is not good enough imo, not sure if it is always good on cpu or if I just got lucky on my first generation.~~ UPDATE: the model gives good outputs with python3.10 + pytorch-nightly

pannous · 2023-03-08T10:48:23Z

thanks!

remixer-dec · 2023-03-09T23:55:20Z

Actually, I was wrong. After I tried my port with a higher version of python+pytorch, the outputs were as good as the cpu ones, I am happy that it worked after all!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking issue for Mac support #4

Tracking issue for Mac support #4

pannous commented Mar 5, 2023

remixer-dec commented Mar 6, 2023 •

edited

Loading

pannous commented Mar 8, 2023

remixer-dec commented Mar 9, 2023

Tracking issue for Mac support #4

Tracking issue for Mac support #4

Comments

pannous commented Mar 5, 2023

remixer-dec commented Mar 6, 2023 • edited Loading

pannous commented Mar 8, 2023

remixer-dec commented Mar 9, 2023

remixer-dec commented Mar 6, 2023 •

edited

Loading