Feat/package and device compatibility by paulasquin · Pull Request #3 · apple/ml-mgie

paulasquin · 2024-02-08T11:43:48Z

Refacto, Packaging & Apple Silicon compatibility

Add poetry-style packaging
Refacto code in Object Oriented Programming
Add typing
Add tests
Add mps compatibility (tested on M3 Max 64Go)
Add gradio app

To squash before merge

Solved issues

Nonsense inner thougts

In Apple Silicon, we are (were) getting nonsense from the model.generate methods

Payload

Instruction: make the frame red
Image:

Expected:

Out:

If the frame of the glasses in the image were made red, the overall appearance of the scene would change significantly.The red frame would draw more attention to the glass and create a stronger contrast with the black frame.

Res:

Obtained

Out

Pres flash togful calledgot At commitilli split sent supports fir card projects course bunch mixture enc halery racc developed curves enjoydog memory seek Inside Wh sam closure served supports fir tripifest towardinn household finishing exact meaning ordinary treat drop whose invert Rem follow til Otherwise stal frames sequence lifted accomp entire variation government carriage uses eratrim condition Wild throne phys mutong B woods racc developed Le rename Ada laugh applying dess squ cit reference rad type refresh spr rud embedded agricult foot ax steps God close These

Res:
~same as input

Fix

Latest llava weights that you can get from hugging face with git clone https://huggingface.co/liuhaotian/LLaVA-Lightning-7B-delta-v1-1 are just not working.
Solved using saved weights by tsujuifu, stored in GoogleDrive
-> A lot of time lost out of this. This is due to delta-vs-full LLava?

Out

The image would feature a close-up view of a pair of black eyeglasses with a gold or metallic frame, placed on a gray background.The frame would be red, drawing attention to the glasses and making them the focal point of the image.

Res

xiaoqian-shen · 2024-02-13T15:58:04Z

I also faced issues when trying to reproduce the results. Although no errors were displayed, the quality of the editing was not good as the paper. Could you please share the environment file so I can verify the versions of the critical packages?

xiaoqian-shen · 2024-02-13T17:03:19Z

I fix the problem by using your provided checkpoint in google drive. Thanks!

paulasquin · 2024-02-13T22:44:52Z

Hello @xiaoqian-shen

Indeed I suggest to use the models from my HuggingFace, which is from Tsu-Jui Fu's Google Drive link.
I do not have clear understanding of why original package weights aren't working.

Even if this isn't needed for you anymore, here are the package version if it can help others:
I'm sharing poetry run python -m pip freeze instead of poetry.lock file for readability

freeze.txt

xiaoqian-shen · 2024-02-15T14:58:42Z

Thanks for your reply! May I ask are you available to reproduce the result of MagicBrush in Table 2?

GitHub1712 · 2024-02-17T12:01:15Z

My trained mgie_7b also not working. Was able to train and export mllm.pt and unet.pt but if running demo, ckpt has no 'emb' and my ckpt´s 'model.embed_tokens.weight' have different tensor size. So running training worked but result model not. With tsujuifu´s weights demo works.

paulasquin · 2024-02-21T17:08:45Z

Thanks for your reply! May I ask are you available to reproduce the result of MagicBrush in Table 2?

Hello @xiaoqian-shen
I have sometimes slight differences but I get mainly same level of quality, and a few times I got ugly results (phone and beach photos mainly)

Here are my before/after on the demo images

lzw-lzw · 2024-03-18T03:59:52Z

Thank you for your contribution. I wonder where can I find the ipr2ipr.pkl/tsv data in the code, that is, the summarized image-text pair, or do I need to construct it myself?

paulasquin added 23 commits February 8, 2024 12:40

init poetry packaging

ed7d29e

add package readme

77b69b4

preco and wip for mps

770e8ad

add preco yaml

67a9714

remove llava

dc8017d

refacto to add processing in object

b913dfd

add pure llava

3b942dc

add test image

df7b0a2

add test model

610776d

rm old llava

e080433

more refacto, add cli

e6783e8

add max size

a6a0e56

allow none maxsize

710f92c

default to float32

c9100d9

add app gradio

e5b4ea5

default size

2ed91bd

add back submodule llava to avoid perturbing legacy

9ddfbff

add newline

371830b

add back submodule llava for legacy

db33463

add inference as decorator, half as default

73c5c25

rename fields in app

84556f9

update package readme

6af3819

update tests and check image result with ssim

4cdba2f

paulasquin marked this pull request as ready for review February 10, 2024 11:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/package and device compatibility#3

Feat/package and device compatibility#3
paulasquin wants to merge 23 commits intoapple:mainfrom
paulasquin:feat/package_and_device_compatibility

paulasquin commented Feb 8, 2024 •

edited

Loading

Uh oh!

xiaoqian-shen commented Feb 13, 2024

Uh oh!

xiaoqian-shen commented Feb 13, 2024

Uh oh!

paulasquin commented Feb 13, 2024

Uh oh!

xiaoqian-shen commented Feb 15, 2024

Uh oh!

GitHub1712 commented Feb 17, 2024 •

edited

Loading

Uh oh!

paulasquin commented Feb 21, 2024

Uh oh!

lzw-lzw commented Mar 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

paulasquin commented Feb 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Refacto, Packaging & Apple Silicon compatibility

Solved issues

Nonsense inner thougts

Payload

Expected:

Obtained

Fix

Uh oh!

xiaoqian-shen commented Feb 13, 2024

Uh oh!

xiaoqian-shen commented Feb 13, 2024

Uh oh!

paulasquin commented Feb 13, 2024

Uh oh!

xiaoqian-shen commented Feb 15, 2024

Uh oh!

GitHub1712 commented Feb 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paulasquin commented Feb 21, 2024

Uh oh!

lzw-lzw commented Mar 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

paulasquin commented Feb 8, 2024 •

edited

Loading

GitHub1712 commented Feb 17, 2024 •

edited

Loading