Offsite-tuning model generation by HarliWu · Pull Request #676 · alibaba/FederatedScope

HarliWu · 2023-08-08T04:00:28Z

There are two main updates:

fschat.py: The users can call next_model() to switch to the next model when multiple checkpoints are available.
utils.py: We can add new functions to support new offsite-tuning strategies (Note: you are required to add the new function to generate_adap_model() accordingly)

rayrayraykk

Please see the inline comments. Thx!

rayrayraykk · 2023-08-16T02:55:24Z

federatedscope/llm/misc/fschat.py

-        else:
-            try:
-                ckpt = torch.load(config.federate.save_to, map_location='cpu')
+        self.prefix = ['']


Should prefix be passed by the config?

rayrayraykk · 2023-08-16T03:01:14Z

federatedscope/llm/offsite_tuning/server.py

        # No need for this attr
        if hasattr(adap_model, 'teacher'):
+            import gc
+            import torch


How about move line 48-49 to top:

try: import gc import torch except ImportError: gc=None torch=None

rayrayraykk · 2023-08-16T03:04:26Z

federatedscope/llm/offsite_tuning/utils.py

+    new_model = set_layers(new_model, emulator_and_adapter)
+
+    if emulator_alignment:
+        new_model.student = layers


Please merge the latest commits in which bugs are fixed. (layers should be detached from new_model)

HarliWu added 19 commits June 29, 2023 06:55

Support flops calculation on LLM

1fdfb5d

Merge branch 'alibaba:dev/llm' into dev/llm

aeb4ac5

Merge branch 'alibaba:dev/llm' into dev/llm

a52f4d0

Merge branch 'alibaba:dev/llm' into dev/llm

10729c4

Merge branch 'alibaba:dev/llm' into dev/llm

782da16

Merge branch 'alibaba:dev/llm' into dev/llm

7fdaf3f

Merge branch 'alibaba:dev/llm' into dev/llm

78557d0

Merge branch 'alibaba:dev/llm' into dev/llm

3470d07

Merge branch 'alibaba:dev/llm' into dev/llm

595e633

Fix bugs for human_eval

bc125fb

Merge branch 'alibaba:dev/llm' into dev/llm

8d29321

Fix bugs on HumanEval

d728b26

Remove \n\n in HumanEval

ed96262

Merge branch 'alibaba:dev/llm' into dev/llm

1dd3c38

Merge branch 'alibaba:dev/llm' into dev/llm

ca41591

Merge branch 'alibaba:dev/llm' into dev/llm

acab21c

Merge branch 'alibaba:dev/llm' into dev/llm

2639504

Merge branch 'alibaba:dev/llm' into dev/llm

e94c34a

method-oriented offsite-tuning model generation

f41614b

HarliWu changed the title ~~Dev/llm~~ Offsite-tuning model generation Aug 8, 2023

HarliWu added 2 commits August 8, 2023 10:24

Merge branch 'alibaba:dev/llm' into dev/llm

f756262

Merge branch 'alibaba:dev/llm' into dev/llm

3f5c58d

rayrayraykk reviewed Aug 16, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Offsite-tuning model generation#676

Offsite-tuning model generation#676
HarliWu wants to merge 21 commits intoalibaba:dev/llmfrom
HarliWu:dev/llm

HarliWu commented Aug 8, 2023

Uh oh!

rayrayraykk left a comment

Uh oh!

rayrayraykk Aug 16, 2023

Uh oh!

rayrayraykk Aug 16, 2023

Uh oh!

rayrayraykk Aug 16, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

HarliWu commented Aug 8, 2023

Uh oh!

rayrayraykk left a comment

Choose a reason for hiding this comment

Uh oh!

rayrayraykk Aug 16, 2023

Choose a reason for hiding this comment

Uh oh!

rayrayraykk Aug 16, 2023

Choose a reason for hiding this comment

Uh oh!

rayrayraykk Aug 16, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants