Commit a13c683
authored
Changes the DPO + finetune scripts to provide progress updates in the Beaker description. (#1127)
* Now, we get num_attention_heads from the hf config.
* Update code
* Added test that we match manual values
* Updated calculations
* Updated code with check_calculation
* Updated code
* Now, tests pass.
* Updated code to normalize properly
* Added some fixes
* Updated code
* Updated code
* Another fix
* Cleaned up tests.
* Cleaned up PR
* Update MFU/MBU code.
* Now, mbu tests pass.
* Moved to json file
* Added test data
* undid changes and simplified test function.
* An attempt at a fix
* Update code with patches
* now, tests pass
* Added MFU to DPO
* updated script
* uses uv for dpo
* Added a chat template to the DPO script.
* Added trackign
* Updated code to handle tracking when none
* Added description updates
* undid changes
* Check out dpo script
* updated script
* Update code to remove whitespace
* fix finetune timing
* Fixed bugs pointed out by cursor.1 parent a76cc4b commit a13c683
File tree
3 files changed
+38
-10
lines changed- open_instruct
- scripts/train/debug
3 files changed
+38
-10
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
79 | 79 | | |
80 | 80 | | |
81 | 81 | | |
| 82 | + | |
82 | 83 | | |
83 | 84 | | |
84 | 85 | | |
| |||
498 | 499 | | |
499 | 500 | | |
500 | 501 | | |
| 502 | + | |
501 | 503 | | |
502 | 504 | | |
503 | 505 | | |
| |||
813 | 815 | | |
814 | 816 | | |
815 | 817 | | |
| 818 | + | |
816 | 819 | | |
817 | 820 | | |
818 | 821 | | |
| |||
936 | 939 | | |
937 | 940 | | |
938 | 941 | | |
| 942 | + | |
| 943 | + | |
| 944 | + | |
| 945 | + | |
| 946 | + | |
| 947 | + | |
939 | 948 | | |
940 | 949 | | |
941 | 950 | | |
| |||
989 | 998 | | |
990 | 999 | | |
991 | 1000 | | |
992 | | - | |
| 1001 | + | |
993 | 1002 | | |
994 | 1003 | | |
995 | 1004 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
62 | 62 | | |
63 | 63 | | |
64 | 64 | | |
| 65 | + | |
65 | 66 | | |
66 | 67 | | |
67 | 68 | | |
| |||
438 | 439 | | |
439 | 440 | | |
440 | 441 | | |
| 442 | + | |
441 | 443 | | |
442 | 444 | | |
443 | 445 | | |
| |||
727 | 729 | | |
728 | 730 | | |
729 | 731 | | |
730 | | - | |
| 732 | + | |
731 | 733 | | |
732 | 734 | | |
733 | 735 | | |
| |||
824 | 826 | | |
825 | 827 | | |
826 | 828 | | |
827 | | - | |
| 829 | + | |
| 830 | + | |
| 831 | + | |
828 | 832 | | |
829 | 833 | | |
830 | | - | |
| 834 | + | |
831 | 835 | | |
832 | 836 | | |
833 | 837 | | |
| |||
855 | 859 | | |
856 | 860 | | |
857 | 861 | | |
858 | | - | |
| 862 | + | |
859 | 863 | | |
860 | 864 | | |
861 | 865 | | |
| |||
869 | 873 | | |
870 | 874 | | |
871 | 875 | | |
872 | | - | |
| 876 | + | |
873 | 877 | | |
874 | 878 | | |
875 | 879 | | |
876 | 880 | | |
877 | | - | |
| 881 | + | |
878 | 882 | | |
879 | 883 | | |
880 | 884 | | |
881 | 885 | | |
882 | 886 | | |
| 887 | + | |
| 888 | + | |
| 889 | + | |
| 890 | + | |
| 891 | + | |
| 892 | + | |
883 | 893 | | |
884 | 894 | | |
885 | 895 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
2 | 5 | | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
3 | 10 | | |
4 | 11 | | |
5 | | - | |
| 12 | + | |
| 13 | + | |
6 | 14 | | |
7 | 15 | | |
8 | 16 | | |
| |||
26 | 34 | | |
27 | 35 | | |
28 | 36 | | |
| 37 | + | |
29 | 38 | | |
30 | | - | |
| 39 | + | |
0 commit comments