k-1 weight decay #19

tom-pollak · 2025-08-21T09:12:44Z

we want the wd to be centered around 1 (exp) not 0

scrambledpie · 2025-08-21T09:29:40Z

boplay/acq_funs/gamma_distribution.py

    for _ in range(max_iters):
        grad = (np.log(k) - digamma(k) - s) / (1.0 / k - polygamma(1, k) + 1e-8)
-        grad += k**2 * wd
+        grad += (k - 1) ** 2 * wd


exponential bistro for the win!!!!

scrambledpie · 2025-08-21T10:36:41Z

boplay/acq_funs/ves_base.py

+        # Apply custom weight decay centered at 1
+        if wd > 0:
+            with pt.no_grad():
+                theta.data -= wd * (theta.data - 1)


theta here can be more general than k-values in a gamma distro. (this code isn't being used in experiments atm so not an issue for the deadline)

k-1 weight decay

af40faa

scrambledpie reviewed Aug 21, 2025

View reviewed changes

scrambledpie approved these changes Aug 21, 2025

View reviewed changes

scrambledpie reviewed Aug 21, 2025

View reviewed changes

scrambledpie force-pushed the master branch from 5614eaf to ca7aa8d Compare August 21, 2025 10:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

k-1 weight decay #19

k-1 weight decay #19

Uh oh!

tom-pollak commented Aug 21, 2025

Uh oh!

scrambledpie Aug 21, 2025

Uh oh!

scrambledpie Aug 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

k-1 weight decay #19

Are you sure you want to change the base?

k-1 weight decay #19

Uh oh!

Conversation

tom-pollak commented Aug 21, 2025

Uh oh!

scrambledpie Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

scrambledpie Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants