Added relativistic discriminator loss used in ESRGAN paper. by bnb32 · Pull Request #261 · NatLabRockies/sup3r

bnb32 · 2025-03-06T16:05:23Z

This changes the previous disc loss calc (and adversarial loss) to use the relativistic versions described in the ESRGAN paper. The symmetry enables us to remove the adversarial loss function, and instead just swap the arguments in the new disc loss calc, and also seems worth doing since this improved on the SRGAN framework.

I'm also finding this to result in much stabler training, btw.

grantbuster · 2025-04-16T17:07:55Z

@bnb32 Starting review - to clarify, you only implemented the feature "Relativistic average GAN (RaGAN) [20], which
learns to judge “whether one image is more realistic than the other” rather than “whether one image is real or fake”" from the Wang paper, right? Seems like Wang did a lot of different things and I'm trying to track what to pay attention to.

Also, does this completely revise the previous disc method or is it an option?

bnb32 · 2025-04-16T17:15:46Z

@grantbuster Yeah, the RaGAN disc loss is what I added. This changes the previous method. It allows us to use disc_loss(true, gen) for the discriminator and disc_loss(gen, true) for adversarial loss, instead of two different methods for these, so I removed the previous function for adversarial loss.

…g half right should give 0.5 loss.

grantbuster

Minor suggestion but LGTM

grantbuster · 2025-04-16T17:59:01Z

+        if train_gen:
+            loss = loss_gen
+        elif train_disc:
+            loss = loss_disc


This seems really inefficient to run all loss calculations and then only output one of them. Why dont we wrap the actual loss calculations in the if statement?

Good call. We've had this inefficient setup for a while.

yeah no blame just reading our old code and thinking well that could have been done better haha

@grantbuster Actually, we need to compute the disc loss every batch to track whether to train it or not. We can skip the gen loss calcs though.

Gotcha, makes sense. Maybe note this in-line comment so future-us remembers haha

@grantbuster Nvm, how's this solution? d122ace

grantbuster · 2025-04-16T18:00:17Z

+        loss_gen_content, loss_gen_content_details = (
+            self.calc_loss_gen_content(hi_res_true, hi_res_gen)
+        )
+        loss_gen_advers = self.calc_loss_disc(
+            disc_out_true=disc_out_gen, disc_out_gen=disc_out_true
+        )
+        loss_gen = loss_gen_content + weight_gen_advers * loss_gen_advers
+        loss_disc = self.calc_loss_disc(
+            disc_out_true=disc_out_true, disc_out_gen=disc_out_gen
+        )
+
+        loss = None
+        if train_gen:
+            loss = loss_gen
+        elif train_disc:
+            loss = loss_disc
+
+        loss_details = {
+            'loss_gen': loss_gen,
+            'loss_gen_content': loss_gen_content,
+            'loss_gen_advers': loss_gen_advers,
+            'loss_disc': loss_disc,
+        }


Suggested change

loss_gen_content, loss_gen_content_details = (

self.calc_loss_gen_content(hi_res_true, hi_res_gen)

)

loss_gen_advers = self.calc_loss_disc(

disc_out_true=disc_out_gen, disc_out_gen=disc_out_true

)

loss_gen = loss_gen_content + weight_gen_advers * loss_gen_advers

loss_disc = self.calc_loss_disc(

disc_out_true=disc_out_true, disc_out_gen=disc_out_gen

)

loss = None

if train_gen:

loss = loss_gen

elif train_disc:

loss = loss_disc

loss_details = {

'loss_gen': loss_gen,

'loss_gen_content': loss_gen_content,

'loss_gen_advers': loss_gen_advers,

'loss_disc': loss_disc,

}

loss_details = {}

if train_gen:

loss_gen_content, loss_gen_content_details = (

self.calc_loss_gen_content(hi_res_true, hi_res_gen)

)

loss_gen_advers = self.calc_loss_disc(

disc_out_true=disc_out_gen, disc_out_gen=disc_out_true

)

loss = loss_gen_content + weight_gen_advers * loss_gen_advers

loss_details['loss_gen'] = loss

loss_details['loss_gen_content'] = loss_gen_content,

loss_details['loss_gen_advers'] = loss_gen_advers,

elif train_disc:

loss = self.calc_loss_disc(

disc_out_true=disc_out_true, disc_out_gen=disc_out_gen

)

loss_details['loss_disc'] = loss

grantbuster · 2025-04-16T18:09:12Z

-        chunk going through the forward pass is too big.
+        chunk going through the forward pass is too big. This is due to a
+        tensorflow padding bug, with the padding mode set to 'reflect'.
+        https://github.com/tensorflow/tensorflow/issues/91027


You might note that in our current TF version 2.15 this actually just results in scrambled outputs.

(which is very hard to detect)

… that tf 2.15.1 padding bug produces scrambled output instead of constant.

…mputing disc loss.

… loss calcs for disc and gen content.

Added relativistic discriminator loss used in ESRGAN paper.

bnb32 force-pushed the bnb/relativistic_disc branch from 8ab0206 to 06c0990 Compare March 7, 2025 23:49

bnb32 marked this pull request as ready for review March 28, 2025 15:24

bnb32 requested a review from grantbuster March 28, 2025 15:24

bnb32 added 2 commits April 16, 2025 11:51

Added relativistic discriminator loss used in ESRGAN paper.

5b47512

Concatenating less / more realistic terms to get "mean" - disc gettin…

3c9ada8

…g half right should give 0.5 loss.

bnb32 force-pushed the bnb/relativistic_disc branch from 06c0990 to 3c9ada8 Compare April 16, 2025 17:51

grantbuster approved these changes Apr 16, 2025

View reviewed changes

added reference to tensorflow padding issue.

1067821

grantbuster reviewed Apr 16, 2025

View reviewed changes

bnb32 added 4 commits April 16, 2025 12:17

Changed inefficient loss calcs in base calc_loss method. Added note…

3d7ff00

… that tf 2.15.1 padding bug produces scrambled output instead of constant.

Added compute_disc kwarg so that during pre-training we can skip co…

d122ace

…mputing disc loss.

Added compute_disc to solar model calc_loss. Split up the sub day…

b6f363e

… loss calcs for disc and gen content.

removed unused index in solar calc_loss

01fc102

bnb32 merged commit a386e51 into main Apr 17, 2025
12 checks passed

bnb32 deleted the bnb/relativistic_disc branch April 17, 2025 15:33

github-actions Bot pushed a commit that referenced this pull request Apr 17, 2025

Merge pull request #261 from NREL/bnb/relativistic_disc

dce496c

Added relativistic discriminator loss used in ESRGAN paper.

Conversation

bnb32 commented Mar 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

grantbuster commented Apr 16, 2025

Uh oh!

bnb32 commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

grantbuster left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bnb32 Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bnb32 commented Mar 6, 2025 •

edited

Loading

bnb32 commented Apr 16, 2025 •

edited

Loading

bnb32 Apr 16, 2025 •

edited

Loading