Links to research papers and resources corresponding to implemented features in this repository. Please feel free to fill in any missing references!
Method | Resource |
---|---|
GradAscent, GradDiff | Naive baselines found in many papers including MUSE, TOFU etc. |
NPO | Paper 📄, Code 🐙 |
SimNPO | Paper 📄, Code 🐙 |
IdkDPO | TOFU (📄) |
RMU | WMDP paper (🐙, 🌐), later used in G-effect (🐙) |
Benchmark | Resource |
---|---|
TOFU | Paper 📄 |
MUSE | Paper 📄 |
Metric | Resource |
---|---|
Verbatim Probability / ROUGE, simple QA-ROUGE | Naive metrics found in many papers including MUSE, TOFU etc. |
Membership Inference Attacks (LOSS, ZLib, Reference, GradNorm, MinK, MinK++) | MIMIR (🐙), MUSE (📄) |
PrivLeak | MUSE (📄) |
Forget Quality, Truth Ratio, Model Utility | TOFU (📄) |
Extraction Strength (ES) | Carlini et al., 2021 (📄), used for unlearning in Wang et al., 2025 (📄) |
Exact Memorization (EM) | Tirumala et al., 2022 (📄), used for unlearning in Wang et al., 2025 (📄) |