Skip to content

Refine anomaly detection features#508

Merged
JohT merged 9 commits intomainfrom
features/refine-feature-engineering-for-anomaly-detection
Jan 26, 2026
Merged

Refine anomaly detection features#508
JohT merged 9 commits intomainfrom
features/refine-feature-engineering-for-anomaly-detection

Conversation

@JohT
Copy link
Owner

@JohT JohT commented Jan 18, 2026

🚀 Feature

  • Remove constant features from anomaly detection: Some of the newly introduced features may turn out to be constant and therefore not informative. Such features are now removed during the data preparation step.
  • Consider abstract classes to have 70% abstractness: When calculating abstractness (according to Robert C. Martin’s object-oriented design metrics), interfaces and abstract classes were previously treated as equal. Since abstract classes often contain concrete implementations for inheriting classes, they are now weighted with 70% abstractness.
  • 🌟 Add abstractness to anomaly detection features: Adds “abstractness” (Robert C. Martin’s Object-Oriented Design Metrics) as an additional feature for anomaly detection. This introduces semantic information to better distinguish, for example, between interfaces and concrete classes.
  • 🌟 Add weakly/strongly connected components to anomaly detection features: The graph algorithm “strongly connected components” produces a directed acyclic graph (DAG) whose members contain cycles, which is often the case for code elements such as Java types. Additionally, “weakly connected components” group one or more strongly connected components into an “island.” Together, they add hierarchical and structural information and enable topological sorting (which requires a DAG).

⚙️ Optimization

🛠 Fix

@JohT JohT self-assigned this Jan 18, 2026
@JohT JohT force-pushed the features/refine-feature-engineering-for-anomaly-detection branch 5 times, most recently from 1a6d09b to 36c49fd Compare January 21, 2026 22:15
@JohT JohT force-pushed the features/refine-feature-engineering-for-anomaly-detection branch from 36c49fd to 1db2f88 Compare January 25, 2026 10:48
@JohT JohT force-pushed the features/refine-feature-engineering-for-anomaly-detection branch from 1db2f88 to ea2e2e7 Compare January 26, 2026 09:33
@JohT JohT marked this pull request as ready for review January 26, 2026 10:02
@JohT JohT merged commit 58a1ce7 into main Jan 26, 2026
10 checks passed
@JohT JohT deleted the features/refine-feature-engineering-for-anomaly-detection branch January 26, 2026 14:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant