-
Notifications
You must be signed in to change notification settings - Fork 5
Open
Description
Could we introduce an option for plot_marginal() and maybe for some other plotting functions to winsorize outliers in numerical x before calculating bins?
There are a couple of options, and all have drawbacks:
- Winsorize values outside [q1 - alpha * IQR, q3 + alpha * IQR], where qi is the i-th quartile: Expensive to calculate on large data, but one could subsample on 10k rows.
- Use mean(x) +- alpha std(x). Drawback: mean and std are not robust.
- Set manual limits. Drawback: User has to work.
Values outside the limits would fall into the most extreme bins for calculations.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels