Skip to content

Improve performance of pandas based IntervalIdentifier wranglers #4

@mansenfranzen

Description

@mansenfranzen

The NaiveIterator may benefit from numba.jit because of the sequential loop.

The VectorizedCumSum may benefit from a pure numpy implementation avoiding the overhead of creating multiple pd.Series etc. The groupby.sum may be replaced via np.unique in combination with np.add.reduceat.

However, the benchmark utility is required first, as addressed here #3.

Metadata

Metadata

Assignees

No one assigned

    Labels

    improvementImprove existing code like performance increases

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions