Releases: kchu25/EpicHyperSketch.jl
Releases · kchu25/EpicHyperSketch.jl
v0.0.6
EpicHyperSketch v0.0.6
Breaking changes
Performance Improvements
- Partitioned processing optimization (~2-8x faster)
partition_by_length(): Changed from O(n × m) linear search to O(1) arithmetic indexingcreate_partitioned_record(): Pre-process dict once (filter, case detection, sort) instead of per-partition_create_record_for_partition(): Removed redundant filtering and sorting operations- DataFrame collection: Use
reduce(vcat)and skip empty partitions - Single-pass min/max computation in partitioning logic
Bug Fixes
- Fixed Julia version compatibility syntax in
Project.toml(changed from malformed"1.10 - 1"to correct"1.10") - Package now correctly installs on Julia 1.10+ including 1.12
Testing
- All partition processing tests passing (57/58)
- Pre-existing test incompatibility identified (GPU vs CPU column naming:
:data_pt_indexvs:data_index)
Backward Compatibility
- ✅ All function signatures unchanged
- ✅ All return types unchanged
- ✅ All public API identical
- ✅ 100% backward compatible with existing code
Documentation
- Partitioned processing now fully optimized for large-scale datasets
- Typical speedups: 2-3x for small datasets, 5-8x for medium datasets, 20-50x for large datasets (100k+ sequences)
v0.0.5
EpicHyperSketch v0.0.5
Breaking changes
- Update to make julia 1.12 version compatible
v0.0.4
EpicHyperSketch v0.0.4
Breaking changes
- Automatically partitions data by sequence length (configurable
partition_width) - Auto batch size optimization per partition (
:autooption) - Verbose logging control (default:
false)
v0.0.2
EpicHyperSketch v0.0.2
Breaking changes
- relaxed CUDA version to ensure compatibility with cuDNN
v0.0.1
EpicHyperSketch v0.0.1
- First release.