feat(datafusion-spark): implement spark compatible `unhex` function #19909

lyne7-sc · 2026-01-20T15:00:18Z

Which issue does this PR close?

Part of: #15914

Rationale for this change

Implement spark compatible unhex functions:
https://spark.apache.org/docs/latest/api/sql/index.html#unhex

What changes are included in this PR?

Are these changes tested?

Yes. UTs and SLT added.

Are there any user-facing changes?

No.

datafusion/spark/src/function/math/unhex.rs

Jefffrey · 2026-01-21T03:10:47Z

datafusion/spark/src/function/math/unhex.rs

+    I: Iterator<Item = Option<T>>,
+    T: AsRef<str>,
+{
+    let mut builder = BinaryBuilder::with_capacity(len, len * 32);


It was chosen based on benchmark results and seemed like a reasonable initial estimate. Thanks for pointing this out, for StringArray and LargeStringArray we can use a more accurate capacity from values().len(). I’ve updated the code accordingly.

datafusion/spark/src/function/math/unhex.rs

lyne7-sc · 2026-01-21T13:40:52Z

Thanks for the review and for pointing this out. I’ve updated the implementation accordingly, and the tests have been updated as well and added to the STLs.

alamb · 2026-01-22T22:15:22Z

🚀

lyne7-sc and others added 3 commits January 20, 2026 22:18

impl spark_unhex

b340f87

Merge branch 'apache:main' into feat/spark_unhex

9b51664

cargo fmt

778accf

github-actions bot added sqllogictest SQL Logic Tests (.slt) spark labels Jan 20, 2026

update signature

1f95eec

Jefffrey reviewed Jan 21, 2026

View reviewed changes

lyne7-sc and others added 3 commits January 21, 2026 21:26

Merge branch 'apache:main' into feat/spark_unhex

c5beb91

update based on feedback

6708ab9

fix

e8bd17f

Jefffrey approved these changes Jan 21, 2026

View reviewed changes

lyne7-sc and others added 2 commits January 21, 2026 23:27

fix

113e181

Merge branch 'apache:main' into feat/spark_unhex

6728035

alamb added this pull request to the merge queue Jan 22, 2026

Merged via the queue into apache:main with commit 736fa7c Jan 22, 2026
51 of 55 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(datafusion-spark): implement spark compatible `unhex` function #19909

feat(datafusion-spark): implement spark compatible `unhex` function #19909

Uh oh!

lyne7-sc commented Jan 20, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jefffrey Jan 21, 2026

Uh oh!

lyne7-sc Jan 21, 2026

Uh oh!

Uh oh!

lyne7-sc commented Jan 21, 2026

Uh oh!

alamb commented Jan 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(datafusion-spark): implement spark compatible unhex function #19909

feat(datafusion-spark): implement spark compatible unhex function #19909

Uh oh!

Conversation

lyne7-sc commented Jan 20, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jefffrey Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

lyne7-sc Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lyne7-sc commented Jan 21, 2026

Uh oh!

alamb commented Jan 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(datafusion-spark): implement spark compatible `unhex` function #19909

feat(datafusion-spark): implement spark compatible `unhex` function #19909