How to solve the problem of data expansion caused by merging small files by adding repartition #6308
Unanswered
shizhengchao
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Originally there were 1000 small files with a total size of 500MB. After adding repartition and insert overwrite, the total file size became 5G.
Beta Was this translation helpful? Give feedback.
All reactions