[SPARK-49741][DOCS] Add `spark.shuffle.accurateBlockSkewedFactor` to config docs page #48189

timlee0119 · 2024-09-20T18:35:27Z

What changes were proposed in this pull request?

spark.shuffle.accurateBlockSkewedFactor was added in Spark 3.3.0 in https://issues.apache.org/jira/browse/SPARK-36967 and is a useful shuffle configuration to prevent issues where HighlyCompressedMapStatus wrongly estimates the shuffle block sizes when the block size distribution is skewed, which can cause the shuffle reducer to fetch too much data and OOM. This PR adds this config to the Spark config docs page to make it discoverable.

Why are the changes needed?

To make this useful config discoverable by users and make them able to resolve shuffle fetch OOM issues themselves.

Does this PR introduce any user-facing change?

Yes, this is a documentation fix. Before this PR there's no spark.sql.adaptive.skewJoin.skewedPartitionFactor in the Shuffle Behavior section on the Configurations page and now there is.

How was this patch tested?

On the IDE:

Updated:

Was this patch authored or co-authored using generative AI tooling?

No

JoshRosen

+1 on documenting this, as this is a useful configuration. Looks like the docs generally match

spark/core/src/main/scala/org/apache/spark/internal/config/package.scala

Lines 1387 to 1397 in f3785fa

    
           private[spark] val SHUFFLE_ACCURATE_BLOCK_SKEWED_FACTOR = 
        
             ConfigBuilder("spark.shuffle.accurateBlockSkewedFactor") 
        
               .internal() 
        
               .doc("A shuffle block is considered as skewed and will be accurately recorded in " + 
        
                 "HighlyCompressedMapStatus if its size is larger than this factor multiplying " + 
        
                 "the median shuffle block size or SHUFFLE_ACCURATE_BLOCK_THRESHOLD. It is " + 
        
                 "recommended to set this parameter to be the same as SKEW_JOIN_SKEWED_PARTITION_FACTOR." + 
        
                 "Set to -1.0 to disable this feature by default.") 
        
               .version("3.3.0") 
        
               .doubleConf 
        
               .createWithDefault(-1.0)

so looks good overall to me, just left one suggestion about moving this down a few rows so it's next to a related configuration in the table.

JoshRosen · 2024-09-20T19:02:35Z

docs/configuration.md

@@ -1010,6 +1010,19 @@ Apart from these, the following properties are also available, and may be useful
  </td>
  <td>2.2.1</td>
 </tr>
+<tr>
+  <td><code>spark.shuffle.accurateBlockSkewedFactor</code></td>


I see that the spark.shuffle.accurateBlockThreshold configuration is already documented in this table. It looks like we're preexistingly inconsistent about alphabetizing this list.

What do you think about either moving this new configuration a bit further down so it's next to spark.shuffle.accurateBlockThreshold?

Done, also updated the test section

dongjoon-hyun · 2024-09-21T00:08:11Z

docs/configuration.md

@@ -1222,6 +1222,19 @@ Apart from these, the following properties are also available, and may be useful
  </td>
  <td>2.2.1</td>
 </tr>
+<tr>
+  <td><code>spark.shuffle.accurateBlockSkewedFactor</code></td>


To @timlee0119 and @JoshRosen , shall we remove .internal() from the configuration definition together?

spark/core/src/main/scala/org/apache/spark/internal/config/package.scala

Lines 1387 to 1389 in bdea091

private[spark] val SHUFFLE_ACCURATE_BLOCK_SKEWED_FACTOR =

ConfigBuilder("spark.shuffle.accurateBlockSkewedFactor")

.internal()

In addition, the PR title looks wrong to me because we are touching spark.shuffle.accurateBlockSkewedFactor instead of spark.sql.adaptive.....

Yeah, I would match the description

Sorry for the typo, I've fixed the PR title and removed internal()

dongjoon-hyun

+1, LGTM. Thank you.

HyukjinKwon · 2024-09-22T05:40:13Z

Merged to master.

…config docs page ### What changes were proposed in this pull request? `spark.shuffle.accurateBlockSkewedFactor` was added in Spark 3.3.0 in https://issues.apache.org/jira/browse/SPARK-36967 and is a useful shuffle configuration to prevent issues where `HighlyCompressedMapStatus` wrongly estimates the shuffle block sizes when the block size distribution is skewed, which can cause the shuffle reducer to fetch too much data and OOM. This PR adds this config to the Spark config docs page to make it discoverable. ### Why are the changes needed? To make this useful config discoverable by users and make them able to resolve shuffle fetch OOM issues themselves. ### Does this PR introduce _any_ user-facing change? Yes, this is a documentation fix. Before this PR there's no `spark.sql.adaptive.skewJoin.skewedPartitionFactor` in the `Shuffle Behavior` section on [the Configurations page](https://spark.apache.org/docs/latest/configuration.html) and now there is. ### How was this patch tested? On the IDE: <img width="1633" alt="image" src="https://github.com/user-attachments/assets/616a94b9-2408-491c-a17b-c6dbdff14465"> Updated: <img width="1274" alt="image" src="https://github.com/user-attachments/assets/ba170e9a-eba2-4fdf-85eb-a3aebefc055e"> ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#48189 from timlee0119/add-accurate-block-skewed-factor-to-doc. Authored-by: Tim Lee <[email protected]> Signed-off-by: Hyukjin Kwon <[email protected]>

add config

0d4d6bc

github-actions bot added the DOCS label Sep 20, 2024

JoshRosen approved these changes Sep 20, 2024

View reviewed changes

move config location

8c59fc6

dongjoon-hyun changed the title ~~[SPARK-49741][Docs] Add spark.sql.adaptive.skewJoin.skewedPartitionFactor to config docs page~~ [SPARK-49741][Docs] Add spark.sql.adaptive.skewJoin.skewedPartitionFactor to config docs page Sep 21, 2024

dongjoon-hyun reviewed Sep 21, 2024

View reviewed changes

timlee0119 changed the title ~~[SPARK-49741][Docs] Add spark.sql.adaptive.skewJoin.skewedPartitionFactor to config docs page~~ [SPARK-49741][Docs] Add spark.shuffle.accurateBlockSkewedFactor to config docs page Sep 21, 2024

remove internal

0e9ac02

timlee0119 requested review from HyukjinKwon and dongjoon-hyun September 21, 2024 04:03

github-actions bot added the CORE label Sep 21, 2024

dongjoon-hyun changed the title ~~[SPARK-49741][Docs] Add spark.shuffle.accurateBlockSkewedFactor to config docs page~~ [SPARK-49741][Docs] Add spark.shuffle.accurateBlockSkewedFactor to config docs page Sep 22, 2024

dongjoon-hyun approved these changes Sep 22, 2024

View reviewed changes

HyukjinKwon changed the title ~~[SPARK-49741][Docs] Add spark.shuffle.accurateBlockSkewedFactor to config docs page~~ [SPARK-49741][DOCS] Add spark.shuffle.accurateBlockSkewedFactor to config docs page Sep 22, 2024

HyukjinKwon approved these changes Sep 22, 2024

View reviewed changes

HyukjinKwon closed this in b642096 Sep 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-49741][DOCS] Add `spark.shuffle.accurateBlockSkewedFactor` to config docs page #48189

[SPARK-49741][DOCS] Add `spark.shuffle.accurateBlockSkewedFactor` to config docs page #48189

Uh oh!

timlee0119 commented Sep 20, 2024 •

edited

Loading

Uh oh!

JoshRosen left a comment

Uh oh!

JoshRosen Sep 20, 2024

Uh oh!

timlee0119 Sep 20, 2024

Uh oh!

dongjoon-hyun Sep 21, 2024

Uh oh!

dongjoon-hyun Sep 21, 2024

Uh oh!

HyukjinKwon Sep 21, 2024

Uh oh!

timlee0119 Sep 21, 2024

Uh oh!

dongjoon-hyun left a comment

Uh oh!

HyukjinKwon commented Sep 22, 2024

Uh oh!

Uh oh!

	private[spark] val SHUFFLE_ACCURATE_BLOCK_SKEWED_FACTOR =
	ConfigBuilder("spark.shuffle.accurateBlockSkewedFactor")
	.internal()
	.doc("A shuffle block is considered as skewed and will be accurately recorded in " +
	"HighlyCompressedMapStatus if its size is larger than this factor multiplying " +
	"the median shuffle block size or SHUFFLE_ACCURATE_BLOCK_THRESHOLD. It is " +
	"recommended to set this parameter to be the same as SKEW_JOIN_SKEWED_PARTITION_FACTOR." +
	"Set to -1.0 to disable this feature by default.")
	.version("3.3.0")
	.doubleConf
	.createWithDefault(-1.0)

[SPARK-49741][DOCS] Add spark.shuffle.accurateBlockSkewedFactor to config docs page #48189

[SPARK-49741][DOCS] Add spark.shuffle.accurateBlockSkewedFactor to config docs page #48189

Uh oh!

Conversation

timlee0119 commented Sep 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

JoshRosen left a comment

Choose a reason for hiding this comment

Uh oh!

JoshRosen Sep 20, 2024

Choose a reason for hiding this comment

Uh oh!

timlee0119 Sep 20, 2024

Choose a reason for hiding this comment

Uh oh!

dongjoon-hyun Sep 21, 2024

Choose a reason for hiding this comment

Uh oh!

dongjoon-hyun Sep 21, 2024

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon Sep 21, 2024

Choose a reason for hiding this comment

Uh oh!

timlee0119 Sep 21, 2024

Choose a reason for hiding this comment

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon commented Sep 22, 2024

Uh oh!

Uh oh!

[SPARK-49741][DOCS] Add `spark.shuffle.accurateBlockSkewedFactor` to config docs page #48189

[SPARK-49741][DOCS] Add `spark.shuffle.accurateBlockSkewedFactor` to config docs page #48189

timlee0119 commented Sep 20, 2024 •

edited

Loading