From e00a5f520cf16188f30d8b2c4b9a7eadf66fed68 Mon Sep 17 00:00:00 2001
From: Michael Morisi <michael.morisi@mongodb.com>
Date: Wed, 29 May 2024 16:05:37 -0400
Subject: [PATCH 01/10] DOCSP-29861: Cleanup unused files (#200)

(cherry picked from commit 05f312577a0d7ec29ea00ab33b37954d81baecf5)
---
 source/includes/data-source.rst                |  5 -----
 source/includes/scala-java-explicit-schema.rst | 13 -------------
 2 files changed, 18 deletions(-)
 delete mode 100644 source/includes/data-source.rst
 delete mode 100644 source/includes/scala-java-explicit-schema.rst

diff --git a/source/includes/data-source.rst b/source/includes/data-source.rst
deleted file mode 100644
index 2f18028e..00000000
--- a/source/includes/data-source.rst
+++ /dev/null
@@ -1,5 +0,0 @@
-.. note::
-
-   The empty argument ("") refers to a file to use as a data source.
-   In this case our data source is a MongoDB collection, so the data
-   source argument is empty.
\ No newline at end of file
diff --git a/source/includes/scala-java-explicit-schema.rst b/source/includes/scala-java-explicit-schema.rst
deleted file mode 100644
index 3b682cb1..00000000
--- a/source/includes/scala-java-explicit-schema.rst
+++ /dev/null
@@ -1,13 +0,0 @@
-By default, reading from MongoDB in a ``SparkSession`` infers the
-schema by sampling documents from the collection. You can also use a
-|class| to define the schema explicitly, thus removing the extra
-queries needed for sampling.
-
-.. note::
-
-   If you provide a case class for the schema, MongoDB returns **only
-   the declared fields**. This helps minimize the data sent across the
-   wire.
-   
-The following statement creates a ``Character`` |class| and then
-uses it to define the schema for the DataFrame:

From 621324901f3a6e9f73d43d87b90f815b0ddd5480 Mon Sep 17 00:00:00 2001
From: Mike Woofter <108414937+mongoKart@users.noreply.github.com>
Date: Wed, 5 Jun 2024 14:09:49 -0500
Subject: [PATCH 02/10] DOCSP-40130 - Note on Sharded Partitioner (#201)

Co-authored-by: Nora Reidy <nora.reidy@mongodb.com>
(cherry picked from commit e25e13fb24dd1d0834c216dfe7b3e3df1b9f6ff7)
---
 source/batch-mode/batch-read-config.txt | 16 +++++++++++++---
 1 file changed, 13 insertions(+), 3 deletions(-)

diff --git a/source/batch-mode/batch-read-config.txt b/source/batch-mode/batch-read-config.txt
index 7233fb2f..d97de93b 100644
--- a/source/batch-mode/batch-read-config.txt
+++ b/source/batch-mode/batch-read-config.txt
@@ -10,6 +10,13 @@ Batch Read Configuration Options
    :depth: 1
    :class: singlecol
 
+.. facet::
+   :name: genre
+   :values: reference
+    
+.. meta::
+   :keywords: partitioner, customize, settings 
+
 .. _spark-batch-input-conf:
 
 Overview
@@ -212,9 +219,12 @@ based on your shard configuration.
 To use this configuration, set the ``partitioner`` configuration option to
 ``com.mongodb.spark.sql.connector.read.partitioner.ShardedPartitioner``.
 
-.. warning::
-
-   This partitioner is not compatible with hashed shard keys.
+.. important:: ShardedPartitioner Restrictions
+   
+   1. In MongoDB Server v6.0 and later, the sharding operation creates one large initial
+      chunk to cover all shard key values, making the sharded partitioner inefficient.
+      We do not recommend using the sharded partitioner when connected to MongoDB v6.0 and later.
+   2. The sharded partitioner is not compatible with hashed shard keys.
 
 .. _conf-mongopaginatebysizepartitioner:
 .. _conf-paginatebysizepartitioner:

From 0c729ac900c3b4ecd296c0306c8ed8b53e654a47 Mon Sep 17 00:00:00 2001
From: anabellabuckvar <41971124+anabellabuckvar@users.noreply.github.com>
Date: Fri, 16 Aug 2024 14:30:13 -0400
Subject: [PATCH 03/10] Add Netlify config files via upload

---
 build.sh     | 7 +++++++
 netlify.toml | 6 ++++++
 2 files changed, 13 insertions(+)
 create mode 100644 build.sh
 create mode 100644 netlify.toml

diff --git a/build.sh b/build.sh
new file mode 100644
index 00000000..a5e15032
--- /dev/null
+++ b/build.sh
@@ -0,0 +1,7 @@
+# ensures that we always use the latest version of the script
+if [ -f build-site.sh ]; then
+  rm build-site.sh
+fi 
+
+curl https://raw.githubusercontent.com/mongodb/docs-worker-pool/netlify-poc/scripts/build-site.sh -o build-site.sh 
+sh build-site.sh
diff --git a/netlify.toml b/netlify.toml
new file mode 100644
index 00000000..d0c89040
--- /dev/null
+++ b/netlify.toml
@@ -0,0 +1,6 @@
+[[integrations]]
+name = "snooty-cache-plugin"
+
+[build]
+publish = "snooty/public"
+command = ". ./build.sh"

From 649c6f6874d67569ad7a00345bdc277f142b6456 Mon Sep 17 00:00:00 2001
From: "github-actions[bot]"
 <41898282+github-actions[bot]@users.noreply.github.com>
Date: Wed, 4 Sep 2024 08:53:17 -0500
Subject: [PATCH 04/10] DOCSP-42969 - remove nested admonitions (#204) (#208)

(cherry picked from commit b37226e457978e048953f89f93f897c2ab6235b1)

Co-authored-by: Mike Woofter <108414937+mongoKart@users.noreply.github.com>
---
 source/includes/note-trigger-method.rst       |  4 ---
 .../streaming-mode/streaming-read-config.txt  | 36 ++++++++-----------
 source/streaming-mode/streaming-write.txt     | 15 ++++----
 3 files changed, 20 insertions(+), 35 deletions(-)
 delete mode 100644 source/includes/note-trigger-method.rst

diff --git a/source/includes/note-trigger-method.rst b/source/includes/note-trigger-method.rst
deleted file mode 100644
index f9ad2d1d..00000000
--- a/source/includes/note-trigger-method.rst
+++ /dev/null
@@ -1,4 +0,0 @@
-.. note::
-
-   Call the ``trigger()`` method on the ``DataStreamWriter`` you create 
-   from the ``DataStreamReader`` you configure.
diff --git a/source/streaming-mode/streaming-read-config.txt b/source/streaming-mode/streaming-read-config.txt
index 997d175d..dd185fe1 100644
--- a/source/streaming-mode/streaming-read-config.txt
+++ b/source/streaming-mode/streaming-read-config.txt
@@ -82,12 +82,10 @@ You can configure the following properties when reading data from MongoDB in str
 
           [{"$match": {"closed": false}}, {"$project": {"status": 1, "name": 1, "description": 1}}]
 
-       .. important::
-
-          Custom aggregation pipelines must be compatible with the
-          partitioner strategy. For example, aggregation stages such as
-          ``$group`` do not work with any partitioner that creates more than
-          one partition.
+       Custom aggregation pipelines must be compatible with the
+       partitioner strategy. For example, aggregation stages such as
+       ``$group`` do not work with any partitioner that creates more than
+       one partition.
 
    * - ``aggregation.allowDiskUse``
      - | Specifies whether to allow storage to disk when running the
@@ -135,14 +133,12 @@ You can configure the following properties when reading a change stream from Mon
        original document and updated document, but it also includes a copy of the
        entire updated document.
 
+       For more information on how this change stream option works,
+       see the MongoDB server manual guide
+       :manual:`Lookup Full Document for Update Operation </changeStreams/#lookup-full-document-for-update-operations>`.
+       
        **Default:** "default"
 
-       .. tip::
-
-          For more information on how this change stream option works,
-          see the MongoDB server manual guide
-          :manual:`Lookup Full Document for Update Operation </changeStreams/#lookup-full-document-for-update-operations>`.
-
    * - ``change.stream.micro.batch.max.partition.count``
      - | The maximum number of partitions the {+connector-short+} divides each 
          micro-batch into. Spark workers can process these partitions in parallel.
@@ -151,11 +147,9 @@ You can configure the following properties when reading a change stream from Mon
        |
        | **Default**: ``1``
 
-       .. warning:: Event Order
-
-          Specifying a value larger than ``1`` can alter the order in which
-          the {+connector-short+} processes change events. Avoid this setting
-          if out-of-order processing could create data inconsistencies downstream. 
+       :red:`WARNING:` Specifying a value larger than ``1`` can alter the order in which
+       the {+connector-short+} processes change events. Avoid this setting
+       if out-of-order processing could create data inconsistencies downstream. 
 
    * - ``change.stream.publish.full.document.only``
      - | Specifies whether to publish the changed document or the full
@@ -174,12 +168,10 @@ You can configure the following properties when reading a change stream from Mon
        - If you don't specify a schema, the connector infers the schema
          from the change stream document.
 
-       **Default**: ``false``
+       This setting overrides the ``change.stream.lookup.full.document``
+       setting.
        
-       .. note::
-
-          This setting overrides the ``change.stream.lookup.full.document``
-          setting.
+       **Default**: ``false``
 
    * - ``change.stream.startup.mode``
      - | Specifies how the connector starts up when no offset is available.
diff --git a/source/streaming-mode/streaming-write.txt b/source/streaming-mode/streaming-write.txt
index 60a6aa3f..815c1f27 100644
--- a/source/streaming-mode/streaming-write.txt
+++ b/source/streaming-mode/streaming-write.txt
@@ -51,7 +51,8 @@ Write to MongoDB in Streaming Mode
  
             * - ``writeStream.trigger()``
               - Specifies how often the {+connector-short+} writes results
-                to the streaming sink. 
+                to the streaming sink. Call this method on the ``DataStreamWriter`` object
+                you create from the ``DataStreamReader`` you configure.
                 
                 To use continuous processing, pass ``Trigger.Continuous(<time value>)`` 
                 as an argument, where ``<time value>`` is how often you want the Spark
@@ -62,8 +63,6 @@ Write to MongoDB in Streaming Mode
   
                 To view a list of all supported processing policies, see the `Java 
                 trigger documentation <https://spark.apache.org/docs/latest/api/java/org/apache/spark/sql/streaming/Trigger.html>`__.
-
-                .. include:: /includes/note-trigger-method
         
          The following code snippet shows how to use the previous 
          configuration settings to stream data to MongoDB:
@@ -119,7 +118,8 @@ Write to MongoDB in Streaming Mode
 
             * - ``writeStream.trigger()``
               - Specifies how often the {+connector-short+} writes results
-                to the streaming sink. 
+                to the streaming sink. Call this method on the ``DataStreamWriter`` object
+                you create from the ``DataStreamReader`` you configure.
 
                 To use continuous processing, pass the function a time value 
                 using the ``continuous`` parameter.
@@ -130,8 +130,6 @@ Write to MongoDB in Streaming Mode
                 To view a list of all supported processing policies, see 
                 the `pyspark trigger documentation <https://spark.apache.org/docs/latest/api/python/reference/pyspark.ss/api/pyspark.sql.streaming.DataStreamWriter.trigger.html>`__.
 
-                .. include:: /includes/note-trigger-method
-         
          The following code snippet shows how to use the previous 
          configuration settings to stream data to MongoDB:
 
@@ -186,7 +184,8 @@ Write to MongoDB in Streaming Mode
  
             * - ``writeStream.trigger()``
               - Specifies how often the {+connector-short+} writes results
-                to the streaming sink.
+                to the streaming sink. Call this method on the ``DataStreamWriter`` object
+                you create from the ``DataStreamReader`` you configure.
 
                 To use continuous processing, pass ``Trigger.Continuous(<time value>)`` 
                 as an argument, where ``<time value>`` is how often you want the Spark
@@ -198,8 +197,6 @@ Write to MongoDB in Streaming Mode
                 To view a list of all 
                 supported processing policies, see the `Scala trigger documentation <https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/streaming/DataStreamWriter.html#trigger(trigger:org.apache.spark.sql.streaming.Trigger):org.apache.spark.sql.streaming.DataStreamWriter[T]>`__.
 
-                .. include:: /includes/note-trigger-method
-        
          The following code snippet shows how to use the previous 
          configuration settings to stream data to MongoDB:
 

From bc845adaae271ee38979e491fb9e94da216f2969 Mon Sep 17 00:00:00 2001
From: "github-actions[bot]"
 <41898282+github-actions[bot]@users.noreply.github.com>
Date: Mon, 25 Nov 2024 18:10:42 -0500
Subject: [PATCH 05/10] DOCSP-44953 TOC Relabel (#214) (#217)

* DOCSP-44953 TOC Relabel

* edit

* Mike's Suggestions

* keep configure tls

(cherry picked from commit 987747d46a822c0696f1a9db0d0fd958ceeea386)

Co-authored-by: lindseymoore <71525840+lindseymoore@users.noreply.github.com>
---
 source/batch-mode.txt                     |  4 ++--
 source/batch-mode/batch-read.txt          |  2 +-
 source/batch-mode/batch-write.txt         |  2 +-
 source/index.txt                          | 24 +++++++++++------------
 source/streaming-mode.txt                 |  4 ++--
 source/streaming-mode/streaming-read.txt  |  2 +-
 source/streaming-mode/streaming-write.txt |  2 +-
 7 files changed, 20 insertions(+), 20 deletions(-)

diff --git a/source/batch-mode.txt b/source/batch-mode.txt
index 5f5119a2..a48a84d0 100644
--- a/source/batch-mode.txt
+++ b/source/batch-mode.txt
@@ -10,8 +10,8 @@ Batch Mode
 
 .. toctree::
 
-   /batch-mode/batch-read
-   /batch-mode/batch-write
+   Read </batch-mode/batch-read>
+   Write </batch-mode/batch-write>
 
 Overview
 --------
diff --git a/source/batch-mode/batch-read.txt b/source/batch-mode/batch-read.txt
index bc59ba90..ab636063 100644
--- a/source/batch-mode/batch-read.txt
+++ b/source/batch-mode/batch-read.txt
@@ -7,7 +7,7 @@ Read from MongoDB in Batch Mode
 .. toctree::
    :caption: Batch Read Configuration Options
 
-   /batch-mode/batch-read-config
+   Configuration </batch-mode/batch-read-config>
 
 .. contents:: On this page
    :local:
diff --git a/source/batch-mode/batch-write.txt b/source/batch-mode/batch-write.txt
index c1fc1e03..f9b93696 100644
--- a/source/batch-mode/batch-write.txt
+++ b/source/batch-mode/batch-write.txt
@@ -7,7 +7,7 @@ Write to MongoDB in Batch Mode
 .. toctree::
    :caption: Batch Write Configuration Options
 
-   /batch-mode/batch-write-config
+   Configuration </batch-mode/batch-write-config>
 
 Overview
 --------
diff --git a/source/index.txt b/source/index.txt
index 1bad808c..9fed2ac5 100644
--- a/source/index.txt
+++ b/source/index.txt
@@ -2,6 +2,18 @@
 MongoDB Connector for Spark
 ===========================
 
+.. toctree::
+   :titlesonly:
+
+   Get Started <getting-started>
+   Configure Spark <configuration>
+   Configure TLS/SSL <tls>
+   Batch Mode </batch-mode>
+   Streaming Mode </streaming-mode>
+   FAQ <faq>
+   Release Notes <release-notes>
+   API Documentation <api-docs>
+
 The `MongoDB Connector for Spark
 <https://www.mongodb.com/products/spark-connector>`_ provides
 integration between MongoDB and Apache Spark.
@@ -41,15 +53,3 @@ versions of Apache Spark and MongoDB:
    * - **{+current-version+}**
      - **3.1 through 3.5**
      - **4.0 or later**
-
-.. toctree::
-   :titlesonly:
-
-   Getting Started <getting-started>
-   configuration
-   tls
-   /batch-mode
-   /streaming-mode
-   faq
-   release-notes
-   api-docs
diff --git a/source/streaming-mode.txt b/source/streaming-mode.txt
index 456695f6..9128ef92 100644
--- a/source/streaming-mode.txt
+++ b/source/streaming-mode.txt
@@ -12,8 +12,8 @@ Streaming Mode
 
 .. toctree::
 
-   /streaming-mode/streaming-read
-   /streaming-mode/streaming-write
+   Read </streaming-mode/streaming-read>
+   Write </streaming-mode/streaming-write>
 
 Overview
 --------
diff --git a/source/streaming-mode/streaming-read.txt b/source/streaming-mode/streaming-read.txt
index ac8fb7ba..4c50febe 100644
--- a/source/streaming-mode/streaming-read.txt
+++ b/source/streaming-mode/streaming-read.txt
@@ -7,7 +7,7 @@ Read from MongoDB in Streaming Mode
 .. toctree::
    :caption: Streaming Read Configuration Options
 
-   /streaming-mode/streaming-read-config
+   Configuration </streaming-mode/streaming-read-config>
 
 .. contents:: On this page
    :local:
diff --git a/source/streaming-mode/streaming-write.txt b/source/streaming-mode/streaming-write.txt
index 815c1f27..854ca917 100644
--- a/source/streaming-mode/streaming-write.txt
+++ b/source/streaming-mode/streaming-write.txt
@@ -7,7 +7,7 @@ Write to MongoDB in Streaming Mode
 .. toctree::
    :caption: Streaming Write Configuration Options
 
-   /streaming-mode/streaming-write-config
+   Configuration </streaming-mode/streaming-write-config>
 
 .. tabs-drivers::
 

From bff86b07fe3c6bafe1f524ec8ccf6f3b16c72c54 Mon Sep 17 00:00:00 2001
From: Sarah Simpers <82042374+sarahsimpers@users.noreply.github.com>
Date: Fri, 6 Dec 2024 13:56:23 -0500
Subject: [PATCH 06/10] (DOCSP-45749) Denests last phase 1 nested component for
 spark connector (#220) (#222)

---
 source/batch-mode/batch-read-config.txt | 11 +++++------
 1 file changed, 5 insertions(+), 6 deletions(-)

diff --git a/source/batch-mode/batch-read-config.txt b/source/batch-mode/batch-read-config.txt
index d97de93b..18bd940f 100644
--- a/source/batch-mode/batch-read-config.txt
+++ b/source/batch-mode/batch-read-config.txt
@@ -114,12 +114,11 @@ You can configure the following properties when reading data from MongoDB in bat
 
           [{"$match": {"closed": false}}, {"$project": {"status": 1, "name": 1, "description": 1}}]
 
-       .. important::
-
-          Custom aggregation pipelines must be compatible with the
-          partitioner strategy. For example, aggregation stages such as
-          ``$group`` do not work with any partitioner that creates more than
-          one partition.
+       :gold:`IMPORTANT:` Custom aggregation pipelines must be
+       compatible with the partitioner strategy. For example,
+       aggregation stages such as
+       ``$group`` do not work with any partitioner that creates more
+       than one partition.
 
    * - ``aggregation.allowDiskUse``
      - | Specifies whether to allow storage to disk when running the

From b9c6875d8dcac3d9ac0ed374cd89dfdb36530c35 Mon Sep 17 00:00:00 2001
From: Rea Rustagi <85902999+rustagir@users.noreply.github.com>
Date: Wed, 15 Jan 2025 10:32:58 -0500
Subject: [PATCH 07/10] Add dependency installation to vale rule

(cherry picked from commit 989f3789a39a38c262c64fb87e92b51da7ac8d03)
---
 .github/workflows/vale-tdbx.yml | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/.github/workflows/vale-tdbx.yml b/.github/workflows/vale-tdbx.yml
index 284033ab..8e4b6f49 100644
--- a/.github/workflows/vale-tdbx.yml
+++ b/.github/workflows/vale-tdbx.yml
@@ -12,6 +12,9 @@ jobs:
       - name: checkout
         uses: actions/checkout@master
 
+      - name: Install docutils
+        run: sudo apt-get install -y docutils
+
       - id: files
         uses: masesgroup/retrieve-changed-files@v2
         with:

From f1d96ab230b53dc5bd88e4e7b87f95d27bbaa956 Mon Sep 17 00:00:00 2001
From: "github-actions[bot]"
 <41898282+github-actions[bot]@users.noreply.github.com>
Date: Wed, 15 Jan 2025 13:05:51 -0500
Subject: [PATCH 08/10] [Spark] Remove autobuilder (#227) (#230)

(cherry picked from commit ca7f7222555e96cff00edac893c1afcd58edcf8f)

Co-authored-by: Rachel Mackintosh <148898879+rachel-mack@users.noreply.github.com>
---
 .github/workflows/check-autobuilder.yml | 13 -------------
 package-lock.json                       | 10 ++++++++++
 package.json                            |  7 +++++++
 3 files changed, 17 insertions(+), 13 deletions(-)
 delete mode 100644 .github/workflows/check-autobuilder.yml
 create mode 100644 package-lock.json
 create mode 100644 package.json

diff --git a/.github/workflows/check-autobuilder.yml b/.github/workflows/check-autobuilder.yml
deleted file mode 100644
index 8495db96..00000000
--- a/.github/workflows/check-autobuilder.yml
+++ /dev/null
@@ -1,13 +0,0 @@
-name: Check Autobuilder for Errors
-
-on:
-  pull_request:
-    paths:
-      - "source/**"
-
-jobs:
-  check:
-    runs-on: ubuntu-latest
-    steps:
-      - uses: actions/checkout@v3
-      - uses: cbush/snooty-autobuilder-check@main
diff --git a/package-lock.json b/package-lock.json
new file mode 100644
index 00000000..691bb46c
--- /dev/null
+++ b/package-lock.json
@@ -0,0 +1,10 @@
+{
+    "name": "docs-spark-connector",
+    "lockfileVersion": 3,
+    "requires": true,
+    "packages": {
+        "": {
+            "name": "docs-spark-connector"
+        }
+    }
+}
diff --git a/package.json b/package.json
new file mode 100644
index 00000000..9f6b7cdb
--- /dev/null
+++ b/package.json
@@ -0,0 +1,7 @@
+{
+    "name": "docs-spark-connector",
+    "lockfileVersion": 3,
+    "requires": true,
+    "packages": {}
+  }
+  
\ No newline at end of file

From 82563eca125918f9b4ca0674dabd10c41d78c9ce Mon Sep 17 00:00:00 2001
From: "github-actions[bot]"
 <41898282+github-actions[bot]@users.noreply.github.com>
Date: Tue, 4 Feb 2025 14:53:16 -0500
Subject: [PATCH 09/10] DOCSP-46719 Spark Guide 404 (#235) (#239)

(cherry picked from commit 671e355deaddd24b47c13493aa47dc9ea9fc5ccb)

Co-authored-by: lindseymoore <71525840+lindseymoore@users.noreply.github.com>
---
 source/batch-mode/batch-write.txt | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/source/batch-mode/batch-write.txt b/source/batch-mode/batch-write.txt
index f9b93696..e4dce8ad 100644
--- a/source/batch-mode/batch-write.txt
+++ b/source/batch-mode/batch-write.txt
@@ -48,7 +48,7 @@ Overview
    - Time-series collections
 
    To learn more about save modes, see the
-   `Spark SQL Guide <https://spark.apache.org/docs/3.2.0/sql-data-sources-load-save-functions.html#save-modes>`__.
+   `Spark SQL Guide <https://spark.apache.org/docs/latest/sql-data-sources-load-save-functions.html#save-modes>`__.
    
 .. important::
 

From f4706fac8c2af058d1bd0df11ba16fc45ff7f37b Mon Sep 17 00:00:00 2001
From: shuangela <angela.shu@mongodb.com>
Date: Thu, 6 Mar 2025 16:14:12 -0500
Subject: [PATCH 10/10] DOCSP-31186 Add sections for connections (#243) (#248)

* add sections for connections

* fix vale

* fix vale flag

* fix ref

* fix typos

* change steps formatting

* change based on mikes feedback

* fix vale error

* change based on feedback

(cherry picked from commit 75df8c41757726f86374a9a00d72a499df2782ac)
---
 .github/workflows/vale-tdbx.yml | 10 +++++-----
 source/getting-started.txt      | 32 ++++++++++++++++++++++++++++++++
 2 files changed, 37 insertions(+), 5 deletions(-)

diff --git a/.github/workflows/vale-tdbx.yml b/.github/workflows/vale-tdbx.yml
index 8e4b6f49..d748e941 100644
--- a/.github/workflows/vale-tdbx.yml
+++ b/.github/workflows/vale-tdbx.yml
@@ -18,20 +18,20 @@ jobs:
       - id: files
         uses: masesgroup/retrieve-changed-files@v2
         with:
-          format: 'csv'
+          format: "csv"
 
       - name: checkout-latest-rules
         uses: actions/checkout@master
         with:
           repository: mongodb/mongodb-vale-action
-          path: './tdbx-vale-rules'
+          path: "./tdbx-vale-rules"
           token: ${{secrets.GITHUB_TOKEN}}
 
       - name: move-files-for-vale-action
         run: |
-            cp tdbx-vale-rules/.vale.ini .vale.ini
-            mkdir -p .github/styles/
-            cp -rf tdbx-vale-rules/.github/styles/ .github/
+          cp tdbx-vale-rules/.vale.ini .vale.ini
+          mkdir -p .github/styles/
+          cp -rf tdbx-vale-rules/.github/styles/ .github/
 
       - name: run-vale
         uses: errata-ai/vale-action@reviewdog
diff --git a/source/getting-started.txt b/source/getting-started.txt
index 0a16beea..e157df84 100644
--- a/source/getting-started.txt
+++ b/source/getting-started.txt
@@ -45,6 +45,38 @@ Getting Started
 
          .. include:: /scala/api.rst
 
+Integrations
+------------
+
+The following sections describe some popular third-party platforms that you can
+integrate Spark and the {+connector-long+} with.
+
+Amazon EMR
+~~~~~~~~~~
+
+Amazon EMR is a managed cluster platform that you can use to run big data frameworks like Spark. To install Spark on an EMR cluster, see
+`Getting Started with Amazon EMR <https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-gs.html>`__ in the AWS documentation.
+
+Databricks
+~~~~~~~~~~
+
+Databricks is an analytics platform for building, deploying, and sharing enterprise-level data. To integrate the {+connector-long+} with Databricks,
+see `MongoDB <https://docs.databricks.com/aws/en/connect/external-systems/mongodb>`__ in the Databricks documentation.
+
+Docker
+~~~~~~
+
+Docker is an open-source platform that helps developers build, share, and run applications in containers. 
+
+- To start Spark in a Docker container, see `Apache Spark <https://hub.docker.com/r/apache/spark#!>`__ in the Docker documentation and follow the steps provided. 
+- To learn how to deploy Atlas on Docker, see `Create a Local Atlas Deployment with Docker <https://www.mongodb.com/docs/atlas/cli/current/atlas-cli-deploy-docker/>`__.
+
+Kubernetes
+~~~~~~~~~~
+
+Kubernetes is an open-source platform for automating containerization management. To run Spark on Kubernetes,
+see `Running Spark on Kubernetes <https://spark.apache.org/docs/3.5.4/running-on-kubernetes.html>`__ in the Spark documentation.
+
 Tutorials
 ---------