Skip to content

Conversation

@dependabot
Copy link
Contributor

@dependabot dependabot bot commented on behalf of github Sep 23, 2025

Bumps apache-beam from 2.67.0 to 2.68.0.

Release notes

Sourced from apache-beam's releases.

Beam 2.68.0 release

We are happy to present the new 2.68.0 release of Beam. This release includes both improvements and new functionality. See the download page for this release.

For more information on changes in 2.68.0, check out the detailed release notes.

Highlights

  • [Python] Prism runner now enabled by default for most Python pipelines using the direct runner (#34612). This may break some tests, see apache/beam#34612 for details on how to handle issues.

I/Os

  • Upgraded Iceberg dependency to 1.9.2 (#35981)

New Features / Improvements

  • BigtableRead Connector for BeamYaml added with new Config Param (#35696)
  • MongoDB Java driver upgraded from 3.12.11 to 5.5.0 with API refactoring and GridFS implementation updates (Java) (#35946).
  • Introduced a dedicated module for JUnit-based testing support: sdks/java/testing/junit, which provides TestPipelineExtension for JUnit 5 while maintaining backward compatibility with existing JUnit 4 TestRule-based tests (Java) (#18733, #35688).
    • To use JUnit 5 with Beam tests, add a test-scoped dependency on org.apache.beam:beam-sdks-java-testing-junit.
  • Google CloudSQL enrichment handler added (Python) (#34398). Beam now supports data enrichment capabilities using SQL databases, with built-in support for:
    • Managed PostgreSQL, MySQL, and Microsoft SQL Server instances on CloudSQL
    • Unmanaged SQL database instances not hosted on CloudSQL (e.g., self-hosted or on-premises databases)
  • [Python] Added the ReactiveThrottler and ThrottlingSignaler classes to streamline throttling behavior in DoFns, expose throttling mechanisms for users (#35984)
  • Added a pipeline option to specify the processing timeout for a single element by any PTransform (Java/Python/Go) (#35174).
    • When specified, the SDK harness automatically restarts if an element takes too long to process. Beam runner may then retry processing of the same work item.
    • Use the --element_processing_timeout_minutes option to reduce the chance of having stalled pipelines due to unexpected cases of slow processing, where slowness might not happen again if processing of the same element is retried.
  • (Python) Adding GCP Spanner Change Stream support for Python (apache_beam.io.gcp.spanner) (#24103).

Breaking Changes

  • Previously deprecated Beam ZetaSQL component has been removed (#34423). ZetaSQL users could migrate to Calcite SQL with BigQuery dialect enabled.
  • Upgraded Beam vendored Calcite to 1.40.0 for Beam SQL (#35483), which improves support for BigQuery and other SQL dialects. Note: Minor behavior changes are observed such as output significant digits related to casting.
  • (Python) The deterministic fallback coder for complex types like NamedTuple, Enum, and dataclasses now uses cloudpickle instead of dill. If your pipeline is affected, you may see a warning like: "Using fallback deterministic coder for type X...". You can revert to the previous behavior by using the pipeline option --update_compatibility_version=2.67.0 (35725). Report any pickling related issues to #34903
  • (Python) Prism runner now enabled by default for most Python pipelines using the direct runner (#34612). This may break some tests, see apache/beam#34612 for details on how to handle issues.
  • Dropped Java 8 support for IO expansion-service. Cross-language pipelines using this expansion service will need a Java11+ runtime (#35981.

Deprecations

  • Python SDK native SpannerIO (apache_beam/io/gcp/experimental/spannerio) is deprecated. Use cross-language wrapper (apache_beam/io/gcp/spanner) instead (Python) (#35860).
  • Samza runner is deprecated and scheduled for removal in Beam 3.0 (#35448).
  • Twister2 runner is deprecated and scheduled for removal in Beam 3.0 (#35905)).

... (truncated)

Changelog

Sourced from apache-beam's changelog.

[2.68.0] - 2025-09-22

Highlights

  • [Python] Prism runner now enabled by default for most Python pipelines using the direct runner (#34612). This may break some tests, see apache/beam#34612 for details on how to handle issues.

I/Os

  • Upgraded Iceberg dependency to 1.9.2 (#35981)

New Features / Improvements

  • BigtableRead Connector for BeamYaml added with new Config Param (#35696)
  • MongoDB Java driver upgraded from 3.12.11 to 5.5.0 with API refactoring and GridFS implementation updates (Java) (#35946).
  • Introduced a dedicated module for JUnit-based testing support: sdks/java/testing/junit, which provides TestPipelineExtension for JUnit 5 while maintaining backward compatibility with existing JUnit 4 TestRule-based tests (Java) (#18733, #35688).
    • To use JUnit 5 with Beam tests, add a test-scoped dependency on org.apache.beam:beam-sdks-java-testing-junit.
  • Google CloudSQL enrichment handler added (Python) (#34398). Beam now supports data enrichment capabilities using SQL databases, with built-in support for:
    • Managed PostgreSQL, MySQL, and Microsoft SQL Server instances on CloudSQL
    • Unmanaged SQL database instances not hosted on CloudSQL (e.g., self-hosted or on-premises databases)
  • [Python] Added the ReactiveThrottler and ThrottlingSignaler classes to streamline throttling behavior in DoFns, expose throttling mechanisms for users (#35984)
  • Added a pipeline option to specify the processing timeout for a single element by any PTransform (Java/Python/Go) (#35174).
    • When specified, the SDK harness automatically restarts if an element takes too long to process. Beam runner may then retry processing of the same work item.
    • Use the --element_processing_timeout_minutes option to reduce the chance of having stalled pipelines due to unexpected cases of slow processing, where slowness might not happen again if processing of the same element is retried.
  • (Python) Adding GCP Spanner Change Stream support for Python (apache_beam.io.gcp.spanner) (#24103).

Breaking Changes

  • Previously deprecated Beam ZetaSQL component has been removed (#34423). ZetaSQL users could migrate to Calcite SQL with BigQuery dialect enabled.
  • Upgraded Beam vendored Calcite to 1.40.0 for Beam SQL (#35483), which improves support for BigQuery and other SQL dialects. Note: Minor behavior changes are observed such as output significant digits related to casting.
  • (Python) The deterministic fallback coder for complex types like NamedTuple, Enum, and dataclasses now uses cloudpickle instead of dill. If your pipeline is affected, you may see a warning like: "Using fallback deterministic coder for type X...". You can revert to the previous behavior by using the pipeline option --update_compatibility_version=2.67.0 (35725). Report any pickling related issues to #34903
  • (Python) Prism runner now enabled by default for most Python pipelines using the direct runner (#34612). This may break some tests, see apache/beam#34612 for details on how to handle issues.
  • Dropped Java 8 support for IO expansion-service. Cross-language pipelines using this expansion service will need a Java11+ runtime (#35981).

Deprecations

  • Python SDK native SpannerIO (apache_beam/io/gcp/experimental/spannerio) is deprecated. Use cross-language wrapper (apache_beam/io/gcp/spanner) instead (Python) (#35860).
  • Samza runner is deprecated and scheduled for removal in Beam 3.0 (#35448).
  • Twister2 runner is deprecated and scheduled for removal in Beam 3.0 (#35905)).

Bugfixes

  • (Python) Fixed Java YAML provider fails on Windows (#35617).
  • Fixed BigQueryIO creating temporary datasets in wrong project when temp_dataset is specified with a different project than the pipeline project. For some jobs, temporary datasets will now be created in the correct project (Python) (#35813).
  • (Go) Fix duplicates due to reads after blind writes to Bag State (#35869).
    • Earlier Go SDK versions can avoid the issue by not reading in the same call after a blind write.
Commits

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [apache-beam](https://github.com/apache/beam) from 2.67.0 to 2.68.0.
- [Release notes](https://github.com/apache/beam/releases)
- [Changelog](https://github.com/apache/beam/blob/master/CHANGES.md)
- [Commits](apache/beam@v2.67.0...v2.68.0)

---
updated-dependencies:
- dependency-name: apache-beam
  dependency-version: 2.68.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
@dependabot dependabot bot added dependencies Pull requests that update a dependency file python Pull requests that update Python code labels Sep 23, 2025
@liferoad liferoad merged commit 65c3c1f into main Sep 23, 2025
8 checks passed
@dependabot dependabot bot deleted the dependabot/pip/apache-beam-2.68.0 branch September 23, 2025 13:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file python Pull requests that update Python code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant