Skip to content

Conversation

@dependabot
Copy link
Contributor

@dependabot dependabot bot commented on behalf of github Jan 22, 2025

Bumps apache-beam from 2.61.0 to 2.62.0.

Release notes

Sourced from apache-beam's releases.

Beam 2.62.0 release

We are happy to present the new 2.62.0 release of Beam. This release includes both improvements and new functionality. See the download page for this release.

For more information on changes in 2.62.0, check out the detailed release notes.

New Features / Improvements

  • Added support for stateful processing in Spark Runner for streaming pipelines. Timer functionality is not yet supported and will be implemented in a future release (#33237).
  • The datetime module is now available for use in jinja templatization for yaml.
  • Improved batch performance of SparkRunner's GroupByKey (#20943).
  • Support OnWindowExpiration in Prism (#32211).
    • This enables initial Java GroupIntoBatches support.
  • Support OrderedListState in Prism (#32929).

I/Os

  • gcs-connector config options can be set via GcsOptions (Java) (#32769).
  • [Managed Iceberg] Support partitioning by time (year, month, day, hour) for types date, time, timestamp, and timestamp(tz) (#32939)
  • Upgraded the default version of Hadoop dependencies to 3.4.1. Hadoop 2.10.2 is still supported (Java) (#33011).
  • [BigQueryIO] Create managed BigLake tables dynamically (#33125)

Breaking Changes

  • Upgraded ZetaSQL to 2024.11.1 (#32902). Java11+ is now needed if Beam's ZetaSQL component is used.

Bugfixes

  • Fixed EventTimeTimer ordering in Prism. (#32222).
  • [Managed Iceberg] Fixed a bug where DataFile metadata was assigned incorrect partition values (#33549).

Security Fixes

For the most up to date list of known issues, see https://github.com/apache/beam/blob/master/CHANGES.md

List of Contributors

According to git shortlog, the following people contributed to the 2.62.0 release. Thank you to all contributors!

Ahmed Abualsaud, Ahmet Altay, Alex Merose, Andrew Crites, Arnout Engelen, Attila Doroszlai, Bartosz Zablocki, Chamikara Jayalath, Claire McGinty, Claude van der Merwe, Damon Douglas, Danny McCormick, Gabija Balvociute, Hai Joey Tran, Hakampreet Singh Pandher, Ian Sullivan, Jack McCluskey, Jan Lukavský, Jeff Kinard, Jeffrey Kinard, Laura Detmer, Kenneth Knowles, Martin Trieu, Mattie Fu, Michel Davit, Naireen Hussain, Nick Anikin, Radosław Stankiewicz, Ravi Magham, Reeba Qureshi, Robert Bradshaw, Robert Burke, Rohit Sinha, S. Veyrié, Sam Whittle, Shingo Furuyama, Shunping Huang, Svetak Sundhar, Valentyn Tymofieiev, Vlado Djerek, XQ Hu, Yi Hu, twosom

Changelog

Sourced from apache-beam's changelog.

[2.62.0] - 2025-01-21

I/Os

  • gcs-connector config options can be set via GcsOptions (Java) (#32769).
  • [Managed Iceberg] Support partitioning by time (year, month, day, hour) for types date, time, timestamp, and timestamp(tz) (#32939)
  • Upgraded the default version of Hadoop dependencies to 3.4.1. Hadoop 2.10.2 is still supported (Java) (#33011).
  • [BigQueryIO] Create managed BigLake tables dynamically (#33125)

New Features / Improvements

  • Added support for stateful processing in Spark Runner for streaming pipelines. Timer functionality is not yet supported and will be implemented in a future release (#33237).
  • The datetime module is now available for use in jinja templatization for yaml.
  • Improved batch performance of SparkRunner's GroupByKey (#20943).
  • Support OnWindowExpiration in Prism (#32211).
    • This enables initial Java GroupIntoBatches support.
  • Support OrderedListState in Prism (#32929).
  • Add apache_beam.ml.rag package with RAG types, base chunking, LangChain chunking and HuggingFace embedding components (Python) (#33364).

Breaking Changes

  • Upgraded ZetaSQL to 2024.11.1 (#32902). Java11+ is now needed if Beam's ZetaSQL component is used.

Bugfixes

  • Fixed EventTimeTimer ordering in Prism. (#32222).
  • [Managed Iceberg] Fixed a bug where DataFile metadata was assigned incorrect partition values (#33549).

Security Fixes

Known Issues

  • [Python] If you are using the official Apache Beam Python containers for version 2.62.0, be aware that they include NumPy version 1.26.4. It is strongly recommended that you explicitly specify numpy==1.26.4 in your project's dependency list. (#33639).
  • [Dataflow Streaming Appliance] Commits fail with KeyCommitTooLargeException when a key outputs >180MB of results. Bug affects versions 2.60.0 to 2.62.0,
    • fix will be released with 2.63.0. #33588.
    • To resolve this issue, downgrade to 2.59.0 or upgrade to 2.63.0 or enable Streaming Engine.
Commits
  • df0ad59 Set version for 2.62.0 RC1
  • 77f38cd Cherrypick #33549 to Release 2.62.0 branch (Iceberg partition value race cond...
  • 0c54e6a Merge pull request #33546: Cherry-pick #33505 to 2.62.0 release branch (Build...
  • 1cc4b4c Build wheels on gha runners instead of local runners (#33505)
  • 48e18c4 Merge pull request #33535: Cherry pick #33231 to 2.62.0 release branch ([BigQ...
  • eab8303 [BigQueryIO] fetch updated schema for newly created Storage API stream writer...
  • b2b2f14 Merge pull request #33486: Cherry-pick #33483 to release 2.62.0 (Fix bounded ...
  • 88f1f3c Merge pull request #33479: [Release-2.62] cherry-pick #33472 into release bra...
  • e8cd8a6 Fix bounded trie merge when merging on empty trie
  • e72ad94 Make Beam expansion service shadowJar multi-release so it processes multi-rel...
  • Additional commits viewable in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [apache-beam](https://github.com/apache/beam) from 2.61.0 to 2.62.0.
- [Release notes](https://github.com/apache/beam/releases)
- [Changelog](https://github.com/apache/beam/blob/master/CHANGES.md)
- [Commits](apache/beam@v2.61.0...v2.62.0)

---
updated-dependencies:
- dependency-name: apache-beam
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
@dependabot dependabot bot added dependencies Pull requests that update a dependency file python Pull requests that update Python code labels Jan 22, 2025
@liferoad liferoad merged commit e13b849 into main Jan 22, 2025
8 checks passed
@dependabot dependabot bot deleted the dependabot/pip/apache-beam-2.62.0 branch January 22, 2025 13:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file python Pull requests that update Python code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant