Click and play the interactive Sedona Python Jupyter Notebook immediately!
Apache Sedona™ is a cluster computing system for processing large-scale spatial data. Sedona equips cluster computing systems such as Apache Spark and Apache Flink with a set of out-of-the-box distributed Spatial Datasets and Spatial SQL that efficiently load, process, and analyze large-scale spatial data across machines.
| Download statistics | Maven | PyPI | CRAN |
|---|---|---|---|
| Apache Sedona | 180k/month | ||
| Archived GeoSpark releases | 10k/month |
| Name | API | Introduction |
|---|---|---|
| Core | Scala/Java | Distributed Spatial Datasets and Query Operators |
| SQL | Spark RDD/DataFrame in Scala/Java/SQL | Geospatial data processing on Apache Spark |
| Flink | Flink DataStream/Table in Scala/Java/SQL | Geospatial data processing on Apache Flink |
| Viz | Spark RDD/DataFrame in Scala/Java/SQL | Geospatial data visualization on Apache Spark |
| Python | Spark RDD/DataFrame in Python | Python wrapper for Sedona |
| R | Spark RDD/DataFrame in R | R wrapper for Sedona |
| Zeppelin | Apache Zeppelin | Plugin for Apache Zeppelin 0.8.1+ |
Please refer to Sedona website
Feedback to improve Apache Sedona: Google Form
Twitter: Sedona@Twitter
Sedona JIRA: Bugs, Pull Requests, and other similar issues
Sedona Mailing Lists: [email protected]: project development, general questions or tutorials.
- Please first subscribe and then post emails. To subscribe, please send an email (leave the subject and content blank) to [email protected]


