New python library for loading with schema evolution, with native airflow support - feedback welcome #33250
Unanswered
adrianbr
asked this question in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Open source Schema evoluton & declarative loading!
hey folks, i just want to put this new development on your radar, an open source library for data loading,complete with airflow support!
Please gib star if you like what you see! github
Schema evolution and declarative loading make pipeline building as easy as can be. I have been working on creating an open source python library to achieve this, and finally now we are ready to tell the world :)
dlt was made to be a simple library that fits in your stack without adding more redundance. It's made to primarily be a pipeline building tool, but we also host some ready made pipelines for common things.
It automates structuring and normalising your data so it gets loaded typed, for example json strings that are datetime will get converted to proper date time. This is part of schema evolution which is a process that helps reduce your code and maintenance and set alerts to defend data quality. This builds towards data meshing, or democratisation of data loading with contracts between consumer and producer.
Anyway, if you would like it and consider adoption, I, my team, or our growing community would be glad to assist you with it, find us on our slack
Here's a code example of how simple this is
Here are some docs links for more info:
Getting started (3 colab demos and some capabilities)
Ecosystem - deployments, dbt runner, existing pipelines
Join us on our Slack!
Beta Was this translation helpful? Give feedback.
All reactions