Skip to content

NGA-TRAN/datafusion-optimal-plans

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DataFusion Optimal Plans

This repo explores how to optimize query plans in DataFusion, starting with foundational concepts. It covers:

  • Using the CLI to create tables, define schema/sort order, insert data, and run queries
  • Inspecting query plans via CLI
  • Writing tests and building custom logical plans
  • Tuning for parallel execution
  • Building and running distributed plans

Start with the docs folder—each .md file introduces a focused topic, showing how to run queries in datafusion-cli or linking to test files for scenarios not supported via CLI.

About

Repro to demonstrate optimal DataFusion query plans for targeted use cases

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages