You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- convert individual data.frames to duck frames which allows you to control their automatic materialization parameters. To do that, you use `duckdb_tibble()`, `as_duckdb_tibble()` or read data using `read_*()` functions like `read_csv_duckdb()`.
64
64
65
-
In both cases, if an operation cannot be performed
66
-
by duckplyr (see `vignettes("limits")`), it will be outsourced to dplyr.
65
+
In both cases, if an operation cannot be performed by duckplyr (see `vignettes("limits")`), it will be outsourced to dplyr.
67
66
You can choose to be informed about fallbacks to dplyr, see `?fallback_config`.
68
67
You can disable fallbacks by turning off automatic materialization.
69
68
In that case, if an operation cannot be performed by duckplyr, your code will error.
@@ -75,13 +74,12 @@ With large datasets, you want:
75
74
- input data in an efficient format, like Parquet files. Therefore you might input data using `read_parquet_duckdb()`.
76
75
- efficient computation, which duckplyr provides via DuckDB's holistic optimization, without your having to use another syntax than dplyr.
77
76
- the output to not clutter all the memory. Therefore you can make use of these features:
78
-
- funneling see vignette TODO ADD CURRENT NAME to disable automatic materialization completely or to disable automatic materialization up to a certain output size.
77
+
- funneling (see `vignette("funnel")`) to disable automatic materialization completely or to disable automatic materialization up to a certain output size.
79
78
- computation to files using `compute_parquet()` or `compute_csv()`.
80
-
81
79
82
80
83
-
A drawback of analyzing large data with duckplyr is that the limitations of duckplyr
84
-
(unsupported verbs or data types, see `vignette("limits")`) won't be compensated by fallbacks since fallbacks to dplyr necessitate putting data into memory.
81
+
82
+
A drawback of analyzing large data with duckplyr is that the limitations of duckplyr (unsupported verbs or data types, see `vignette("limits")`) won't be compensated by fallbacks since fallbacks to dplyr necessitate putting data into memory.
0 commit comments