Polars - Python Library
Polars
https://github.com/pola-rs/polars
Frame
- DataFrame
- created by
read_csv - can be converted to lazy mode
df.lazy()
- created by
- LazyFrame
- created by
scan_csv describe_optimized_planmethod returns optimized plan.- call
collectmethod to evaluates the output - call
fetchmethod to evaluate the output for testing. It uses just a part of the data, not whole.fetchcan accept one parameter that represents the number of the records.df.fetch(3)
pivotmethod is not available in this mode, for example. callcollect()for it.
- created by
Streaming mode
In streaming mode, the program processes the data step by step, instead of all at once. This allows the program to avoid the need for holding the entire dataset in memory, and it only requires memory for the specific part of the data it is currently handling.
The collect and fetch functions can be used with the streaming argument.
df.collect(streaming=True)
df.fetch(streaming=True)
It’s important to note that streaming mode might not be available for all operations, but it is supported for most of them.