About 50 results
Open links in new tab
  1. Unable to use dask-sql due to 'dask_expr.io' module

    Jul 7, 2025 · However, since dask 2025.1.0 release, dask-expr was merged in Dask. It is possible that latest versions of dask or dask-expr package are not well supported by dask-sql.

  2. How to transform Dask.DataFrame to pd.DataFrame?

    Aug 18, 2016 · How can I transform my resulting dask.DataFrame into pandas.DataFrame (let's say I am done with heavy lifting, and just want to apply sklearn to my aggregate result)?

  3. Converting an DataFrame from pandas to dask - Stack Overflow

    Oct 22, 2020 · I followed this documentation dask.dataframe.from_pandas and there are optional arguments called npartitions and chunksize. So I try write something like this: import dask.dataframe …

  4. dask: difference between client.persist and client.compute

    Jan 23, 2017 · More pragmatically, I recommend using persist when your result is large and needs to be spread among many computers and using compute when your result is small and you want it on just …

  5. dask: looping over groupby groups efficiently - Stack Overflow

    Mar 25, 2025 · for name in set(ddf['groupby_column'].unique().compute()): group = ddf[ddf['groupby_column'].eq(name)].compute() # Process each group This approach computes the …

  6. python - Difference between dask.distributed LocalCluster with threads ...

    Sep 2, 2019 · What is the difference between the following LocalCluster configurations for dask.distributed? Client(n_workers=4, processes=False, threads_per_worker=1) versus …

  7. Reading an SQL query into a Dask DataFrame - Stack Overflow

    May 24, 2022 · I'm trying create a function that takes an SQL SELECT query as a parameter and use dask to read its results into a dask DataFrame using the dask.read_sql_query function.

  8. Newest 'dask' Questions - Stack Overflow

    Jan 8, 2026 · I am learning Dask to make my Python projects more efficient and scalable. To understand its performance better, I wrote a script comparing the computation time of Pandas and Dask when …

  9. How to see progress of Dask compute task? - Stack Overflow

    I would like to see a progress bar on Jupyter notebook while I'm running a compute task using Dask, I'm counting all values of id column from a large csv file +4GB, so any ideas? import dask.datafr...

  10. python - Using Matplotlib with Dask - Stack Overflow

    Jul 15, 2022 · One motivation to use dask instead of pandas is the size of the data. As such, swapping pandas DataFrame with dask DataFrame might not be feasible. Imagine a scatter plot, this might …