I have this dask example of a standalone python script that runs on my desktop that has 4 CPU nodes It takes 0.735 seconds currently. The goal is to u ...
I have this dask example of a standalone python script that runs on my desktop that has 4 CPU nodes It takes 0.735 seconds currently. The goal is to u ...
I get the following error: TypeError: object list can't be used in 'await' expression when I try to await futures as dask_client.gather(futures) or a ...
When I run my dask workers I gather useful information from them through the logs, but occasionally the logs get absolutely flooded with an error rega ...
. Answers to this question are eligible for a +150 reputation bounty. p ...
I read the captioned sentence in dask’s website and wonder what it means. I have extracted the relevant part below for ease of reference: A common ...
. Answers to this question are eligible for a +100 reputation bounty. c ...
Context I am trying to write a data pipeline using dask distributed and some legacy code from a previous project. get_data simply get url:str and ses ...
I like to run an asynchronous dask dataframe computation with dd.persist() and then been able to track an individual partition status. The goal is to ...
my original script uses pool.map to run in parallel. I have logger setup in code to output to a file, and code running in different processes output l ...
The logs of the function submitted via the client are immediately displayed. Instead, the logs are expected to be displayed on client.gather(futures). ...
I have two dataframes that are interdependent in my calculation and I would like to get the results on both with one compute() call. The code can be s ...
I tried to use dask localcluster, in multiprocess but single thread per process setup, in linux, but failed so far: What happens is that dask indee ...
. Answers to this question are eligible for a +150 reputation bounty. p ...
I have a Dask distributed application running workers on Docker containers. Problem is that when I run an SQLAlchemy read_sql_query statement, I get a ...
I'm trying to load a Dask dataframe with SQLAlchemy using dd.read_sql_query. I define a table with one of the columns balance_date type DateTime (in t ...
How can I properly serialize metpy units (based on pint) to work with dask distributed? As far as I understand, it looks like dask distributed automat ...
I have bank accounts records where each row is the monthly balance of the account: Assume there are 10 million accounts and 10 years of data. What ...
I am trying to understand how dask.foldby works. Consider the following code. I create a dask bag with 100 items. I then fold the items by a cer ...
Suppose I have some function that I then delay and compute: Is it possible to get the name Jim from within f? Can I ask the worker for the key a ...
ModuleNotFoundError: No module named 'src.data_processing' Exception ignored in: <function Pool.del at 0x7f593e7a95e0> ...