简体繁体中英

Working with big data on Dask Kubernetes in Azure Kubernetes Service(AKS)

原文 2019-06-02 15:47:34 1 1 kubernetes/ dataset/ dask/ azure-aks/ dask-kubernetes

I want do analysis on a dataset(like csv file) of 8gb which is in my laptop hard disk. I have already setup a dask kubernetes cluster on AKS with 1 scheduler and 3 worker with 7 gb each.

How can I work on my dataset using this dask kubernetes cluster on AKS? Which file system to share dataset between worker will be best for this purpose?

Any suggestion where I should store this dataset so that I can work on this dataset easily.

The method should work from both a jupyter notebook and from a python file also.

1 answers

You would probably want to upload your data to an Azure blob store. There is more information about dask remote data (including Azure) here:

https://docs.dask.org/en/latest/remote-data-services.html

dask kubernetes aks (azure) virtual nodes

Understanding Azure Kubernetes Service (AKS)

Kubernetes Autoscaler not working on Azure AKS

Isolated Azure Kubernetes Service (AKS) SSL Error

Error: Rotate certificates in Azure Kubernetes Service (AKS)

how to update the credentials for Azure Kubernetes Service (AKS)

Azure Kubernetes Service (AKS) Reserve VMs?

Azure Kubernetes Service (AKS) - Pod restart alert

Azure Kubernetes Service (AKS) nodepool resizing

Attaching non-azure VMs to Azure Kubernetes Service (AKS)

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Tags

Working with big data on Dask Kubernetes in Azure Kubernetes Service(AKS)

Question

1 answers

solution1 0 ACCPTED 2019-06-02 18:57:32

solution1
0 ACCPTED 2019-06-02 18:57:32