简体   繁体   English

Azure 的“Data Lake Storage Gen2”和“Data Lake Gen2”有什么区别?

[英]What is the difference between Azure's "Data Lake Storage Gen2" and "Data Lake Gen2"?

I'm confused by the options available when creating a storage account on Azure and am looking for clarification.我对在 Azure 上创建存储帐户时可用的选项感到困惑,并且正在寻求澄清。

If I create a new "Storage Account" (Standard tier) from the Azure marketplace, I'm offered to activate " Data Lake Storage Gen2 " in the advanced settings:如果我从 Azure 市场创建一个新的“存储帐户”(标准层),系统会提示我在高级设置中激活“ Data Lake Storage Gen2 ”:

在此处输入图像描述

Once the resource is deployed I'm seeing the option to do a " Data Lake Gen2 upgrade ":部署资源后,我会看到执行“ Data Lake Gen2 升级”的选项: 在此处输入图像描述

If I create the storage account using a premium tier with block blobs, that upgrade option is not showing up.如果我使用带有块 blob 的高级层创建存储帐户,则不会显示该升级选项。 However, it is when selecting page blobs.但是,它是在选择页面 blob 时。

The only visible difference I can spot by looking into the JSON of the ARM deployment: the upgraded storage account has isHnsEnabled: true .通过查看 ARM 部署的 JSON,我可以发现唯一可见的区别:升级后的存储帐户具有isHnsEnabled: true This seems to enable hierarchical namespaces and atomic directory operations.这似乎启用了分层命名空间和原子目录操作。 And in the UI I'm getting different icons: for the upgraded one I can see a database icon with some water in it.在 UI 中,我得到了不同的图标:对于升级后的图标,我可以看到一个数据库图标,里面有一些水。 For the non-upgraded it's showing a folder in the storage browser.对于未升级的,它在存储浏览器中显示一个文件夹。

To the questions:对于问题:

  1. I can create folders in the non-upgraded blob storage, although hierarchical namespaces are not enabled.我可以在未升级的 blob 存储中创建文件夹,尽管未启用分层命名空间。 Does this mean that folder operations just take longer?这是否意味着文件夹操作需要更长的时间?
  2. If I upgrade my standard tier storage account to Data Lake Gen2 it remains in the standard tier.如果我将我的标准层存储帐户升级到 Data Lake Gen2,它仍保留在标准层中。 But how would I create a standard tier storage account with Data Lake Gen2 support right from the beginning, without upgrading later?但是,我如何从一开始就创建一个支持 Data Lake Gen2 的标准层存储帐户,而无需稍后升级? I thought, the checkbox "Data Lake Storage Gen2" would do this, but aparently it's not.我想,复选框“Data Lake Storage Gen2”可以做到这一点,但显然不是。
  3. What is then the difference between "Data Lake Storage Gen2" and "Data Lake Gen2"?那么“Data Lake Storage Gen2”和“Data Lake Gen2”有什么区别呢?

I can create folders in the non-upgraded blob storage, although hierarchical namespaces are not enabled.我可以在未升级的 blob 存储中创建文件夹,尽管未启用分层命名空间。 Does this mean that folder operations just take longer?这是否意味着文件夹操作需要更长的时间?

I doubt it, can you tell me how you did that?我怀疑,你能告诉我你是怎么做到的吗? You can upload a blob to a virtual folder by naming it accordingly.您可以通过相应命名将 blob 上传到虚拟文件夹。 For example, a blob named folder/test.png upload to a container named container will appear as a file in the folder folder in the storage account explorer but there is no actual folder created.例如,上传到名为container的容器的名为folder/test.png的 blob 将在存储帐户资源管理器的文件夹文件folder中显示为一个文件,但不会创建实际的文件夹。

What is then the difference between "Data Lake Storage Gen2" and "Data Lake Gen2"?那么“Data Lake Storage Gen2”和“Data Lake Gen2”有什么区别呢?

Azure Data Lake is a container for several services like Azure HDInsight, Azure Data Lake Analytics. Azure Data Lake 是 Azure HDInsight、Azure Data Lake Analytics 等多项服务的容器。 Azure Data Lake is a solution, not a specific product. Azure Data Lake 是一个解决方案,而不是一个特定的产品。 Azure Data Lake Storage is a product that is part of the Azure Data Lake solution. Azure Data Lake Storage 是 Azure Data Lake 解决方案的一部分。

There is a gen1 and a gen2, gen2 is based on storage accounts: Data Lake Storage Gen2 converges the capabilities of Azure Data Lake Storage Gen1 with Azure Blob Storage有 gen1 和 gen2,gen2 基于存储帐户: Data Lake Storage Gen2 converges the capabilities of Azure Data Lake Storage Gen1 with Azure Blob Storage

If I upgrade my standard tier storage account to Data Lake Gen2 it remains in the standard tier.如果我将我的标准层存储帐户升级到 Data Lake Gen2,它仍保留在标准层中。 But how would I create a standard tier storage account with Data Lake Gen2 support right from the beginning, without upgrading later?但是,我如何从一开始就创建一个支持 Data Lake Gen2 的标准层存储帐户,而无需稍后升级? I thought, the checkbox "Data Lake Storage Gen2" would do this, but aparently it's not.我想,复选框“Data Lake Storage Gen2”可以做到这一点,但显然不是。

It does work for me, what makes you think setting the checkbox does not do the job?它对我有用,是什么让您认为设置复选框不起作用? With this option enabled I am able to for example create directories, something I cannot do using a regular storage account.启用此选项后,我可以创建目录,这是使用常规存储帐户无法做到的。

在此处输入图像描述

When it comes to what type of storage accounts are supported, only block blob premium accounts are supported as seen in the docs :关于支持哪种类型的存储帐户,如文档中所示,仅支持块 blob 高级帐户:

Data Lake Storage capabilities are supported in the following types of storage accounts:以下类型的存储帐户支持 Data Lake Storage 功能:

  • Standard general-purpose v2标准通用 v2
  • Premium block blob高级块 blob

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM