Shoaib
Shoaib I’m curious about how things work — and how data can improve the way we solve problems and make decisions. Everything I share is part of that exploration, created not just to inform, but to inspire smarter ways of thinking, building, and solving.

Configure a Hierarchical Namespace for Azure Data Lake

Understanding how to configure a hierarchical namespace for Azure Data Lake.

1. Create a storage account

alt text

Enable the hierarchical namespace to create a Data Lake

alt text


Quick question:

You plan to implement Azure Data Lake Storage by using a hierarchical namespace. You need to execute performance benchmarks to optimize the upload and download of large data files. Your solution must use a minimum of development effort.

What should you use?

The AzCopy utility Azure Data Explorer Azure Data Factory A Synapse data pipeline

Answer: A


2. Create a container

alt text

Go to the container and update the shared access tokens

alt text

Click on Generate SAS token and URL

Create 2 directories in this container

alt text

3. Run performance benchmarks to optimize and ingest data for a data lake

Create an Azure Cloud Shell (PowerShell) session by using the values in the following table.

Click mount storage account

alt text

alt text


Run an upload performance benchmark by using the azcopy benchmark command

  • You can use AzCopy to copy blobs or files to or from a storage account—including support for Data Lake Storage.
  • You can use AzCopy to optimize performance by running performance benchmarks to view performance statistics and to identify bottlenecks.
1
azcopy benchmark --mode='Upload' "https://lake46848625.blob.core.windows.net/data.." --file-count 300 --delete-test-data=false

Run a download performance benchmark by using the azcopy benchmark command

1
azcopy benchmark --mode='Download' "https://lake46848625.blob.core.windows.net/data..." --delete-test-data=false

Open the “data” container to view the new directory created by upload benchmark.

Open the new directory, and then review the contents created by the upload benchmark.

The new directory starts with “benchmark” as prefix.