+1 vote
in Azure Data Factory by
What is the difference between Azure Data Lake store and Blob storage?

2 Answers

0 votes
by
 Azure Data Lake Storage Gen1Azure Blob Storage
PurposeOptimized storage for big data analytics workloadsGeneral-purpose object store for a wide variety of storage scenarios, including big data analytics
StructureHierarchical file systemObject store with flat namespace
Key ConceptsData Lake Storage Gen1 account contains folders, which in turn contains data stored as filesStorage account has containers, which in turn has data in the form of blobs
Use CasesBatch, interactive, streaming analytics, and machine learning data such as log files, IoT data, clickstreams, large datasetsAny type of text or binary data, such as application back end, backup data, media storage for streaming, and general-purpose data. Additionally, full support for analytics workloads; batch, interactive, streaming analytics, and machine learning data such as log files, IoT data, clickstreams, large datasets
Server-side APIWebHDFS-compatible REST APIAzure Blob Storage REST API
Data Operations – AuthenticationBased on Azure Active Directory IdentitiesBased on shared secrets – Account Access Keys and Shared Access Signature Keys.
0 votes
by

Following is the list of key differences between Data Lake Storage and Blob Storage:

Azure Data Lake StorageAzure Blob Storage
The main purpose of Azure Data Lake Storage is to provide optimized storage for big data analytics workloads. So, we can say that it is an optimized storage solution for big data analytics workloads.The Azure Blob Storage is a general-purpose object storage system for a wide variety of storage scenarios and big data analytics.
The structure of Azure Data Lake Storage follows the hierarchical file system.The structure of Azure Blob Storage follows an object store with a flat namespace.
Azure Data Lake Storage contains folders in which the data is stored as files.Azure Blob Storage facilitates us to create a storage account with containers in which the data is stored.
It is mainly used to store batch, interactive, streaming analytics, and machine learning data such as log files, IoT data, clickstreams, large datasets, etc.It can be used to store any text or binary data, such as application back end, backup data, media storage for streaming, and general-purpose data. It also provides full support for analytics workloads; batch, interactive, streaming analytics, and machine learning data such as log files, IoT data, clickstreams, large datasets, etc.
It uses WebHDFS-compatible REST API as the Server-side API.It uses Azure Blob Storage REST API as the Server-side API.
Its data operations and authentication is based on Azure Active Directory Identities.Its data operations and authentication is based on shared secrets, Account Access Keys, and Shared Access Signature Keys.
...