Azure Data Lake is a family of Azure services that enables you to analyze your Big Data workloads in a managed manner.


Azure Data Lake is a batch, real-time, interactive data analysis tool which makes it easy for developers, data scientists, and analysts to store data of any size, shape and speed, and do all types of processing and analytics across platforms and languages.

It consists of these services:

Service Description
Azure Data Lake Store A data repository that enables you to store any type of data in its raw format without defining schema. The store offers unlimited storage with immediate read/write access to it and scaling the throughput you need for your workloads. The store is Hadoop Data File System (HDFS) compatible so you can use your existing tools.
Azure Data Lake Analytics An analytics service that allows you to run analysis jobs on data. Analytics using Apache YARN to manage its resources for the processing engine. By using U-SQL, you can process data from several data sources such as Azure Data Lake Store, Azure Blob Storage, and Azure SQL Database but also from other data stores built on HDFS.
Azure HDInsight An analytics service that enables you to analyze data sets on a managed cluster running open-source technologies such as Hadoop, Spark, Storm & HBase.