The first cloud data lake built for organizations that is secure, highly scalable, and based on the open HDFS standard. With no limits on data size and the ability to run high-level parallel analyses, you can unlock the value in your unstructured, semi-structured, and structured data.
Azure Data Lake
Azure Data Lake provides all the features needed to simplify the storage of data of any size, shape, or speed for developers, data experts, and analysts, and to perform any operation or analysis across platforms and languages. It eliminates the complexity of ingesting and storing all your data while enabling you to quickly get started with batch processing, streaming, and interactive analytics.
Azure Data Lake works with your existing IT investments in identity, governance, and security for simplified data management and governance. It also works seamlessly with operational stores and data warehouses, allowing you to extend your existing data applications. We leveraged our experience gained by working with enterprise customers and performing large-scale processing and analysis for Microsoft businesses such as Office 365, Xbox Live, Azure, Windows, Bing, and Skype. Azure Data Lake resolves most efficiency and scalability issues that prevent you from maximizing the value of your data assets with a service ready to meet your current and future needs.

Data Lake Store: an unlimited data lake supporting big data analytics

Easily perform development, debugging, and improvement tasks in big data programs.
Finding the right tools for design and configuring your big data queries can be challenging. Data Lake simplifies this process through comprehensive integration with Visual Studio, Eclipse, and IntelliJ. This allows you to use the tools you’re familiar with to run your code, debug your code, and make code adjustments. U-SQL, Apache Spark, Apache Hive, and Apache Storm visualizations of your work allow you to see how your code works at scale, as well as identify performance issues and cost optimizations. This makes it easy to tune your queries.
Our execution environment actively analyzes your running programs and provides recommendations that improve performance and reduce costs. Data engineers, database administrators, and data architects can use their existing skills in SQL, Apache Hadoop, Apache Spark, R, Python, Java, and .NET to work efficiently from day one.
Store and analyze petabyte-sized files and trillions of objects
Data Lake is designed from the ground up for cloud scale and performance. By using Azure Data Lake Store, your organization can analyze all its data from a single location without any artificial constraints. Data Lake Store can store trillions of files larger than 1 petabyte in size, which is 200 times more than other cloud storage solutions.
This means that if you increase or decrease the size of the stored data or the amount of processing used, you won’t have to rewrite code. This allows you to focus solely on your business logic rather than worrying about how to process and store large data sets. Data Lake can help you meet your current and future business needs, preventing the complexity that big data in the cloud often causes.
