1. Microsoft Learn
Azure Databricks Documentation
"What is a data lakehouse?": This document explicitly states
"The data lakehouse architecture is enabled by a new
open
and standardized system design: storing data in open-source file formats (like Parquet) in a data lake (like Azure Data Lake Storage Gen2)
and having a transactional metadata layer on top (like Delta Lake)." This positions ADLS Gen2 as the foundational storage layer.
2. Microsoft Learn
Azure Databricks Documentation
"Tutorial: Azure Data Lake Storage Gen2
Azure Databricks & Spark": The introduction of this tutorial states
"This tutorial shows you how to connect your Azure Databricks cluster to an Azure storage account with Azure Data Lake Storage Gen2 enabled." This demonstrates the primary and intended integration pattern.
3. Microsoft Learn
Azure Architecture Center
"Modern analytics architecture with Azure Databricks": In the "Data flow" section
step 1 describes data ingestion: "Raw structured
semi-structured
and unstructured (text
image
audio
and video) data is ingested... and stored in Azure Data Lake Storage Gen2." This reference confirms ADLS Gen2 as the designated persistence layer for ingested data in a standard Databricks architecture.