1. Dashboard Refresh Scheduling and Cost: Databricks Documentation
"Schedule a dashboard for periodic refreshes". This document shows that refresh intervals as low as "Every 1 minute" are available. It also notes
"When a dashboard is refreshed on a schedule
Databricks runs all queries in the dashboard. The user who saved the schedule is the one whose credentials are used to run the queries." This implies compute usage.
Source: Databricks Documentation > Databricks SQL > Dashboards > Schedule a dashboard for periodic refreshes.
2. SQL Warehouse Compute and Cost: Databricks Documentation
"What is a Databricks SQL warehouse?". This document explains that SQL warehouses are the compute resources for running queries. "The cost of a SQL warehouse is determined by its cluster size... and the number of clusters." Frequent queries directly translate to sustained compute usage and cost.
Source: Databricks Documentation > Databricks SQL > Administration > What is a Databricks SQL warehouse?
3. Gold-Level Tables: Databricks Documentation
"The medallion architecture". This guide defines the Gold layer as tables for "reporting and business intelligence" that are "often aggregated and contain broad
denormalized data."
Source: Databricks Documentation > Data management > Best practices > The medallion architecture.
4. Structured Streaming Fault Tolerance: Databricks Documentation
"Structured Streaming". The documentation states
"Structured Streaming provides fault tolerance and exactly-once data consistency for streaming queries."
Source: Databricks Documentation > Apache Spark > Structured Streaming.