Real world project for Azure Data Engineers using Azure Data Factory, SQL, Data Lake, Databricks, HDInsight and PowerBI

Common Questions

1. Explain Azure Data Lake Gen2 vs Blob Storage

Azure Data Lake Gen 2

Azure Blob Storage

Actual hierarchy folder structure
Fine grained access controls (ACL)
No soft delete (yet)
Cost more than blob (2-3x)

–> For Analytics / Data Warehousing

“Logical” hierarchy folder structure
Access control at resource level
Soft delete
Cheaper

–> General purpose storage

1. ADF Parameters vs Variables

Parameters

Variables

External values passed into pipelines, datasets or linked services. Values cannot be changed

Internal values set inside a pipeline. Value can be changed inside the pipeline using Set Variable or Append Variable Activity

Leave a Comment

Your email address will not be published. Required fields are marked *