Microsoft DP-203 Quiz 3 Topic 1 Questions 11-15

Question: 1

You plan to create an Azure Data Factory pipeline that will include a mapping data flow.

You have JSON data containing objects that have nested arrays.

You need to transform the JSON-formatted data into a tabular dataset. The dataset must have one tow for each item in the arrays.

Which transformation method should you use in the mapping data flow?

Aunpivot

Bflatten

Cnew branch

Dalter row

Show Answer

Question: 2

You develop data engineering solutions for a company.

A project requires the deployment of data to Azure Data Lake Storage.

You need to implement role-based access control (RBAC) so that project members can manage the Azure Data Lake Storage resources.

Which three actions should you perform? Each correct answer presents part of the solution.

NOTE: Each correct selection is worth one point.

AAssign Azure AD security groups to Azure Data Lake Storage.

BConfigure end-user authentication for the Azure Data Lake Storage account.

CConfigure service-to-service authentication for the Azure Data Lake Storage account.

DCreate security groups in Azure Active Directory (Azure AD) and add project members.

EConfigure access control lists (ACL) for the Azure Data Lake Storage account.

Show Answer

Question: 3

You have an Azure Stream Analytics job.

You need to ensure that the job has enough streaming units provisioned

You configure monitoring of the SU % Utilization metric.

Which two additional metrics should you monitor? Each correct answer presents part of the solution.

NOTE Each correct selection is worth one point

AOut of order Events

BLate Input Events

CBaddogged Input Events

DFunction Events

Show Answer

Question: 4

You are designing an inventory updates table in an Azure Synapse Analytics dedicated SQL pool. The table will have a clustered columnstore index and will include the following columns:

* EventDate: 1 million per day

* EventTypelD: 10 million per event type

* WarehouselD: 100 million per warehouse

* ProductCategoryTypeiD: 25 million per product category type

You identify the following usage patterns:

Analyst will most commonly analyze transactions for a warehouse.

Queries will summarize by product category type, date, and/or inventory event type.

You need to recommend a partition strategy for the table to minimize query times.

On which column should you recommend partitioning the table?

AProductCategoryTypeID

BEventDate

CWarehouseID

DEventTypeID

Show Answer

Question: 5

You implement an enterprise data warehouse in Azure Synapse Analytics.

You have a large fact table that is 10 terabytes (TB) in size.

Incoming queries use the primary key SaleKey column to retrieve data as displayed in the following table:

q5_DP-203

You need to distribute the large fact table across multiple nodes to optimize performance of the table.

Which technology should you use?

Ahash distributed table with clustered index

Bhash distributed table with clustered Columnstore index

Cround robin distributed table with clustered index

Dround robin distributed table with clustered Columnstore index

Eheap table with distribution replicate

Show Answer

Microsoft DP-203 Quiz:3 Topic:1 Questions:11-15