Summer Limited Time 60% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: wrap60

Databricks Databricks-Certified-Data-Analyst-Associate Dumps

Databricks Certified Data Analyst Associate Exam Questions and Answers

Question 1

A data analyst wants the following output:

customer_name

number_of_orders

John Doe

388

Zhang San

234

Which statement will produce this output?

Options:

A.

SELECT customer_name, count(order_id) AS number_of_orders

FROM customers

JOIN orders

ON customers.customer_id = orders.customer_id

GROUP BY customer_name;

B.

SELECT customer_name, count(order_id) number_of_orders

FROM customers

JOIN orders

ON customers.customer_id = orders.customer_id USE customer_name;

C.

SELECT customerjiame, (order_id) number_of_orders

FROM customers

JOIN orders

ON customers.customer_id = orders.customer_id;

D.

SELECT customerjiame, count(order_id)

FROM customers

JOIN orders

ON customers.customer_id = orders.customer_id GROUP BY customerjiame;

Question 2

Which location can be used to determine the owner of a managed table?

Options:

A.

Review the Owner field in the table page using Catalog Explorer

B.

Review the Owner field in the database page using Data Explorer

C.

Review the Owner field in the schema page using Data Explorer

D.

Review the Owner field in the table page using the SQL Editor

Question 3

A data analyst needs to use the Databricks Lakehouse Platform to quickly create SQL queries and data visualizations. It is a requirement that the compute resources in the platform can be made serverless, and it is expected that data visualizations can be placed within a dashboard.

Which of the following Databricks Lakehouse Platform services/capabilities meets all of these requirements?

Options:

A.

Delta Lake

B.

Databricks Notebooks

C.

Tableau

D.

Databricks Machine Learning

E.

Databricks SQL

Question 4

Which statement about subqueries is correct?

Options:

A.

Subqueries are not available in Databricks SQL

B.

Subqueries can be used like other user-defined functions to transform data into different data types.

C.

Subqueries can retrieve data without requiring the creation of a table or view.

D.

Subqueries can be used like other built-in functions to transform data into different data types.

Question 5

What is a benefit of using Databricks SQL for business intelligence (Bl) analytics projects instead of using third-party Bl tools?

Options:

A.

Computations, data, and analytical tools on the same platform

B.

Advanced dashboarding capabilities

C.

Simultaneous multi-user support

D.

Automated alerting systems

Question 6

Where in the Databricks SQL workspace can a data analyst configure a refresh schedule for a query when the query is not attached to a dashboard or alert?

Options:

A.

Data bxplorer

B.

The Visualization editor

C.

The Query Editor

D.

The Dashboard Editor

Question 7

Which of the following approaches can be used to connect Databricks to Fivetran for data ingestion?

Options:

A.

Use Workflows to establish a SQL warehouse (formerly known as a SQL endpoint) for Fivetran to interact with

B.

Use Delta Live Tables to establish a cluster for Fivetran to interact with

C.

Use Partner Connect's automated workflow to establish a cluster for Fivetran to interact with

D.

Use Partner Connect's automated workflow to establish a SQL warehouse (formerly known as a SQL endpoint) for Fivetran to interact with

E.

Use Workflows to establish a cluster for Fivetran to interact with

Question 8

A data analyst is working with gold-layer tables to complete an ad-hoc project. A stakeholder has provided the analyst with an additional dataset that can be used to augment the gold-layer tables already in use.

Which of the following terms is used to describe this data augmentation?

Options:

A.

Data testing

B.

Ad-hoc improvements

C.

Last-mile

D.

Last-mile ETL

E.

Data enhancement

Question 9

A data team has been given a series of projects by a consultant that need to be implemented in the Databricks Lakehouse Platform.

Which of the following projects should be completed in Databricks SQL?

Options:

A.

Testing the quality of data as it is imported from a source

B.

Tracking usage of feature variables for machine learning projects

C.

Combining two data sources into a single, comprehensive dataset

D.

Segmenting customers into like groups using a clustering algorithm

E.

Automating complex notebook-based workflows with multiple tasks

Question 10

A data engineering team has created a Structured Streaming pipeline that processes data in micro-batches and populates gold-level tables. The microbatches are triggered every minute.

A data analyst has created a dashboard based on this gold-level data. The project stakeholders want to see the results in the dashboard updated within one minute or less of new data becoming available within the gold-level tables.

Which of the following cautions should the data analyst share prior to setting up the dashboard to complete this task?

Options:

A.

The required compute resources could be costly

B.

The gold-level tables are not appropriately clean for business reporting

C.

The streaming data is not an appropriate data source for a dashboard

D.

The streaming cluster is not fault tolerant

E.

The dashboard cannot be refreshed that quickly

Question 11

A data analyst needs to share a Databricks SQL dashboard with stakeholders that are not permitted to have accounts in the Databricks deployment. The stakeholders need to be notified every time the dashboard is refreshed.

Which approach can the data analyst use to accomplish this task with minimal effort/

Options:

A.

By granting the stakeholders' email addresses permissions to the dashboard

B.

By adding the stakeholders' email addresses to the refresh schedule subscribers list

C.

By granting the stakeholders' email addresses to the SQL Warehouse (formerly known as endpoint) subscribers list

D.

By downloading the dashboard as a PDF and emailing it to the stakeholders each time it is refreshed

Question 12

A data analyst created and is the owner of the managed table my_ table. They now want to change ownership of the table to a single other user using Data Explorer.

Which of the following approaches can the analyst use to complete the task?

Options:

A.

Edit the Owner field in the table page by removing their own account

B.

Edit the Owner field in the table page by selecting All Users

C.

Edit the Owner field in the table page by selecting the new owner's account

D.

Edit the Owner field in the table page by selecting the Admins group

E.

Edit the Owner field in the table page by removing all access

Question 13

Which of the following statements describes descriptive statistics?

Options:

A.

A branch of statistics that uses summary statistics to quantitatively describe and summarize data.

B.

A branch of statistics that uses a variety of data analysis techniques to infer properties of an underlying distribution of probability.

C.

A branch of statistics that uses quantitative variables that must take on a finite or countably infinite set of values.

D.

A branch of statistics that uses summary statistics to categorically describe and summarize data.

E.

A branch of statistics that uses quantitative variables that must take on an uncountable set of values.

Question 14

A stakeholder has provided a data analyst with a lookup dataset in the form of a 50-row CSV file. The data analyst needs to upload this dataset for use as a table in Databricks SQL.

Which approach should the data analyst use to quickly upload the file into a table for use in Databricks SOL?

Options:

A.

Create a table by uploading the file using the Create page within Databricks SQL

B.

Create a table via a connection between Databricks and the desktop facilitated by Partner Connect.

C.

Create a table by uploading the file to cloud storage and then importing the data to Databricks.

D.

Create a table by manually copying and pasting the data values into cloud storage and then importing the data to Databricks.

Question 15

What does Partner Connect do when connecting Power Bl and Tableau?

Options:

A.

Creates a Personal Access Token. downloads and installs an ODBC driver, and downloads a configuration file for connection by Power Bl or Tableau to a SQL Warehouse (formerly known as a SQL Endpoint).

B.

Creates a Personal Access Token for authentication into Databricks SQL and emails it to you.

C.

Downloads a configuration file for connection by Power Bl or Tableau to a SQL Warehouse (formerly known as a SQL Endpoint).

D.

Downloads and installs an ODBC driver.

Question 16

How can a data analyst determine if query results were pulled from the cache?

Options:

A.

Go to the Query History tab and click on the text of the query. The slideout shows if the results came from the cache.

B.

Go to the Alerts tab and check the Cache Status alert.

C.

Go to the Queries tab and click on Cache Status. The status will be green if the results from the last run came from the cache.

D.

Go to the SQL Warehouse (formerly SQL Endpoints) tab and click on Cache. The Cache file will show the contents of the cache.

E.

Go to the Data tab and click Last Query. The details of the query will show if the results came from the cache.

Question 17

A data analyst has set up a SQL query to run every four hours on a SQL endpoint, but the SQL endpoint is taking too long to start up with each run.

Which of the following changes can the data analyst make to reduce the start-up time for the endpoint while managing costs?

Options:

A.

Reduce the SQL endpoint cluster size

B.

Increase the SQL endpoint cluster size

C.

Turn off the Auto stop feature

D.

Increase the minimum scaling value

E.

Use a Serverless SQL endpoint

Question 18

A data analyst wants to create a dashboard with three main sections: Development, Testing, and Production. They want all three sections on the same dashboard, but they want to clearly designate the sections using text on the dashboard.

Which of the following tools can the data analyst use to designate the Development, Testing, and Production sections using text?

Options:

A.

Separate endpoints for each section

B.

Separate queries for each section

C.

Markdown-based text boxes

D.

Direct text written into the dashboard in editing mode

E.

Separate color palettes for each section

Question 19

A data analyst has been asked to configure an alert for a query that returns the income in the accounts_receivable table for a date range. The date range is configurable using a Date query parameter.

The Alert does not work.

Which of the following describes why the Alert does not work?

Options:

A.

Alerts don't work with queries that access tables.

B.

Queries that return results based on dates cannot be used with Alerts.

C.

The wrong query parameter is being used. Alerts only work with Date and Time query parameters.

D.

Queries that use query parameters cannot be used with Alerts.

E.

The wrong query parameter is being used. Alerts only work with drogdown list query parameters, not dates.

Page: 1 / 7
Total 65 questions