Month End Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: dumps65

Oracle 1z0-1110-25 Dumps

Page: 1 / 16
Total 158 questions

Oracle Cloud Infrastructure 2025 Data Science Professional Questions and Answers

Question 1

You are preparing a configuration object necessary to create a Data Flow application. Which THREE parameter values should you provide?

Options:

A.

The path to the archive.zip file

B.

The local path to your PySpark script

C.

The compartment of the Data Flow application

D.

The bucket used to read/write the PySpark script in Object Storage

E.

The display name of the application

Question 2

You have received machine learning model training code, without clear information about the optimal shape to run the training. How would you proceed to identify the optimal compute shape for your model training that provides a balanced cost and processing time?

Options:

A.

Start with a smaller shape and monitor the Job Run metrics and time required to complete the model training. If the compute shape is not fully utilized, tune the model parameters, and rerun the job. Repeat the process until the shape resources are fully utilized

B.

Start with the strongest compute shape Jobs support and monitor the Job Run metrics and time required to complete the model training. Tune the model so that it utilizes as much compute resources as possible, even at an increased cost

C.

Start with a smaller shape and monitor the utilization metrics and time required to complete the model training. If the compute shape is fully utilized, change to compute that has more resources and rerun the job. Repeat the process until the processing time does not improve

D.

Start with a random compute shape and monitor the utilization metrics and time required to finish the model training. Perform model training optimizations and performance tests in advance to identify the right compute shape before running the model training as a job

Question 3

You are a data scientist working for a utilities company. You have developed an algorithm that detects anomalies from a utility reader in the grid. The size of the model artifact is about 2 GB, and you are trying to store it in the model catalog. Which THREE interfaces could you use to save the model artifact into the model catalog?

Options:

A.

Oracle Cloud Infrastructure (OCI) Command Line Interface (CLI)

B.

Accelerated Data Science (ADS) Software Development Kit (SDK)

C.

ODSC CLI

D.

Console

E.

OCI Python SDK

F.

Git CLI

Question 4

Which components are a part of the OCI Identity and Access Management service?

Options:

A.

Policies

B.

Regional subnets

C.

Compute instances

D.

VCN

Question 5

True or false? Data scientists typically need a combination of technical skills, nontechnical ones, and suitable personality traits to be successful.

Options:

A.

True

B.

False

Question 6

You have been given a collection of digital files required for a business audit. They consist of several different formats that you would like to annotate using Oracle Cloud Infrastructure (OCI) Data Labeling. Which THREE types of files could this tool annotate?

Options:

A.

Video footage of a conversation in a conference room

B.

Images of computer server racks

C.

A typewritten document that details an annual budget

D.

A collection of purchase orders for office supplies

E.

An audio recording of a phone conversation

Question 7

Triggering a PagerDuty notification as part of Monitoring is an example of what in the OCI Console?

Options:

A.

Action

B.

Rule

C.

Function

D.

Event

Question 8

What is the correct definition of Git?

Options:

A.

Git is a centralized version control system that allows you to revert to previous versions of files as needed.

B.

Git is a distributed version control system that allows you to track changes made to a set of files.

C.

Git is a distributed version control system that protects teams from simultaneous repo contributions and merge requests.

D.

Git is a centralized version control system that allows data scientists and developers to track copious amounts of data.

Question 9

While working with Git on Oracle Cloud Infrastructure (OCI) Data Science, you notice that two of the operations are taking more time than the others due to your slow internet speed. Which TWO operations would experience the delay?

Options:

A.

Moving the changes into staging area for the next commit

B.

Updating the local repo to match the content from a remote repository

C.

Pushing changes to a remote repository

D.

Making a commit that is taking a snapshot of the local repository for the next push

E.

Converting an existing local project folder to a Git repository

Question 10

You have just completed analyzing a set of images by using Oracle Cloud Infrastructure (OCI) Data Labeling, and you want to export the annotated data. Which TWO formats are supported?

Options:

A.

CONLL V2003

B.

COCO

C.

Data Labeling Service Proprietary JSON

D.

Spacy

Question 11

Where are OCI secrets stored?

Options:

A.

OCI Object Storage

B.

OCI Vault

C.

Autonomous Data Warehouse

D.

Oracle Databases

Question 12

You have a complex Python code project that could benefit from using Data Science Jobs as it is a repeatable machine learning model training task. The project contains many sub-folders and classes. What is the best way to run this project as a Job?

Options:

A.

ZIP the entire code project folder and upload it as a Job artifact. Jobs automatically identifies the main top-level where the code is run

B.

Rewrite your code so that it is a single executable Python or Bash/Shell script file

C.

ZIP the entire code project folder and upload it as a Job artifact on job creation. Jobs identifies the main executable file automatically

D.

ZIP the entire code project folder, upload it as a Job artifact on job creation, and set JOB_RUN_ENTRYPOINT to point to the main executable file

Question 13

You are asked to prepare data for a custom-built model that requires transcribing Spanish video recordings into a readable text format with profane words identified. Which Oracle Cloud Service would you use?

Options:

A.

OCI Anomaly Detection

B.

OCI Speech

C.

OCI Translation

D.

OCI Language

Question 14

Which of the following best describes the principal goal of data science?

Options:

A.

To collect and archive exhaustive datasets from various source systems for corporate record-keeping uses.

B.

To mine and analyze large amounts of data in order to uncover information that can be used for operational improvements and business gains.

C.

To collect and prepare data for use as part of analytics applications.

D.

Data science is focused on output of the analysis.

Question 15

A bike sharing platform has collected user commute data for the past 3 years. For increasing profitability and making useful inferences, a machine learning model needs to be built from the accumulated data. Which of the following options has the correct order of the required machine learning tasks for building a model?

Options:

A.

Data Access, Feature Exploration, Data Exploration, Feature Engineering, Modeling

B.

Data Access, Data Exploration, Feature Exploration, Feature Engineering, Modeling

C.

Data Access, Data Exploration, Feature Engineering, Feature Exploration, Modeling

D.

Data Access, Feature Exploration, Feature Engineering, Data Exploration, Modeling

Question 16

After you have created and opened a notebook session, you want to use the Accelerated Data Science (ADS) SDK to access your data and get started with exploratory data analysis. From which TWO places can you access the ADS SDK?

Options:

A.

Oracle Big Data Service

B.

Oracle Machine Learning

C.

Conda environment in OCI Data Science

D.

Python Package Index (PyPI)

E.

Oracle Autonomous Data Warehouse

Question 17

Which statement best describes Oracle Cloud Infrastructure Data Science Jobs?

Options:

A.

Jobs let you define and run repeatable tasks on fully managed infrastructure.

B.

Jobs let you define and run repeatable tasks on customer-managed infrastructure.

C.

Jobs let you define and run repeatable tasks on fully managed third-party cloud infrastructures.

D.

Jobs let you define and run all Oracle Cloud DevOps workloads.

Question 18

Which of these options allow the sharing and loading back of ML models into a notebook session?

Options:

A.

Model provenance

B.

Model taxonomy

C.

Model deployment

D.

Model catalog

Question 19

Which architecture is based on the principle of “never trust, always verify”?

Options:

A.

Federated identity

B.

Zero trust

C.

Fluid perimeter

D.

Defense in depth

Question 20

You have created a model and want to use Accelerated Data Science (ADS) SDK to deploy the model. Where are the artifacts to deploy this model with ADS?

Options:

A.

OCI Vault

B.

Model Depository

C.

Model Catalog

D.

Data Science Artifactory

Question 21

You want to write a program that performs document analysis tasks such as extracting text and tables from a document. Which Oracle AI service would you use?

Options:

A.

OCI Language

B.

Oracle Digital Assistant

C.

OCI Speech

D.

OCI Vision

Question 22

You want to make your model more frugal to reduce the cost of collecting and processing data. You plan to do this by removing features that are highly correlated. You would like to create a heatmap that displays the correlation so that you can identify candidate features to remove. Which Accelerated Data Science (ADS) SDK method is appropriate to display the comparability between Continuous and Categorical features?

Options:

A.

pearson_plot()

B.

cramersv_plot()

C.

correlation_ratio_plot()

D.

corr()

Question 23

As a data scientist, you use the Oracle Cloud Infrastructure (OCI) Language service to train custommodels. Which types of custom models can be trained?

Options:

A.

Image classification, Named Entity Recognition (NER)

B.

Text classification, Named Entity Recognition (NER)

C.

Sentiment Analysis, Named Entity Recognition (NER)

D.

Object detection, Text classification

Question 24

Which statement accurately describes an aspect of machine learning models?

Options:

A.

Model performance degrades over time due to changes in data.

B.

Static predictions become increasingly accurate over time.

C.

Data models are more static and generally require fewer updates than software code.

D.

A high-quality model will not need to be retrained as new information is received.

Question 25

You have configured the Management Agent on an Oracle Cloud Infrastructure (OCI) Linux instance for log ingestion purposes. Which is a required configuration for OCI Logging Analytics service to collect data from multiple logs of this instance?

Options:

A.

Log - Log Group Association

B.

Entity - Log Association

C.

Source - Entity Association

D.

Log Group - Source Association

Question 26

You want to create a user group for a team of external data science consultants. The consultants should only have the ability to see Data Science resource details but not have the ability to create, delete, or update Data Science resources. What verb should you write in the policy?

Options:

A.

Use

B.

Inspect

C.

Manage

D.

Read

Question 27

Which statement about Oracle Cloud Infrastructure Anomaly Detection is true?

Options:

A.

Accepted file types are SQL and Python

B.

Data used for analysis can be text or numerical in nature

C.

It is an important tool for detecting fraud, network intrusions, and discrepancies in sensor time series analysis

D.

It is trained on a combination of customer and general industry datasets

Question 28

Arrange the following in the correct Git Repository workflow order:

    Install, configure, and authenticate Git.

    Configure SSH keys for the Git repository.

    Create a local and remote Git repository.

    Commit files to the local Git repository.

    Push the commit to the remote Git repository.

Options:

A.

2, 3, 1, 4, 5

B.

4, 2, 3, 1, 5

C.

3, 5, 1, 2, 4

D.

1, 2, 3, 4, 5

Question 29

You are using Oracle Cloud Infrastructure (OCI) Anomaly Detection to train a model to detect anomalies in pump sensor data. How does the required False Alarm Probability setting affect an anomaly detection model?

Options:

A.

It is used to disable the reporting of false alarms

B.

It changes the sensitivity of the model to detecting anomalies

C.

It determines how many false alarms occur before an error message is generated

D.

It adds a score to each signal indicating the probability that it’s a false alarm

Question 30

Which two statements are true about published conda environments?

Options:

A.

They are curated by Oracle Cloud Infrastructure (OCI) Data Science

B.

The odsc conda init command is used to configure the location of published conda environments

C.

Your notebook session acts as the source to share published conda environments with team members

D.

You can only create a published conda environment by modifying a Data Science conda environment

E.

In addition to service job run environment variables, conda environment variables can be used inData Science Jobs

Question 31

You have trained three different models on your dataset using Oracle AutoML. You want to visualize the behavior of each of the models, including the baseline model, on the test set. Which class should be used from the Accelerated Data Science (ADS) SDK to visually compare the models?

Options:

A.

EvaluationMetrics

B.

ADSEvaluator

C.

ADSExplainer

D.

ADSTuner

Question 32

How can you convert a fixed load balancer to a flexible load balancer?

Options:

A.

There is no way to convert the load balancer

B.

Use Update Shape workflows

C.

Delete the fixed load balancer and create a new one

D.

Using the Edit Listener option

Question 33

Which TWO statements about Oracle Cloud Infrastructure (OCI) Open Data service are true?

Options:

A.

Open Data includes text and image data repositories for AI and ML.

B.

Audio and video formats are not available.

C.

Each dataset in Open Data consists of code and tooling usage examples for consumption and reproducibility.

D.

Open Data is a dataset repository made for the people that create, use, and manipulate datasets.

E.

A primary goal of Open Data is for users to contribute to the data repositories to expand the content offered.

F.

Subscribers can pay and log into Open Data to view curated datasets that are otherwise not available to the public.

Question 34

Which statement about Oracle Cloud Infrastructure Data Science Jobs is true?

Options:

A.

Jobs provisions the infrastructure to run a process on-demand

B.

Jobs comes with a set of standard tasks that cannot be customized

C.

You must create and manage your own Jobs infrastructure

D.

You must use a single Shell/Bash or Python artifact to run a job

Question 35

You are a data scientist working inside a notebook session and you attempt to pip install a package from a public repository that is not included in your conda environment. After running this command, you get a network timeout error. What might be missing from your network configuration?

Options:

A.

The NAT Gateway with public internet access

B.

Service Gateway with private subnet access

C.

FastConnect to an on-premises network

D.

Primary Virtual Network Interface Card (VNIC)

Question 36

Six months ago you created and deployed a model that predicts customer churn for a call center. Initially, it was yielding quality predictions. However, over the last two months, users have been questioning the credibility of the predictions. Which TWO methods would you employ to verify accuracy and lower customer churn?

Options:

A.

Drift monitoring

B.

Redeploy the model

C.

Operational monitoring

D.

Retrain the model

E.

Validate the model using recent data

Question 37

You have trained a binary classifier for a loan application and saved this model into the model catalog. A colleague wants to examine the model, and you need to share the model with your colleague. From the model catalog, which model artifacts can be shared?

Options:

A.

Metadata, hyperparameters, metrics only

B.

Model metadata and hyperparameters only

C.

Models and metrics only

D.

Models, model metadata, hyperparameters, metrics

Question 38

Which statement about resource principals is true?

Options:

A.

When you authenticate using a resource principal, you need to create and manage credentials to access OCI resources.

B.

A resource principal is not a secure way to authenticate to resources, compared to the OCI configuration and API key approach.

C.

The Data Science service does not provide authentication via a notebook session’s or job run’s resource principal to access other OCI resources.

D.

A resource principal is a feature of IAM that enables resources to be authorized principal actors.

Question 39

Which of the following analytical and statistical techniques do data scientists commonly use?

Options:

A.

Classification

B.

Regression

C.

Clustering

D.

All of the above

Question 40

As a data scientist, you create models for cancer prediction based on mammographic images. The correct identification is very crucial in this case. After evaluating two models, you arrive at the following confusion matrix. Which model would you prefer and why?

    Model 1 has Test accuracy is 80% and recall is 70%

    Model 2 has Test accuracy is 75% and recall is 85%

Options:

A.

Model 2, because recall is high

B.

Model 1, because the test accuracy is high

C.

Model 2, because recall has more impact on predictions in this use case

D.

Model 1, because recall has lesser impact on predictions in this use case

Question 41

You are working as a data scientist for a healthcare company. They decided to analyze the data to find patterns in a large volume of electronic medical records. You are asked to build a PySpark solution to analyze these records in a JupyterLab notebook. What is the order of recommended steps to develop a PySpark application in OCI Data Science?

Options:

A.

Launch a notebook session, configure core-site.xml, install a PySpark conda environment, develop your PySpark application, create a Data Flow application with the Accelerated Data Science (ADS) SDK

B.

Configure core-site.xml, install a PySpark conda environment, create a Data Flow application with the Accelerated Data Science (ADS) SDK, develop your PySpark application, launch a notebook session

C.

Install a Spark conda environment, configure core-site.xml, launch a notebook session, create a Data Flow application with the Accelerated Data Science (ADS) SDK, develop your PySpark application

D.

Launch a notebook session, install a PySpark conda environment, configure core-site.xml, develop your PySpark application, create a Data Flow application with the Accelerated Data Science (ADS) SDK

Question 42

You have a dataset with fewer than 1000 observations, and you are using Oracle AutoML to build a classifier. While visualizing the results of each stage of the Oracle AutoML pipeline, you notice that no visualization has been generated for one of the stages. Which stage is not visualized?

Options:

A.

Feature selection

B.

Algorithm selection

C.

Adaptive sampling

D.

Hyperparameter tuning

Question 43

You are using Oracle Cloud Infrastructure (OCI) Anomaly Detection to train a model to detect anomalies in pump sensor data. What are you trying to determine? How does the required False Alarm Probability setting affect an anomaly detection model?

Options:

A.

It is used to disable the reporting of false alarms

B.

It changes the sensitivity of the model to detecting anomalies

C.

It determines how many false alarms occur before an error message is generated

D.

It adds a score to each signal indicating the probability that it’s a false alarm

Question 44

The Oracle AutoML pipeline automates hyperparameter tuning by training the model with different parameters in parallel. You have created an instance of Oracle AutoML as oracle_automl and now you want an output with all the different trials performed by Oracle AutoML. Which of the following commands gives you the results of all trials?

Options:

A.

oracle_automl.print_trials()

B.

oracle_automl.visualize_tuning_trials()

C.

oracle_automl.visualize_adaptive_sampling_trials()

D.

oracle_automl.visualize_algorithm_selection_trials()

Question 45

You are running a pipeline in the OCI Data Science service and want to override some of the pipeline's default settings. Which of the following statements about overriding pipeline defaults is true?

Options:

A.

Pipeline defaults can be overridden only during pipeline creation.

B.

Pipeline defaults can be overridden only by the Administrator.

C.

Pipeline defaults can be overridden before starting the pipeline execution.

D.

Pipeline defaults cannot be overridden once the pipeline has been created.

Question 46

Which statement is true about standards?

Options:

A.

They may be audited

B.

They are the result of a regulation or contractual requirement or an industry requirement

C.

They are methods and instructions on how to maintain or accomplish the directives of the policy

D.

They are the foundation of corporate governance

Question 47

You are a data scientist designing an air traffic control model, and you choose to leverage Oracle AutoML. You understand that the Oracle AutoML pipeline consists of multiple stages and automatically operates in a certain sequence. What is the correct sequence for the Oracle AutoML pipeline?

Options:

A.

Algorithm selection, Feature selection, Adaptive sampling, Hyperparameter tuning

B.

Adaptive sampling, Algorithm selection, Feature selection, Hyperparameter tuning

C.

Adaptive sampling, Feature selection, Algorithm selection, Hyperparameter tuning

D.

Algorithm selection, Adaptive sampling, Feature selection, Hyperparameter tuning

Question 48

Which Oracle Cloud Infrastructure (OCI) Data Science policy is invalid?

Options:

A.

Allow group DataScienceGroup to use virtual-network-family in compartment DataScience

B.

Allow group DataScienceGroup to use data-science-model-sessions in compartment DataScience

C.

Allow dynamic-group DataScienceDynamicGroup to manage data-science-projects in compartment DataScience

D.

Allow dynamic-group DataScienceDynamicGroup to manage data-science-family in compartment DataScience

Question 49

For your next data science project, you need access to public geospatial images. Which Oracle Cloud service provides free access to those images?

Options:

A.

Oracle Big Data Service

B.

Oracle Analytics Cloud

C.

Oracle Cloud Infrastructure (OCI) Data Science

D.

Oracle Open Data

Question 50

You want to create a user group for a team of external data science consultants. The consultants should only have the ability to see Data Science resource details but not have the ability to create, delete, or update Data Science resources. What verb should you write in the policy?

Options:

A.

Use

B.

Inspect

C.

Manage

D.

Read

Question 51

Which statement about resource principals is true?

Options:

A.

When you authenticate using a resource principal, you need to create and manage credentials to access OCI resources.

B.

A resource principal is not a secure way to authenticate to resources, compared to the OCI configuration and API key approach.

C.

The Data Science service does not provide authentication via a notebook session’s or job run’s resource principal to access other OCI resources.

D.

A resource principal is a feature of IAM that enables resources to be authorized principal actors.

Question 52

You want to make your model more frugal to reduce the cost of collecting and processing data. You plan to do this by removing features that are highly correlated. You would like to create a heatmap that displays the correlation so that you can identify candidate features to remove. Which Accelerated Data Science (ADS) SDK method is appropriate to display the comparability between Continuous and Categorical features?

Options:

A.

pearson_plot()

B.

cramersv_plot()

C.

correlation_ratio_plot()

D.

corr()

Question 53

In which two ways can you improve data durability in Oracle Cloud Infrastructure Object Storage?

Options:

A.

Setup volumes in a RAID1 configuration

B.

Enable server-side encryption

C.

Enable Versioning

D.

Limit delete permissions

E.

Enable client-side encryption

Question 54

A bike sharing platform has collected user commute data for the past 3 years. For increasing profitability and making useful inferences, a machine learning model needs to be built from the accumulated data. Which of the following options has the correct order of the required machine learning tasks for building a model?

Options:

A.

Data Access, Feature Exploration, Data Exploration, Feature Engineering, Modeling

B.

Data Access, Data Exploration, Feature Exploration, Feature Engineering, Modeling

C.

Data Access, Data Exploration, Feature Engineering, Feature Exploration, Modeling

D.

Data Access, Feature Exploration, Feature Engineering, Data Exploration, Modeling

Question 55

You are a data scientist working inside a notebook session and you attempt to pip install a package from a public repository that is not included in your conda environment. After running this command, you get a network timeout error. What might be missing from your network configuration?

Options:

A.

The NAT Gateway with public internet access

B.

Service Gateway with private subnet access

C.

FastConnect to an on-premises network

D.

Primary Virtual Network Interface Card (VNIC)

Question 56

Which type of firewalls are designed to protect against web application attacks, such as SQL injection and cross-site scripting?

Options:

A.

Stateful inspection firewall

B.

Web Application Firewall

C.

Incident firewall

D.

Packet filtering firewall

Question 57

Where are OCI secrets stored?

Options:

A.

OCI Object Storage

B.

OCI Vault

C.

Autonomous Data Warehouse

D.

Oracle Databases

Question 58

You are a data scientist leveraging Oracle Cloud Infrastructure (OCI) to create a model and need some additional Python libraries for processing genome sequencing data. Which of the following THREE statements are correct with respect to installing additional Python libraries to process the data?

Options:

A.

OCI Data Science allows root privileges in notebook sessions

B.

You can install any open-source package available in a publicly accessible Python Package Index (PyPI) repository

C.

You can only install libraries using yum and pip as a normal user

D.

You cannot install a library that’s not preinstalled in the provided image

E.

You can install private or custom libraries from your own internal repositories

Question 59

What is the minimum active storage duration for logs used by Logging Analytics to be archived?

Options:

A.

60 days

B.

10 days

C.

30 days

D.

15 days

Question 60

Which Web Application Firewall (WAF) service component must be configured to allow, block, or log network requests when they meet specified criteria?

Options:

A.

Protection rules

B.

Bot Management

C.

Origin

D.

Web Application Firewall policy

Question 61

What is feature engineering in machine learning used for?

Options:

A.

To perform parameter tuning

B.

To interpret ML models

C.

To transform existing features into new ones

D.

To help understand the dataset features

Question 62

Which OCI service provides a managed Kubernetes service for deploying, scaling, and managing containerized applications?

Options:

A.

Oracle Cloud Infrastructure Container Registry

B.

Oracle Cloud Infrastructure Load Balancing

C.

Oracle Cloud Infrastructure Container Engine for Kubernetes

D.

Oracle Cloud Infrastructure Streaming

Question 63

How are datasets exported in the OCI Data Labeling service?

Options:

A.

As a binary file

B.

As an XML file

C.

As a line-delimited JSON file

D.

As a CSV file

Question 64

As a data scientist, you require a pipeline to train ML models. When can a pipeline run be initiated?

Options:

A.

Pipeline can be initiated once it is created.

B.

Pipeline can be initiated during the pipeline run state.

C.

Pipeline can be initiated after the active state.

D.

Pipeline can be initiated before the active state.

Question 65

You are asked to prepare data for a custom-built model that requires transcribing Spanish video recordings into a readable text format with profane words identified. Which Oracle Cloud Service would you use?

Options:

A.

OCI Anomaly Detection

B.

OCI Speech

C.

OCI Translation

D.

OCI Language

Question 66

In machine learning, what is the primary difference between supervised and unsupervised learning?

Options:

A.

Supervised learning involves data that has been labeled and classified, while unsupervised learning data is unlabeled and unclassified.

B.

Supervised learning is monitored closely by data scientists, while they don’t play a role inunsupervised learning.

C.

Supervised learning is only used for image recognition, while unsupervised learning can be used for various analytics applications.

D.

Supervised learning is created and managed by the Data Engineer.

Question 67

You are attempting to save a model from a notebook session to the model catalog by using the Accelerated Data Science (ADS) SDK, with resource principal as the authentication signer, and you get a 404 authentication error. Which two should you look for to ensure permissions are set up correctly?

Options:

A.

The model artifact is saved to the block volume of the notebook session

B.

A dynamic group has rules that match the notebook sessions in its compartment

C.

The policy for your user group grants manage permissions for the model catalog in this compartment

D.

The policy for a dynamic group grants manage permissions for the model catalog in this compartment

E.

The networking configuration allows access to Oracle Cloud Infrastructure services through a Service Gateway

Question 68

You want to make API calls against other OCI services from your instance without configuring user credentials. How would you achieve this?

Options:

A.

Create a dynamic group and add a policy

B.

Create a dynamic group and add your instance

C.

Create a group and add a policy

D.

No configuration is required for making API calls

Question 69

You are creating an Oracle Cloud Infrastructure (OCI) Data Science job that will run on a recurring basis in a production environment. This job will pick up sensitive data from an Object Storage Bucket, train a model, and save it to the model catalog. How would you design the authentication mechanism for the job?

Options:

A.

Create a pre-authenticated request (PAR) for the Object Storage bucket and use that in the job code

B.

Use the resource principal of the job run as the signer in the job code, ensuring there is a dynamic group for this job run with appropriate access to Object Storage and the model catalog

C.

Package your personal OCI config file and keys in the job artifact

D.

Store your personal OCI config file and keys in the Vault, and access the Vault through the job run resource principal

Question 70

True or false? Bias is a common problem in data science applications.

Options:

A.

True

B.

False

Question 71

Why is data sampling useful for data scientists?

Options:

A.

It lets them analyze datasets in small batches to reduce their use of system resources.

B.

It reduces the amount of data storage space that’s required for data science applications.

C.

It enables them to use a representative subset of data to build accurate analytical models more quickly.

Question 72

Which OCI service enables you to build, train, and deploy machine learning models in the cloud?

Options:

A.

Oracle Cloud Infrastructure Data Catalog

B.

Oracle Cloud Infrastructure Data Integration

C.

Oracle Cloud Infrastructure Data Science

D.

Oracle Cloud Infrastructure Data Flow

Question 73

As a data scientist, you create models for cancer prediction based on mammographic images. The correct identification is very crucial in this case. After evaluating two models, you arrive at the following confusion matrix. Which model would you prefer and why?

    Model 1 has Test accuracy is 80% and recall is 70%

    Model 2 has Test accuracy is 75% and recall is 85%

Options:

A.

Model 2, because recall is high

B.

Model 1, because the test accuracy is high

C.

Model 2, because recall has more impact on predictions in this use case

D.

Model 1, because recall has lesser impact on predictions in this use case

Question 74

As you are working in your notebook session, you find that your notebook session does not have enough compute CPU and memory for your workload. How would you scale up your notebook session without losing your work?

Options:

A.

Create a temporary bucket on Object Storage, write all your files and data to Object Storage, delete your notebook session, provision a new notebook session on a larger compute shape, and copy your files and data from your temporary bucket onto your new notebook session

B.

Ensure your files and environments are written to the block volume storage under the /home/datascience directory, deactivate the notebook session, and activate the notebook session with a larger compute shape selected

C.

Download all your files and data to your local machine, delete your notebook session, provision a new notebook session on a larger compute shape, and upload your files from your local machine to the new notebook session

D.

Deactivate your notebook session, provision a new notebook session on a larger compute shapeand re-create all of your file changes

Question 75

You are using a custom application with third-party APIs to manage application and data hosted in an Oracle Cloud Infrastructure (OCI) tenancy. Although your third-party APIs don’t support OCI’s signature-based authentication, you want them to communicate with OCI resources. Which authentication option must you use to ensure this?

Options:

A.

OCI username and password

B.

API Signing Key

C.

SSH Key Pair with 2048-bit algorithm

D.

Auth Token

Question 76

You have just started as a data scientist at a healthcare company. You have been asked to analyze and improve a deep neural network model, which was built based on the electrocardiogram records of patients. There are no details about the model framework that was built. What would be the best way to find more details about the machine learning models inside the model catalog?

Options:

A.

Refer to the code inside the model

B.

Check for model taxonomy details

C.

Check for metadata tags

D.

Check for provenance details

Question 77

You are a data scientist working for a utilities company. You have developed an algorithm that detects anomalies from a utility reader in the grid. The size of the model artifact is about 2 GB, and you are trying to store it in the model catalog. Which THREE interfaces could you use to save the model artifact into the model catalog?

Options:

A.

Oracle Cloud Infrastructure (OCI) Command Line Interface (CLI)

B.

Accelerated Data Science (ADS) Software Development Kit (SDK)

C.

ODSC CLI

D.

Console

E.

OCI Python SDK

F.

Git CLI

Question 78

Which cache rules criterion matches if the concatenation of the requested URL path and query are identical to the contents of the value field?

Options:

A.

URL_PART_CONTAINS

B.

URL_IS

C.

URL_PART_ENDS_WITH

D.

URL_STARTS_WITH

Question 79

You are given a task of writing a program that sorts document images by language. Which Oracle AI Service would you use?

Options:

A.

Oracle Digital Assistant

B.

OCI Vision

C.

OCI Speech

D.

OCI Language

Question 80

What is a common maxim about data scientists?

Options:

A.

They spend 80% of their time finding and preparing data and 20% analyzing it.

B.

They spend 80% of their time analyzing data and 20% finding and preparing it.

C.

They spend 80% of their time on failed analytics projects and 20% doing useful work.

Question 81

Which components are a part of the OCI Identity and Access Management service?

Options:

A.

Policies

B.

Regional subnets

C.

Compute instances

D.

VCN

Question 82

Using Oracle AutoML, you are tuning hyperparameters on a supported model class and have specified a time budget. AutoML terminates computation once the time budget is exhausted. What would you expect AutoML to return in case the time budget is exhausted before hyperparameter tuning is completed?

Options:

A.

The current best-known hyperparameter configuration

B.

The last generated hyperparameter configuration

C.

A hyperparameter configuration with a minimum learning rate

D.

A random hyperparameter configuration

Question 83

You have an embarrassingly parallel or distributed batch job on a large amount of data that you consider running using Data Science Jobs. What would be the best approach to run the workload?

Options:

A.

Create the job in Data Science Jobs and start a job run. When it is done, start a new job run until you achieve the number of runs required

B.

Create the job in Data Science Jobs and then start the number of simultaneous job runs required for your workload

C.

Reconfigure the job run because Data Science Jobs does not support embarrassingly parallel workloads

D.

Create a new job for every job run that you have to run in parallel, because the Data Science Jobs service can have only one job run per job

Question 84

You want to build a multistep machine learning workflow by using the Oracle Cloud Infrastructure (OCI) Data Science Pipeline feature. How would you configure the conda environment to run a pipeline step?

Options:

A.

Configure a compute shape

B.

Configure a block volume

C.

Use command-line variables

D.

Use environmental variables

Page: 1 / 16
Total 158 questions