Home / Microsoft / Microsoft Azure / DP-100 - Designing and Implementing a Data Science Solution on Azure Exam

Latest DP-100 Exam Questions


Question # 1



Note: This question is part of a series of questions that present the same scenario. Each question in the
series contains a unique solution that might meet the stated goals. Some question sets might have more
than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these
questions will not appear in the review screen.
You have a Python script named train.py in a local folder named scripts. The script trains a regression model
by using scikit-learn. The script includes code to load a training data file which is also located in the scripts
folder.
You must run the script as an Azure ML experiment on a compute cluster named aml-compute.
You need to configure the run to ensure that the environment includes the required packages for model
training. You have instantiated a variable named aml-compute that references the target compute cluster.
Solution: Run the following code:

Does the solution meet the goal?

A.

Yes

B.

No




B.
  

No



Explanation
The scikit-learn estimator provides a simple way of launching a scikit-learn training job on a compute target. It
is implemented through the SKLearn class, which can be used to support single-node CPU training.
Example:
from azureml.train.sklearn import SKLearn
}
estimator = SKLearn(source_directory=project_folder,
compute_target=compute_target,
entry_script='train_iris.py'
)
Reference:
https://docs.microsoft.com/en-us/azure/machine-learning/how-to-train-scikit-learn





Question # 2



You create a classification model with a dataset that contains 100 samples with Class A and 10,000 samples with Class B 
The variation of Class B is very high.
You need to resolve imbalances.
Which method should you use?

A.

Partition and Sample

B.

Cluster Centroids

C.

Tomek links

D.

Synthetic Minority Oversampling Technique (SMOTE)




D.
  

Synthetic Minority Oversampling Technique (SMOTE)







Question # 3



You create an Azure Machine Learning workspace and set up a development environment. You plan to train a
deep neural network (DNN) by using the Tensorflow framework and by using estimators to submit training
scripts.
You must optimize computation speed for training runs.
You need to choose the appropriate estimator to use as well as the appropriate training compute target
configuration.
Which values should you use? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.





Question # 4



You have a dataset that contains 2,000 rows. You are building a machine learning classification model by
using Azure Learning Studio. You add a Partition and Sample module to the experiment.
You need to configure the module. You must meet the following requirements:
Divide the data into subsets
Assign the rows into folds using a round-robin method
Allow rows in the dataset to be reused
How should you configure the module? To answer, select the appropriate options in the dialog box in the answer area.
NOTE: Each correct selection is worth one point





Question # 5



You are implementing a machine learning model to predict stock prices.
The model uses a PostgreSQL database and requires GPU processing.
You need to create a virtual machine that is pre-configured with the required tools.
What should you do?

A.

Create a Data Science Virtual Machine (DSVM) Windows edition.

B.

Create a Geo Al Data Science Virtual Machine (Geo-DSVM) Windows edition.

C.

Create a Deep Learning Virtual Machine (DLVM) Linux edition.

D.

Create a Deep Learning Virtual Machine (DLVM) Windows edition.

E.

Create a Data Science Virtual Machine (DSVM) Linux edition.




E.
  

Create a Data Science Virtual Machine (DSVM) Linux edition.







Question # 6



You use Azure Machine Learning Studio to build a machine learning experiment.
You need to divide data into two distinct datasets.
Which module should you use?

A.

Split Data

B.

Load Trained Model

C.

Assign Data to Clusters

D.

Group Data into Bins




D.
  

Group Data into Bins



Explanation
The Group Data into Bins module supports multiple options for binning data. You can customize how the bin
edges are set and how values are apportioned into the bins.
References:
https://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/group-data-into-bins





Question # 7



You have a dataset that contains 2,000 rows. You are building a machine learning classification model by using Azure Learning Studio. You add a Partition and Sample module to the experiment. You need to configure the module. You must meet the following requirements: Divide the data into subsets
Assign the rows into folds using a round-robin method
Allow rows in the dataset to be reused
How should you configure the module? To answer, select the appropriate options in the dialog box in the answer area. NOTE: Each correct selection is worth one point.





Question # 8



You are implementing a machine learning model to predict stock prices.
The model uses a PostgreSQL database and requires GPU processing.
You need to create a virtual machine that is pre-configured with the required tools.
What should you do?

A.

Create a Data Science Virtual Machine (DSVM) Windows edition.

B.

Create a Geo Al Data Science Virtual Machine (Geo-DSVM) Windows edition.

C.

Create a Deep Learning Virtual Machine (DLVM) Linux edition.

D.

Create a Deep Learning Virtual Machine (DLVM) Windows edition.

E.

Create a Data Science Virtual Machine (DSVM) Linux edition.




E.
  

Create a Data Science Virtual Machine (DSVM) Linux edition.







Question # 9



A set of CSV files contains sales records. All the CSV files have the same data schema.
Each CSV file contains the sales record for a particular month and has the filename sales.csv. Each file in stored in a folder that indicates the month and year when the data was recorded. The folders are in an Azure blob container for which a datastore has been defined in an Azure Machine Learning workspace. The folders are organized in a parent folder named sales to create the following hierarchical structure:

At the end of each month, a new folder with that month’s sales file is added to the sales folder.
You plan to use the sales data to train a machine learning model based on the following requirements:
You must define a dataset that loads all of the sales data to date into a structure that can be easilyconverted to a dataframe.
You must be able to create experiments that use only data that was created before a specific previous
month, ignoring any data that was added after that month.
You must register the minimum number of datasets possible.
You need to register the sales data as a dataset in Azure Machine Learning service workspace.
What should you do?

A.

Create a tabular dataset that references the datastore and explicitly specifies each 'sales/mm-yyyy/ sales.csv' file every month. Register the dataset with the name sales_dataset each month, replacing the existing dataset and specifying a tag named month indicating the month and year it was registered. Use this dataset for all experiments.

B.

Create a tabular dataset that references the datastore and specifies the path 'sales/*/sales.csv', register the dataset with the name sales_dataset and a tag named month indicating the month and year it was registered, and use this dataset for all experiments.

C.

Create a new tabular dataset that references the datastore and explicitly specifies each 'sales/mm-yyyy/ sales.csv' file every month. Register the dataset with the name sales_dataset_MM-YYYY each month with appropriate MM and YYYY values for the month and year. Use the appropriate month-specific dataset for experiments.

D.

Create a tabular dataset that references the datastore and explicitly specifies each 'sales/mm-yyyy/ sales.csv' file. Register the dataset with the name sales_dataset each month as a new version and with a tag named month indicating the month and year it was registered. Use this dataset for all experiments, identifying the version to be used based on the month tag as necessary.




B.
  

Create a tabular dataset that references the datastore and specifies the path 'sales/*/sales.csv', register the dataset with the name sales_dataset and a tag named month indicating the month and year it was registered, and use this dataset for all experiments.



Specify the path.
Example:
The following code gets the workspace existing workspace and the desired datastore by name. And then passes
the datastore and file locations to the path parameter to create a new TabularDataset, weather_ds.
from azureml.core import Workspace, Datastore, Dataset
datastore_name = 'your datastore name'
# get existing workspace
workspace = Workspace.from_config()
# retrieve an existing datastore in the workspace by name
datastore = Datastore.get(workspace, datastore_name)
# create a TabularDataset from 3 file paths in datastore
datastore_paths = [(datastore, 'weather/2018/11.csv'),
(datastore, 'weather/2018/12.csv'),
(datastore, 'weather/2019/*.csv')]
weather_ds = Dataset.Tabular.from_delimited_files(path=datastore_paths)





Question # 10



You use the following code to run a script as an experiment in Azure Machine Learning: 

You must identify the output files that are generated by the experiment run.
You need to add code to retrieve the output file names.
Which code segment should you add to the script?

A.

files = run.get_properties()

B.

files= run.get_file_names()

C.

files = run.get_details_with_logs()

D.

files = run.get_metrics()

E.

files = run.get_details()




B.
  

files= run.get_file_names()



Explanation
You can list all of the files that are associated with this run record by called run.get_file_names()
Reference:
https://docs.microsoft.com/en-us/azure/machine-learning/how-to-track-experiments




Get 428 Designing and Implementing a Data Science Solution on Azure Exam questions Access in less then $0.12 per day.

Total Questions Answers: 428
Last Updated: 22-Oct-2024
Available with 1, 3, 6 and 12 Months Free Updates Plans
PDF: $15 $64

Test Engine: $20 $80

PDF + Engine: $25 $99


Microsoft DP-100 Dumps - Real Exam Questions


Exam Code: DP-100
Exam Name: Designing and Implementing a Data Science Solution on Azure Exam

  • 90 Days Free Updates
  • Microsoft Experts Verified Answers
  • Printable PDF File Format
  • DP-100 Exam Passing Assurance

Get 100% Real DP-100 Exam Dumps With Verified Answers As Seen in the Real Exam. Designing and Implementing a Data Science Solution on Azure Exam Exam Questions are Updated Frequently and Reviewed by Industry TOP Experts for Passing Microsoft Azure Exam Quickly and Hassle Free.

Microsoft DP-100 Exam Questions


Struggling with Designing and Implementing a Data Science Solution on Azure Exam prep? Get the edge you need!

Our carefully crafted DP-100 dumps give you the confidence to ace the exam. We offer:

  • Up-to-date Microsoft Azure practice questions: Stay current with the latest exam content.
  • PDF and test engine formats: Choose the study tools that work best for you.
  • Realistic Microsoft DP-100 practice exams: Simulate the real exam experience and boost your readiness.
Pass your Microsoft Azure exam with ease. Try our study materials today!


Ace your Microsoft Azure exam with confidence!



We provide top-quality DP-100 exam prep materials that are:
  • Accurate and up-to-date: Reflect the latest Microsoft exam changes and ensure you are studying the right content. 
  • Comprehensive: Cover all exam topics so you do not need to rely on multiple sources. 
  • Convenient formats: Choose between PDF files and online Designing and Implementing a Data Science Solution on Azure Exam practice tests for easy studying on any device.
Do not waste time on unreliable DP-100 practice exams. Choose our proven Microsoft Azure study materials and pass with flying colors.

Try Dumps4free Designing and Implementing a Data Science Solution on Azure Exam Exam 2024 PDFs today!

  • Assurance

    Designing and Implementing a Data Science Solution on Azure Exam practice exam has been updated to reflect the most recent questions from the Microsoft DP-100 Exam.

  • Demo

    Try before you buy! Get a free demo of our Microsoft Azure exam dumps and see the quality for yourself. Need help? Chat with our support team.

  • Validity

    Our Microsoft DP-100 PDF contains expert-verified questions and answers, ensuring you're studying the most accurate and relevant material.

  • Success

    Achieve DP-100 success! Our Designing and Implementing a Data Science Solution on Azure Exam exam questions give you the preparation edge.

If you have any question then contact our customer support at live chat or email us at support@dumps4free.com.