Databricks Certified Professional Data Scientist Exam Practice Test

Page: 1 / 14
Total 138 questions
Question 1

You are working in a data analytics company as a data scientist, you have been given a set of various types of Pizzas available across various premium food centers in a country. This data is given as numeric values like Calorie. Size, and Sale per day etc. You need to group all the pizzas with the similar properties, which of the following technique you would be using for that?



Answer : C


Question 2

Under which circumstance do you need to implement N-fold cross-validation after creating a regression model?



Answer : B


Question 3

You are working in an ecommerce organization, where you are designing and evaluating a recommender system, you need to select which of the following metric wilt always have the largest value?



Answer : E


Question 4

In which lifecycle stage are appropriate analytical techniques determined?



Answer : A


Question 5

What is the best way to evaluate the quality of the model found by an unsupervised algorithm like k-means clustering, given metrics for the cost of the clustering (how well it fits the data) and its stability (how similar the clusters are across multiple runs over the same data)?



Answer : A


Question 6

Which of the following metrics are useful in measuring the accuracy and quality of a recommender system?



Answer : C


Question 7

You are designing a recommendation engine for a website where the ability to generate more personalized recommendations by analyzing information from the past activity of a specific user, or the history of other users deemed to be of similar taste to a given user. These resources are used as user profiling and helps the site recommend content on a user-by-user basis. The more a given user makes use of the system, the better the recommendations become, as the system gains data to improve its model of that user. What kind of this recommendation engine is ?



Answer : B


Page:    1 / 14   
Total 138 questions