An analyst is doing a clinical study on the value of analyte among a large population of healthy people. The analyst is going to use a Gaussian Distribution to share the results. Which of the following represents a Gaussian Distribution?
Answer : B
The Gaussian distribution, also known as the normal distribution, is a probability distribution that is symmetric about the mean, showing that data near the mean are more frequent in occurrence than data far from the mean. In graph form, the Gaussian distribution will appear as a bell curve, which is the case with option A. It is characterized by its bell-shaped curve and is defined by the mean () and the standard deviation (). It is a common assumption for the distribution of independent, randomly generated variables.
An organization has a customer database of 3000 customers and has accumulated 5 years of sales dat
a. They want to make decisions about which products to retire and which to continue to offer. Management has turned to the analytics team to analyze the data and provide recommendations. The analytics team develops a survey to send to randomly selected customers. This is an example of:
Answer : D
Data sampling is the process of selecting a subset of data from a larger population to represent the characteristics of the whole population. Data sampling is often used when the population is too large or costly to collect data from every individual. Data sampling can help reduce the time, cost, and complexity of data analysis, while maintaining the validity and reliability of the results. Data sampling can also help avoid biases and errors that may arise from collecting data from the entire population. Data sampling can be done using various methods, such as random sampling, stratified sampling, cluster sampling, or convenience sampling, depending on the research objectives and the availability of data. In this example, the analytics team develops a survey to send to randomly selected customers, which is a form of data sampling. The survey aims to collect data from a representative sample of customers that can reflect the preferences and opinions of the entire customer population. The survey data can then be used to analyze the performance and demand of different products, and provide recommendations to management.Reference:
[Business Data Analytics: A Practitioner's Guide], Chapter 4: Data Analysis, Section 4.2: Data Sampling, pp. 69-72.
[A Guide to the Business Analysis Body of Knowledge (BABOK Guide)], Version 3, Chapter 6: Solution Evaluation, Section 6.2: Analyze Performance Measures, pp. 152-153.
A data dictionary is being developed for a dataset describing a company's customer base. Within the data dictionary, which of the following represents a composite data element?
Answer : A
A composite data element is a data element that is made up of smaller units called sub-elements, which are separated by a sub-element separator character, such as a colon (:). For example, ITEMNO is a composite data element that consists of three sub-elements: part number, aisle number, and bin number. A street address is also a composite data element that can consist of sub-elements such as street number, street name, city, state, and zip code. First name, total sale, and birthdate are simple data elements that do not have sub-elements.
An analyst supporting the Marketing department for a specialty retailer has been asked to look through past sales data to help guide product decisions. The business sponsor for this initiative would first like to know 'What is the most profitable product line?'. What type of analytics is the analyst going to perform to address this question?
Answer : C
An operations manager for a new hotel is in need of determining the optimum number of vans to purchase to shuttle guests to/from the airport. It will be necessary to determine the most efficient routes and schedule to follow to ensure guests do not experience excessive delays. Which business analytics technique would lend itself to supporting these types of business decisions?
Answer : A
A food and beverage company would like to administer a survey to obtain customer insights about a new cookie product recently launched. A data team is asked to build the survey paying careful attention to reduce the degree of sampling error. Which criteria would help the team meet this objective?
Answer : B
A data scientist is working with a team of upper level managers to develop a strategy for creating an enterprise analytics program. What critical success factor would help ensure the organization obtains the most value from its data?
Answer : C