Question 1
What is the advantage of median over mean?
Median is more outlier resistent. Odd values influence median less than mean.
Question 2
What is the mean of the following list?
1,2,4,5,34,1,32,4,34,2,1,3
Please use a decimal point instead of a comma
10.25
Question 3
What is the median of the following list?
1,2,4,5,34,1,32,4,34,2,1,3
Please use a decimal point instead of a comma
3.5
Question 4
Which of the following two plots has a higher standard deviation?
Plot 2
Question 5
What is the standard deviation of the following list?
34,1,23,4,3,3,12,4,3,1
Please enter at least 3 digits after the decimal
Please use a decimal point instead of a comma
10.5629
Question 6
Which of the two plots indicates a higher kurtosis value?
Plot 1
Question 7
What is the kurtosis of the following list?
34,1,23,4,3,3,12,4,3,1
Please enter at least three digits after the decimal
3.2968
Question 8
What is true about this value distribution?
This distribution is positively skewed
Question 9
What is the correlation between the two lists?
1,2,3,4,5,6,7,8,9,10
7,6,5,4,5,6,7,8,9,10
Please enter at least three digits after the decimal
0.7093
Question 10
What is the covariance between the two lists?
1,2,3,4,5,6,7,8,9,10
7,6,5,4,5,6,7,8,9,10
Please enter at least three digits after the decimal
3.653
Question 11
The correlation between the following two lists is zero, can you explain why?
1,2,3,4,5,6,7
7,6,5,4,5,6,7
Correlation of 1st half of the list is negative and between the last half of the list is positive so they cancel out
SparkML concepts
Question 1
What are Machine Learning Pipelines?
Please choose all correct answers
A way to do feature engineering within the pipeline framework
A way of speeding up ML development
A way of expressing your complete end-2-end machine learning flow within a single framework with a homogeneous API
Question 2
How is the class called which transforms a string class label to a class index in SparkML?
StringIndexer
Question 3
What is the class called which transforms a set of columns in a data frame to a single
DenseVector representation in SparkML?
VectorAssembler
Course Project Quiz