___________ uses artifacts to present data visually.
Statistics Analytics
Data Mining
Text Analytics
data visualizationCorrect
_____________ includes identifying groups of data record.
Statistics Analytics
Cluster analysisCorrect
Business Intelligence
Text Analytics
_____________ is rated as the number one business analytics software
Knime
Rapid minerCorrect
WEKA
Orange
_______________ is a data structure that every component has a unique processor and succesor.
linearCorrect
nonlinear
static
dynamic
“ All models are wrong but some are useful “
WilliamGibson
Georg cantor
George E P BoxCorrect
DJPatil
A bell shaped curve that is symmetric about a vertical line.
normal distributionCorrect
kurtic
skewed
standard distribution
A bell-shaped distribution that is symmetric about a vertical line.
standard
skewed
normalCorrect
symmetric
A data having the same number of occurrence in scores is said to be
no modeCorrect
A distribution where large distribution are displayed.
Grouped frequency distributionCorrect
ogive
histogram
Relative frequency distribution
A frequently used method as it enables binary variables, sum polytomous variable to be modelled.
logistic regressionCorrect
exponential regression
linear regression
binomial regression
A graph that is used to indicate frequency distribution.
histogramCorrect
A graph used to indicate intervals in a frequency distribution is refereed to as a______________.
bar graph
pie graph
ogive
histogramCorrect
A matrix that has the same number of rows and columns is called
SquareCorrect
A model thatcorresponds to the case where the dependent variable has more than two categories.
multinomial logit modelCorrect
A negative correlation exists when___________.
x increases y decreasesCorrect
A network purpoting to describe family memberships
network topologyCorrect
networking
network tautology
network adherence
A new phenomenon for the explosion of _________data
communication
transient
transaction
interactionCorrect
A perfect positive correlation coefficient is equal to
1Correct
A positive z-score means that the score is
Onestandard deviation higher than the mean
Equalto the mean
Lowerthan the mean
Higherthan the meanCorrect
A score of 3 in2,4,4,4,5,5,6,8,9 is
102 below the meanCorrect
12 above the mean
192 above the mean
118 below the mean
A score of 50 lies 2 standard deviations above a mean of 30.What is the value of the standard deviation?
25
20
10Correct
15
A special type of function where the domain is a set of consecutive integers.
sequenceCorrect
A survey of 100 consumers said that the price charged for a kilo of rice could be approximated by a normal distribution with a mean of 35 and a standard deviation of 4.How many are less than 39?
82
84Correct
80
78
A survey of 100 consumers said that the price charged for a kilo of rice could be approximated by a normal distribution with a mean of 35 and a standard deviation of 4.How many of them lie between 27 and 43?
90
95Correct
88
92
A vegetable distributor knows that during the month of August ,the weights of tomatoes are normally distributed with a mean of 0.61 lb and a standard deviation of 0.15 lb. How many can be expected to weigh between 0.31 to 0.91 in a shipment of 4500 tomatoes.
4275Correct
A vegetable distributor knows that during the month of August ,the weights of tomatoes are normally distributed with a mean of 0.61 lb and a standard deviation of 0.15 lb. How many can be expected to weigh more than 0.31 lb in a shipment of 6000 tomatoes.
150Correct
A vegetable distributor knows that during the month of August ,the weights of tomatoes are normally distributed with a mean of 0.61 lb and a standard deviation of 0.15 lb. What percent of the tomatoes weigh less than 0.71 lb?
84Correct
According to Hilary Mason which is NOT a skill that a good data scientist must cultivate.
critical thinkingCorrect
Addition and subtraction of matrices only is possible if two are more matrices.
Have same number of columns
Are square matrices
Have same number of rows
Have same sizesCorrect
Adistribution with 4 modes is said to be a _________distribution
trimodal
multimodalCorrect
bimodal
unimodal
Algorithm analysisis an important part of a broader_____________.
computational complexity theoryCorrect
All representations are ________.
unstable
perfect
stable
imperfectCorrect
An array is a good example of _________data structure.
nonlinear
linear
dynamic
staticCorrect
An example of an abstract computer.
Turing machineCorrect
Another term for an empty set.
nullCorrect
Another term for text analytics.
text miningCorrect
Another term for variability
mean
center
frequent
dispersionCorrect
Any way to get new expressions from old ones
semantic
surrogate
inferenceCorrect
reasoning
As of 2014,there are _______million of tweets a day.
500Correct
Classification table is also called ________
criteria matrix
confidential matrix
confusion matrixCorrect
conditional matrix
Data involving two variables.
bivariateCorrect
Data is NOT information unless we add_________.
analyticsCorrect
Displays the performance of a model and enables a comparison to be made with other models.
DAC
ROC curveCorrect
SBC
GLM
Earlier name fordata science.
datalogyCorrect
Empirical rule for a normal distribution lie ______% of data with 1 standard deviation below and above the mean.
79
68Correct
64
75
Empirical rule for a normal distribution that is 2 standard deviations above and below the mean is ________% of data.
85
80
90
95Correct
Empirical rule for a normal distribution that is 3 standard deviations above and below the mean covers ______% of the data.
98
95
997Correct
92
Exabyte means ________bytes
millionmillion
billionbillionCorrect
trilliontrillion
thousandthousand
Example of a data product.
google mapCorrect
He coined the term "data scientist"
DJ PatilCorrect
He coined the term “analysis of algorithms”.
Donald KnuthCorrect
He is someone who asks interesting questions on formal and informal theory.
data scientistCorrect
He pointed out that until 2003 ,all of mankind had generated just 5 exabytes of data
Eric Smith
Eric SchmidtCorrect
Eric Smidth
Eric Smicht
He proposed the use of a penalized likehood function.
Hein
Gombartz
Heitz
FirthCorrect
He said that “ In mathematics the art of proposing a question must be held of higher value than solving it”.
GeorgCantorCorrect
FrancisGalton
WilliamGibson
EricSchmidt
How many bytes of data are generated every two days in today's world?
5 megabytes
5 gigabytes5 gigabytes
5 terabytes
5 exabytesCorrect
If A= { x/x is a distinct letter in the word "MATHEMATICS"} AND B={x/x is a distinct letter in the word"STATISTICS"} then their intersection is
{C,I,S}
{A,C,I,S,T}
{A,C,I,S}
{A,C,S,}
If A={ 2,3} B={4,5},which of the following is a Cartesian product of the two sets?
{ (3,4) (3,5) (2,4 ) {2,2) }
{ (3,4) (3,5) (2,4 ) {2,5) }Correct
{ (3,4) (3,3) (2,4 ) {2,5) }
{ (3,3) (3,5) (2,4 ) {2,5) }
If in a distribution all scores are distinctthen_____________
itis skewed
thereis no modeCorrect
themean is higher than the mode
itis normal
If R= { (3,3), (3,6), (5,5),(5,10),(6.12)} is a binary relation in R which the domain is
{3,5,6}Correct
If the standard deviation of a distribution is 3, the variance is
141
6
15
9Correct
If the standard deviation of a distribution is 3.5, the variance is
12.25Correct
If there are 101 scores the median is equal tothe _____ranked score
51stCorrect
54th
52nd
55th
If there are 103 scores the median is equal to the _____ranked score.
52ndCorrect
In 2,4,4,4,5,5,6,8,9 the range is
5
3
6
7Correct
In the equation of the regression line represented by Y= 1.24 X + 6.9 if X=2 then Y =?
9.38Correct
IOT means
Internet of time
Interaction of time
Internet of thingsCorrect
Interconnction of things
It allows you to see which value of the explanatory variable corresponds a given probability success
probability analysis tableCorrect
ogive
probability table
histogram
It corresponds to the case where the dependent variable has more than 2 categories.
trinomial logit model
multinomial logit modelCorrect
binomial logit model
polynomial logit model
It does NOT require the assumption that the parameters are normally distributed
profile likehoodCorrect
definite likehood
mass likehood
densiy likehood
It does NOT require the assumption that the parameters are normally distributed.
profile likehoodCorrect
It enables the performance of a model and enables a comparison to be made with other models.
LR
IOT
GML
ROCCorrect
It expands available data enormously since there is so much more text being generated than numbers.
Text miningCorrect
text analysis
data mining
data ranking
It expands available data enormously.
text miningCorrect
It extracts meaningful numerical indices from information and make it available to statistical andmachine learning.