___________ uses artifacts to present data visually.

- Statistics Analytics
- Data Mining
- Text Analytics
**data visualization**Correct

_____________ includes identifying groups of data record.

- Statistics Analytics
**Cluster analysis**Correct- Business Intelligence
- Text Analytics

_____________ is rated as the number one business analytics software

- Knime
**Rapid miner**Correct- WEKA
- Orange

_______________ is a data structure that every component has a unique processor and succesor.

**linear**Correct- nonlinear
- static
- dynamic

“ All models are wrong but some are useful “

- WilliamGibson
- Georg cantor
**George E P Box**Correct- DJPatil

A bell shaped curve that is symmetric about a vertical line.

**normal distribution**Correct- kurtic
- skewed
- standard distribution

A bell-shaped distribution that is symmetric about a vertical line.

- standard
- skewed
**normal**Correct- symmetric

A data having the same number of occurrence in scores is said to be

**no mode**Correct

A distribution where large distribution are displayed.

**Grouped frequency distribution**Correct- ogive
- histogram
- Relative frequency distribution

A frequently used method as it enables binary variables, sum polytomous variable to be modelled.

**logistic regression**Correct- exponential regression
- linear regression
- binomial regression

A graph that is used to indicate frequency distribution.

**histogram**Correct

A graph used to indicate intervals in a frequency distribution is refereed to as a______________.

- bar graph
- pie graph
- ogive
**histogram**Correct

A matrix that has the same number of rows and columns is called

**Square**Correct

A model thatcorresponds to the case where the dependent variable has more than two categories.

**multinomial logit model**Correct

A negative correlation exists when___________.

**x increases y decreases**Correct

A network purpoting to describe family memberships

**network topology**Correct- networking
- network tautology
- network adherence

A new phenomenon for the explosion of _________data

- communication
- transient
- transaction
**interaction**Correct

A perfect positive correlation coefficient is equal to

**1**Correct

A positive z-score means that the score is

- Onestandard deviation higher than the mean
- Equalto the mean
- Lowerthan the mean
**Higherthan the mean**Correct

A score of 3 in2,4,4,4,5,5,6,8,9 is

**102 below the mean**Correct- 12 above the mean
- 192 above the mean
- 118 below the mean

A score of 50 lies 2 standard deviations above a mean of 30.What is the value of the standard deviation?

- 25
- 20
**10**Correct- 15

A special type of function where the domain is a set of consecutive integers.

**sequence**Correct

A survey of 100 consumers said that the price charged for a kilo of rice could be approximated by a normal distribution with a mean of 35 and a standard deviation of 4.How many are less than 39?

- 82
**84**Correct- 80
- 78

A survey of 100 consumers said that the price charged for a kilo of rice could be approximated by a normal distribution with a mean of 35 and a standard deviation of 4.How many of them lie between 27 and 43?

- 90
**95**Correct- 88
- 92

A vegetable distributor knows that during the month of August ,the weights of tomatoes are normally distributed with a mean of 0.61 lb and a standard deviation of 0.15 lb. How many can be expected to weigh between 0.31 to 0.91 in a shipment of 4500 tomatoes.

**4275**Correct

A vegetable distributor knows that during the month of August ,the weights of tomatoes are normally distributed with a mean of 0.61 lb and a standard deviation of 0.15 lb. How many can be expected to weigh more than 0.31 lb in a shipment of 6000 tomatoes.

**150**Correct

A vegetable distributor knows that during the month of August ,the weights of tomatoes are normally distributed with a mean of 0.61 lb and a standard deviation of 0.15 lb. What percent of the tomatoes weigh less than 0.71 lb?

**84**Correct

According to Hilary Mason which is NOT a skill that a good data scientist must cultivate.

**critical thinking**Correct

Addition and subtraction of matrices only is possible if two are more matrices.

- Have same number of columns
- Are square matrices
- Have same number of rows
**Have same sizes**Correct

Adistribution with 4 modes is said to be a _________distribution

- trimodal
**multimodal**Correct- bimodal
- unimodal

Algorithm analysisis an important part of a broader_____________.

**computational complexity theory**Correct

All representations are ________.

- unstable
- perfect
- stable
**imperfect**Correct

An array is a good example of _________data structure.

- nonlinear
- linear
- dynamic
**static**Correct

An example of an abstract computer.

**Turing machine**Correct

Another term for an empty set.

**null**Correct

Another term for text analytics.

**text mining**Correct

Another term for variability

- mean
- center
- frequent
**dispersion**Correct

Any way to get new expressions from old ones

- semantic
- surrogate
**inference**Correct- reasoning

As of 2014,there are _______million of tweets a day.

**500**Correct

Classification table is also called ________

- criteria matrix
- confidential matrix
**confusion matrix**Correct- conditional matrix

Data involving two variables.

**bivariate**Correct

Data is NOT information unless we add_________.

**analytics**Correct

Displays the performance of a model and enables a comparison to be made with other models.

- DAC
**ROC curve**Correct- SBC
- GLM

Earlier name fordata science.

**datalogy**Correct

Empirical rule for a normal distribution lie ______% of data with 1 standard deviation below and above the mean.

- 79
**68**Correct- 64
- 75

Empirical rule for a normal distribution that is 2 standard deviations above and below the mean is ________% of data.

- 85
- 80
- 90
**95**Correct

Empirical rule for a normal distribution that is 3 standard deviations above and below the mean covers ______% of the data.

- 98
- 95
**997**Correct- 92

Exabyte means ________bytes

- millionmillion
**billionbillion**Correct- trilliontrillion
- thousandthousand

Example of a data product.

**google map**Correct

He coined the term "data scientist"

**DJ Patil**Correct

He coined the term “analysis of algorithms”.

**Donald Knuth**Correct

He is someone who asks interesting questions on formal and informal theory.

**data scientist**Correct

He pointed out that until 2003 ,all of mankind had generated just 5 exabytes of data

- Eric Smith
**Eric Schmidt**Correct- Eric Smidth
- Eric Smicht

He proposed the use of a penalized likehood function.

- Hein
- Gombartz
- Heitz
**Firth**Correct

He said that “ In mathematics the art of proposing a question must be held of higher value than solving it”.

**GeorgCantor**Correct- FrancisGalton
- WilliamGibson
- EricSchmidt

How many bytes of data are generated every two days in today's world?

- 5 megabytes
- 5 gigabytes5 gigabytes
- 5 terabytes
**5 exabytes**Correct

If A= { x/x is a distinct letter in the word "MATHEMATICS"} AND B={x/x is a distinct letter in the word"STATISTICS"} then their intersection is

- {C,I,S}
- {A,C,I,S,T}
- {A,C,I,S}
- {A,C,S,}

If A={ 2,3} B={4,5},which of the following is a Cartesian product of the two sets?

- { (3,4) (3,5) (2,4 ) {2,2) }
**{ (3,4) (3,5) (2,4 ) {2,5) }**Correct- { (3,4) (3,3) (2,4 ) {2,5) }
- { (3,3) (3,5) (2,4 ) {2,5) }

If in a distribution all scores are distinctthen_____________

- itis skewed
**thereis no mode**Correct- themean is higher than the mode
- itis normal

If R= { (3,3), (3,6), (5,5),(5,10),(6.12)} is a binary relation in R which the domain is

**{3,5,6}**Correct

If the standard deviation of a distribution is 3, the variance is

- 141
- 6
- 15
**9**Correct

If the standard deviation of a distribution is 3.5, the variance is

**12.25**Correct

If there are 101 scores the median is equal tothe _____ranked score

**51st**Correct- 54th
- 52nd
- 55th

If there are 103 scores the median is equal to the _____ranked score.

**52nd**Correct

In 2,4,4,4,5,5,6,8,9 the range is

- 5
- 3
- 6
**7**Correct

In the equation of the regression line represented by Y= 1.24 X + 6.9 if X=2 then Y =?

**9.38**Correct

IOT means

- Internet of time
- Interaction of time
**Internet of things**Correct- Interconnction of things

It allows you to see which value of the explanatory variable corresponds a given probability success

**probability analysis table**Correct- ogive
- probability table
- histogram

It corresponds to the case where the dependent variable has more than 2 categories.

- trinomial logit model
**multinomial logit model**Correct- binomial logit model
- polynomial logit model

It does NOT require the assumption that the parameters are normally distributed

**profile likehood**Correct- definite likehood
- mass likehood
- densiy likehood

It does NOT require the assumption that the parameters are normally distributed.

**profile likehood**Correct

It enables the performance of a model and enables a comparison to be made with other models.

- LR
- IOT
- GML
**ROC**Correct

It expands available data enormously since there is so much more text being generated than numbers.

**Text mining**Correct- text analysis
- data mining
- data ranking

It expands available data enormously.

**text mining**Correct

It extracts meaningful numerical indices from information and make it available to statistical andmachine learning.

- business intelligence
**Text analytics**Correct- data mining
- data visualization

