STA301 - Statistics and Probability Concepts

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What is the definition of an unbiased estimator?

An estimator that has the smallest variance among all possible estimators.
An estimator that always produces the exact true value of the parameter being estimated.
An estimator that is always consistent and efficient.
An estimator whose expected value is equal to the true value of the parameter being estimated. (correct)

What is the purpose of an experimental design?

To control for all possible confounding variables.
To ensure that the results of the experiment are generalizable to the entire population.
To collect data in a way that provides a basis for objective inference about the problem under study. (correct)
To collect data in a way that ensures the results are statistically significant.

Which variable is typically represented by 'Y' in regression analysis?

Confounding Variable
Control Variable
Independent Variable
Dependent Variable (correct)

What is the expected value of the sampling distribution given in the content?

2/3 (D) Signup and view all the answers

What is the variance of the sampling distribution given in the content?

1/3 (A) Signup and view all the answers

What is the purpose of calculating the mean and variance of a sampling distribution?

All of the above. (D) Signup and view all the answers

Which of the following is NOT a characteristic of a good experimental design?

Bias in the selection of subjects (D) Signup and view all the answers

In regression analysis, what is the relationship between the independent and dependent variables?

The independent variable influences the dependent variable. (B) Signup and view all the answers

What is the technical term for the difference between a sample statistic and the corresponding population parameter?

Sampling error (D) Signup and view all the answers

Which of the following is NOT a property of a hypergeometric distribution?

The trials are independent. (C) Signup and view all the answers

What is the formula for calculating sampling error?

$X - \mu$ (C) Signup and view all the answers

Which of the following scenarios accurately describes the use of a hypergeometric distribution?

A researcher draws 5 cards from a deck without replacement to determine the probability of getting all hearts. (A) Signup and view all the answers

What is the primary advantage of using the median as a measure of central tendency?

It is not influenced by extreme values in the dataset. (D) Signup and view all the answers

What condition indicates that a statistic is a biased estimator?

The expected value of the statistic is not equal to the true parameter. (D) Signup and view all the answers

What does the term "five-number summary" refer to?

The minimum, first quartile (Q1), median, third quartile (Q3), and maximum of a dataset. (C) Signup and view all the answers

What is a key disadvantage of using the median as a measure of central tendency?

It does not take into account all data points in the dataset. (C) Signup and view all the answers

What is the formula for the mathematical expectation (E) of a discrete random variable X?

E(X) = ∑ xi f(xi) (C) Signup and view all the answers

Which of the following is a property of the expected value of a random variable?

Both A and B (D) Signup and view all the answers

What is the purpose of a statistical test?

To test a hypothesis about a population parameter. (D) Signup and view all the answers

What is the general rule for determining if a sample is considered small or large?

A sample is considered small if n is less than or equal to 30 and large otherwise. (C) Signup and view all the answers

What does the Least Significant Difference (LSD) test determine?

The significance of the difference between two sample means. (A) Signup and view all the answers

What is the formula for the combined or pooled proportion of two samples, where p1 is the proportion of the first sample, n1 is the size of the first sample, and p2 and n2 are the proportion and size of the second sample, respectively?

(n1p1 + n2p2) / (n1 + n2) (C) Signup and view all the answers

Given a class interval of approximately 2.96 and a range of 14.8, what is the approximate number of classes?

5 (B) Signup and view all the answers

What does the coefficient of variation (C.V) measure?

The spread of a distribution relative to the mean. (A) Signup and view all the answers

What is the significance of the method of maximum likelihood in estimation?

It offers the most probable estimate based on observed data. (C) Signup and view all the answers

In a standard normal distribution, what is the value of the lower quartile?

-0.6745 (B) Signup and view all the answers

In the context of an ANOVA, what does SST represent?

Sum of squares total (B) Signup and view all the answers

What is the expected value of 2X if the expected value of X is 0.7?

1.4 (D) Signup and view all the answers

Which of the following statements is true regarding the probability distribution of a statistic?

The probability distribution of a statistic is called the sampling distribution. (D) Signup and view all the answers

What is the inter-quartile range in a standard normal distribution?

1.349 (D) Signup and view all the answers

In a hypothesis test, why is the critical region defined as z > 1.645 for α = 0.10?

It represents the rejection region for a one-tailed test with a significance level of 0.10. (C) Signup and view all the answers

Suppose a department claims that the average value exceeds Rs. 2500. What would be the null and alternative hypotheses to test this claim at a 0.05 level of significance?

H0: μ ≤ 2500, H1: μ > 2500 (B) Signup and view all the answers

In the provided ANOVA table, what is the degrees of freedom for the 'Error' source?

8 (B) Signup and view all the answers

Which of the following scenarios represents mutually exclusive events?

Getting heads and tails when tossing a coin. (D) Signup and view all the answers

What is the value of β2 for a normal distribution?

3 (D) Signup and view all the answers

In the given ANOVA table, what specific term is represented by the 'MS' value?

Mean Square (D) Signup and view all the answers

Given the formula for calculating the test statistic z, which of the following values is NOT required to perform the hypothesis test?

μ (B) Signup and view all the answers

Which of these events is NOT an example of a partition in the context of probability?

The numbers on a standard die: 1, 2, 3, 4, 5, or 6. (B) Signup and view all the answers

According to Bayes' theorem, what does P(Ai/B) represent?

The probability of event Ai occurring given that event B has already occurred. (B) Signup and view all the answers

In the statement "The 95% confidence interval for the population mean is 1.3 to 4.7", what does "95% confidence" mean?

There is a 95% chance that the true population mean lies within the interval 1.3 to 4.7. (C) Signup and view all the answers

What is the formula used to calculate the chi-square goodness of fit test statistic?

$\chi^2 = \sum_{i=1}^{n} \frac{(o_i - e_i)^2}{e_i}$ (D) Signup and view all the answers

What is the approximate value of the F-statistic at a 0.05 significance level with 7 degrees of freedom in the numerator and 10 degrees of freedom in the denominator?

4.075 (D) Signup and view all the answers

In a z-test statistic for a proportion, if the sample proportion (X/n) is greater than the hypothesized proportion (p0), which of the following is used in the calculation of the z-statistic?

X + \frac{1}{2} - np0 (B) Signup and view all the answers

The formula $\sigma_{\hat{p}_1 - \hat{p}_2}$ calculates:

The standard deviation of the difference between two sample proportions (B) Signup and view all the answers

What is the value of q1 in the context of calculating the standard deviation $\sigma{\hat{p}_1 - \hat{p}_2}$ when p1 is 0.3?

0.7 (B) Signup and view all the answers

What is the null hypothesis in the scenario involving the mayoral candidate?

p1 - p2 > 0.10 (C) Signup and view all the answers

Which of the following is the correct formula for the standard deviation of the difference between two sample proportions in the context of the mayoral candidate scenario?

$\sqrt{\frac{p_1q_1}{n_1} + \frac{p_2q_2}{n_2}}$ (C) Signup and view all the answers

What is the alternative hypothesis in the mayoral candidate scenario?

p1 - p2 < 0.10 (D) Signup and view all the answers

Flashcards

Un-Biased Estimator

An estimator that is correct on average across many samples.

Experimental Design

A plan to collect data objectively for inference.