Introduction to Statistics PDF
Document Details
Uploaded by Deleted User
Tags
Summary
This document provides an introduction to statistics, covering various aspects such as the different meanings of the word, its origin, and features. It also explores the different types of statistics (descriptive and inferential), levels of data measurement (nominal, ordinal, interval, and ratio), and key statistical terms such as population, sample, parameter, and statistic.
Full Transcript
# Introduction to Statistics ## Introduction The word statistics conveys a variety of meaning to people in different walks of life. The word statistics comes from the Italian words *Statista* meaning *STATEMENT*. The German word *Statistik* meaning *POLITICAL STATE*. Statistics is a branch of...
# Introduction to Statistics ## Introduction The word statistics conveys a variety of meaning to people in different walks of life. The word statistics comes from the Italian words *Statista* meaning *STATEMENT*. The German word *Statistik* meaning *POLITICAL STATE*. Statistics is a branch of scientific methods used in dealing with phenomena that can be described numerically either by count or by measurement. ## Statistics - Statistics refers to dealing with quantitative information. A branch of mathematics which deals with gathering, organizing, analyzing, and interpreting numerical data. - Statistics is the sciences and art of dealing with figure and facts. - Father of statistics is Gottfried Achenwall. ## Features of Statistics 1. It is aggregated of facts. 2. Numerically expressed (*Quantitative*). 3. Affected to a marked extent by multiplicity of causes. 4. Estimated according to a reasonable standard of accuracy. 5. Collecting data in a systematic manner. 6. Statistics are collected for a predetermined purpose, well defined & specific 7. Should be placed in relation to each other. 8. Data can't be expressed in qualitative form. ## Statement All Statistics are numerical statements (*data*) but all numerical statements (*data*) are not statistics. ## Types of Statistics Statistics can be divided into two branches: Descriptive statistics and Inferential statistics. - **Descriptive Statistics:** Descriptive statistics results from gathering data from a body, group, or population and reaching conclusions only about that particular group or population. For example: If an instructor produces statistics to summarize a class's examination effort and uses those statistics to reach conclusions about that class only, the statistics are descriptive. - **Inferential Statistics:** Sample is collected from the group or population and statistics is used to reach conclusions about the group or population from which the sample was taken, this statistics is called inferential statistics. For example: Testing of new drug on a small sample of patients to reach the conclusions about the drug and make inferences about the population. ## Inferential Statistics Two major types of inferential statistics are: - **Parametric Statistics:** Certain assumptions are made about the distribution of the data. Only Interval and Ratio data are used in parametric statistics. - **Non-Parametric Statistics:** When there is no assumptions made about the distribution of the data. Only Nominal and Ordinal data are used in non-parametric statistics. ## Data Measurement Scaling is an extension of the concept of measurement. It is said the result of measurement is a scale comprises of set of numerals on which an object score is placed using a certain rule of assignment. According to Adward “Scaling is a procedure for the assignment of numbers to a property of objects in order to impart some of characteristics of numbers to the properties in question”. ## Levels of Data Measurement Four levels of data measurement as follow: - **Nominal** - **Ordinal** - **Interval** - **Ratio** ## Levels of Measurement - **Nominal:** Named Variables - **Ordinal:** Named + Ordered + Proportionate Interval between Variables - **Interval:** Named + Ordered + Proportionate Interval between Variables - **Ratio:** Named + Ordered + Proportionate Interval between Variables + Can accommodate Absolute Zero ## Nominal Level - **The lowest level of data measurement is the nominal level.** Nominal level data measurement is used to define data as numbers, alphabets or symbols without any quantitative value. Numbers representing nominal level data can be used only to classify or categorize. - **Examples:** Employee identification numbers. The numbers are used to differentiate employees and not to make a value statement about them. ## Ordinal Level - **Ordinal level data measurement is higher than the nominal level.** Ordinal level measurement can be used to rank, order or give preference to the objects. - **For example:** Using ordinal data, a supervisor can evaluate three employees by ranking their productivity with the numbers 1 to 3. (+) Positive - Strongly Agree - Agree - Neutral - Disagree - Strongly Disagree (-) Negative ## Interval Level - **Interval level data measurement is the next to the highest level of data in which the distances between consecutive numbers have meaning and the data are always numerical.** The distances represented by the differences between consecutive numbers are equal; i.e. interval data have equal intervals. - **For example,** when we measure temperature (*in Fahrenheit*), the distance from 30-40 is same as distance from 70-80. ## Ratio Level - **Ratio level data measurement is the highest level of data measurement.** Ratio data have the same properties as interval data, but ratio data have an absolute zero and the ratio of two numbers is meaningful. The value of zero can not be arbitrarily assigned because it represent a fixed point. - **Examples of ratio data are height, weight, time, volume etc.** - **Note:** Ratio scale is also the part of interval scale. Interval scale ranges from negative to positive while the ratio scale starts from positive i.e. zero “0 ”. ## Important Statistical Terms 1. **Population:** Population is a collection of persons, objects or items of interest. 2. **Census:** A process of gathering data from the whole population for a given measurement of interest. 3. **Sample:** A sample is a portion of the whole population. A sample represents the whole population. 4. **Parameter:** A descriptive measure of the population is called a parameter. Parameters are usually denoted by Greek letters. Examples of parameters are population mean (µ), population variance (σ2) and population standard deviation (σ). 5. **Statistic:** A descriptive measure of a sample is called a statistic. Statistic are usually denoted by Roman letters. Example of statistics are sample mean(x), sample variance(s2) and sample standard variation (s). ## Process of Inferential Statistics to estimate a Population mean (µ) **Calculate (x) to estimate (µ) ** - Population (µ) (*Parameter*) - Select a random Sample - Sample (x) (*Statistics*)