Lecture 14 Analysis of Quantitative Data-FN-slides (1).pptx

Full Transcript

Analysis of Quantitative Data Technical Report Writing BITS F112 Two kinds data and their analysis ➢ Qualitative data: Contains words and descriptions of experiences or pertains to quality of the parameter. ○ Contains written or spoken words, images or videos. ➢ Quantitative data: is a set of tech...

Analysis of Quantitative Data Technical Report Writing BITS F112 Two kinds data and their analysis ➢ Qualitative data: Contains words and descriptions of experiences or pertains to quality of the parameter. ○ Contains written or spoken words, images or videos. ➢ Quantitative data: is a set of techniques that use mathematical and statistical modeling, measurement, and research to understand behavior. ○ Such data can be measured, calculated, grouped or ranked. Outline 1. IKW pyramid and analysis of data 2. What is data analysis 3. Broad strategy employed in data analysis 4. The use of statistics 5. Descriptive statistics: Visualisation of data 6. Descriptive statistics: Measures of central tendency 7. Descriptive statistics: Measures of dispersal DIKW Pyramid and Analysis of data The science of examining data by subjecting it to operations in order to reveal insights. Source: Knowledge Hub (n.d.) What is Data Analysis? ● The process of systematically applying statistical and/or logical techniques to clean, condense, describe, illustrate and evaluate data. ● The practice of getting insights from data that can enable us to achieve our research goals and make informed decisions. ● The process of reducing raw data to smaller fragments that make sense. What does primary data look like? IDa RAIN APR-GOA 3.6 ID JUN-GOA RAIN 764 ID DEC-PUNJAB FEB-GOA (Source: IMD) RAIN 0.7 ID MAY-PUNJAB 16.7 NOV-GOA 27.3 APR-KERALA 214.1 JUN-KERALA 563.6 APR-PUNJAB 29.8 JUN-PUNJAB 48.3 FEB-KERALA 5.8 NOV-KERALA 31.3 NOV-PUNJAB 0 AUG-GOA 377.3 MAR-GOA 36.8 FEB-PUNJAB AUG-KERALA 252.2 MARKERALA 50.1 JAN-GOA 2.7 OCT-GOA AUG-PUNJAB 88.6 MAR-PUNJAB 68.5 JAN-KERALA 3.1 OCT-KERALA MAY-GOA 11.3 JAN-PUNJAB 17.7 OCT-PUNJAB MAYKERALA 201.8 DEC-GOA 0 DEC-KERALA 79.4 JUL-PUNJAB 130.2 SEP-PUNJAB 69.2 JUL-GOA JUL-KERALA 526.5 406 RAIN 223.6 0.8 91.4 308.1 9 SEP-GOA 240.9 SEP-KERALA 292.9 Broad strategy when analyzing data Organization of data (cleaning and arranging the archive) Month KERALA GOA PUNJAB JAN 3.1 2.7 17.7 FEB 5.8 0 31.3 MAR 50.1 36.8 68.5 APR 214.1 3.6 29.8 MAY 201.8 11.3 16.7 JUN 563.6 764 48.3 JUL 406 526.5 130.2 AUG 252.2 377.3 88.6 SEP 292.9 240.9 69.2 OCT 308.1 91.4 9 NOV 223.6 27.3 0.8 DEC 79.4 0 0.7 Summarization and categorization of data ● to reveal patterns, themes and relationships Analysis ● using inductive and deductive approaches to give raw data structure and meaning Max Rainfall - Kerala Min Rainfall - Punjab The use of Statistics ▪ Statistics: Methods to investigate numerical data ▪ Statistics: Broadly, refers to methods to systematically observe and describe data and to carefully/cautiously make generalisations and predictions. ▪ Two broad branches of statistics: • Descriptive statistics: methods used to summarise or describe our observations • Inferential statistics: Using previous observations as a basis for making estimates or predictions (inferences) about a situation that is not fully investigated. Descriptive statistics ● What is it that is actually described? Population: consists of all cases/data points Sample: relatively small selection of the population/data points Rainfall data for very single place in Goa Source: Gokulakrishnan (2020) Rainfall data was collected from representative points such as Mapusa, Cancona, Vasco, Madgaon in Goa. Descriptive statistics ● Once we collect data from a sample (observations), making sense of the same involves summarising the observations using: 1. Sum, percentage, percentile 2. Measures of central tendency 3. Measures of dispersion And, visualising the observations using tables, diagrams, and graphs. Descriptive statistics: Tabular representation of data Table Our study intended to find out the amount of rainfall in different states across india. We then sampled six geographically representative states, and measured the annual rainfall there. State Cumulative Annual rainfall (in mm) KERALA 2600.7 GOA 2081.8 ANDAMAN & NICOBAR ISLANDS 2904.5 PUNJAB 510.8 HIMACHAL PRADESH 1210.5 JAMMU & KASHMIR 1572.9 Descriptive statistics: Diagramatic representation of Charts data ● Representation of the same data set mentioned above as pie chart. I. Organize and show data as a percentage of a whole II. Represent the relative sizes of data Descriptive statistics: Graphical representation of Graphs - Bar charts data A bar chart is a graphical representation of data using rectangular bars. The length of each bar is proportional to the value it represents. Bar charts are used to display and compare discrete categories of data Descriptive statistics: Measures of Central Tendency Frequency Distribution Number of instances with which each value is observed (see the excel sheet for values) An example – distribution of student’s marks Descriptive statistics: Measures of Central Tendency The ‘tendency’ of the observations to pile around a particular value (or a particular group). Measures of central tendency: the mean, median and mode. ● Mean: the arithmetic mean is determined by the sum of the values in the distribution divided by the number of values. ● Median: the point on the scale below which half of the values lie. ● Mode: the value with the most frequent occurrence. Mean, Mode, Median An example – distribution of student’s marks Mode = 40 Mean = 46.15 Range in this data set = 0 - 86 Median=46 Descriptive statistics: Measures of Dispersion The degree of the spread, variability or distribution of the data. Measures of spread or distribution: range and standard deviation. Range: the difference between the maximum and minimum values on the scale of measurement. Range = [0-86] Descriptive statistics: Measures of Dispersion Variance: measures the average degree to which each point differs from the mean Standard deviation: the average deviation from the mean of the distribution. Standard deviation and variance An example – distribution of student’s marks Mode = 40 Mean = 46.15 Range in this data set = 0 - 86 Std Dev. in this data set = 17.92 So this dataset can be described as 46.15±17.92 References Derek Rowntree (1981) Statistics Without Tears: A Primer for Nonmathematicians CHARLES SCRIBNER's SONS; New York. Gokulakrishnan (2020) Probability and Statistics: Population and Sample, Medium.com, Accessed online: https://medium.com/analytics-vidhya/statistics-population-andsample-993a488572ac [06 June 2022]. Knowledge Hub (n.d.) What is the Data, Information, Knowledge, Wisdom (DIKW) Pyramid?, Ontotext, Accessed online: https://www.ontotext.com/knowledgehub/fundamentals/dikwpyramid/ [06 June 2022]. Pyrczak, F & Oh, D.M. (2018) Making Sense of Statistics: A Conceptual Overview (7th Edition), Routledge: New York.

Use Quizgecko on...
Browser
Browser