Data Management & Presentation PDF
Document Details
Uploaded by Deleted User
Tags
Summary
This document discusses data management, including different types of data, various presentation methods (textual, tabular, graphical), and computing measures of central tendency. It also covers concepts in understanding data ethically and includes basic probability problems.
Full Transcript
Module 6: Data Management: Responsible Handling of Data and Data Presentation. **Data i**s a set of facts that give us information about things, it doesn't show us everything , it only gives us part of the picture. Ethical awareness & Responsible handling - Principles of Ethical Awareness & R...
Module 6: Data Management: Responsible Handling of Data and Data Presentation. **Data i**s a set of facts that give us information about things, it doesn't show us everything , it only gives us part of the picture. Ethical awareness & Responsible handling - Principles of Ethical Awareness & Responsible Handling - - - - **Textual Presentation** combines text and numerical facts in statistical reports and its narrative in nature. **Text** is any written or printed words that convey a message **Tabular Presentation** is used when the values are numeric and independent. **Table** is a set of data arranged in arrows. **Graph** effective tool as it displays data at a glance, facilitates comparison, and can reveal trends and relationships within the data. **Graphical Presentation** It is a way of showing information using pictures, charts, or graphs instead of just words. **Case Study Presentation** real life situations where people have had to make tough decisions about how to handle data ethically & responsibly. TYPES OF DATA **Categorical Data** means organizing individuals or things into groups based on their characteristics.(gender; male/ female) **Numerical Data** it is exact numerical values are expected. Ex: heights, weight, age,\# children. CLASSIFYING NUMERICAL DATA **Discrete Variables** values obtained [by counting] such; \# of children, male students, etc. **Continuous Variables** values obtained by [measuring]; height, temp, Area, Distance. Module 7: Interpreting Different Graphs **Graphs** are like a visual map made of dots and lines. The dots represent things, and the lines show connections between them. **Bar Graph** uses rectangular bars to represent data. **Line Graph** good for showing how things change overtime. **Histogram** shows how often different numbers or ranges of numbers appear in a data set. **Pie Graph** is a circular chart divided into slices to show how different parts make up a whole. **Scatter Plot** uses dots to show how different things are related. Concepts in Understanding Data Ethically - - - - - - - Module 8: Computing Measures of Central Tendency **Measures of central tendency** methods used to find the "middle" or "center" of a dataset and this gives us a sense of where most of the data points are located. Three main measures **Mean** found by adding all the numbers together and then dividing by the number of numbers. Ex 2+4+6+8+10=30/5= mean is 6 **Median** the middle number when the dataset is arranged in order. It is appropriate to use with ordinal variables and with interval variables that have outliers. **Mode** the number that appears most often; 1,2,2,3 and 4, the mode is 2. It helps summarize and describe datasets, making it easier to understand what is typical or common in the data. **Weighted Mean** treats all numbers equally, it is assigned different weights to each number. Module 9: Computing Measures of Dispersion **Measures of Dispersion** statistical used to quantify the spread or variability of data points within a dearest. **Range is** the difference between the highest and lowest values in a dataset. **Variance** measurement of the spread between numbers in a data set. **Sample Variance** is measuring how spread out those scores are from the average. It's like guessing what the whole pizza taste like by tasting a slice. **Standard Deviation** provides an average measure of how far the data points are from the mean and is derived from the variance. **Sample standard Deviation** it is a tool we use to see how spread out the numbers are in a small group compared to the average number in that group. Module 10: Tree Diagram & Basic Probability Problems **Tree Diagram** is a picture used for making smart choices, figuring out values, or calculating chances of things happening. Basic rules of computing Probability **Probability Computation:** for equally outcomes, the probability of event A is given by: **P(A)= Number of favorable outcomes** **\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_** **Total number of possible outcomes** **Addition Rule-** For 2 events, A and B, The probability of selecting one event or another is given by: **P (A or B)= P(A) + P(B)- P (A and B)** **Multiplication Rule-** For 2 independent events, A and B, where the outcome of A does not change the probability of B, the probability of A and B is given by; **P (A and B)= P(A) \* P (B)**