Data_Mining.pdf
Document Details
Tags
Full Transcript
IT (9626) Theory Notes Data Mining What is Data Mining? Data mining is the process of uncovering patterns and finding anomalies and relationships in large datasets that can...
IT (9626) Theory Notes Data Mining What is Data Mining? Data mining is the process of uncovering patterns and finding anomalies and relationships in large datasets that can be used to make predictions about future trends. The main purpose of data mining is extracting valuable information from available data. Applications of Data Mining Data mining offers many applications in business. For example, the establishment of proper data (mining) processes can help a company to decrease its costs, increase revenues, or derive insights from the behavior and practices of its customers. Certainly, it plays a vital role in the business decision‐making process nowadays. Data mining is also actively utilized in finance. For instance, relevant techniques allow users to determine and assess the factors that influence the price fluctuations of financial securities. The field is rapidly evolving. New data emerges at enormously fast speeds while technological advancements allow for more efficient ways to solve existing problems. In addition, developments in the areas of artificial intelligence and machine learning provide new paths to the precision and efficiency in the field. Data Mining Process Generally, the process can be divided into the following steps: 1. Define the problem: Determine the scope of the business problem and objectives of the data exploration project. 2. Explore the data: The step includes the exploration and collection of data that will help solve the stated business problem. 3. Prepare the data: Clean and organize collected data to prepare it for the further modeling procedures. 4. Modeling: Create the model using data mining techniques that will help solve the stated problem. 5. Interpretation and evaluation of results: Draw conclusions from the data model and assess its validity. Translate the results into a business decision. Data Mining Techniques The most commonly used techniques in the field include: 1. Detection of anomalies: Identifying unusual values in a dataset. [email protected] +92 300 8460713 IT (9626) Theory Notes 2. Dependency modeling: Discovering existing relationships within a dataset. It frequently involves regression analysis. 3. Clustering: Identifying structures (clusters) in unstructured data. 4. Classification: Generalizing the known structure and applying it to the data. Data mining is a huge step forward for businesses. If they are able to predict what will be wanted beforehand, they are able to make the most profit by changing what they offer early, to easily meet the new demand. [email protected] +92 300 8460713