Introduction to Tools for Data Analytics PDF

Summary

This document provides an introduction to tools for data analytics, covering various concepts and techniques. It explains the steps involved in data analysis and different types of data analytics, like descriptive, diagnostic, etc.

Full Transcript

Introduction to Tools for Data 15 Analytics T ools of Data Analytics actually refer to different software used for the purpose of Data Analytics. Before discussing about these, it is...

Introduction to Tools for Data 15 Analytics T ools of Data Analytics actually refer to different software used for the purpose of Data Analytics. Before discussing about these, it is felt that a few words about Analytics is going to be helpful to understand the necessity of various types of tools related to it. Analytics is the scientific process of discovering and communicating the meaningful patterns which can be found in data. In other words it is the science that analyses raw data to fetch information contained in it. It is concerned with turning raw data into insight for making better decisions. Analytics relies on the application of statistics, computer programming and operations research in order to quantify and gain insight to the meanings of data. It is especially useful in areas which record a lot of data or information. In today`s information age, data is everywhere and ever increasing. As soon as internet is used by on1e (which may be from a desktop computer or laptop or tab or from a mobile phone etc.) digital foot print is left which gives rise to data of some kind. Thus `Data Analytics` is a broad term that encompasses many diverse types of Data Analysis. Any type of data or information can be subjected to the techniques of data analytics to get insight that can be used to improve things. These techniques can reveal trends and metrics that would otherwise be lost in the mass of information, The output of data analytics can then be used to optimize processes to increase the overall efficiency of a system, including that of a business. Implementing it into business models means companies can help reduce costs by identifying more efficient ways of doing business and by storing large amounts of data. Actually this helps to make better business decisions by analysing customer trends and level of satisfaction which can lead to development of new and better products and services. So the role of Data Analytics for a business can be pointed out as follows – ~ Gather hidden insights – Hidden insights from data are gathered and then analysed with respect to the requirements of the business. ~ Generate reports – Reports are generated and passed on to the respective teams and individuals to deal with further actions for a high rise in business. ~ Perform Market Analysis – Market analysis is performed to understand the strengths and weaknesses of the competitors. ~ Improve business requirements – Analysis of data facilitates improvement in business by meeting the actual requirements of the customer. Steps in Data Analytics Step 1 The first step is to determine the data requirements or how the data is grouped. Data may be separated by age, demography, income, gender etc. Data values may be numerical or be divided by category. 722 The Institute of Cost Accountants of India Introduction to Tools for Data Analytics Step 2 The second step in Data Analytics is the process of collecting it. This can be done through a variety of sources – computers, on line sources, cameras, satellites or through personnel. Step 3 Once the data is collected, it must be organized so that it can be analysed. This may take place on a Spread-sheet or other form of software that can take statistical data. Step 4 The data is then cleaned up before analysis. This means it is scrubbed and checked to ensure there is no duplication or error and that it is not incomplete. This step helps correct any error before being analysed. This process is also known as Data Wrangling. Step 5 Analyse data and generate report. Types of Data Analytics There are four types of Data Analytics depending on the type of data available and the type of knowledge one is looking for from the available data. These are – 1. Descriptive Analytics: This is the most basic type of Analytics. It looks at data to examine, understand and describe something that has already happened. It provides quantitative information on “What happened” by analysing the data with the help of statistical techniques like calculation of Mean, Median, Mode etc. Daily report of Cost of Production of an item being produced continuously in an automatic machine is an example. 2. Diagnostic Analytics: It goes deeper than Descriptive Analytics by seeking to understand the “Why” behind what happened. While the production report may show an increase in the Cost of Production of the item, but the reason behind the same is not mentioned. Diagnostic Analytics will look into the patterns and deviations to find the root cause of increase in the cost. Diagnostic Analytics involve correlation of two different datasets. 3. Predictive Analytics: This relies on historical data, past trends and assumptions to answer questions about “What will happen” in future. This is of immense importance as far as the situations of future prediction (like Sales forecasting) are concerned. Predictive Analytics correlates the results of Descriptive and Diagnostic Analytics with external datasets. 4. Prescriptive Analytics: This identifies the specific actions an individual or organisation should take to reach future targets or goals. In other words it comes up with recommendations of “What actions should be taken”. In fact different possible scenarios are taken into account and the corresponding outcomes of each is considered of which the best possible one is recommended. Huge computing power is necessary for going through such type of analytics. This type of Analytics skill is observed in Artificial Intelligence (AI) and Machine Learning (ML). The Institute of Cost Accountants of India 723 Strategic Cost Management What action is to be taken? What is likely to happen? Value Diagnostic Why it happened? What happened? Descriptive Complexity Figure 15.1:Types of Data Analytics Besides the above four, there can be other type known as – Exploratory and Confirmatory Data Analytics As the name implies, Exploratory Data Analytics explores the dataset to find answer as to which describes the data well. In this case no hypothesis is stated before analysing the data. It starts by framing questions about what to do with the data. Thereafter the data is manipulated to the best possible extent to find the answers to the questions. This involves understanding data structure, identifying erroneous or missing data, establishing a margin of error and figuring out the hypothesis. Confirmatory Data Analytics deals with some stated hypothesis. Same is tested for the dataset using statistical tools and inference is drawn about its correctness. An example of hypothesis can be – “I predict that a person`s likelihood of recommending our product is directly proportional to their reported satisfaction with the product”. Now if the trend analysis is carried out on the available data then from that the hypothesis is either proved or disproved. But the availability of the stated hypothesis gives a clear cut direction towards which the data analysis is done. In a way it is obviously advantageous compared to the Exploratory Data Analytics. Tools for Data Analytics From the above discussions, it is clear that acquiring and storing data, processing the same and finally report generation are the steps in Data Analytics in a nutshell. For these, the tools used are nothing but different Software. Depending on the purpose for which a tool is used, one can classify them as follows. 1. Data Transformation Tools These are basically used for the purpose of cleansing of the available data. A prime requirement of getting meaningful output from any software is to provide it with the data in specific field types and structures. Normally lot of issues are noticed in the datasets as far as formats, structures, spellings etc. are concerned. To take care of these, ETL (Extract, Transform and Load) tools are used. It can be mentioned here that according to some other line of thinking, the tools are ELT (Extract, Load and Transform) type and not ETL type which works better. As far as our purpose is concerned both work perfectly alright. Such tools facilitate building a pipeline through which the data are brought together from multiple sources, 724 The Institute of Cost Accountants of India Introduction to Tools for Data Analytics cleansed and stored in a place called data warehouse for analysing. This is applicable when the data is structured. In case of unstructured or semi structured data, the storage is called Lake instead of Warehouse. Data Stores filling start by data extraction from primary sources. Various data inconsistency is removed during this phase. Before their transformation into the data schema, extracted data can be loaded in a temporary dumping ground known as Data Staging Area (DSA). The data component in DSA used to be most frequently a part of those solutions of data stores which has a source in heavy transaction systems. Use of DSA will reduce requirement of transaction system’s utilization in the ETL Process. Use of DSA is possible also in the case when it is necessary to transfer data from a text file into the required database format. After the extraction follows data transformation which will convert data obtained from single data source into unified data model. This model makes it possible to create aggregation and clustering. The final phase of ETL is data transmission from source data memories or temporary dumping ground to database tables of the data store. At the primary filling it can be a gigantic quantity of data. Because ETL works in batch mode, such amount of data will be brought again at the time of next regular updating. The time interval of updating could be a day, a week, a month or a year. The transformed data coming from various service systems and external databases are saved in Data Warehouse which is an extensive central business database. Actually the data in the Data Warehouses are not optimized for quick transaction processing but meant for quick administration of analytical information obtained from big amount of data. When the concept of decentralization is applied to Data Warehouse then we get Data Marts. Basically these are decentralized, based on certain themes. Data Marts are meant to provide analytical information to specific groups like Marketing, Sales, Finance etc. Data Transformation Tools are also called Data Smoothing Tools. Many software for visualization and analysis have such ETL (or ELT) tools built in. Besides, there are many special tools which are capable of handling this cleansing process. Xplenty, Stitch, ABS Glue, Skyvia etc. are some of the commonly used ETL Tool for data cleansing. 2. Data Analysis Tools These tools are meant for Analysis part of the data. Broad classification of these tools are – Spread-sheets Business Intelligence (BI) tools Financial data Analytics tools Programming Languages Tools for Statistical Data Analysis Industry Specific tools I. Spread-sheets Although there are several Spread-sheet database software solutions that offer more than a traditional spread- sheet, Spread-sheet software is still indispensible for many businesses. From generating reports and creating a budget to becoming a glorified to-do list, spread-sheets have the flexibility to meet just about every need of user. The right Spread-sheet software can not only help the user to shift through thousands of datasets, but also keeps the user on track for setting and completing goals. Of the several available Spread-sheet tools of today, MS-Excel remains the most common. The simplest and most obtainable analysis proceedings of business data is offered by MS-Excel. Certainly it is the cheapest The Institute of Cost Accountants of India 725 Strategic Cost Management option available because there is no user having MS-Excel not installed in the computer system. So there is no necessity to buy license for specialized software. Users can straightway go for creation of analytical reports and graphs. Data analyses created by MS-Excel are very dynamic and effective. They enable a lot of different views and graphical representations. Data feeding to MS-Excel is possible in different ways. Most common is the manual data feeding from business reports. The other way is data import from business information system. The third way is direct connection to database of business information system. This way is most operative in the situations where huge volume of data is handled. Pivot Tables are one of the most powerful tools of MS-Excel This enables data summarization, filtration and ordering. It is possible to create a lot of different views, reports and graphs from one data source. Created Pivot Table is easily editable – we can add or delete data, columns, rows or change summaries without the influence of Data Source. Some of the reasons why MS-Excel finds its utility as a tool used in data science are as follows - 1. Awareness: Irrespective of the demographic background, MS-Excel is a widely known tool to almost all people who work with data, and prepare reports. The tables, pivots, VBA-scripts and charts are having high awareness. MS-Excel is treated as a personal productivity tool used extensively in business organisations for preparing dashboards and reporting. 2. User-friendliness: Microsoft Excel is incredibly easy to use. Especially for beginner data scientists, the ease of use and the wide-scale availability of Microsoft Excel makes it an excellent introductory program for both students and professionals. 3. Data visualization: MS Excel is one of best editors for data visualization in 2D format. The tables are easily edited, formatted, colourized, and shared. Google Sheets is a clear confirmation of the Excel design for editing data — but scaled for multiple users. Add to that the built-in tables, filters, slicers, groupings, window splitting, cell formulas and other features and any alternative table or database editor set MS Excel apart from many tools. MS Excel can handle as many as 1,048,576 rows and 16,384 columns per sheet. Therefore, other than a set of complex real-life problems dealing with truly big data, MS-Excel can solve the purpose in many cases. MS Excel also helps in creating the metadata and also to change metadata over time as a dataset grows. 4. Data Analysis: Microsoft Excel has multiple functions programmed into the software which makes it simple to explore a dataset through sorting, filtering, and pivot tables. Tools like Scenario Manager also make it easy to create and compare hypothetical scenarios across datasets. Moreover, MS Excel helps in enfolding the pattern and psychology of the data. For a beginner in data science, MS Excel is a very useful tool. Further, Excel comes with various analytical tool packs that can be activated to unleash more advanced calculations. In addition, one can write customized scripts vis-à-vis the algorithm in use, for data analysis. Even there is an option for using Python with MS Excel. Comparability with other software: This is one of the very powerful features of MS Excel as a tool. The other tools like SPSS, JAMOVI, R, Python etc directly allow to import the excel sheet (in CSV form) as the input data for further analysis. Data science professionals that are committed to writing code and queries can use SQL. Besides MS-Excel there are other Spread-sheet software available for Data Analytics and some of those are – Google Sheets, Quip etc. Google Sheets is a free alternative to Excel. It requires a Google account to begin work (which one will 726 The Institute of Cost Accountants of India Introduction to Tools for Data Analytics readily have if the person is user of gmail). It offers real-time collaboration, commenting and version history to accurately track all progress between contributors. Google Sheets can immediately sync with G Suite apps like Google Analytics and Google Data Studio. It can also populate a spread-sheet with data pulled from Google Forms. In addition to its collaborative features, Google Sheets also comes equipped with a slew of traditional mathematical and logical functions. Even better, its available in the web, Android and IOS Systems, making it one of the most easily accessible software options. Google Sheets and all of its features are available up to 15 GB storage. Quip is a unique productivity tool that combines spread-sheets, documents and team chat all into one app. The tool features the flexibility and customization. For instance you can toggle between document and spread-sheet layout as you work. Quip also boasts more than 400 built-in functions and offers key board shortcuts for most menu items, such as (Shift + Space) for row selection and (Crtl + Space) for column selection. Plus just like Excel, creating a Graph or Chart is as easy as selecting a data range and clicking a button (in this case, Insert and Chart) II. Business Intelligence (BI) Tools Meaning of BI Software Business Intelligence (BI) Software is a set of business analytics solutions used by companies to retrieve, analyse and transform data into useful business insights usually within easy-to-read visualization - like charts, graphs and dashboards. Examples of the best BI Tools include data visualization, data warehouses, interactive dashboards and BI reporting tools. A BI Solution pulls internal data produced by a company, into an Analytics platform for deep insights as to how different parts of a business affect one another. As Big Data has gained in prominence, the tendency for companies to collect, store and mine their business data has increased many times and so has the popularity of BI Software. Companies generate, track and compile business data at a scale never seen before. The ability to integrate cloud software directly with proprietary systems has further driven the need to combine multiple data sources and take advantage of data preparation tools. But all this data is nothing if we can`t make sense of it and use it to improve business outcomes. To make informed choices, businesses need to make their decisions on evidence. The mountains of data that businesses and their customers produce contain evidence of purchasing patterns and market trends. By aggregating, standardising and analysing that data, businesses can better understand their customers, better forecast their revenue growth and better protect themselves against business pitfalls. Business intelligence has traditionally taken the form of quarterly or yearly reports that report on a defined set of Key Performance Indicators (KPI). But today’s BI Reporting software is backed by Data Analytics tools that work continuously at the speed of light. These insights can help a company take a course of action within minutes. BI Software interprets a sea of quantifiable customer and business actions and returns queries based on patterns in the data. BI comes in many forms and spans, many different types of technology. The chart below shows a comparison of few top Business Intelligence Tools according to user popularity and major features. The Institute of Cost Accountants of India 727 Strategic Cost Management Table 16.1: Chart showing comparison of top BI Tools [Source – www.technologyadvice.com] Natural Data Real-Time Embedded Product TA Rating Langauge Analytics Reporting Analytics Processing 4/5 435/5 4.5/5 4.5/5 4.5/5 4/5 4.5/5 4/5 In the chart TA Rating means the rating given by Technology Advice Data storage for BI Throughout an organisation data is available in a number of systems. Companies should ensure standardised formatting across data types from each of these systems so that most accurate analysis is possible. A large enterprise can have information about the customers in the Customer Relationship Management (CRM) application and have financial data in the Enterprise Resource Planning (ERP) application and several other key revenue datasets in various cloud software applications. These separate programs may label and categorise data differently and the company will need to standardise the data before analysis. Some BI Platforms pull data for analysis directly from the source applications through a native API connection or Webhook. Other BI Tools require the use of a cloud data storage system to aggregate diverse data sets in a common location. Small businesses, single departments or individual users may find that a native connection works well, but big organisations which generate large datasets will need a more comprehensive BI setup. In case they want to go for centralised storage location then a Data Warehouse or Data Mart could be a 728 The Institute of Cost Accountants of India Introduction to Tools for Data Analytics solution and purchase an ETL Tool to facilitate their Big Data storage. Alternatively they may use a Data Storage Framework like Hadoop to manage their data. Analysing Big Data with BI Regardless of whether businesses choose to store their data in a Data Warehouse, a Cloud Database, an On- premise Server or run queries on the Source System, data analysis and the resulting insights make the field appealing to the business users. Data Analytics tools vary in terms of complexity, but the general method of combining large amounts of normalised data to identify patterns remains consistent across BI Platforms. Data Mining: This is the activity of “Data Discovery” because here the patterns and inconsistencies of data unveiled through automated or semi-automated data analysis. Common correlations drawn from Data Mining include grouping specific sets of data, finding outliers in data and drawing connections and dependencies from disparate datasets. Data Mining often uncovers the patterns used in more complex analyses, like Predictive modelling which makes it an essential part of the BI Process whose growth is correlated directly with the rise of Big Data in businesses of all sizes. Of the standard processes performed by Data Mining, association rule learning presents the greatest benefit. By examining data to draw dependencies and construct correlations, the association rule can help businesses better understand the way customers interact with their website or even what factors influence their purchasing behaviour. Association rule learning was originally introduced to uncover connections between purchase data recorded in point of sale systems at supermarkets. For example if a customer purchased Tomato Sauce and Cheese, the association rules would likely uncover that the customer purchased Hamburger Meat as well. Though this is a very simple example but it works well to understand the type of analysis that now connects incredibly complex chains of events in all sorts of industries and helps users find correlations that would have remained hidden otherwise. Predictive Analytics with BI Software: One of the most exciting aspects of BI, advanced analytics features like Predictive and Prescriptive analytics function as a subset of Data Mining. The tools use existing datasets and algorithmic models to help companies make better decisions. As discussed before, Predictive Analytics forecast future events based on current and past data. By drawing connections between datasets, these software applications predict the likelihood of future events which can lead to a huge competitive advantage for businesses. Predictive analysis involves detailed modelling. Even it ventures into the realms of Artificial Intelligence (AI) and Machine Learning (ML) where software actually learns from past events to predict future consequences. The three main forms of Predictive analysis are Predictive Modelling, Descriptive Modelling and Decision Analytics. Predictive Modelling is the most well-known segment of Predictive Analytics. This type of software predicts, particularly in reference to a single element. Predictive models use algorithms to search for correlation between a particular unit of measurement and at least one or more features pertaining to that unit. The goal is to find same correlation across different datasets. An example of this modelling is “Likelihood of a customer switching Insurance Providers”. The Institute of Cost Accountants of India 729 Strategic Cost Management Descriptive Modelling seeks to reduce data into manageable sizes and groupings. Descriptive Analytics works well to summarise information such as unique page views or social media mentions. Decision or Prescriptive Analytics take into account all the factors related to a discrete decision. It predicts the cascading effect an action will have across all the variables involved in making that decision. In other words Decision Analytics gives businesses the concrete information they need to predict outcomes and take action. Natural Language Processing: Data comes in three forms – structured, semi structured and unstructured. Of these, the most common is unstructured data which includes text documents and other types of files that exists in formats which computer cannot read easily. Unstructured data cannot be stored in neatly categorised sets of similarly formatted data rows or columns. This makes the traditional data mining software unable to analyse the data. But often such data is seen to become crucial for understanding business outcomes. With so much data in unstructured form, Text Analytics should be a key consideration while deciding on the purchase of BI Tool. Natural Language Processing (NLP) software, also known as Text Analytics software combs large sets of unstructured data to find hidden patterns. NLP is particularly interesting for businesses that work with social media. Using the right software mix of data ingestion and AI, a business can set up rules to track key words or phrases – for example “Name of a business” – to find patterns in how customers use that language. NLP Tools also measure customer sentiment, provide actionable insight into lifetime customer value and learn customer trends that can inform future product lines. BI Software for Corporate reporting The previous two applications of BI Software dealt with the mechanics of BI System – how business data is stored and how software refines this data into meaningful intelligence. BI reporting focuses on the presentation of these findings. Online Analytical Processing (OLAP): This uses multidimensional databases to enable users to query data warehouses and create reports that view data from multiple perspectives. OLAP gives BI Software the ability to combine data, drill down into single metrics and view data for combinations of single metrics that are unobtainable in a traditional spread-sheet setup. For example, a supply chain`s data metrics can include Location, SKU, Date of purchase, Name of the Salesperson and Expiration date. OLAP Tools can provide the analysts with the power to surface insights that would otherwise be hidden within two or three dimensional spread-sheets. Data Visualisation: One of the most popular trends of BI is Data Visualisation. This allows companies to graphically the results of data mining or other analytics. Presenting findings in a visual format like – a graph, chart or on a map provides immediate insights into the most important metrics – awareness that does not surface within the context of a spread-sheet. Dashboards: Every user of a system does not need full access to everything available in the dashboard. Most employees 730 The Institute of Cost Accountants of India Introduction to Tools for Data Analytics only need access to a dashboard of their most important metrics. It gives at a glance access to a range of predefined visualisations. While each company can define its own dashboards based on custom business needs, some of the possible dashboards that BI Tools provide are as follows – Sales Dashboards that includes the total number of leads and prospects in each stage of the sales funnel, KPI metrics of the total number of meetings scheduled per salesperson, a total revenue leader-board, gas gauge tool that shows total revenue towards monthly goal Marketing Dashboards that shows a line chart with the total number of marketing qualified leads per day, top performing blog posts per month, latest social post Customer Success Dashboards with visualisations for the total number of open tickets, number of closed tickets per day, average time to close, ticket totals leader-board IT Dashboards with key metrics regarding sprint progress, total number of open bug tickets, current on- call developers, feature request leader-board Some BI solutions also offer interactive dashboard tools where business users can manipulate the data visualisations, dig for a more detailed view and zoom out for more context. Alerts and Notifications: While interactive dashboards and reports greatly extend the usability of BI Software for non IT users, alerts and notifications can provide even further practical applications for all business users. Alerts notify users who do not spent most of their time in the tool to data changes that need immediate action. When the companies set alerts for threshold of high and low performance, they can track when they need to mobilize a response or investigate an issue before it becomes an emergency. Even better, companies that set alerts for goal metrics can celebrate and recognise their team efforts early and often. Embedded Analytics: BI applications promise to clarify data analytics tools for the most non-technical of employees, which has driven the demand for Embedded BI. These features let companies build data visualisations within their cloud BI Software and dynamically serve those visualisations to internal and external customers within company apps. Visualisations, reports and dashboards that are embedded in a company webpage or cloud app save companies thousands of hours and lots of money they would otherwise use to build BI Reporting tools and Analytics dashboards from scratch to track company`s performance. These tools now give the business users access to custom, plug-and-plug visualisations, greatly speeding the time to market. Different BI Software from various vendors Business Intelligence Tools and Platforms come in several forms for varying business needs. Companies looking to provide data services to business users will find Self Service BI meeting the needs of most of their users. Data Visualisation Tools are helpful for teams that are dipping their toes into Data Analytics but may not have lots of extra development resources available. Data Warehousing Tools provide the underlying infrastructure that can house and cleanse data before serving it up through visualisations and BI Tools provide end-to-end dashboard tools to store, cleanse, visualise and publish data. A list of top BI Tool Software Vendors is given below: The Institute of Cost Accountants of India 731 Strategic Cost Management Table 16.2 Self Service Data Visualisation Data Warehousing Bi Platforms SAP Crystal Reports iDashboards Sisense Tableau Chartio Dundas Oracle BI InsightSquared Alteryx Segment SAS Domo SAP Lumira (formerly Jaspersoft Geckoboard Birst Business Objects) Some of the important Business Intelligence tools are discussed in the following paragraphs. Tableau specialises in making beautiful visualisations. It is a very advanced tool and one of the early entrants in the market. It is considered to be the industry leaders among BI Tools. Tableau`s platform certainly live up to the hype. In addition to easy –to –use Reporting, a full API, Report Sharing and good customer support are the plus points in support of its purchase. But it is quite costly and affordable for corporate business houses. That`s why much of their advertising is focussed on corporate environments with data engineers and bigger budgets. Tableau has a public (free) version but with limited capabilities. The more you pay the more you can access with Tableau including benchmarked data from third parties. Tableau allows to pull data from an array of cloud software and data preparation sources, including but not limited to Excel, SQL and Oracle. InsightSquared is built specifically to help with Sales, Marketing, Finance, Staffing and Support Services. It is perfect for detailed insights about specific aspects of a company`s operations. In addition to data analysis and reports InsightSquared also allows one to get an idea about Closed business deals, Sales success rates of certain salesperson and more. Integrating InsightSquared with Salesforce gives users access to predictions about which prospects are most likely to lead to a sale, which salespeople are likely to win most sales and more. Domo is the first BI Tool to run completely on the cloud. Domo is a quickly growing system that allows the user to access insights about the business from anywhere. Domo offers over 500 data connectors and its own app store allows the user to plug in apps specifically tailored for the industry to which the user`s business belong. Names of few more important BI Softwares are – QlikView, Birst, GoodData, Hubble, Looker, MicroStrategy, BOARD, Microsoft PowerBI, Oracle BI, Logi Analytics, Sisense, Alteryx etc. III. Tools for Financial Data Analytics Harnessing the power of technology, Financial Analytics tools are transforming business operations and planning. Financial Analytics, a subset of Business Intelligence (BI) and Enterprise Performance Management (EPM), has become indispensible in the modern business environment. Using financial analytics tools companies can examine huge volumes of financial and other data to identify patterns and predict trends, boost revenues and slash costs. Financial Analytics involves the use of software applications to examine all the data generated by an organisation. This includes Financial and Accounting information as well as other data like capturing customer interactions, monitoring supply chains, deep diving into historical trends and more. As the complexity of business world has increased, companies need to deploy more sophisticated tools to understand what is going on and to plan. Financial Analytics improves on traditional financial analysis 732 The Institute of Cost Accountants of India Introduction to Tools for Data Analytics because of its ability to capture larger volumes of data as well as to drill down to a highly detailed level. Financial Analytics does not only reveal historical trends, but using Predictive Analytics it can peer into the future. Answers to the following critical issues can be provided by the Financial Analytics tools. ~ What is the profitability of different products or services? ~ What are the most valuable customer segments? ~ What risks are the business currently facing and what are the potential future challenges? ~ How are different sales channels performing? ~ What future events could impact stock price performance? Types of Financial Analytics Financial Analytics can provide business leaders a better understanding of business processes and drivers. This grants them the ability to monitor and improve business operations and to plan effectively for future performance. Different types of Financial Analytics are – Predictive Sales Analysis Product Profitability Analytics Value Driver Analytics Financial Ratio Analytics Scenario and Sensitivity Analysis Growth Rate Analysis Cash Flow Analysis Variance Analysis Predictive Sales Analysis: This enables an organisation to assess and adjust Sales Forecasts more rapidly than when using Excel based models. Predictive Sales Analysis can incorporate many more data points to predict future sales because of the ability to drill down deeper, for example, into the performance of different sales channels and different product lines. Product Profitability Analytics: A company`s overall profit is the money left at the end of an accounting period after subtracting total costs from the total revenue. Product Profitability Analysis is used to determine the profitability of each product or service. This is an invaluable insight for decision makers as the profitability of different product lines can be masked by factors such as high sales volume or high revenue growth. Product revenues are stripped off all associated costs – production costs, logistics, discounts, rebates, marketing and commissions – to reveal the profit derived from each product. Business leaders need to know how to allocate their resources in the most effective way – finding out which products are most profitable is critical. Value Driver Analytics: In business Value Driver are the elements that increase the worth of a product, service, asset or business. In the case of a product it could be a differentiating capability that makes the product a “Must have” for customers. For a business it could be a trusted staff or a customer base which increase the value of the business. The Institute of Cost Accountants of India 733 Strategic Cost Management Most businesses have identified their main value drivers as part of their strategy development. What value driver analytics provides is a way to quantify, measure and assess these drivers using transparent financial metrics. Financial Ratio Analytics: Ratio Analysis is simply the analysis of different financial metrics in a business` financial statements. Financial Ratio Analysis compares the relationship or ratio between two or more financial items. Financial ratios are used in six main areas – Liquidity, Coverage, Solvency, Profitability, Efficiency and Market Prospects. Analytics allows us not only to see different ratios more quickly and accurately than in the past but also let Data Analysts visualise this data in more understandable formats, providing managers with the evidence they need to make informed decisions. Scenario and Sensitivity Analysis: Business leaders make business decisions based on an analysis of risks and benefits. Any decision is based on a set of assumptions – i,e market and economy. Scenario and Sensitivity analysis model these assumptions and manipulate them to show the impact of different decisions or different conditions. Scenario Analysis examines the results of all the assumptions – while Sensitivity Analysis examines the impact of changing a single variable. This can include macro or micro economic changes. The impact of pricing and volume changes can also be examined. Analytics takes this to another level by allowing for the development of more complex models that can be updated more rapidly to reveal which factors the business is most sensitive to – such as inflation or customer confidence. Growth Rate Analysis: Growth Rates are the most common financial metrics used in business. Simply put, growth rates are the percentage change of a specific variable within a given time period. They are used by business executives to track performance. Businesses typically track revenue and profitability growth, sales and cost evolution and so on. Investors and financial analysts typically use growth rate analysis to assess potential investments including revenues and earnings, price to earnings ratio (P/E), price to earnings to growth ratio (PEG) and return on equity (ROE). Cash flow Analysis: Cash is most important resource in business. Being able to determine at any point in time how much cash a company has or will have in a specific future period is essential. Tracking cash flow is perhaps the most important function performed by the finance department. Cash flow analysis reveals how much money is coming into the business from sales, services or investments or how much money is going out in the form of costs and interest payments. Being able to control and predict these flows determines the viability of any business. Cash flow is divided into – (1) Cash flow from operating activities, (2) Cash flow from investing activities and (3) Cash flow from financing activities. Variance Analysis: Variance Analysis is an important aspect of budgeting activities. At the most basic level, we use it to 734 The Institute of Cost Accountants of India Introduction to Tools for Data Analytics compare actual performance to what was predicted in company forecasts or budgets. In other words it compares actual results with planned one. Variance Analysis can highlight various issues. Calculating and analysing variances can identify areas where management needs to put more focus – for example in optimising costs or increasing operational efficiency. It can also point out the flaws in the budgeting or planning process if there is significant or persistant variance between the actual and planned. Data analytics is a thrilling development in financial analysis because many of these analytics can now be performed much more quickly and accurately. Business Intelligence and Automation Platform tools now help us to extract, organise and examine data from different sources and systems. This provides invaluable support to the company executives – allowing them to make faster, more informed and better decisions. Financial Analytics Tools These are mainly BI Tools with option for Financial Analytics. Some of the BI applications with specific modules for Financial Analytics are given below. Jedox – Salient features of this tool are as follows. ~ SAAS based solution Business Intelligence within seamlessly integrated software platform. ~ Deployable in the cloud as well as on-premise server or hybrid ~ Facilitates business planning, reporting, data consolidation and financial data analytics ~ Offers Excel add-in enabling data modelling, ad-hoc and predictive analytics, KPI reporting, budgeting, forecasting activities within the familiar Excel environment. Zoho Analytics – Following are the salient features of this tool ~ BI and Analytics platform ~ Deployable in the cloud as well as on-premises server ~ Can be used across various functional areas and by a wide range of users for their reporting and analytics needs ~ Allows users to easily create and share powerful reports within minutes ~ Current version of Zoho Analytics has additional features like Zoho DataPrep,, a Unified Data Management and Analytics Platform, Deep Augmented capabilities like Zia insights and Conversational BI, Data Story telling features Quickbooks – This has the salient features given as – ~ One of the most feature rich accounting solutions for small businesses ~ User friendly and accessible platform, providing all the tools needed for day to day financial management. ~ Ability to track and record business information at high level of detail – providing treasure trove of “Small” Data. ~ Can generate reports but requires integration with other data analysis/ reporting tools to provide more advanced Financial Analytics. Few other tools for Financial Data Analytics are Hyper Anna, NetSuite, FICO etc. IV. Programming Languages Data scientists use several Programming Languages in their work. Programming allows the creation of The Institute of Cost Accountants of India 735 Strategic Cost Management specific analytical solutions which may not be available in other ready to use software packages. Some of the most popular languages for this purpose are – R Programming, Python, SAS etc. R Programming : R is a open source programming language that is widely used as a statistical software and data analysis tool. It generally comes with command line interface. R is available across widely used platforms like Windows, Linux, Mac-OS. It is the latest cutting edge tool. Due to its expressive syntax and easy-to-use interface, it has grown in popularity in recent years. In fact this is one of the most popular languages amongst statisticians, data analysts, researchers and marketers to retrieve, clean, analyse, visualize and present data. It was designed by Ross Ihaka and Robert Gentleman at the University of Auckland, New Zealand and currently developed by the R Development Core team. R programming language is an implementation of the S programming language. The project was conceived in 1992, with an initial version released in 1995 and a stable beta version in 2000. Reasons for using R The figure below is meant for showing the reasons of using R. The points mentioned are discussed briefly in the paragraphs next. Figure 15.2: Reasons for using R ~ It`s an Open Source language. It can be installed for free because it is licensed under the GNU General Public License. There are many R Packages available under the same license. So there is no need to pay any license fee for the usage of any of these packages – even for commercial applications. ~ R Programming is Platform Independent language. Actually it is compatible with all the popularly used Operating Systems like Windows, Linux and Mac-OS. R code written on one platform can easily be ported to another without any issues. Cross platform interoperability is an important feature to have in today`s computing world – even Microsoft is making its coveted.NET Platform available on all platforms after realising the benefits of technology that runs on all platforms. 736 The Institute of Cost Accountants of India Introduction to Tools for Data Analytics ~ It is used as a leading tool for Machine Learning, Statistics and Data Analysis. R can easily create objects, functions and packages based on user`s requirements. ~ It allows the user to integrate it with other languages like C, C++ etc. As a result, easy interaction with many data sources and statistical packages is possible. ~ R is currently the most wanted programming language in the Data Science jobs. Hence it is considered to have the hottest trend in the job market. ~ The R Programming language has a vast community of users and it`s growing day by day. It is being used by the biggest tech giants which is a sign of the potential of the language. R has the right mix of power and simplicity. Here are a few ways industry stalwarts are using R and contributing to the R ecosystem. COMPANY APPLICATION / CONTRIBUTION Twitter Monitor user experience Ford Analyse social media to support design decisions for the cars. New York Times Info-graphics, Data journalism Microsoft Released Microsoft R Open, an enhanced R distribution and Microsoft R Server after acquiring Revolution Analytics. Human Rights Data Analysis Group Measure the impact of war Google Created the R style guide for the R user community within Google ~ R is currently the most wanted programming language in the Data Science jobs. Hence it is considered to have the hottest trend in the job market. Applications of R Programming in the real world 1. Data Science – With the advent of “Internet of things” (IoT) devices creating terabytes and terabytes of data that can be used to make better decisions, Data Science is a field that has no other way but to go up. Simply explained, a data scientist is a statistician with an extra asset – computer programming skills. Programming languages like R give a data scientist superpowers that allow them to collect data in real time, perform statistical and predictive analysis, create visualisation and communicate actionable results to the stakeholders. 2. Statistical Computing – R is the most popular programming language among statisticians. In fact it was initially built by statisticians for carry work related to statistical data. It has a rich package repository with over 9000 packages having every statistical function one can think of. R`s expressive syntax allows researchers – even those from non-computer science backgrounds to quickly import, clean and analyse data from various data sources. R also has charting capabilities which means one can plot the data and create interesting visualisations from any dataset. 3. Machine Learning – R has found a lot of use in predictive analytics and machine learning. It has various packages for common ML tasks like linear and non linear regression, decision trees, linear and non-linear classification and many more. Everyone from machine learning enthusiasts to researchers use R to implement machine learning algorithms in fields like finance, genetics research, retail, marketing and health care. The Institute of Cost Accountants of India 737 Strategic Cost Management Features of R Programming language 1. Statistical features of R:- ~ Basic Statistics – Most common terms of basic statistics are Mean, Median and Mode which are the Measures of Central Tendency for a dataset. These can be very easily computed using R. ~ Static Graphics – R is rich with facilities for creating and developing interesting Static Graphics. R contains functionality for many plot types including graphic maps, mosaic plots, bi-plots and the list goes on. ~ Probability Distributions – Probability Distributions play vital role in statistics. By using R various types of problems related to probability distributions (such as Binomial Distribution, Normal Distribution, Student`s t Distribution, Chi Square Distribution etc.) can be handled very easily. ~ Data Analysis – It provides a large, coherent and integrated collection of tools for data analysis. 2. Programming features of R:- ~ R Packages – One of the major features of R is the fact that it has a wide availability of libraries. R has CRAN (Comprehensive R Archive Network) which is repository holding more than 10,000 packages. ~ Distributed Computing – Distributed computing is a model in which components of a software system are shared among multiple computers to improve efficiency and performance. Packages like ddR and multidplyr are used for distributed programming in R. Programming in R Since R is much similar to other widely used languages syntactically, it is easier to code and learn in R. Programs can be written in R in any of the widely used IDE like R Studio, Rattle, Tinn-R etc. After writing the program, the file needs to be saved with the extension.r To run the program following command has to be used in the command line. R file_name.r Some examples of programs in R language are given as follows. Example Write R program to generate 10,000 numbers in a Random distribution, organise them based on the frequency and create a Bar Chart. Solution: Following three lines of code need to be written for the purpose n

Use Quizgecko on...
Browser
Browser