ITF 1-5 text.txt
Document Details
Uploaded by RestfulCanyon
Full Transcript
Objectives On completion of this unit, you will be able to: □ Describe basic support and troubleshooting procedures. □ Use websites and tools to obtain support and search for advice and help. Syllabus Objectives and Content Examples This unit covers the following exam domain objectives and cont...
Objectives On completion of this unit, you will be able to: □ Describe basic support and troubleshooting procedures. □ Use websites and tools to obtain support and search for advice and help. Syllabus Objectives and Content Examples This unit covers the following exam domain objectives and content examples: □ 1.6 Explain the troubleshooting methodology. Identify the problem (Gather information, Duplicate the problem, if possible, Question users, Identify symptoms, Determine if anything has changed, Approach multiple problems individually) Research knowledge base/Internet, if applicable Establish a theory of probable cause (Question the obvious, Consider multiple approaches, Divide and conquer) Test the theory to determine the cause (Once the theory is confirmed [confirmed root cause], determine the next steps to resolve the problem, If the theory is not confirmed, establish a new theory or escalate) Establish a plan of action to resolve the problem and identify potential effects Implement the solution or escalate as necessary Verify full system functionality and, if applicable, implement preventive measures Document findings/lessons learned, actions and outcomes Support and Troubleshooting To some extent, being an effective troubleshooter simply involves having a detailed knowledge of how something is supposed to work and of the sort of things that typically go wrong. However, the more complex a system is, the less likely it is that this sort of information will be at hand, so it is important to develop general troubleshooting skills to approach new and unexpected situations confidently. Troubleshooting is a process of problem solving. It is important to realize that problems have causes, symptoms, and consequences. For example: A computer system has a fault in the hard disk drive (cause). Because the disk drive is faulty, the computer is displaying a "bluescreen" (symptom). Because of the loss of service, the user cannot do any work (consequence). From a business point-of-view, resolving the consequences of the problem is often more important than solving the original cause. For example, the most effective solution might be to provide the user with another workstation, then get the drive replaced. CompTIA Troubleshooting Model Make sure that you know the order of the steps in CompTIA's troubleshooting model: 1) Identify the problem. ○ Gather information. ○ Duplicate the problem, if possible. ○ Question users. ○ Identify symptoms. ○ Determine if anything has changed. ○ Approach multiple problems individually. 2) Research knowledge base/Internet, if applicable. 3) Establish a theory of probable cause. ○ Question the obvious. ○ Consider multiple approaches. ○ Divide and conquer. 4) Test the theory to determine cause. ○ Once the theory is confirmed (confirmed root cause), determine the next steps to resolve the problem. ○ If the theory is not confirmed, establish a new theory or escalate. 5) Establish a plan of action to resolve the problem and identify potential effects. 6) Implement the solution or escalate as necessary. 7) Verify full system functionality and, if applicable, implement preventive measures. 8) Document findings/lessons learned, actions and outcomes. These steps and the approach and attitude you should apply when troubleshooting are explained in a bit more detail below. A methodical process is the ideal, but troubleshooting in help desk and IT support departments is often a time-critical process. In the real world, you often have to balance being methodical with being efficient. Identifying the Problem It is important that when you encounter a troubleshooting situation, you approach it logically. The first stage in the troubleshooting process is to identify the problem. This stage involves a number of techniques. Question Users and Determine Changes When a user first discovers a problem and reports it to the help desk, you will attempt to classify the problem in terms of the problem’s nature and scope. Most of your troubleshooting information will have to be obtained by questioning the user. You must be patient and address your questions and any test actions you ask them to perform to the user's level of expertise and knowledge. Be polite, avoid blaming the user for causing the problem, and help them to help you. You will probably start by asking the user to describe the symptoms of the problem and the status of the computer, including any error messages or unusual conditions. During a discussion with the user, you should also ask: How many people are affected by the problem? This helps determine the severity of a problem. Clearly, if all users are affected, that is more serious than if only one user is experiencing the problem. When did the problem first occur? This could help you identify possible causes of the problem. For example, if the problem occurred on Monday morning during sign in, then that might indicate that the user’s password expired over the weekend. The user might tell you that the problem has been intermittent for the last three weeks, but that it suddenly got worse. Either of these responses might help you focus your troubleshooting. What might have changed? This is a crucial question because often, problems occur after something changed. Sometimes, the thing that changed might seem totally unrelated. For example, someone experiences a problem right after some desks were moved. This might indicate a cabling problem that has arisen after a cable was broken or unplugged. The change might be a configuration change on your network or on the specific user’s computer. Rolling back that change might resolve the issue. Duplicate the Problem and Identify Symptoms It is very helpful to be able to observe the issue as it occurs. You might be able to do this via remote desktop or by visiting the user, but if these are not practical you could test whether the problem can be repeated on a lab system or virtual machine. If the problem cannot be duplicated on a reference system, this points to some issue with the user's local environment. You can also ask the user to describe symptoms, such as error messages appearing on the screen, or ask them to navigate to the relevant log file and report on its contents. Approach Multiple Problems Individually When you start to investigate symptoms, you might discover symptoms of more than one problem. Perhaps a user has reported that a machine has lost Internet connectivity, and you discover that it has also not been receiving maintenance updates. The issues could be related, or one might be incidental to the other. If the problems do not seem to be related, treat each issue as a separate case. If they seem to be related, check for outstanding support or maintenance tickets that might indicate existing problems. It may also be the case that a user reports two different problems at the same time (often preceded by, "While you're on the line…" sort of statements). Treat each problem as a separate case. In most cases you should advise the user to initiate a separate support ticket. Gather Information and Research Symptoms Your main source of information about a problem is likely to be the user reporting it, but if this is insufficient to successfully troubleshoot, you may have to consider other sources. Use a remote desktop tool to access the system or travel to the user to observe it in operation. View system, application, or network log files. Monitor other support requests to identify similar problems. Once you have gathered sufficient information it is possible that you are able to learn enough to resolve the problem. This is likely with simple issues, such as password lockouts. If not, during this classification stage, you will document the problem in your ticket system and provide as much background information as you are able to determine. This will help you, or a colleague to whom you might escalate the issue, resolve the problem. If you do not recognize the problem, use a product Knowledge Base or a web/Internet search tool to research the symptoms you have identified. Using support resources and web searches is discussed in more detail later in this unit. Understanding the Problem After you have gathered sufficient information about a reported problem, you must start to determine a theory of probable cause from analysis of the symptoms. This is a process of thinking about possible causes then eliminating those possible causes through testing to arrive at the root cause. Ultimately, the purpose is to complete the testing stage with a single probable cause, enabling you to search for and implement a resolution. Question the Obvious If you can prove that there is no inherent fault (perhaps by failing to duplicate the problem on a reference system), make sure that the system is set up and being operated in the correct way. Step through the process of using the system or application making sure that you verify even the simplest steps by questioning the obvious, best illustrated by the age-old "Is it switched on?" question. Divide and Conquer Often, a testing process might consist of a divide and conquer approach. This means that you try to envisage different problem areas. In computing, this might be like making the distinction between a workstation problem, a server problem, a storage problem, or a network problem. You then devise a testing routine to eliminate one or more categories of problem. This helps you more quickly identify probable causes. For example, if you suspect a physical cabling problem, try plugging the device into a different wall socket. If that makes no difference, the problem lies elsewhere. If you think about a typical automotive problem – your car won’t start – you might go through a testing process yourself. You might start by listing possible causes, and then eliminating them one by one. You check that there’s fuel in the tank, so you move on. You verify, by turning the ignition, that the starter motor turns, so the battery is good and so is the starter. You progress through a series of tests to eliminate those possibilities that are not the cause of the symptoms you have recorded. With luck, you end up with a single possibility which you can then seek to resolve. Bear in mind that although there is fuel in your tank, it might be contaminated or unable to flow to the engine. So, you might need to perform several tests for a given possible cause. This process also works with computer-related problems. Think back to the cabling example. You plugged the computer into a different socket, but perhaps the cabling issue lies at the wiring closet end? Don’t jump to conclusions. Sometimes, reported symptoms can be identical for several different causes. If you jump to a conclusion, you might waste valuable time attempting to resolve the wrong issue. You will also frustrate your users by taking longer to fix their problem. For example, if the light in your kitchen does not come on, you could assume it’s a problem with the bulb when the cause could also lie with the light fitting, the fuse box, or even the electrical supply to your house. They all have the same symptom of your kitchen light not coming on when you flick the switch. Before you head out to the store to buy a new bulb, run simple tests to eliminate those other possible causes, by flicking on another light or running an appliance for instance. Consider Multiple Approaches If one troubleshooting method does not yield results, be prepared to try something different. For example, you might start by a process of questioning the obvious and step through the operating procedure but switch to a divide and conquer approach of testing each component in the process in isolation. You might also suggest workarounds. A workaround doesn't actually solve the reported problem but provides a way for the user to continue to work with the system. That way you can deal with the consequences of the problem quickly and give yourself more time to investigate the underlying issue. Test the Theory and Escalate the Problem As you devise different theories of cause, you will naturally also be testing them to see if they fit the facts. While "testing" follows "establishing" in the methodology, the process is iterative (establish a theory, test it, if it doesn't work, establish another theory). You might discover that the immediate cause of a problem is itself just the symptom of a wider problem with a different cause. The end result of this process, therefore, is to establish a root cause for the problem. If you are unable to resolve a problem during the initial phases, you might need to escalate the problem. This largely depends on how your help desk organization is structured. Most help desks use a tiered system, with Tier 1 staff performing the classification and some of the testing stages. If a problem cannot be resolved quickly, then the problem is recorded and escalated to Tier 2. Staff at this level have more experience and are allocated more time to work on these more complex problems. However, for even more difficult problems, Tier 2 staff can escalate to Tier 3. These are IT specialists who have many years of experience in troubleshooting. They might also be skilled in specific technologies, such as email, networking, cloud platforms, and so on. In the unlikely situation where a Tier 3 specialist is unable to resolve an issue, it can be escalated outside your organization to a manufacturer. When testing your theory of probable cause, it is important not to make actual changes to the production system. If you make uncontrolled changes, reversing what you’ve done might be very difficult and you could cause worse problems than were originally reported. Resolving and Documenting the Problem The result of the identifying and understanding phases of troubleshooting is a diagnosis of the root cause of the reported problem. The final phase of troubleshooting is to establish a plan of action to eliminate the root cause without destabilizing some other part of the system. Establish a Plan of Action There are typically three solutions to any problem. Repair—you need to determine whether the cost of repair/time taken to reconfigure something makes this the best option. Replace—often more expensive and may be time-consuming if a part is not available. There may also be an opportunity to upgrade the device or software. A basic technique when troubleshooting a cable, connector, or device is to have a "known good" duplicate on hand (that is, another copy of the same cable or device that you know works) and to test by substitution. Ignore—as any software developer will tell you, not all problems are critical. If neither repair nor replace is cost-effective, it may be best either to find a workaround or just to document the issue and move on. When you consider solutions, you have to assess the cost and time required. Another consideration is potential effects on the rest of the system. A typical example is applying a software patch, which might fix a given problem in one piece of software but cause other programs not to work. This is where an effective configuration management system comes into play, as it should help you to understand how different systems are interconnected and cause you to seek the proper authorization for your plan. Implement the Solution Your plan of action should contain the detailed steps and resources required to implement the solution. As well as these practical steps, you have to consider the issue of authorization. If you do not have the authority to implement a solution, you will need to escalate the problem to more senior personnel. If applying the solution is disruptive to the wider network, you also need to consider the most appropriate time to schedule the reconfiguration work and plan how to notify other network users. When you make a change to the system as part of implementing a solution, test after each change. If the change does not fix the problem, reverse it and then try something else. If you make a series of changes without recording what you have done, you could find yourself in a tricky position. Verify Full System Functionality and Implement Preventive Measures When you apply a solution, validate that it fixes the reported problem and that the system as a whole continues to function normally (that is, identify the results and effects of the solution). Ensure that you were right and that the problem is resolved. Can the user now log in properly? Is there any way you can induce the problem again? Before you can consider a problem closed, you should both be satisfied in your own mind that you have resolved it and get the customer's acceptance that it has been fixed. Restate what the problem was and how it was resolved then confirm with the customer that the incident log can be closed. To fully solve the root cause of a problem, you should try to eliminate any factors that may cause the problem to recur. For example, if a user plugs their laptop into the wrong network jack, ensure that the jacks are clearly labeled to help users in the future. If a faulty server induces hours of network downtime, consider implementing failover services to minimize the impact of the next incident. Document Findings, Actions, and Outcomes All the way through the preceding steps, it is important that information about the problem, tests performed, and attempted resolutions are recorded. That way, when a problem is resolved, a complete record exists documenting the symptoms, possible causes investigated, and the ultimate resolution. This information can be very helpful when troubleshooting similar symptoms on subsequently reported problems and provides the foundation of a knowledge base for your help desk. Most troubleshooting takes place within the context of a ticket system. This shows who is responsible for any particular problem and what its status is. This gives you the opportunity to add a complete description of the problem and its solution (findings, actions, and outcomes). This is massively useful for future troubleshooting, as problems fitting into the same category can be reviewed to see if the same solution applies. It also helps to analyze IT infrastructure by gathering statistics on what type of problems occur and how frequently. When you complete a problem log, remember that people other than you may come to rely on it. Also, logs may be presented to customers as proof of troubleshooting activity. Write clearly and concisely, checking for spelling and grammar errors. Troubleshooting PC Issues If you are troubleshooting a computer that will not start or a peripheral device that will not work, first inspect the component for physical damage. Look for dents, scratches, or cracks that might show a device has been dropped or banged. This might have caused damage to the internal components. Inspect cables and connectors for signs of wear and dirt. Inspect the ports on the computer case for dirt and damage. When you start a computer, it automatically runs a program stored in firmware called the Power-On Self-Test (POST). The POST routine ensures that all the components required to start the computer are present. If the tests complete successfully, the computer may issue a single beep. Not many computers beep these days, so do not be worried if the computer boots silently. You should also be able to see images on the screen. Otherwise, try to isolate the issue using the following tests: No beep—check whether the power light has come on and whether the disk light is flickering and whether there is an image on the screen. You should also be able to hear some disk activity and the whir of fans inside the PC. If you can detect none of these things, there is a power problem. Check the power cable and fuse. If these are OK, then the problem is either with the computer's internal power supply or the electrical outlet (try plugging in a lamp to test). Solid green lights (LEDs) generally show that something is switched on and working. A flickering LED is usually a sign that something is working too (for example, a disk or network activity LED). A flashing or blinking LED or an LED colored other than green can be a sign that there is a problem. Check the manual to understand what the LED signifies. More than one beep—the beeps specify where the problem is (and there may be an error message on the screen), but you will probably need to get help to diagnose and fix it. Do check that nothing is resting on the keyboard; if a key is pushed down it can cause this type of error. Screen is dark—check that the monitor is plugged in and switched on, that the power cable and fuse are good, and that the cable from the monitor to the computer is properly connected. If you see a message such as "No sync," the cable is probably disconnected or damaged. If you can see a very dim image, check that the brightness control hasn't been turned all the way down. If there are no problems during POST, the firmware then passes control of the computer to the operating system, which finishes loading before displaying the logon prompt. If the operating system fails to load, there should be an error message. The error message should help to diagnose the cause of the boot failure. If the system boots but a peripheral device does not work, first check for loose connections between the device and the computer. If you can discount physical problems, the device's driver might need updating or replacing. If a computer crashes during operation and stops responding to any mouse clicks or keyboard presses, there could be a fault in either hardware or software. These issues can be difficult to diagnose, but do check that the computer is not becoming too hot. If the computer overheats, it can stop working suddenly, and overheating can be a relatively common occurrence. Getting Support When troubleshooting a system, it is important not to attempt solutions that may be beyond your experience or expertise. You could easily cause more damage by "experimenting." In the first instance, follow the basic guidelines for troubleshooting specific problems in the manufacturer's documentation. The setup or maintenance guide should be included with a new PC system, though often it is available as a PDF rather than printed copy. You can also use the vendor's website to look for the documentation; typically, you will need to use the product code to find this. For example, on HP's website, click the link for "Support" then choose the "Drivers and Downloads" option. Enter the product name or code to view the available files. This will include drivers for hardware components and system guides and manuals. Note that the site also hosts online troubleshooting guides and product support forums, where you can post questions to HP employees and other system owners. Windows Help and Tips If you have a software problem rather than a computer one, you can access online help for Windows by pressing F1. In Windows 10, you can also use the Tips app from Start. Contacting Technical Support If there is a problem with your computer or with a software application, it is likely that you will need help to fix it. You may be able to get help from your business's help desk, from the computer manufacturer, or from the software vendor. You may be able to access support services by phone, by email, or through a website. A lot of support is now done remotely. This means that the operator takes control of your computer to try to diagnose and fix the problem. Most IT support works on the basis of a job ticket. When you contact a support operator, they will open the ticket and ask you to provide a description of the problem. You should provide the following information: Your name and contact information. The software or device you are having trouble with (including its version number, which can usually be found through the Help > About command, or model and serial number). The date you purchased the system (if applicable). A description of the problem, including any error message or number. Be polite and calm when reporting problems. You may feel frustrated or be a bit fearful that you will lose important work, but if you are rude or cannot provide clear information it will just make the problem harder to resolve. The operator should then work with you to try to resolve the problem. If the operator asks you to perform troubleshooting actions, listen carefully to the steps and confirm when you have completed each one. If the operator has to call you back, make sure you get their name and your job ticket number. When you agree that the problem has been solved, the ticket can be closed. Technical Community Groups Formal technical support may not be available to you or a user. A product might be out of warranty and support may be expensive for instance. In this case, you might want to use alternative support methods. Do be aware that these may not be as reliable. If warranty support is available to you, use that in the first instance. As mentioned above, PC and hardware vendors all operate forums to support their products. There are also many community forums not tied to particular vendors but hosting reviews, advice, support, and discussions about laptops, or phones (or pretty much anything else). If you use an Internet search engine to look for the symptoms of the problem you are having, the results are certain to include posts in forums such as these. When posting a problem to a newsgroup or forum, remember that people are responding to you out of goodwill. Pick the appropriate forum and check the Frequently Asked Questions (FAQ) before posting. The FAQ is usually the first post in the forum. Do not "cross-post" problems to multiple forums. Describe the problem accurately, and be patient. Software vendors also maintain forums and support websites. For example, Microsoft's support website support.microsoft.com hosts solution centers for each product. You can access troubleshooting articles and (if applicable) assisted support via phone, email, or chat. More advanced articles plus product documentation and some online books can be found at technet.microsoft.com. The communities site (answers.microsoft.com) features newsgroups, blogs, webcasts, and forums dedicated to different Microsoft products. These resources are generally monitored by Microsoft Most Valued Professional (MVP) volunteers. mvps.org contains links to useful sites maintained by MVPs. There are also numerous other technical community groups, some of which are subscription-based. Some well-known ones include expertsexchange. com, petri.com, and tomshardware.com. Using a Search Engine A search engine is a tool to help locate web pages. A search engine may be designed to search the entire web for pages or to locate pages within a particular site. A search engine usually works by compiling a database of information about web pages. This is often done automatically by software agents called robots or spiders. When users want to perform a search, they enter keywords on the topic they want to read about. The search engine checks these keywords against its database and returns a list of links that are relevant. The user can then click one or more links to go to the web page. The most widely used search engines are Google and Microsoft's Bing, though there are many more. Also, search engines have different sites for different countries. For example, you can access Google at google.co.uk, google.com, google.co.za, google.com.au, and so on. In most browsers, if you type text into the address bar, the browser will convert the text into a search using the default search provider if it does not match an actual web address. You can use browser settings or preferences to change the default search provider. When you construct a search phrase, it helps to use as many words as possible. Do not use common words such as "and" or "the." Using more unusual words will help to limit the number of matches. Search Selection Criteria As you start to build more complex search phrases, you can use special syntax and search engine tools to combine keywords and perform advanced searches. As an alternative to using syntax, you can usually access the search engine's Advanced Search page to specify criteria using a form. This table provides a summary of search syntax: Quotation Marks (" "): Use to find the exact phrase as typed. Example: "Monty Python" Plus Sign (+): Requires certain words in the search results. Example: +snake +Python Minus Sign (-): Excludes specific words from search results. Example: python -Monty OR / Pipe (|): Finds either of the specified words. Example: snake | python Asterisk (*): Acts as a wildcard for unknown words. Example: genius * python Fields: Restricts search to specific fields like URLs or titles. Example: inurl:monty python