Summary

This document provides troubleshooting steps and information for VMware Cloud Foundation (VCF). It details the SOS tool, including its command-line interface, health checks, and log collection features. It also explains how to troubleshoot failed workflows using reference token IDs.

Full Transcript

**VCF Troubleshooting** - **SOS Tool (Supportability and Serviceability Tool)** - Command-line Python tool - Can be invoked via API - Performs checks and creates log bundles - Run health checks - Collect logs for VCF components - You CANNOT run SOS comman...

**VCF Troubleshooting** - **SOS Tool (Supportability and Serviceability Tool)** - Command-line Python tool - Can be invoked via API - Performs checks and creates log bundles - Run health checks - Collect logs for VCF components - You CANNOT run SOS commands if workflows are in progress in SDDC Manager! - Ran from SDDC Manager - SSH to SDDC Manager - Vcf user account - Access root with sudo - Sudo /opt/vmware/sddc-support/sos \--\ - Enter password when prompted - Health Check Example - \--connectivity-health - \--services-health - \--ntp-health - \--dns-health - \--certificate-health - \--password-health - \--get-inventory-info - \--health-check - Password Validity Example - /home/vcf - /opt/vmware/sddc-support/sos --password-health -skip-known-host-check - Component, services user, changed date,expiry date, expiry in days, etc - **Collecting Log Files** - Collecting for all components in a specific domain (no options) - Sudo /opt/vmware/sddc-support/sos --domain-name MGMT - For specific components, use the following options - \--api-logs - \--esx-logs - \--nsx-logs - \--psc-logs - \--sddc-manager-logs - \--vc-logs - \--wcp-logs A screenshot of a computer Description automatically generated **VCF Services and Log Files** - ID - Open a SR best practices - Always open SRs with VCF product Name - Start collecting logs BEFORE open ticket - Use relevant component - Try these - Reboot SDDC Manager - Sddcmanager\_restart\_services.sh script - Systemctl to restart individual services - Systemctl - \ - \ - Ex: systemctl restart domainmanager - Key Log Files - /var/log/vwware/vcf folder in SDDC Manager - /commonsvcs - /domainmanager - /lcm - /operationsmanager - /sddc\_manager-ui-app - /sddc-support (sos utility) ![A screenshot of a computer Description automatically generated](media/image5.png) **SDDC Manager Provides the Following Key Services** - HTML-5 Based interface - Lifecycle manager - Domain manager - SoS utility - Network Pools - Inventory **Using Reference Token IDs to Troubleshoot** - When something fails, description of error and reference token ID created for the error A screenshot of a computer Description automatically generated Troubleshooting Recommended Steps 1. Expand the workflow tasks in SDDC manager and id failed subtask 2. Note any errors described in the SDDC manager UI (simple can be resolved and the workflow can be restarted) 3. Record the ref token id in the log 4. Search for the ref token id in the log 5. Use the opID in the log to view the entire workflow 6. [Restart the task after the root cause addressed] ![A screenshot of a computer Description automatically generated](media/image7.png) A screenshot of a computer Description automatically generated ![A screenshot of a computer Description automatically generated](media/image11.png) A screenshot of a computer Description automatically generated ![A screenshot of a computer screen Description automatically generated](media/image15.png) Summary -- Generate Log Bundle Command Example /opt/vmware/sddc-support/sos --esx-logs --domain-name sa-wld01 --log-dir /tmp A screenshot of a computer Description automatically generated ![A screenshot of a computer Description automatically generated](media/image17.png)

Use Quizgecko on...
Browser
Browser