SCOM Ultimate Troubleshooting Guide
This page is a community-driven list of troubleshooting guides, tips and tricks and other useful articles covered on the TechNet Wiki.
Introduction
When troubleshooting an issue System Center Operations Manager (SCOM), there are some important steps that should be done before start reviewing data and performing troubleshooting itself.
We can summarize the troubleshooting process in the following six core phases:
- Identify the problem.
- Establish a theory of probable cause.
- Test the theory to determine the cause.
- Establish a plan of action to resolve the problem and implement the solution.
- Verify the full system functionality and if applicable implement preventative measures.
- Document findings, actions and outcomes.
SCOM Components
In order to have a better experience troubleshooting SCOM, it is recommended that you have a brief understanding of the SCOM components.
- Management server
- Gateway server
- Web console server
- Reporting server
- Operational database
- Data warehouse database
- ACS collector
- ACS database
- ACS forwarder
Troubleshooting
1. Installation Issues
This section describes some of the common SCOM installation issues.
1.1 SCOM Component Installation
- Management server installation
- Gateway server installation
- Database & data warehouse database installation
- Web Console installation
- Console installation
1.2 Agent Installation
2. Configuration Issues
This section describes some of the common configuration issues in SCOM.
2.1 Database / Data warehouse Configuration Issues
- Database / Data warehouse database low disk space
- Database / Data warehouse database slow performance
3. Discovery Issues
This section describes some of the common discovery issues when trying to deploy SCOM agents to either Windows, UNIX or Linux operating systems.
3.1 Server / Client discovery
- Windows agent discovery
- UNIX / Linux agent discovery
4. DMZ / Workgroup Issues
This section describes some of the common issues when trying to set up out-of-band monitoring with SCOM, by using a Gateway server and/or certificates.
4.1 Connectivity Issues
- Gateway server
- Agent server
4.2 Certificate Issues
- Gateway server certificate
- Agent server certificate
5. Management Pack Issues
This section describes some of the common management pack issues.
- Importing management packs
- Downloading management packs
- Creating management packs
Best Practices
- SCOM Best Practices
- System Center Operations Manager (SCOM) Management Group Performance Optimizations
Tools
- MP Viewer
- Override Explorer
- Alert Update Connector
- DWDataRP
- SCOM DataWarehouse Grooming Settings tool (GUI)
- MP Event Analyzer
- Maintenance Mode Scheduler
- Visual Studio Authoring Extensions (VSAE)