Device Crashes Test
In the context of a Cisco SD-WAN overlay network, port hopping is the process by which devices try different ports when attempting to establish connections with each other, in the event that a connection attempt on the first port fails.
If a controller device/component - Cisco vBond Orchestrator, Cisco vManage, or Cisco vSmart Controller - is down or has crashed for any reason and the vEdge routers remain up, when the controller device comes back up, the connection between that device and the vEdge router might shut down and restart. This behavior occurs because of port hopping. When one device loses its control connection to another device, it port hops to another port in an attempt to re-establish the connection. For example, when Cisco vBond Orchestrator crashes, Cisco vManage might lose or close all its control connections. Cisco vManage then port hops, to try to establish connections to the Cisco vSmart Controllers on a different port. This port hopping on Cisco vManage shuts down and then restarts all its control connections, including those to the vEdge routers.
If a component crashes frequently, the connections will also be frequently closed and re-established. This may affect the performance of the entire Cisco SD-WAN overlay network. To avoid such performance degradation, it is necessary to figure out how frequently each component of the Cisco SD-WAN appliance crashed. The Device Crashes test helps administrators in this regard!
This test reports the number of times each component of the target Cisco SD-WAN appliance crashed. The detailed diagnosis of this test will help administrators figure out the time during which the crash occurred and the name of the crash file. Using this test, administrators can analyze the reason behind such crashes and rectify issues before end users start complaining of frequent failures on the overlay network.
Target of the test : Cisco SD-WAN
Agent deploying the test : An external agent
Outputs of the test : One set of results for each component of the target Cisco SD-WAN that is being monitored.
Parameter | Description |
---|---|
Test period |
How often should the test be executed |
Host |
The IP address of the host for which this test is to be configured. |
Port |
Specify the port at which the specified Host listens to. By default, this is NULL. |
Username |
By default, the eG agent executes REST APIs on the target appliance to collect the required metrics. For the eG agent to execute the REST APIs, a special user with monitoring privilege is required. Specify the name of such a user in this text box. |
Password |
Specify the password corresponding to the user mentioned in the Username text box here. |
Confirm Password |
Confirm the Password by retyping it here. |
REST API Port |
By default, the REST API Port is set to none. This implies that by default, this test connects to the default port to execute the REST API commands and collect the required metrics. In some environments however, a different port can be used to execute the REST API commands. In such case, specify that port number against the REST API Port text box. |
SSL |
The eG agent collects performance metrics by executing REST APIs on the target appliance. Typically, the REST APIs can be invoked through HTTP or HTTPS mode. By default, the eG agent invokes REST APIs using the HTTPS mode. This is why, the SSL flag is set to Yes by default. If the target appliance is not SSL-enabled, then the REST APIs can be accessed through HTTP mode only. In this case, set the SSL flag to No. |
Detailed Diagnosis |
To make diagnosis more efficient and accurate, the eG Enterprise embeds an optional detailed diagnostic capability. With this capability, the eG agents can be configured to run detailed, more elaborate tests as and when specific problems are detected. To enable the detailed diagnosis capability of this test for a particular server, choose the On option. To disable the capability, click on the Off option. The option to selectively enable/disable the detailed diagnosis capability will be available only if the following conditions are fulfilled:
|
Measurement | Description | Measurement Unit | Interpretation |
---|---|---|---|
Total crashes |
Indicates the number of times this component crashed. |
Number |
The detailed diagnosis of this test reports the crash index, crash time and crash file. |