The current state of monitored services and hosts is determined by two components:
- The status of service or host i.e. OK, WARNING, UP, DOWN etc..
- The type of state the service or host is in.
- There are two types of states SOFT states and HARD states.
Now explain what is Soft and Hard states:
When a service or host check results are in a non-OK or non-UP state and the service check has not yet been rechecked the number of times specified by the max_check_attempts directives in the service or host definition. This is called Soft Error. When a service or a host recovers from Soft Error that is considered as Soft Recovery.
When a service or host check results are in a non-OK or non-UP state and the service check has been rechecked the number of times specified by the max_check_attempts directives in the service or host definition. This is called Hard Error. When a service or a host recovers from Hard Error that is considered as Hard Recovery.
This is the end of my blog on Nagios interview questions and if you want in-depth knowledge about the whole DevOps life-cycle click on the button below: