Creating Alerts with Netdata Alerts Configuration Manager
The Netdata Alerts Configuration Manager enables users with Business subscriptions to create alerts from the Netdata Dashboard with an intuitive user interface.
Using Alerts Configuration Manager
Go to the Metrics tab and navigate to the chart you want to configure an alert for.
Click the Alert icon on the top right corner of the chart.
The Alert Configuration Manager will open up with the default thresholds. You can modify the configuration as required and the alert definition on the right will be updated dynamically.
If you want more fine-grained control or access to more advanced settings, enable Show advanced
Copy the alert definition that is generated in the code box and add it to an existing health configuration file or a new custom file under
<path to netdata install>/etc/netdata/health.d/on a Parent Agent or a Standalone Child Agent.Reload Netdata Alert Health checks
<path to netdata install>/usr/sbin/netdatacli reload-healthand the new alert is now configured.
Alerts Configuration Manager Sections
- Alert Detection MethodAn alert is triggered whenever a metric crosses a threshold:
- Based on a standard
threshold - Based on metric
variance - Based on the
anomaly rateof the metric
- Based on a standard
Metrics Lookup, Filtering and Formula Section
Metrics Lookup: This is the Evaluate line of fields in the modal and it defines the parameters for the database lookup that is needed to get the value that will be compared against the alert definition. It corresponds to the
lookupline of the Alert configuration file. The Alerts Configuration Manager provides a default selection for the lookup and can be modified to suit your requirements. The parameters that can be modified are:- METHOD (
avg,sum,min,max,cv,stddev) - COMPUTATION (sum of all dimensions or individually for each dimension)
- DIMENSIONS (All dimensions, or a selection of dimensions)
- DURATION (the period in time to run the lookup)
- OPTIONS (
absolute,unaligned,percentage,min2max)
- METHOD (
Alert Filtering: This functionality can be accessed through the Show advanced checkbox and it allows for filtering the alert health checks to be run only for specific components of the infrastructure. It helps in achieving a fine-grained configuration for any given alert.
HOSTS/NODES- By default all hosts are selected. You can pick nodes from the dropdown list, or enter a wildcard matching a list of hosts that you want the alert health check to run on. This field corresponds to thehostsline of the Alert configuration file.INSTANCES- All instances are selected by default. You can pick instances from the dropdown list, or enter a wildcard matching a list of instances that you want the alert health check to run on. This field corresponds to thechartsline of the Alert configuration file.CHART LABELS- All chart labels are selected by default. You can pick a chart label from the dropdown list or enter a wildcard matching a list of chart labels that you want the alert health check to run on. This field corresponds to thechart labelsline of the Alert configuration file.OS- All Operating Systems are selected by default. You can choose which OS(s) an alert health check should run on. This field corresponds to theosline of the Alert configuration file.
Formula / Calculation: This field is available through the Show advanced checkbox and it is used to define a formula to be run on top of the
lookupvalue. The result of the lookup is available in the$thisvariable, and after the formula is run, the result is also stored in$thisand can be accessed while setting the alert thresholds. This field corresponds to thecalcline of the Alert configuration file.
Alerting conditions
Warning and Critical Thresholds: These fields are used to set the thresholds for the
WarningandCriticalalert states, while also having the option to set the condition for the alert to be raised if it isaboveorbelowthe given threshold. If the advanced settings are selected, a formula option can also be used, to define a custom formula instead of a threshold. These fields correspond to thewarnandcritlines of the Alert configuration file.- Recovery Thresholds: This field is available through the Show advanced checkbox, and it is used to set the threshold that the metric value needs to meet to de-escalate from a given severity status, like
Critical to Warningand fromWarning to Clear. The logic is appended to thewarnandcritlines of the Alert configuration file and resembles a one-lineIF-THEN-ELSEclause.
- Recovery Thresholds: This field is available through the Show advanced checkbox, and it is used to set the threshold that the metric value needs to meet to de-escalate from a given severity status, like
Check Interval: This field is used to define the frequency of the health check for the alert and corresponds to the
everyline of the Alert configuration file.Delay Notifications: This field is available through the Show advanced checkbox and it is used to set delay parameters on notifications for an alert severity
escalationorde-escalation. It corresponds to thedelayline of the Alert configuration file.Agent Specific Options: These options are only available on the
Netdata Agentand not honored onNetdata Cloud. They can be accessed through the Show advanced checkbox.- Repeat Notifications: This field defines the repeat frequency for the alert notification when the alert is in either
warningorcriticalstatus and corresponds to therepeatline of the Alert configuration file. - Send to: This field is used to define a user role to which the alert notifications will be sent. If set to
silent, then the alert won't be sent to any role. It corresponds to thetoline of the Alert configuration file. - Custom Exec Script: This field is used to define a custom script that will be executed when the alert is triggered (but needs to be carefully designed as it needs to call the
health_alarm_notify.shmodule) and corresponds to theexecline of the Alert configuration file.
- Repeat Notifications: This field defines the repeat frequency for the alert notification when the alert is in either
Alert Name, Description and Summary Section
- Alert Template Name: This field uniquely identifies an alert and corresponds to the
templatefield of the Alert configuration file. The Alerts Configuration Manager provides a default name for an Alert template but we recommend you modify this to have a meaningful name for your configured alert. - Alert Template Description: This field provides a brief explanation of the alert and corresponds to the
infoline of the Alert configuration file. - Alert Summary: This field enables the users to create a custom title for the alert notification (via Notification integrations) and corresponds to the
summaryline of the Alert configuration file.
- Alert Template Name: This field uniquely identifies an alert and corresponds to the
Do you have any feedback for this page? If so, you can open a new issue on our netdata/learn repository.