Notable Events
  • 4 Minutes to read
  • Contributors
  • Dark
    Light
  • PDF

Notable Events

  • Dark
    Light
  • PDF

Notable events displays all the details of the device and system monitor errors and alerts that should be notified to the Administrator. It would assist in identifying the conditions in which the alert was raised so that a further investigation can be done in identifying and resolving the issue.

Following are the list of notable events, these are events that inform you about the failures in your environment along with the duration so that you can take the appropriate measures to resolve it.

The events can be segregated into the following categories:

  • Connector Monitor: The events under the category dev monitor indicates that there was some error in log collection.
Events Action
Connector Pico: Invalid Port
  1. Reconfigure the connector
  2. Contact DNIF support.
Connector AWS: Encountered an error Contact DNIF support
  • System Monitor: These events indicate that there was some kind of process failure.
Events Action
celery_scheduler is FATAL Export snapshot of the logs belonging to the component and contact DNIF support
Disk usage at 82.67%
    Note: Alerts will be raised once the disk usage is equal to or above 82.67%
Increase the size or free up disk space whichever is applicable. For troubleshooting steps click here
System unreachable
  1. Check for network connectivity between reported components and CORE.
  2. If network connectivity is present download snapshot for the component/Cluster and send it to DNIF support. For troubleshooting steps click here
Datanode not responding For troubleshooting steps click here
Disk usage at 82.67%
    Note: Alerts will be raised once the disk usage is equal to or above 60-80%
  1. Increase the disk space provided.
  2. Configure retentions for the required streams
  3. For troubleshooting steps click here
Storage Leader down
  1. Check for network connectivity
  2. For troubleshooting steps click here.
Storage Service down
  1. Check for network connectivity
  2. For troubleshooting steps click here.
Compute Leader down
  1. Check for network connectivity
  2. Contact DNIF support if connectivity is present.
Compute Service down
  1. Check for network connectivity
  2. Contact DNIF support if connectivity is present.
Event processor pipeline is down Start Event Processor pipeline
High CPU Utilization (> 90%) Contact DNIF support. For troubleshooting steps click here
High Mem Utilization (> 90%) Contact DNIF support. For troubleshooting steps click here
Query server is down For troubleshooting steps click here
Correlation server is down For troubleshooting steps click here
Report server is down For troubleshooting steps click here.
  • Alerts: These events indicate that there was some kind of unusual behaviour
Events Action
Email delivery failure, check SMTP configuration Reconfigure SMTP with proper credentials
Retention service failed to clear [number of] buckets
  1. Check for network connectivity
  2. Contact DNIF support if connectivity is present.
Unable to snapshot logs
  1. Check for network connectivity
  2. Contact DNIF support if connectivity is present.
Degradation in datanode transfer
  1. Check for network connectivity and bandwidth between AD and DN
  2. If network connectivity is present, download log snapshot and send it to DNIF support
Datanode(s) not available, stopping ingestion For troubleshooting steps click here.
Potential Datanode transfer failure
  1. Check network connectivity.
  2. Check datanode services For troubleshooting steps click here.
EVTMEM has crossed threshold For troubleshooting steps click here
"System Name": EPS Governor dropped 50 events in last 5 minutes Investigate EPS Spike

Warning: These events indicate that there could be some critical failures

Events Action
Datanode transfer cache full, ingestion stopped Increase the disk space provided. For more information contact DNIF support

Service Monitor: These events indicate that there were some Service Failures

Events Action
Memory store down Download log snapshot and send it to DNIF support
Signaling framework down Download log snapshot and send it to DNIF support
Queue down Download log snapshot and send it to DNIF support

License Alert: These events indicate that there are some License related errors

Events Action
Unable to connect to UNET For all license related issues contact DNIF support
Breached 100% Licensed Volume Contact DNIF support
Device Enforcement Enabled The system will restrict log collection to an arbitrary set of devices equaling in count to the licensed limit
Device Enforcement Disabled The system will automatically disable enforcement mode if log collection is within the permissible count of devices.

How to view Notable Events?

  • Hover on the Administration icon on the left navigation panel of the Home screen, from the option displayed select Manage Components, the following screen will be displayed.

image.png

  • Scroll to the end of the page to view the notable events of the cluster.
  • Click the particular component to view the notable events of a particular component.

image.png

  • The Notable Events screen displays the following fields:
Field Name Description
Component Displays the name of the component
Event Type Displays the event type example : system monitor / device monitor
Message Displays the error message
Severity Displays the severity level of the error event
Start Time Displays the start time when the error was found
End Time Displays the end time when the error was found

You can filter notable events on the basis of event types such as System Monitor, Connector Monitor, and Alert.

image.png

Similarly, the notable events list will be displayed for all other components.

Switchboard

Switchboard lists all the possible notable events that can be encountered in DNIF. You can configure email addresses against each Notable event. An alert notification is sent to configured email addresses whenever the particular notable event has occurred.

  • Hover on the Administration icon on the left navigation bar of the Home screen, from the options displayed select Switchboard, the following screen will be displayed.

image.png

The Switchboard screen displays the following fields:

Field Name Description
Name Displays the name of the notable event (error message)
Severity Displays the severity level of the error event
Configure Notification Enter the email id of the User to be notified and click Save.

Was this article helpful?

What's Next