System Monitoring

System Monitoring helps administrators to monitor different performance KPIs from a centralized module. It helps track the performance and health of the systems by delivering real-time statistics on critical metrics such as GPU usage, CPU usage, available memory, network consumption, and much more. This is especially useful for larger organizations with multiple servers and endpoints, as it allows administrators to drill down and examine how each system is functioning. This section covers the insights provided by the System Monitoring module.

Server

System Info

This section details the system components that power your deployment.

Section
Description

Summary

This shows details like server manufacturer, server name, serial number, family, product UUID, SKU number, maximum capacity, memory slots, etc.

GPU

It displays the details of the GPU installed on the system.

Memory Info

It displays information like the memory card size, serial number, speed, and manufacturer.

Processor Info

It displays the processor information like the vendor, number of cores in the processor, its version, etc.

Capture Card

It shows the inputs and ports of the capture card.

Chasis Info

It shows the chassis information like the manufacturer, asset tag, and serial number.

BIOS Info

It shows the BIOS information like the version, release date, ROM size, and vendor.

Networking

This allows you to track the usage and performance of your network through various graphs capturing network details. You can monitor these charts while streaming content and analyze network performance.

Graph
Description

Receive Bandwidth

This graph shows the number of bytes of data received by the server over a specified time period.

Transmit Bandwidth

This graph displays the number of bytes of data transmitted by the server over a specified time period.

Receive Drops

This graph depicts the number of network packets received by the server during data transfer over a specified time period.

Transmit Drops

This graph shows the number of network packet losses caused due to network congestion, hardware issues, software bugs, or any other factor during data transfer over a specified time period.

Receive Errors

This graph shows the number of errors that occurred during the processing of received data packets owing to logical or hardware defects over a specified time period.

Transmit Errors

This graph depicts the number of errors that occurred during data packet transmission over a specified time period.

A time filter at the top right corner helps filter results by defining an absolute or relative time range.

Select the Absolute time range

  1. Click on the From text field.

  2. On the calendar that appears, select the date. Enter the time in hh: mm: ss format if required. Its default value ranges from 00:00:00 to 23:59:59.

  3. Click on the To text field. On the calendar that appears, select the date. Enter the time in hh: mm: ss format if required. Its default value ranges from 00:00:00 to 23:59:59.

  4. Click on the Apply time range option. The charts show results as per the specified time period.

Select the Relative time range

Click on the required time range, and the graphs update to show the results as per the selected option. For example, selecting the Last 5 minutes option, modifies the chart to show live results accurate to data transferred per second.

Zoom in the details for a specific time period.

  1. Left-click and select the time period (drag the pointer horizontally). The time period gets highlighted.

  2. Release the left-click. The graph values get modified to reflect the data recorded for the selected time range. Note that the platform modifies the values of the entire dashboard to match the selected time period.

Refresh Dashboard

Actions

GPU

This section allows you to monitor GPU utilization and performance.

Chart
Description

Information Card

It shows the details of the GPU installed on the system.

Utilization

It shows the percentage of GPU utilized over a specified time period.

Memory Total vs Used

It shows the total GPU memory vs used memory (in GBs) over a specified time period. The GPU uses memory to to perform operations like managing multiple applications simultaneously or create images in a frame buffer to output to a display device.

Video Encoder Utilization

It displays the levels of video endor utilization over a specified time period. A video encoder encodes the digital data stream or signal. It compresses raw video and audio files.

Video Decoder Utilization

It displays the levels of video endor utilization over a specified time period. A video decoder decodes the digital data stream or signal. It decompresses raw video and audio files to a defined output format (based on the deployment).

CPU

This allows you to track the usage and performance of your CPU. A high CPU usage can indicate that your system is under strain and may need additional resources or optimization.

Graph
Description

Information Card

It shows the details of the CPU installed on the system.

Average CPU Usage

It shows the percentage of average CPU resource consumption over a specified period of time.

Load Averages

It shows the CPU load capacity plotted over a duration of 5, 10, and 15 minutes. The load capacity denotes the number of processes being executed or waiting to be executed by the CPU.

It is important to note that each operation uses CPU resources differently and that every CPU has a threshold specified to achieve operating efficiency. Please use best practices to get the most out of the system.

Temperature

It shows the temperature (in °C) of the CPU over a specified period of time. It is advised that a sufficient cooling system be used to transfer the generated heat into the surrounding air and keep the chip at optimal temperatures.

Memory Details

This allows you to track the usage and availability of your system's RAM, which stores data temporarily while it is being processed.

Graph
Description

Memory- Total vs Used

It shows the total RAM memory available and the used memory over a specified period of time.

Memory Available

It displays the available memory over a specified period of time.

Cached

It shows used cache memory over a specified period of time. To ensure operational efficiency, the system caches some data for faster access by the CPU.

Swap- Total vs Used

It shows the total swamp memory available and the used memory over a specified period of time. In scenarios where the RAM is fully utilized the system uses the Swap memory to utilize space for process execution.

Buffers

It shows the buffer memory utilized over a specified period of time.

Memory- Free

Unused physical memory

Thread Details

This section shows the thread utilization (in percentage) by the CPU core over a specified period of time.

Storage

Graph
Description

Storage- Total vs Used

It shows a graphical representation of the total storage and used storage over a specified period of time.

Storage- Used vs Available

It shows the used storage and available storage.

Data retention:

For better disk utilization, system monitoring data will be cleaned up at 50 GB space utilization or one year, whichever comes first.

Capacity

This section helps monitor the health of the system by viewing peak and minimum capacity usage over a specified period of time. The usage levels are indicated as 1UL, 2UL, and 3UL, with 1 being the lowest and 3 being the highest.

Graph
Description

Server Capacity

The server capacity graph plots the highest CPU usage level value or GPU usage level value over a specified time period.

Overall CPU Usage Levels

It plots the overall CPU usage level over a specified time period.

Overall GPU Usage Levels

It plots the overall GPU usage level over a specified time period.

Capacity Usage Warnings

These warnings alert about the health of the system resources. Only after reaching a lower usage level benchmark does the system trigger a higher usage level (2UL or 3UL). For example, if the CPU or GPU usage level exceeds 70% of the total usage capacity at a specific time it would be plotted in the Overall CPU/ GPU Usage graph and Server Capacity graph as UL2. This would also trigger a warning notification indicating system levels have reached usage levels of 60% or higher. You can now balance system resources or buy additional licenses/ CPUs/ GPUs to accommodate your requirements.

Endpoints

Userful uClient Adapter is a versatile, powerful, cutting-edge client for the Userful Infinity Platform that facilitates streaming content to destinations. The end-point monitoring section enables IT to centrally monitor end-points (uClients) and view status and other uClient information.

Status

Below is the list of filters you can use to identify uClients based on their status.

  • Down View powered-off or physically disconnected uClients.

  • Online View uClients physically connected to the platform.

  • Orphaned View uClients connected to a server that is down at the moment.

  • Free View available uClients that can be connected to the platform.

uClient Type

  • Android View the list of Android uClient adapters

  • WebOS View uClients connected to WebOS displays

Endpoint Statistics

Determine uClient operational efficiency and identify downtime through statistics visualized on dynamic and interactive graphs.

Click on the client to view the statistics. Client information like its connection type, location, IP Address, and MAC Address are displayed along with a line graph. You can apply a time range and filter status for selected periods of time.

A shortcut to filter results for a specific time range:

  • Position the pointer at the starting point on the timeline, left-click, and drag the pointer to create a time range.

  • Release the left-click. The graph reloads to show the details for that time range.

Note: To edit client details, navigate to Admin Center > System configuration > uClients.

Notification Channel

Outbound alerts proactively deliver and monitor systems health. Configure email notifications alerting about the system's health once the system reaches high resource consumption levels- Usage Level 2 (UL2) or Usage Level 3 (UL3) or when an end-point device unexpectedly goes offline.

Set up SMTP Configuration

  1. In your Userful account, navigate to Admin Center > System Monitoring.

  2. Click Notification channel in the left pane.

  3. Click SMTP Configuration at the top right corner. The SMTP Server Configuration menu appears.

  4. Enter the Username and password.

  5. Enter the Host URL and Port number.

  6. Enter the address from which the emails would be sent.

  7. Click Save.

Create Notification Channel

  1. In your Userful account, navigate to Admin Center > System Monitoring.

  2. Click Notification channel in the left pane.

  3. Click Create Notification Channel at the top right corner.

  4. Enter the channel name. Note: This field does not accept any spaces. You can use underscores or capital letters.

  5. Enter the email addresses. To separate the email addresses, use a semicolon (;).

  6. Enable the Single Email option to send one email containing all the recipients.

  7. Notification Settings Disable Resolve Message: Enabling this option does not send a resolve email notification when the system resumes its optimal usage level. It is recommended not to enable this option. Send reminders: Enable this option to specify the duration, after which the system should re-send the email notification.

  8. Click Save.

When the system reaches a state of usage level 2, it waits 5 minutes before sending the notification.

Email triggered when the system hits Usage Level 2. It is a state where the system utilizes 60% or higher GPU resources.

Email triggered when the system reaches usage level 3 (UL3). It is a state where the system utilizes 85% or higher GPU resources.

Email triggered when an Endpoint, i.e., uClient, goes offline.

Last updated

Copyright © 2024 Userful Corporation. All rights reserved.