System Monitoring

System Monitoring allows administrators to monitor different performance KPIs from a centralized module. It helps track the performance and health of the systems by delivering real-time statistics on critical metrics such as GPU usage, CPU usage, available memory, network consumption, and much more. This is especially useful for larger organizations with multiple servers and endpoints, as it allows administrators to drill down and examine how each system is functioning. This section covers the insights provided by the System Monitoring module.

Server

System Info

This section details the system components that power your deployment.

SectionDescription

Summary

This shows details like server manufacturer, server name, serial number, system ID, SKU number, maximum capacity slots, etc.

GPU

It displays the details of the GPU installed on the system.

Memory Info

It displays information like the memory card's size, serial number, speed, and manufacturer.

Processor Info

It displays the processor information like the vendor, number of cores in the processor, its version, etc.

Capture Card

It shows the inputs and ports of the capture card.

Chasis Info

It shows the chassis information like the manufacturer, asset tag, and serial number.

BIOS Info

It shows the BIOS information like the version, release date, ROM size, and vendor.

Networking

This allows you to track the usage and performance of your network through various graphs capturing network details. You can monitor these charts while streaming content and analyze network performance.

GraphDescription

Receive Bandwidth

This graph shows the number of bytes of data received by the server over a specified time period.

Transmit Bandwidth

This graph displays the number of bytes of data transmitted by the server over a specified time period.

Receive Drops

This graph depicts the number of network packets received by the server during data transfer over a specified time period.

Transmit Drops

This graph shows the number of network packet losses caused due to network congestion, hardware issues, software bugs, or any other factor during data transfer over a specified time period.

Receive Errors

This graph shows the number of errors that occurred during the processing of received data packets owing to logical or hardware defects over a specified time period.

Transmit Errors

This graph depicts the number of errors that occurred during data packet transmission over a specified time period.

Filter to time

The filter-to-time option helps filter results by defining an absolute or relative time range.

Select the Absolute time range

  • Click on the From text field. On the calendar that appears, select the date. Enter the time in hh: mm: ss format if required. Its default value ranges from 00:00:00 to 23:59:59.

  • Click on the To text field. On the calendar that appears, select the date. Enter the time in hh: mm: ss format if required. Its default value ranges from 00:00:00 to 23:59:59.

Select the Relative time range Relative time range options provide an easier method to select a time range.

  • Click on the required time range, and the graphs update to show the results as per the selected option. For example, selecting the Last 5 minutes option, modify the charts to show live results accurate to data transferred per second.

Zoom in the details for a specific time period.

  • Left-click and select the time period (drag the pointer horizontally). The time period gets highlighted.

  • Release the left-click. The graph values get modified to reflect the data recorded for the selected time range. Note that the platform modifies the values of the entire dashboard to match the selected time period.

Refresh Dashboard

Actions

GPU

This section allows you to monitor GPU utilization and performance.

ChartDescription

Information Card

It shows the details of the GPU installed on the system.

Utilization

It shows the percentage of GPU utilized over a specified time period.

Memory Total vs Used

It shows the total GPU memory vs used memory (in GBs) over a specified time period. The GPU uses memory to to perform operations like managing multiple applications simultaneously or create images in a frame buffer to output to a display device.

Video Encoder Utilization

It displays the levels of video endor utilization over a specified time period. A video encoder encodes the digital data stream or signal. It compresses raw video and audio files.

Video Decoder Utilization

It displays the levels of video endor utilization over a specified time period. A video decoder decodes the digital data stream or signal. It decompresses raw video and audio files to a defined output format (based on the deployment).

CPU

This allows you to track the usage and performance of your CPU. A high CPU usage can indicate that your system is under strain and may need additional resources or optimization.

GraphDescription

Information Card

It shows the details of the CPU installed on the system.

Average CPU Usage

It shows the percentage of average CPU resource consumption over a specified period of time.

Load Averages

It shows the CPU load capacity plotted over a duration of 5, 10, and 15 minutes. The load capacity denotes the number of processes being executed or waiting to be executed by the CPU.

It is important to note that each operation uses CPU resources differently and that every CPU has a threshold specified to achieve operating efficiency. Please use best practices to get the most out of the system.

Temperature

It shows the temperature (in °C) of the CPU over a specified period of time. It is advised that a sufficient cooling system be used to transfer the generated heat into the surrounding air and keep the chip at optimal temperatures.

Memory Details

This allows you to track the usage and availability of your system's RAM, which stores data temporarily while it is being processed.

GraphDescription

Memory- Total vs Used

It shows the total RAM memory available and the used memory over a specified period of time.

Memory Available

It displays the available memory over a specified period of time.

Cached

It shows used cache memory over a specified period of time. To ensure operational efficiency, the system caches some data for faster access by the CPU.

Swap- Total vs Used

It shows the total swamp memory available and the used memory over a specified period of time. In scenarios where the RAM is fully utilized the system uses the Swap memory to utilize space for process execution.

Buffers

It shows the buffer memory utilized over a specified period of time.

Swap- Free

It displays the swap memory that the system used when the available (unused) physical memory was insufficient to complete the task over a specified period of time.

Thread Details

This section shows the thread utilization (in percentage) by the CPU core over a specified period of time.

Storage

GraphDescription

Storage- Total vs Used

It shows a graphical representation of the total storage and used storage over a specified period of time.

Storage- Used vs Available

It shows the used storage and available storage.

Capacity

This section helps monitor the health of the system by viewing peak and minimum capacity usage over a specified period of time. The usage levels are indicated as 1UL, 2UL, and 3UL, with 1 being the lowest and 3 being the highest.

GraphDescription

Server Capacity

The server capacity graph plots the highest CPU usage level value or GPU usage level value over a specified time period.

Overall CPU Usage Levels

It plots the overall CPU usage level over a specified time period.

Overall GPU Usage Levels

It plots the overall GPU usage level over a specified time period.

Capacity Usage Warnings

These warnings alert about the health of the system resources. Only after reaching a lower usage level benchmark does the system trigger a higher usage level (2UL or 3UL). For example, if the CPU or GPU usage level exceeds 70% of the total usage capacity at a specific time it would be plotted in the Overall CPU/ GPU Usage graph and Server Capacity graph as UL2. This would also trigger a warning notification indicating system levels have reached usage levels of 60% or higher. You can now balance system resources or buy additional licenses/ CPUs/ GPUs to accommodate your requirements.

Endpoints

Userful uClient Adapter is a versatile, powerful, cutting-edge client for the Userful Infinity Platform that facilitates streaming content to destinations. The end-point monitoring section enables IT to centrally monitor end-points (uClients) and view status and other uClient information.

Status

It helps filter end-points according to

uClient Type

  1. Android It filters Android uClients.

  2. WebOS It filters uClients with WebOS.

Endpoint Statistics

Determine client’s operational efficiency and identify downtime through statistics visualized on dynamic and interactive graphs.

Client States

  1. Offline It is activated when the client goes offline.

  2. Unknown It is activated when the client’s status can not be detected by the server.

  3. Orphan It is activated when the server the client is connected to faces downtime.

  4. Occupy

  5. Available It is activated when the client is physically secured to an online server and is available to connect.

  6. Connected It is activated when the client is connected to the platform.

A shortcut to filter results for a specific time range: Position the pointer at the starting point on the timeline, left-click, and drag the pointer to create a time range. Release the left-click. The graph reloads to show the details for that time range.

Note: To edit client details, navigate to Admin Center > System configuration > uClients.

Notification Channel

Outbound alerts proactively deliver and monitor systems health. Configure email notifications alerting about the system's health once the system reaches high resource consumption levels- Usage Level 2 (UL2) or Usage Level 3 (UL3) or when an end-point device unexpectedly goes offline.

Setup SMTP Configuration

  1. Server configuration details.

    • Enter the Username.

    • Enter the Password.

    • Enter the Host URL.

    • Provide the Port number.

    • Enter the address from which the emails would be sent.

  2. A notification indicating that the SMTP Server is created appears.

Create Notification Channel

  1. Channel notification details.

    • Enter the channel name. Please note that the name does not accept any spaces. You can use underscores or capital letters.

    • Enter the email addresses. To separate the email addresses, please use a semicolon (;).

    • Enable the Single Email option to send one email containing all the recipients.

    • Notification Settings Disable Resolve Message: Enabling this option does not send a resolve email notification when the system resumes its optimal usage level. It is recommended not to enable this option. Send reminders: Enable this option to specify the duration, after which the system should re-send the email notification.

  2. The notification channel appears on the screen. Click on the three-dot menu to edit or delete the channel.

When the system reaches a state of usage level 2, it waits 5 minutes before sending the notification.

Email triggered when the system hits Usage Level 2. It is a state where the system utilizes 60% or higher GPU resources.

Email triggered when the system reaches usage level 3 (UL3). It is a state where the system utilizes 85% or higher GPU resources.

Email triggered when an Endpoint, i.e., uClient, goes offline.

Last updated

Copyright © 2024 Userful Corporation. All rights reserved.