Questions tagged [monitoring]
Applications or appliances that observe machines, systems and networks to find problems and notify administrators.
2,471
questions
1
vote
0
answers
50
views
Monitor Windows Server network data transfer [duplicate]
I would like to monitor the amount of data transferred (Network Out) in a Windows Server. The traffic is composed mainly of files downloaded from IIS.
Is there some software or Windows Feature that I ...
3
votes
2
answers
7k
views
Prometheus: scrape interval is 1m, but resolution is still 15s
tl;dr: My scrape interval is 1m, yet I have a 15s resolution. Why?
My prometheus configuration includes a job to scrape kong metrics:
- job_name: kong_blue
honor_timestamps: true
scrape_interval:...
0
votes
1
answer
833
views
How is possible that free memory goes down and used memory is not going up?
I am collecting the results of free -m command on an ubuntu server every 5 seconds, and preparing a time series graph based on the values of free and used columns.
free -m
total ...
0
votes
1
answer
78
views
Best way to monitor for SSL / Connection failures on a server that's still up [closed]
We had an outage issue today (NGinx not successfully restarted by our Let's Encrypt renewal CRON job, so not serving the renewed cert) leading to the server being up according to Pingdom but not ...
0
votes
1
answer
9k
views
Debian on HP Gen9 — most recent hpssacli appears to be too old
I've installed a recent Debian 10 Buster system on to the server, which is HPE DL360 Gen9. It has P440ar adapter, which works with the "new" hpsa driver. As far as I remember, RAIDs were configured ...
0
votes
1
answer
4k
views
Check_MK: How to add service to Servicegroup?
I'm new at monitoring, and would to know is it possible to add a Services to a Servicegroup.
What I will do:
I have Servers how i monitor and created a Servicegroup, and now I will add queries like ...
0
votes
1
answer
659
views
Measuring/monitoring Unbound Resolver Performance
I am running Unbound resolver on FreeBSD, and i am looking for a solution to measure its performance.sometimes there are some delays in regard of resolving domain names,and my customers are ...
0
votes
0
answers
142
views
MySQL least privileges to get database/tables size
I'm using centreon_plugins to monitor some mysql databases. As a best practice I create a dedicated user for monitoring and I give it the least possible privileges.
For simple monitoring I usually ...
0
votes
1
answer
41
views
Programs to monitor CentOS servers and workstations?
Are there programs / solutions that would allow me to monitor the hardware, software, issues, performance, networking and I/O of approximately 100 CentOS/Rhel servers and workstations?
Google finds ...
0
votes
1
answer
304
views
NAgios: add/import a list of host to ping, automatically?
I am searching for a way to automatically add a few hundreds of hosts to ping in Nagios.
I guess there could be procedure or a file to fill with something like an /etc/hosts file to have Nagios to ...
0
votes
1
answer
53
views
Azure Monitor Metrics
About 4 weeks ago we had a development SQL database on azure and it was great as we could see the CPU and other information.
After doing what we needed to do to make our database work with Azure, we ...
0
votes
0
answers
425
views
How to manually install Stackdriver Monitoring on Google Cloud Platform
We are running several microservices on Google Cloud Platform (GKE using master version v1.12.9-gke.15) and have been using the Stackdriver integration for logging and monitoring (https://cloud.google....
2
votes
1
answer
331
views
HTML formatted alert emails using monit
Is it possible to send HTML formatted alert emails using monit service ?
I would like to include coloured status based on the alert.
Ref: https://linux.die.net/man/1/monit
5
votes
2
answers
10k
views
Smoothing Grafana graph
I'm setting up montinoring on my company infrastructure. I installed Grafana with the Prometheus Datasource. I built my dashboards and have rather "Spiky" graphs. I was wondering if there was any way ...
3
votes
2
answers
8k
views
Disk usage metrics missing from Prometheus node exporter
I used Prometheus and node exporter a while ago and had access to node_filesystem_* metrics to monitor disk usage but I've recently fired it up on some other servers (Ubuntu Linux) and those metrics ...
0
votes
1
answer
1k
views
Zabbix eth0 incoming traffic, triggering false alarms?
I'm a bit confused right now, I just got this false alarm
I had an item set up to monitor incoming and outgoing traffic on my servers and despite it working as intended for A WHOLE MONTH, suddenly I ...
0
votes
1
answer
371
views
Google App Engine Traffic Analytics by UserAgent/Operating system
I have an app running in Google App Engine. Their ecosystem seems to provide some facilities for analytics from logs in StackDriver.
For the life of me I can't figure out how to get a simple chart ...
2
votes
2
answers
13k
views
How to visualize current CPU usage of a pod with Grafana
Is there a way to visualize current CPU usage of a pod in a K8S cluster?
I tried different Prometheus metrics like namespace_pod_name_container_name:container_cpu_usage_seconds_total:sum_rate and ...
0
votes
0
answers
539
views
Analysing latency for a single request
I have the following (very simplified) network :
(ingress) -> DMZ (nginx) -> HA Proxy -> Reverse proxy (nginx) -> Application Server (tomcat)
All the layers send information to ...
3
votes
1
answer
166
views
Is it safe to use SSH for periodic command execution on remote Server regarding usage of entropy?
I'm planning to set up periodic checks for system health on a multiple remote systems. To achieve this I've set up key-based authentication to execute scripts on remote side without interaction
i.e.:
...
0
votes
1
answer
1k
views
Centreon - Plugin command does not work with simple quote
I have a problem with one of my sonde (plugin) in Centreon.
On my poller server, I launch this command:
/usr/bin/perl /usr/lib/nagios/plugins/centreon-plugins-master/centreon_plugins.pl --plugin=os::...
0
votes
1
answer
115
views
Nagios3: Different CRITICAL color for certain host groups
My colleagues want a new host group to have more visibility and want the Critical status to be different color. This is defined in nagios3/stylesheets/status.css as statusBGCRITICAL but as I can't ...
0
votes
1
answer
177
views
How do i make stackdriver graphs show 24 hour format dates
In my stackdriver dashboard, the horizontal axis is always date/time but it displays in am/pm format, e.g. 5pm, which is a pain as everything else in my workflow is 24hour.
I cannot for the life of ...
1
vote
3
answers
5k
views
How to monitor HTTPS and TCP Load Balancers on GCP
we are using HTTPS and TCP Load balancers in the Google Cloud Platform (GCP) and we want to monitor them (e.g. with Stackdriver).
For us, one main criteria seems to be the amount of healthy backend ...
0
votes
1
answer
461
views
How to monitor network and disk i/o
This is specifically for AWS EC2 monitoring but I'm also interested in the generic context of monitoring input/output.
How do I know what the network or disk i/o limit is for any server/service and ...
0
votes
1
answer
1k
views
Zabbix macros, how to access hosts by name, and their items or keys
I have a zabbix 4.2 installation.
I have it currently monitoring a network device via generic SNMP queries with my supplied OIDs. The data is being ingested just fine, graphs are populating as ...
0
votes
1
answer
6k
views
Monitor all processes in Linux with telegraf and grafana
I have a task to configure monitoring of all processes in a Linux system with Telegraf and Grafana. But there is some issue that there are many processes in a system.
For gathering information about ...
0
votes
2
answers
2k
views
nrpe for Windows - filter and arguments change threshold values
i have my monitoring Linux server with
NRPE Plugin for Nagios
Version: 3.2.1
and i installed on the windows machine NSClient++ with this .bat script.
set setupfile="NSCP-0.5.2.35-x64.msi"
msiexec /...
2
votes
0
answers
723
views
Clean old release files in Sentry
I'm self-hosting Sentry 8 and /var/lib/sentry/files grew to a significant size. I tried launching a script to go through each project's releases via the API, select those older than X days, and remove ...
0
votes
1
answer
171
views
How to check Azure VM Port Heatbeat
I have an application providing http services in a specific port (eg: 8181).
This app is autonomous and independent of IIS and any other application server. It is installed in a Azure VM.
I want to ...
0
votes
2
answers
623
views
Multiple HTTP checks per host with Icinga2
What is best practice for configuring multiple HTTP checks for a host?
I know there is that "apply" method:
apply Service "http" for (http_vhost => config in host.vars.vhosts) {
...
}
But i ...
0
votes
1
answer
371
views
Monitor unix process with Nagios Core
I'm begginer with Nagios and i'm trying to monitor if a specific process on my unix machine is up or not.
Server and client are on the same machine : Trying to monitor that machine from a Nagios ...
0
votes
0
answers
31
views
OCSInventory handover table and form
I need to make some kind of handover table.
Records would look like this:
PC , user , action , date;
X , Y , return , 10.07;
X , Z , hire , 11.07;
A , Y , hire , 16.07;
etc. Each record would ...
0
votes
1
answer
2k
views
Azure Monitor Alert for Windows Services
Is it possible to add an Azure Monitor Alert for Windows Services running on Cloud Services?
We have a web app running on Cloud Services depending on a service we installed on the instance VM, we ...
0
votes
1
answer
3k
views
Zabbix service.discovery multiple templates for distinct items and triggers
service.discovery in Zabbix returns a JSON array of all current running services (based on filters) for each host.
I would like to create a template hierarchy where i have a parent template that ...
0
votes
0
answers
27
views
Server component version number reporting and collecting - what is this called?
What is the name given to software which reports versions for a given set of software components (libraries, applications, etc) on one or more servers, and the collection software that runs on a ...
3
votes
2
answers
6k
views
Zabbix - Web scenarios trigger. How to ignore the first failed attempts?
Problem:
Trigger send an e-mail immediately when the site is unreachable.
System:
Linux Debian Stretch, Zabbix 4.2.1.
Current configuration of the web scenario:
Scenario:
Name: website.com
...
0
votes
2
answers
244
views
Count instances of a pattern's match in a real-time log file, in a specific time frame, using PowerShell?
I've to intercept a pattern in a log file and, when this pattern is matched for 5 times in 5 minutes, send an alert...
I've wrote this lines, but I'm stuck... I don't know how to handle the time ...
0
votes
1
answer
262
views
How to identify the source of growing blocked processes on a system container (LXC/OpenVZ)
When the number of blocked processes grow indefinitely, how to identify exactly what are these processes.
Since I'm inside a system container (LXC/OpenVZ), I can't modify the hosts' kernel modules to ...
0
votes
1
answer
94
views
How to suppress recovery notification in Stackdriver?
I have Alerting Policy in Google's Stackdriver which sends a notification when there is an error in the specific log.
For the obvious reason, there is no recovery for such condition, i.e. I don't want ...
2
votes
1
answer
3k
views
Microsoft Telemetry Data
I work at a large company and started digging through firewall logs to get my head around the network and what is going on. I noticed a HUGE amount of traffic going to 72.21.81.240. I cross-correlated ...
2
votes
2
answers
4k
views
kubectl top node doesn't work. Looks like an issue with heapster
I have a fresh k8s cluster on gke.
Whenever I run kubectl top node gke-data-custom-vm-6-25-0cbae9b9-hrkc
I get
Error from server (NotFound): the server could not find the requested resource (get ...
0
votes
1
answer
1k
views
Stackdriver logs stopped appearing in GCP Stackdriver Logging Console
Background
I have a Flask server running on GCE's container-optimized image. This server prints a log for every request. Normally, I can view the logs my application prints in GCP Stackdriver Logging ...
0
votes
0
answers
372
views
How to use snmp remotly with MRTG
I'm using CentOS 7 as the operating system. I'm trying to setup a client MRTG to fetch several servers informations. I don't want to install a web server on the remote server. Is there some official ...
3
votes
3
answers
4k
views
NRPE not working after NRPE plugin upgrade from v2.15 to v3.2.1
I'm migrating our Icinga 2 from Debian 8 to Ubuntu 18.04. The old server had NRPE plugin 2.15. The new server has NRPE plugin 3.2.1.
If I try to connect with the new plugin to old NRPE servers (v2.15 ...
1
vote
0
answers
50
views
Mapping network connections between servers
Sometimes I'd like an easy way to produce lists over everything a server accepts or connects to. Not what it can connect to/accept connections from (firewall rules etc) but actual connections.
I can ...
0
votes
1
answer
664
views
Monitor SSL Certificate Changes
Is there a good tool to check when a certificate has been changed, in our environment we have certificates being changed by an operational team that is creating issues. I looked at NAGIOS ...
0
votes
1
answer
435
views
Telegraf not working with sysstat options like `-d` or `-n ALL`?
I am attempting to setup Telegraf on a Proxmox (Debian-based distro) to send metrics to InfluxDB. However, it seems to be complaining about sysstat syntax.
I am using the sysstat plugin in Telegraf.
...
1
vote
1
answer
820
views
How to check to exist a unit systemd on the servers by Prometheus?
I have a service 'service_1' on the servers. And I need checking by Prometheus existing this service on the servers. node_systemd_unit_state showing only metrics for existing services.
Maybe I can ...
0
votes
0
answers
365
views
snmpd.conf - TImeout with Centos
First of all i want to say i did a lot of researches and i read tons of examples for the last 3 days but i didn't find a solution that's why i am posting here.
I would like to monitor services and ...