Questions tagged [monitoring]
Applications or appliances that observe machines, systems and networks to find problems and notify administrators.
2,471
questions
606
votes
4
answers
354k
views
What exactly do the colors in htop status bars mean?
By default, htop shows colored status bars for processors, memory, and swap. From left to right, the bars are colored green, blue, yellow, and red depending on some thresholds.
What does it mean when ...
187
votes
73
answers
248k
views
What tool do you use to monitor your servers?
For a more comprehensive list of monitoring tools and their features, check out this Wikipedia page.
As the question states, what are the most commonly used tools used for this task and what are ...
60
votes
13
answers
91k
views
How to monitor a windows log file in real time? [closed]
On windows how can I easily monitor a log file and see updates to the file in real time?
Basically, same functionality like tail -f log_file on Unix systems
Platform: Windows XP/2003/2008 server
[...
54
votes
4
answers
187k
views
ipmitool - can't find /dev/ipmi0 or /dev/ipmidev/0
Based on a previous question, I installed ipmitool (yum install ipmitool).
Even after a reboot, though, i get the following error when trying to run ipmitool power status:
Could not open device at /...
47
votes
7
answers
65k
views
View content of memcached
Are there any tools to explore what is currently cached inside a memcached pool? Not some much graphs, but the actual key/values currently stored.
45
votes
8
answers
277k
views
Get a history of CPU/memory usage
Is there someway to record Task Managers info about CPU and memory usage to examine later? Or an equivalent tool?
44
votes
5
answers
57k
views
monit: check process without pidfile
I'm looking for a way to kill all processes with a given name that have been running for more than X amount of time. I spawn many instances of this particular executable, and sometimes it goes into a ...
44
votes
11
answers
101k
views
How to monitor and log the memory/cpu usage of processes over time? [closed]
I am looking for a way to diagnose issues, such as swap death, where a balooning memory process fills up swap and kills the whole machine (such as apache).
I'm already using cacti and I can set up ...
42
votes
19
answers
9k
views
Can anyone recommend a website monitoring service? [closed]
What are people using for website monitoring services?
I am referring to a service that I can configure specific hits to my site to monitor if the site is up, and how fast it is responding to the ...
41
votes
8
answers
33k
views
Colorize Monitoring of Logs
I sometimes monitor apache and php error logs using tail under FreeBSD. Is there any way to get colorized output, either using tail or some other command line app?
Alternatively, what is your ...
40
votes
34
answers
8k
views
Best smartphone for sysadmins?
I need a cell phone that will help me keep an eye on my servers and services when I am away from my computer/desk/workplace.
Which smart phone would you recommend for sysadmins?
An SSH client is a ...
39
votes
6
answers
125k
views
How to see incoming IPs in Linux?
I am wondering what is the command/utility to have a real-time view of incoming IPs to my server, ideally along with the port and connected.
38
votes
10
answers
310k
views
Testing a website from Linux command line [closed]
I am looking for a tool to test a website from a Linux command line.
From the output, I need to know the http response (status codes) but also benchmark the time it takes to download the different ...
38
votes
2
answers
17k
views
Get notification from supervisord when a job exits
Is there any way supervisord can automatically restart a failed/exited/terminated job and send me a notification email with a dump of the last x lines of log file?
34
votes
6
answers
25k
views
Linux: logwatch(8) is too noisy. How can I control the noise level?
Our Linux systems run logwatch(8) utility by default. On a RedHat/CentOS/SL system, Logwatch is called by the /etc/cron.daily/ cronjob, which then sends a daily email with the results. These emails ...
33
votes
4
answers
28k
views
How i configure monit to start a process with a specific user?
Monit runs with root, but i don't want to start my processes as root.. like mysql, mongrel, apache..
33
votes
24
answers
11k
views
What are your most useful/valued Nagios plugins? [closed]
We're expanding our Nagios 3 setup, and are frequently coming across new NRPE or general Nagios plugins to test our existing infrastructure. This is in dribs and drabs though - it would be useful to ...
30
votes
5
answers
44k
views
Do SSDs support SMART?
S.M.A.R.T. (for Self-Monitoring Analysis and Reporting Technology) is a wonderful technology to detect hard drive failure before it really happens.
But is S.M.A.R.T. relevant for SSDs?
29
votes
2
answers
10k
views
Is there a Windows equivalent of Unix 'CPU steal time'?
In order to assess performance monitoring accuracy on virtualization platforms, the CPU steal time has become an increasingly relevant metric - see EC2 monitoring: the case of stolen CPU for an ...
28
votes
2
answers
21k
views
What is the difference between OpenTSDB and Graphite?
As far as I can tell, here are the main differences:
OpenTSDB does not deteriorate data over time, unlike Graphite where the size of the database is pre-determined.
OpenTSDB can store metrics per ...
28
votes
5
answers
125k
views
How can I manually run a nagios check from the command line?
When defining and testing new services in nagios I have been restarting nagios, then clicking the service, and rescheduling a check for as soon as possible, then waiting until the check happens.
Is ...
28
votes
6
answers
59k
views
How to find out the number of time series stored in Prometheus LevelDB
i'm responsible for maintaining the Prometheus servers in our company. The metrics however are provided by the teams.
Is there a way to find out the number of time series stored in the Prometheus ...
27
votes
6
answers
104k
views
A better "top" command for Mac OS X? [closed]
The top command on OS X is pretty crappy.. The one included with most Linux distros allows you to change the sort-by column using < and >, there is a coloured mode (by pressing the z key), and a ...
27
votes
8
answers
31k
views
How can I see how much bandwidth each Apache Virtual Host is using?
I have Apache set up to serve several Virtual Hosts, and I would like to see how much bandwidth each site uses. I can see how much the entire server uses, but I would like more detailed reports.
...
25
votes
7
answers
59k
views
Enable Ping in Windows Server Firewall?
I've just installed Windows Server 2008 on a server and I'm able to connect through Remote Desktop but can't ping. Do I need to open an special port on the firewall to be able to ping a server?
25
votes
12
answers
57k
views
Techniques to Monitor cron tasks?
Are there good techniques for monitoring cron tasks over a cluster?
We're starting to use cron to launch tasks at daily intervals. A few ideas for checking out information:
Add special application ...
25
votes
3
answers
4k
views
Shinken - Anyone using it? [closed]
I've recently discovered Shinken, which is a new implementation of Nagios using python. Shinken "divides" Nagios in 5 different types of daemons, each one performing separated tasks. I haven't tried ...
24
votes
7
answers
15k
views
Continuously monitor logs with tail that are occasionally rotated
We're using tail to continuously monitor several logs, but when a log is rotated the tail for that file will cease.
As far as I understand, the problem is that when the log is rotated, there is a new ...
24
votes
9
answers
80k
views
Monitor network traffic volume over interface
Is there a way to monitor the traffic (e.g., get a live view of the utilization) over a particular network interface, say eth0?
The catch here is that the set of tools on the box is fixed, and is ...
24
votes
7
answers
31k
views
Process Monitor fails to start on a 64 bit computer
When I try to start Process Monitor from SysInternals on some 64 bit windows 7 machines,the process fails to start. There is no error message. I double click and nothing happens. Other 64 bit windows ...
24
votes
7
answers
35k
views
Monitor an incoming SSH session in real time
Is there any linux software to monitor an incoming ssh session. At a previous job I was told that if you ever needed support from Red Hat for example you could have them SSH into your machine and you ...
21
votes
9
answers
3k
views
Best practices for backup checking?
It is a common situation, when administrator makes system for automatic backuping and forgets it. Only after a system fails administrator notices, that backup system has broken before or backups are ...
21
votes
1
answer
29k
views
htop - show I/O wait percentage
Is there a way to show I/O wait percentage in htop? I can't find a way how to enable this. On the following link there is an example of the functionality I need (the "Avg:" row): http://www.concept47....
21
votes
5
answers
3k
views
What am I looking for in a Monitoring Solution?
This is a Canonical Question about Monitoring Software.
Also Related: What tool do you use to monitor your servers?
I need to monitor my servers; what do I need to consider when deciding on a ...
21
votes
2
answers
20k
views
Sending a test/example alert from monit?
I've configured monit tests and alerts — but I want to make sure that everything in my alert stack (outgoing email servers, sms email gateways…) is functioning properly. Is there a handy way to fire ...
20
votes
4
answers
47k
views
How to generate netflow data in linux
We have a number of Linux servers for which I would like to capture netflow data to be processed by a netflow analyzer. I've been spoiled by the ease in which Mikrotik routers allow netflow data ...
19
votes
6
answers
21k
views
Is Zabbix the right tool for me?
I just want to monitor a small handful of servers (less than 10).
From reading various places it sounds like the top leading contenders (for open source at least) are:
nagios
munin
zabbix
From ...
19
votes
1
answer
16k
views
Monit versus Nagios
Does nagios offer any additional functionality when compared to monit?
This answer states that monit is a clumsy tool for situational awareness, which is the reason I'm asking the question.
19
votes
2
answers
112k
views
How to find the .pid file for a given process
I'm setting up monit and want to monitor a given python application. Monit does this by looking at the .pid files for processes, but I don't know where this would be.
I also tried creating my own ...
18
votes
10
answers
5k
views
Network Traffic Monitoring
What is the best tool to monitor/analyze network traffic on an entire network (several subnets)?
I'm looking for something that will help me toubleshoot bandwidth problems when, for instance, users ...
18
votes
4
answers
3k
views
Monitoring production server [closed]
We have 3 dedicated server, splitted in several VPS using openVZ.
We're using munin to monitor the VPS with the production sites, and monit on some one of the VPS to make sure it restarts the service ...
17
votes
8
answers
11k
views
There is any monitoring hosted solution? [closed]
I know Zabbix and Nagios, but I don't know any company that host those services for others to use.
I would like to find a solution that monitored at least some of my public URLs (if they are ...
17
votes
7
answers
12k
views
How can I force a MySQL table to become corrupted?
I have written a simple Nagios plugin that calls mysqlcheck (which checks for corrupted tables) and will give a warning if any are corrupt.
However none of my tables are corrupt now. So I'm not 100% ...
17
votes
2
answers
10k
views
What's difference between monitoring, tracing and profiling?
I have seen these three words showing up a lot, but don't understand the exact differences between them. For example, collecting CPU utilisation is often called profiling and can also fall into ...
16
votes
3
answers
13k
views
Finding short-lived TCP connections owner process
Running tcpdump on local connections to an apache server, I found TCP connections being established and closed immediately every 2 seconds. How do I find which process is responsible for these? ...
16
votes
8
answers
12k
views
Nagios alerts by telephone?
I have a couple of nagios servers that currently alert me via SMS and email, but I'd like to add outgoing calls to that. The ideal solution would have the nagios machines call my cellphone and play an ...
16
votes
6
answers
31k
views
How do you monitor the health of a mirrored disk in Windows?
I have a Mirrored Dynamic disk on my Windows 2003 Server. How do you monitor the health of the volume?
Is there a way to have the server send an email when there is an issue with the volume?
Is there ...
16
votes
1
answer
9k
views
SNMP Deprecated in Windows Server 2012
According to Technet's Features Removed or Deprecated in Windows Server 2012, SNMP is being depreciated:
SNMP is deprecated. Instead, use the Common Information Model (CIM),
which is supported by ...
15
votes
7
answers
94k
views
What Warning and Critical values to use for check_load?
Right now I am using these values:
# y = c * p / 100
# y: nagios value
# c: number of cores
# p: wanted load procent
# 4 cores
# time 5 minutes 10 minutes 15 minutes
# warning: 90% ...
15
votes
2
answers
5k
views
What is the strategy for detecting time drift in a linux based data centre?
What is the strategy for detecting time drift in all linux based data centre? This is a more difficult problem than it seems at first.
Time drift can cause serious problems for certain applications ...