Questions tagged [metrics]
metrics are a way to measure things.
114
questions
29
votes
2
answers
10k
views
Is there a Windows equivalent of Unix 'CPU steal time'?
In order to assess performance monitoring accuracy on virtualization platforms, the CPU steal time has become an increasingly relevant metric - see EC2 monitoring: the case of stolen CPU for an ...
28
votes
2
answers
21k
views
What is the difference between OpenTSDB and Graphite?
As far as I can tell, here are the main differences:
OpenTSDB does not deteriorate data over time, unlike Graphite where the size of the database is pre-determined.
OpenTSDB can store metrics per ...
24
votes
2
answers
49k
views
Routing selection: specificity vs metric
I understand that Linux chooses the most specific route to the destination when it does routing selection. But what about a route's metric? Does it have a higher priority than route's specificity?
A ...
13
votes
9
answers
1k
views
When to move a virtualized server to physical?
Virtualization has some great benefits, but there are times when a virtualized server needs more performance and should be moved to physical.
My question is, how do you tell when these times are? I'm ...
11
votes
4
answers
52k
views
How can I monitor VM memory usage on Azure?
I have a few VMs set up as ADO (VSTS) build servers and thought I'd look into CPU and RAM usage over time to optimize the VM size.
However I have had a hard time finding anything in Azure that allows ...
11
votes
7
answers
1k
views
Looking for a recommendation on measuring a high availability app that is using a CDN
I work for a Fortune 500 company that struggles with accurately measuring performance and availability for high availability applications (i.e., apps that are up 99.5% with 5 seconds page to page ...
8
votes
5
answers
14k
views
I need to replace munin with something more scalable [closed]
I have used munin on multiple servers for many years with great success, however with more than 100 munin-nodes and when there is load on the clients, the processing is timing out.
I have made some ...
8
votes
2
answers
6k
views
Tracking Apache by VirtualHost
I have an apache webserver running many VirtualHosts.
Recently it has been bogging down and becoming unresponsive, and I'm wondering how I can determine which VirtualHosts are causing most of the ...
8
votes
2
answers
3k
views
Graphite stops collecting data randomly
We have a Graphite server to collect data through collectd, statsd, JMXTrans ... Since a few days, we frequently have holes in our data. Digging through the data we still have, we can see an increase ...
5
votes
3
answers
476
views
How to compare CPUs?
What are the metrics and their priorities to compare task specific performance of various CPUs?
4
votes
4
answers
3k
views
Monitor and collect statistics on TCP connection to src or dst host
I would like to collect stats on TCP/UDP packets sent between two hosts (A and B). I am looking for a tool to install on A and B to get the following information:
kb/seconds
packets/seconds
(...
4
votes
5
answers
6k
views
What metrics should I watch when I monitor a server?
I saw a lot of monitoring tools and mostly show the same things. But I'm wondering if it's really necessary to watch all of these things.
I would like to know which metrics really matter for example ...
4
votes
4
answers
4k
views
How to define appropriate measurements for a Service Level Agreement?
I work for a small development house who are increasingly being asked to put together formal SLAs for our products based on particular configurations.
From a development side of things I'm ...
4
votes
3
answers
25k
views
How to change Interface Metric permanently in CentOS
My Config:
Guest OS - CentOS7
Host OS - Windows 7
Interface - 3 (enp0s3, enp0s8, enp0s9)
Installed in Virtual Box
enp0s8 is Bridge to wlan0(For Internet)
enp0s3 and enp0s9 are host-only
Kernel IP ...
4
votes
1
answer
5k
views
Why does my prometheus dashboard only show 2 weeks of data when I have longer term data available?
I have Prometheus configured with Thanos. I'm using the Node Exporter Full dashboard to view data from the node exporter. Regardless of what time range I select (e.g., now-20d), the dashboard only ...
4
votes
1
answer
2k
views
Do prometheus metrics have some sort of "freshness" lifetime?
I'm scraping some metrics (openstack cinder volume sizes) every 15 minutes, and the results produce a discontinuous graph, like this:
(That's the result of the simple query cinder_volume_size_gb).
...
4
votes
1
answer
175
views
Reading Docker metrics in machine readable format
I'm running some Docker containers for my users. Users are able to start and stop their containers dynamically, so I would like to watch my Data space available metrics from docker info with some sort ...
4
votes
1
answer
5k
views
How to display a graphite graph of used memory when you have total memory and free memory
My servers send mem_total and mem_free metrics to Graphite, but I'd like to display mem_used on my dashboard. I'm currently solving the problem by doing:
offset(scale(servers.foo.*.mem_free,-1),...
4
votes
4
answers
2k
views
Monitoring tools that can take high rate and high volume? [closed]
We're using Cacti with RRDTool to monitor and graph about 100,000 counters spread across about 1,000 Linux-based nodes. However, our current setup generally only gives us 5-minute graphs (with some ...
3
votes
2
answers
1k
views
How to compare CDN services?
What are the metrics and their priorities to compare various CDN services?
3
votes
2
answers
8k
views
Disk usage metrics missing from Prometheus node exporter
I used Prometheus and node exporter a while ago and had access to node_filesystem_* metrics to monitor disk usage but I've recently fired it up on some other servers (Ubuntu Linux) and those metrics ...
3
votes
5
answers
5k
views
LAMP monitoring by zabbix
Someone have experience with lamp monitoring ? I mean standard monitoring and slow queries monitoring for mysql, virtual hosts load as well. And alerting if something going not well.
Cacti is very ...
3
votes
1
answer
2k
views
Kubernetes Nginx Ingress Controller Metrics
I've tried to find a documentation about the metrics exposed by the NGINX ingress controller in Kubernetes but so far I haven't found any reliable source about the metrics and what they mean.
For ...
2
votes
9
answers
4k
views
Measure data transfer throughput?
How can I benchmark the network transfer speed of my servers? We already have an existing file server, but it is fairly old. We recently got a donation of a (relatively) newer P4 box, and I wanted to ...
2
votes
2
answers
4k
views
kubectl top node doesn't work. Looks like an issue with heapster
I have a fresh k8s cluster on gke.
Whenever I run kubectl top node gke-data-custom-vm-6-25-0cbae9b9-hrkc
I get
Error from server (NotFound): the server could not find the requested resource (get ...
2
votes
5
answers
463
views
How can I measure precisely the resources consumed by each Web site on my server?
We are having performance issues on our server which may or may not be the inevitable result of high traffic to our sites. We would like to optimize each site for performance so that we can be sure ...
2
votes
3
answers
470
views
What tools are available to capture and report system (application, database and sys) metrics?
I'm looking at setting up a tool for collecting usage data and KPIs from multiple systems on various platforms. We'd like to regularly report on key indicators on system usage and health. It doesn't ...
2
votes
2
answers
3k
views
GKE metrics agent logging many errors
We have created GKE cluster and we are getting errors from gke-metrics-agent. The errors shows up every cca 30 minutes. It's always the same 62 errors.
All the errors have label k8s-pod/k8s-app: "...
2
votes
1
answer
64
views
Something for Icinga like sensu-metrics-relay
I used to work with Sensu and I was using a plugin called sensu-metrics-relay so I could easily forward some metrics collected using a jenkins plugin called "Metrics Plugin" to graphite.
Now I am ...
2
votes
1
answer
3k
views
What is the counter type for Linux's /sys/block/$DEVICE/stat
What are the counter types for /sys/block/$DEVICE/stat?
$ cat /sys/block/sda/stat
8323741 42381356 464031898 19207478 1940738 49062039 408122282 79935459 0 22315734 99172579
Specifically, do ...
2
votes
1
answer
974
views
how to send alert on the same incident using logs-based metrics
On GCP we have created various logs-based metric where we basically are looking for log entries with some keywords like 'error', 'warning' etc...
We also created alerts based on those metrics.
Most of ...
2
votes
1
answer
387
views
How to extract Kubernetes core metrics usage over time - metrics-server
I am using metrics-server in Kubernetes. How do I extract data relating to these core metrics:
Number of pods running for each horizontal pod over a period of time
CPU usage at certain points in time
...
2
votes
1
answer
87
views
The command route the "Use" field is only zeroed, it does not count the route searches chosen
The route command output shows me the "Use" field but it always goes to zero, would not it be the route lookups because it does not count would it have something to activate so that it starts counting?...
2
votes
2
answers
781
views
Stackdrive agent metric isn't propagated to GCE instance-group-manager autoscaler
I'm trying to set up autoscaling group of instances on GCE. Autoscaling policy setup to add instance(s) based on percent of used memory.
On every instance I installed Stackdriver Monitoring Agent (...
2
votes
1
answer
335
views
Negative Request Per Second observed?
I'm currently looking at my Nagios metrics, and especially on my WebServers and I suddently noticed that sometimes, I've a negative amount of Request per second and other metrics, how it is possible??
...
2
votes
1
answer
891
views
Graphite Clustered Webapp - Not Sharing Data
I have Graphite setup on three instances on EC2:
carbon-relay - relay1.graphite.prod.example.ec2
carbon-cache + webapp - cache3.graphite.prod.example.ec2
carbon-cache + webapp - cache4.graphite.prod....
2
votes
1
answer
1k
views
Alternative Ganglia interfaces?
I'm evaluating various metric collection systems and appreciate just about everything in Ganglia, except for the less than clean interface. Does anyone know of alternative Ganglia interfaces or ...
2
votes
0
answers
363
views
Does Prometheus efficiently compress series of identical samples?
Does Prometheus efficiently store and de-duplicate long repeats of identical sample values for a metric?
I'm working with a system where some of the inputs change at a low frequency - say every 30 ...
2
votes
1
answer
1k
views
How do I configure Prometheus for aggregated metrics?
I have five related questions around the configuration of Prometheus for aggregated metric reception.
There are some invariants in my system:
Python server code that needs to be instrumented
...
1
vote
2
answers
2k
views
Calculating IOPS for a single HDD - what am I doing wrong?
So I know there is no standardized way of calculating IOPS for a HDD, but from everything I have read it appears one of the most accurate formulas is the following:
IOP/ms = + {rotational latency} +...
1
vote
3
answers
1k
views
Monitoring / metric collection for system collectives that change a lot in time (a.k.a. cloud) [closed]
When your server fleet doesn't change a lot in time, like when you're using bare-metal hosting, classic monitoring and metric collection solutions (Nagios, Munin) work well.
But if the number of ...
1
vote
2
answers
1k
views
Dealing with redundant Ethernet and WiFi under Windows
What happens under Windows (XP Pro SP3, specifically) when you have multiple network interfaces configured on the same subnet, like when your ethernet is plugged in and your WiFi is active at the same ...
1
vote
1
answer
118
views
Are Azure VM Metrics Always Accurate?
I have an Azure VM that is supposedly sized at [Standard B8ms (8 vcpus, 32 GiB memory)]. The Azure metric is pegged at 16%. When I log in to the server, the server is maxed out. The services/...
1
vote
1
answer
3k
views
AWS Application Load Balancer - processed bytes metric
I have just finished setting up Application Load Balancer on AWS.
I am trying to figure why my Processed Bytes metric is giving such a high number (and as a result Consumed Capacity Units go up and ...
1
vote
1
answer
687
views
Prometheus with remote_write and backups
I'm building a prometheus stack that uses the remote_read/remote_write feature for long term storage, and I have some questions about if/how prometheus' tsdb needs to be backed up, given that it is ...
1
vote
2
answers
347
views
What is the best way to get ubuntu webserver cpu and memory load chart? [duplicate]
Possible Duplicate:
What tool do you use to monitor your servers?
I have a simple ubuntu 12.04 server that runs nginx, gunicorn and some python WSGI websites. Every morning i want to see server's ...
1
vote
3
answers
383
views
Simple tools to quickly capture and graph some arbitrary metrics on a server?
I want something that's quick to set up, doesn't have a lot of dependencies, outputs pretty graphs, and is easily customizable (e.g. calling external scripts that simply output a numerical value).
I'...
1
vote
1
answer
106
views
Filtering logs sent by opentelemetry-agent to Coralogix
I'm trying to figure out how to configure this filterprocessor to filter some kind of logs.
I've created for myself playground in minikube with app that randomly generates logs and installed otel ...
1
vote
1
answer
135
views
AWS CloudWatch metric for EC2 security updates
Is it possible to create an AWS CloudWatch metric which keeps track of the pending security updates on an EC2 (Ubuntu) instance? The idea is to have a quicker overview of EC2 instances who are in need ...
1
vote
1
answer
983
views
I'm unable to change network interfaces' metrics in Linux
I am trying to change the priority of network interfaces on my server using ifmetric. When I execute the command it looks like it worked (exit code 0), but it simply won't affect the interfaces - in ...