Skip to main content

Questions tagged [metrics]

metrics are a way to measure things.

Filter by
Sorted by
Tagged with
29 votes
2 answers
10k views

Is there a Windows equivalent of Unix 'CPU steal time'?

In order to assess performance monitoring accuracy on virtualization platforms, the CPU steal time has become an increasingly relevant metric - see EC2 monitoring: the case of stolen CPU for an ...
Steffen Opel's user avatar
  • 5,668
28 votes
2 answers
21k views

What is the difference between OpenTSDB and Graphite?

As far as I can tell, here are the main differences: OpenTSDB does not deteriorate data over time, unlike Graphite where the size of the database is pre-determined. OpenTSDB can store metrics per ...
vedang's user avatar
  • 382
24 votes
2 answers
49k views

Routing selection: specificity vs metric

I understand that Linux chooses the most specific route to the destination when it does routing selection. But what about a route's metric? Does it have a higher priority than route's specificity? A ...
Eugene Yarmash's user avatar
13 votes
9 answers
1k views

When to move a virtualized server to physical?

Virtualization has some great benefits, but there are times when a virtualized server needs more performance and should be moved to physical. My question is, how do you tell when these times are? I'm ...
Alex Angas's user avatar
  • 2,057
11 votes
4 answers
52k views

How can I monitor VM memory usage on Azure?

I have a few VMs set up as ADO (VSTS) build servers and thought I'd look into CPU and RAM usage over time to optimize the VM size. However I have had a hard time finding anything in Azure that allows ...
Pelle's user avatar
  • 251
11 votes
7 answers
1k views

Looking for a recommendation on measuring a high availability app that is using a CDN

I work for a Fortune 500 company that struggles with accurately measuring performance and availability for high availability applications (i.e., apps that are up 99.5% with 5 seconds page to page ...
Tim Reddy's user avatar
  • 213
8 votes
5 answers
14k views

I need to replace munin with something more scalable [closed]

I have used munin on multiple servers for many years with great success, however with more than 100 munin-nodes and when there is load on the clients, the processing is timing out. I have made some ...
Tom's user avatar
  • 11.4k
8 votes
2 answers
6k views

Tracking Apache by VirtualHost

I have an apache webserver running many VirtualHosts. Recently it has been bogging down and becoming unresponsive, and I'm wondering how I can determine which VirtualHosts are causing most of the ...
Brent 's user avatar
  • 23.5k
8 votes
2 answers
3k views

Graphite stops collecting data randomly

We have a Graphite server to collect data through collectd, statsd, JMXTrans ... Since a few days, we frequently have holes in our data. Digging through the data we still have, we can see an increase ...
Guillaume's user avatar
  • 1,063
5 votes
3 answers
476 views

How to compare CPUs?

What are the metrics and their priorities to compare task specific performance of various CPUs?
Jeff's user avatar
  • 461
4 votes
4 answers
3k views

Monitor and collect statistics on TCP connection to src or dst host

I would like to collect stats on TCP/UDP packets sent between two hosts (A and B). I am looking for a tool to install on A and B to get the following information: kb/seconds packets/seconds (...
Skarab's user avatar
  • 327
4 votes
5 answers
6k views

What metrics should I watch when I monitor a server?

I saw a lot of monitoring tools and mostly show the same things. But I'm wondering if it's really necessary to watch all of these things. I would like to know which metrics really matter for example ...
yvan's user avatar
  • 141
4 votes
4 answers
4k views

How to define appropriate measurements for a Service Level Agreement?

I work for a small development house who are increasingly being asked to put together formal SLAs for our products based on particular configurations. From a development side of things I'm ...
Jon Hopkins's user avatar
4 votes
3 answers
25k views

How to change Interface Metric permanently in CentOS

My Config: Guest OS - CentOS7 Host OS - Windows 7 Interface - 3 (enp0s3, enp0s8, enp0s9) Installed in Virtual Box enp0s8 is Bridge to wlan0(For Internet) enp0s3 and enp0s9 are host-only Kernel IP ...
Veerendra K's user avatar
4 votes
1 answer
5k views

Why does my prometheus dashboard only show 2 weeks of data when I have longer term data available?

I have Prometheus configured with Thanos. I'm using the Node Exporter Full dashboard to view data from the node exporter. Regardless of what time range I select (e.g., now-20d), the dashboard only ...
larsks's user avatar
  • 45.6k
4 votes
1 answer
2k views

Do prometheus metrics have some sort of "freshness" lifetime?

I'm scraping some metrics (openstack cinder volume sizes) every 15 minutes, and the results produce a discontinuous graph, like this: (That's the result of the simple query cinder_volume_size_gb). ...
larsks's user avatar
  • 45.6k
4 votes
1 answer
175 views

Reading Docker metrics in machine readable format

I'm running some Docker containers for my users. Users are able to start and stop their containers dynamically, so I would like to watch my Data space available metrics from docker info with some sort ...
Věroš K.'s user avatar
4 votes
1 answer
5k views

How to display a graphite graph of used memory when you have total memory and free memory

My servers send mem_total and mem_free metrics to Graphite, but I'd like to display mem_used on my dashboard. I'm currently solving the problem by doing: offset(scale(servers.foo.*.mem_free,-1),...
phrend's user avatar
  • 41
4 votes
4 answers
2k views

Monitoring tools that can take high rate and high volume? [closed]

We're using Cacti with RRDTool to monitor and graph about 100,000 counters spread across about 1,000 Linux-based nodes. However, our current setup generally only gives us 5-minute graphs (with some ...
Jon Watte's user avatar
  • 270
3 votes
2 answers
1k views

How to compare CDN services?

What are the metrics and their priorities to compare various CDN services?
Jeff's user avatar
  • 461
3 votes
2 answers
8k views

Disk usage metrics missing from Prometheus node exporter

I used Prometheus and node exporter a while ago and had access to node_filesystem_* metrics to monitor disk usage but I've recently fired it up on some other servers (Ubuntu Linux) and those metrics ...
Sam's user avatar
  • 183
3 votes
5 answers
5k views

LAMP monitoring by zabbix

Someone have experience with lamp monitoring ? I mean standard monitoring and slow queries monitoring for mysql, virtual hosts load as well. And alerting if something going not well. Cacti is very ...
user avatar
3 votes
1 answer
2k views

Kubernetes Nginx Ingress Controller Metrics

I've tried to find a documentation about the metrics exposed by the NGINX ingress controller in Kubernetes but so far I haven't found any reliable source about the metrics and what they mean. For ...
MysteriousPerson's user avatar
2 votes
9 answers
4k views

Measure data transfer throughput?

How can I benchmark the network transfer speed of my servers? We already have an existing file server, but it is fairly old. We recently got a donation of a (relatively) newer P4 box, and I wanted to ...
lfaraone's user avatar
  • 1,621
2 votes
2 answers
4k views

kubectl top node doesn't work. Looks like an issue with heapster

I have a fresh k8s cluster on gke. Whenever I run kubectl top node gke-data-custom-vm-6-25-0cbae9b9-hrkc I get Error from server (NotFound): the server could not find the requested resource (get ...
Korjavin Ivan's user avatar
2 votes
5 answers
463 views

How can I measure precisely the resources consumed by each Web site on my server?

We are having performance issues on our server which may or may not be the inevitable result of high traffic to our sites. We would like to optimize each site for performance so that we can be sure ...
Aaron Lozier's user avatar
2 votes
3 answers
470 views

What tools are available to capture and report system (application, database and sys) metrics?

I'm looking at setting up a tool for collecting usage data and KPIs from multiple systems on various platforms. We'd like to regularly report on key indicators on system usage and health. It doesn't ...
Mark Nold's user avatar
  • 285
2 votes
2 answers
3k views

GKE metrics agent logging many errors

We have created GKE cluster and we are getting errors from gke-metrics-agent. The errors shows up every cca 30 minutes. It's always the same 62 errors. All the errors have label k8s-pod/k8s-app: "...
Melchy's user avatar
  • 83
2 votes
1 answer
64 views

Something for Icinga like sensu-metrics-relay

I used to work with Sensu and I was using a plugin called sensu-metrics-relay so I could easily forward some metrics collected using a jenkins plugin called "Metrics Plugin" to graphite. Now I am ...
Mallouli Boulbeba's user avatar
2 votes
1 answer
3k views

What is the counter type for Linux's /sys/block/$DEVICE/stat

What are the counter types for /sys/block/$DEVICE/stat? $ cat /sys/block/sda/stat 8323741 42381356 464031898 19207478 1940738 49062039 408122282 79935459 0 22315734 99172579 Specifically, do ...
dialt0ne's user avatar
  • 3,075
2 votes
1 answer
974 views

how to send alert on the same incident using logs-based metrics

On GCP we have created various logs-based metric where we basically are looking for log entries with some keywords like 'error', 'warning' etc... We also created alerts based on those metrics. Most of ...
bogumbiker's user avatar
2 votes
1 answer
387 views

How to extract Kubernetes core metrics usage over time - metrics-server

I am using metrics-server in Kubernetes. How do I extract data relating to these core metrics: Number of pods running for each horizontal pod over a period of time CPU usage at certain points in time ...
cfbd's user avatar
  • 137
2 votes
1 answer
87 views

The command route the "Use" field is only zeroed, it does not count the route searches chosen

The route command output shows me the "Use" field but it always goes to zero, would not it be the route lookups because it does not count would it have something to activate so that it starts counting?...
Ricardo Barbosa's user avatar
2 votes
2 answers
781 views

Stackdrive agent metric isn't propagated to GCE instance-group-manager autoscaler

I'm trying to set up autoscaling group of instances on GCE. Autoscaling policy setup to add instance(s) based on percent of used memory. On every instance I installed Stackdriver Monitoring Agent (...
vtrv101's user avatar
  • 21
2 votes
1 answer
335 views

Negative Request Per Second observed?

I'm currently looking at my Nagios metrics, and especially on my WebServers and I suddently noticed that sometimes, I've a negative amount of Request per second and other metrics, how it is possible?? ...
Dr I's user avatar
  • 975
2 votes
1 answer
891 views

Graphite Clustered Webapp - Not Sharing Data

I have Graphite setup on three instances on EC2: carbon-relay - relay1.graphite.prod.example.ec2 carbon-cache + webapp - cache3.graphite.prod.example.ec2 carbon-cache + webapp - cache4.graphite.prod....
organicveggie's user avatar
2 votes
1 answer
1k views

Alternative Ganglia interfaces?

I'm evaluating various metric collection systems and appreciate just about everything in Ganglia, except for the less than clean interface. Does anyone know of alternative Ganglia interfaces or ...
CarpeNoctem's user avatar
  • 2,457
2 votes
0 answers
363 views

Does Prometheus efficiently compress series of identical samples?

Does Prometheus efficiently store and de-duplicate long repeats of identical sample values for a metric? I'm working with a system where some of the inputs change at a low frequency - say every 30 ...
Craig Ringer's user avatar
  • 11.3k
2 votes
1 answer
1k views

How do I configure Prometheus for aggregated metrics?

I have five related questions around the configuration of Prometheus for aggregated metric reception. There are some invariants in my system: Python server code that needs to be instrumented ...
Reinderien's user avatar
1 vote
2 answers
2k views

Calculating IOPS for a single HDD - what am I doing wrong?

So I know there is no standardized way of calculating IOPS for a HDD, but from everything I have read it appears one of the most accurate formulas is the following: IOP/ms = + {rotational latency} +...
red888's user avatar
  • 4,273
1 vote
3 answers
1k views

Monitoring / metric collection for system collectives that change a lot in time (a.k.a. cloud) [closed]

When your server fleet doesn't change a lot in time, like when you're using bare-metal hosting, classic monitoring and metric collection solutions (Nagios, Munin) work well. But if the number of ...
Florin Andrei's user avatar
1 vote
2 answers
1k views

Dealing with redundant Ethernet and WiFi under Windows

What happens under Windows (XP Pro SP3, specifically) when you have multiple network interfaces configured on the same subnet, like when your ethernet is plugged in and your WiFi is active at the same ...
wfaulk's user avatar
  • 6,968
1 vote
1 answer
118 views

Are Azure VM Metrics Always Accurate?

I have an Azure VM that is supposedly sized at [Standard B8ms (8 vcpus, 32 GiB memory)]. The Azure metric is pegged at 16%. When I log in to the server, the server is maxed out. The services/...
user2634253's user avatar
1 vote
1 answer
3k views

AWS Application Load Balancer - processed bytes metric

I have just finished setting up Application Load Balancer on AWS. I am trying to figure why my Processed Bytes metric is giving such a high number (and as a result Consumed Capacity Units go up and ...
Mateusz Drankowski's user avatar
1 vote
1 answer
687 views

Prometheus with remote_write and backups

I'm building a prometheus stack that uses the remote_read/remote_write feature for long term storage, and I have some questions about if/how prometheus' tsdb needs to be backed up, given that it is ...
user307927's user avatar
1 vote
2 answers
347 views

What is the best way to get ubuntu webserver cpu and memory load chart? [duplicate]

Possible Duplicate: What tool do you use to monitor your servers? I have a simple ubuntu 12.04 server that runs nginx, gunicorn and some python WSGI websites. Every morning i want to see server's ...
grigoryvp's user avatar
  • 3,855
1 vote
3 answers
383 views

Simple tools to quickly capture and graph some arbitrary metrics on a server?

I want something that's quick to set up, doesn't have a lot of dependencies, outputs pretty graphs, and is easily customizable (e.g. calling external scripts that simply output a numerical value). I'...
toupeira's user avatar
  • 121
1 vote
1 answer
106 views

Filtering logs sent by opentelemetry-agent to Coralogix

I'm trying to figure out how to configure this filterprocessor to filter some kind of logs. I've created for myself playground in minikube with app that randomly generates logs and installed otel ...
Anatoly's user avatar
  • 152
1 vote
1 answer
135 views

AWS CloudWatch metric for EC2 security updates

Is it possible to create an AWS CloudWatch metric which keeps track of the pending security updates on an EC2 (Ubuntu) instance? The idea is to have a quicker overview of EC2 instances who are in need ...
SolveSoul's user avatar
  • 113
1 vote
1 answer
983 views

I'm unable to change network interfaces' metrics in Linux

I am trying to change the priority of network interfaces on my server using ifmetric. When I execute the command it looks like it worked (exit code 0), but it simply won't affect the interfaces - in ...
eden881's user avatar
  • 215