Commit Graph

28 Commits

Author SHA1 Message Date
Clark Boylan 4ef6ea4d10 Fix nodepool API ops Grafana graphs
openstacksdk statsd records for API operations are not response code
specific. This change adds a glob to the statsd record path to include
all return code response metrics in our API graphs. I believe this
change may have come with the openstacksdk 0.103.0 update. We also
update paths for servers POST, server details, and flavor details as
they have changed.

Note the network info is empty which is why we don't get graphs for
them, but the paths appear correct. I think this may be beacuse we don't
need to query network info in any of our clouds currently.

While we are at it we stop updating the airship and inap cloud graphs
since those should be cleaned up and this keeps the review overhead
smaller.

Change-Id: I5a6b80118afaf3b7782a1d1c131787f208583799
2022-12-08 15:42:45 -08:00
Ian Wienand d4f91931e3 grafana: further path fixes
I mistyped these in I6bd82b38d80db2b56a399f80132a723564f9bc40

Change-Id: I9eb083ce14e4814decf639fee18da9e43ed3a421
2021-09-22 14:55:10 +10:00
Ian Wienand b803305b41 grafana: fix openstack API stats for providers
I think I generally messed these up on the original import as every
stat seems to refer to the same thing.  Over time, the layout of
openstacksdk stats has changed meaning this doesn't work at all now.

Use stats that are actually in graphite which should show the overall
health of API requests.

Change-Id: I6bd82b38d80db2b56a399f80132a723564f9bc40
2021-09-22 08:23:24 +10:00
Ian Wienand ed86eb100b Add OSUOSL resources to nodepool
These resouces kindly donated by OSU OSL (https://osuosl.org/)

We have about 15 ARM64 nodes, we're sticking with just the regular
"os.large" instances.

(re)generate the nodepool graphs to account for this

The mirror site is active

Depends-On: https://review.opendev.org/c/opendev/system-config/+/786155
Change-Id: I8bc34beabd130d4a8bb004b0e029ec96945a95df
2021-04-14 12:26:22 +10:00
Zuul 9045766252 Merge "grafana: don't use bool for refresh" 2020-06-24 01:23:30 +00:00
Ian Wienand 99c8bd953b grafana: don't use bool for refresh
grafyaml actually knows this is deprecated, but it's actually more
than that and doesn't work to refresh the variable at all now.  "1"
means "on load" which is what we want.

Change-Id: I34ecdd30c2188cb7e6ec32e33c6a6e99b6240934
2020-06-24 10:54:04 +10:00
Clark Boylan f00445b977 Try set explicit datasource on the OVH dashboard
The templating we end up with in the running grafana for the OVH regions
on the OVH dashboard is null. We set our OpenStack datasource to be our
default datasource but maybe we need to set it explicitly. Do this to
see if it changes the behavior.

Change-Id: Ie95dd980a5c117e1849b08a3611330ff06987c34
2020-06-22 12:18:33 -07:00
Andreas Jaeger a494d238b0 Update infra-manual links
The infra-manual publishes now to docs.opendev.org, change links to it.

New location is: https://docs.opendev.org/opendev/infra-manual/latest/

Link specs.o.o contributors link to OpenStack Contributor Guide.

Change http to https for docs.o.o and specs.o.o everywhere.

Depends-On: https://review.opendev.org/713929
Needed-By: https://review.opendev.org/713930
Change-Id: I46bf9e9527941f2120de59fdbc355337f4c019c1
2020-03-21 09:29:14 +01:00
James E. Blair d624c23335 grafana: add FortNebula dashboard
The minor updates are apparently due to us not having run the script
the last time it was updated with new urls.

Change-Id: I255d1e47b5cff29a3ed377b65ceab677ab1c272e
2019-07-08 09:29:13 -07:00
Andreas Jaeger c1c135087d Update grafana for OpenDev
Use opendev.org everywhere.

Change-Id: I688fe1d949eaee39e20bc74f7aea39583ca14f6f
2019-04-20 19:39:11 +02:00
Ian Wienand d34d7ad763 Consolidate nodepool graphs
All of these dashboards are the same, and have mostly copied all the
same issues with them.  This makes updating anything a massive pain.

This implements a single dashboard template with a small script to
create individual dashboards for each provider and its regions.

I have included a range of fixes.  The y-axis format has changed in
later versions of grafana.  The API time tracking is no longer scaled,
but we just tell grafana it is in ms and it displays it correctly.
The test nodes history graph is moved to the top, as it is probably
the most interesting graph (note this splits itself out per region, if
mulitple regions are selected).  Values for "null as zero" are
consistently set.  Various formatting fixes for the labels are
included.

Change-Id: I5fbffaec3c82aa1fce0947f771de67edd15f7dfc
2018-06-22 12:21:10 +10:00
Ian Wienand 8e8c4b6c62 nodepool grafana : remove job runtimes
These stats aren't updating any more.  Unfortunately, I don't think
there's any current replacement as nodepool doesn't have any insight
into the job it is satisfying a request for.

Change-Id: Ib69fbda5ee019180cd8761d0ead474b426bce379
2018-06-19 14:07:26 +10:00
Paul Belanger 371cd4f944
Add ComputeGetLimits API to nodepool provider dashboards
Since we now query a cloud for its quota information, lets track the
response rate in grafana.

Change-Id: Ie9e2727b5dc3d18f5e5fc37be89a9a5f9492eb47
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2018-01-16 16:57:33 -05:00
David Moreau-Simard a281b06c27
Update Nodepool graphite metric names
Following the update to Zuul v3 some things changed:
- nodes.delete became nodes.deleting
- nodes.used became nodes.in-use but nodes.used is still relevant
  as it's the status between 'in-use' and 'deleting'
- Add a panel for displaying failed nodes

Change-Id: I240d082115bd9078e45984d8fcff212a4e40e842
Depends-On: I6a89752d74ed7424267c3af3937ad01fb4bb8f86
2017-10-02 10:31:52 -04:00
Mathieu Gagné 1188d6866e grafana: Update metric names for API operations timers
Change-Id: I9e713646b40f0be37b101f661561c04bb4f31d7f
2017-09-25 13:12:04 -04:00
Andreas Jaeger e40acc4543 Remove gate-tempest-dsvm-full from grafana
This job does not exist anymore, remove it from grafana.

Change-Id: Ia9df559aa4f9da3aab9624557c44e51c1cec3f23
2017-08-02 21:42:07 +02:00
Andreas Jaeger e698d683c7 grafana: Follow xenial renames
Rename the jobs for xenial changes.

Change-Id: Ie69c371d6bf92582236eda92d5c99f94a9bbc123
2016-07-29 19:28:54 +02:00
Monty Taylor 27a946b77f
Add ServerGet task to nodepool graphs
We also perform a direct ServerGet in some cases - specifically once for
each server create.

Change-Id: I58fd87a8efbf66be785ecec9d3db1abaec5a4816
2016-04-20 17:23:25 -05:00
Paul Belanger 393f6d9201
Switch to shade specific statsd for ServerTasks
Now that nodepool has been switch to use shade, we need to update
grafana to use the new shade syntax for Server related tasks.

Change-Id: I7698d54d89bda5327ac434fd8e662f0fe58d7f5e
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2016-04-17 09:31:04 -04:00
Clark Boylan ceda9719f4 Switch rest of devstack-trusty to ubuntu-trusty
Switch all remaining devstack-trusty jobs to ubuntu-trusty to complete
the migration.

Change-Id: I7f94b59f318cab581aa4f194f33acf8ab597fc36
2016-03-14 12:58:43 -07:00
James E. Blair dbb3544761 Grafana: add image upload times for OVH
This is a new graph; adding it to one provider first for testing.

Change-Id: Id74b33606d294c1de7ff76d39e0868d822c9b3b3
2016-01-28 12:30:15 -08:00
James E. Blair f371d051be Grafana: Add max server line to node graph
OVH for now, the rest if this checks out.

Change-Id: I45635f9d414357533ba0063b3067bb781f7d7985
Depends-On: I22743f566b32aaa5527237dd023551a8f3a3dbc4
2016-01-08 14:36:26 -08:00
James E. Blair 5a8db2f7e8 Grafana: clean up OVH graphs
I missed some tweaks on the previous Test Nodes graph change.

Also make the job runtimes wider like Paul suggested.

Change-Id: I5ac43909a679d273a557112ad8526a68de15f4f1
2016-01-08 12:24:14 -08:00
James E. Blair 6d951009f3 Grafana: Update OVH graph with axis labels
Add axis labels and units where appropriate.

Change the launch attempts graphs to summarize to 1m rather than
1h since grafana lets us zoom in.  1m is the lowest native unit
of time that will always show whole numbers for this metric (whose
lowest non-zero value is 1 event / 10 seconds).

Change the test nodes graph to stacked to match the way we normally
draw this graph, but change the tooltip to 'individual' so that
when hovering, individual values for the different states are
displayed, rather than cumulative (which does not make sense for
this application).

Also change the tooltip for the node graphs on the zuul dashboard
in the same manner.

Change-Id: I500aa486362476cff76a3d254093723f27021bed
Depends-On: Ie542dc4d0e151a00e84cc970c2cfa8c02377d7bf
2016-01-07 17:23:27 -08:00
James E. Blair 2339af1fe4 Grafana: add sparklines to remaining providers
To match those added to rackspace.

Change-Id: I6e7f412ea0ef7cd6b3088622782f2f6f6d50c561
2016-01-07 11:43:18 -08:00
James E. Blair 0c5f11e855 Grafana: Add per-region node graphs
These are per-region versions of the nodepool node state graph,
except that the values are not stacked in order to make the
individual values more accessible.

Change-Id: I8ec90758828484a9ffb7a90d2eacbcccc8b78bb4
2016-01-06 16:58:00 -08:00
James E. Blair c5dc3cacad Grafana: Fix node launch error graphs
There is no .error metric, but rather, errors are broken out by
cause.  For this graph, simply display their sum.

Change-Id: Iae19e4e78098f3373c3195ff3ec52a11c5e92a3b
2016-01-06 16:24:30 -08:00
James E. Blair 2207224179 Grafana: Add OVH dashboard
Change-Id: I8a814cd3dfae0cdcc04a0e2230cd35866a033863
2016-01-06 16:18:31 -08:00