Commit Graph

125 Commits

Author SHA1 Message Date
Andreas Jaeger c929899400 Retire repository
Fuel repositories are all retired in openstack namespace, retire
remaining fuel repos in x namespace since they are unused now.

This change removes all content from the repository and adds the usual
README file to point out that the repository is retired following the
process from
https://docs.openstack.org/infra/manual/drivers.html#retiring-a-project

See also
http://lists.openstack.org/pipermail/openstack-discuss/2019-December/011675.html

A related change is: https://review.opendev.org/699752 .

Change-Id: I8aded54f1b9f3b79f3a4bf8f607d3695b92f528b
2019-12-18 19:39:39 +01:00
Swann Croiset d2fc3a9fd8 Enable pagination for Neutron
Change-Id: Ib1be3884a26d7bec9d0cf9dbc4ade9dfd6fab31d
2017-02-09 12:55:08 +00:00
Swann Croiset 88323c91d3 Purge metric_collector toml files from previous version
Change-Id: I17ee4dfa70c242daf16da91637f54ef1edbc4801
2017-01-25 14:49:07 +01:00
Swann Croiset b7c7e7bdc2 Remove the SMTP standalone alerting_mode
This feature was broken and not stable enough for production deployment.

Related-bug: #1606831
Related-bug: #1643542

Change-Id: I0ce52ec01838d891c43d6e797617d3044a02d10f
2017-01-09 13:23:14 +01:00
Guillaume Thouvenin 9dbf48dbfe Replace the workers AFD filter
This patch uses the generic AFD filter with new alarms to replace
the custom AFD for workers.

Blueprint: allow-all-alarms-to-be-specified-in-alarming-file
Change-Id: I6c432e60a16da5bb3c8d0ecd0bd22a1246fe6f82
2016-10-06 09:05:30 +02:00
Guillaume Thouvenin 215f693307 Replace the API backends AFD filter
This patch uses the generic AFD filter with new alarms to replace the
custom AFD for API backends.

Blueprint: allow-all-alarms-to-be-specified-in-alarming-file
Change-Id: Id139e45a9942a9c86a2d35d1966b083d9c75af89
2016-10-05 15:41:55 +00:00
Swann Croiset 553d2040cc Send GSE service clusters status to alerting
Change-Id: Iad33e1f4bffd81066a82a0d73a46e7b489eb23d7
blueprint: alarming-refactoring
2016-09-20 09:41:54 +02:00
Simon Pasquier 38ec02fe46 Add a dedicated manifest to configure collectd
This removes duplication of code and limitations we had to deal with
because the collectd Puppet resources don't play well when they are
created at different times from several manifests.

Change-Id: I52fabb1fb5795a33f552168553a148b1520fc496
2016-08-26 15:59:04 +02:00
Jenkins d46ed8070c Merge "Check memcached service on controller nodes" 2016-08-26 13:21:44 +00:00
Swann Croiset 26c5788684 Check memcached service on controller nodes
The patch replaces the service_heartbeat mechanism.

Change-Id: I060e10320cf6f8b874a39037b1f9257ed1996342
2016-08-26 10:56:06 +02:00
Simon Pasquier ea6c8d3ae5 Fix notification_driver for Cinder and Heat
Change-Id: Ic9fd9f7d71ba9dbd9f4979612aefb114176a96ad
Closes-Bug: #1616456
2016-08-24 15:50:24 +02:00
Simon Pasquier 3a3ef6f2e3 Add Pacemaker collectd plugin
This change adds a collectd plugin that gets metrics from the Pacemaker
cluster:

  - cluster's metrics
  - node's metrics
  - resource's metrics

Most of the metrics are only collected from the node that is the
designated controller except pacemaker_resource_local_active and
pacemaker_dc_local_active.

The plugin also removes the 'pacemaker_resource' plugin by providing the
exact same metrics and notifications for the other collectd plugins.

Finally the plugin is also installed on the standalone-rabbitmq and
standalone-database nodes if they are present.

Change-Id: I8b5b987704f69c6a60b13e8ea982f27924f488d1
2016-08-11 14:53:43 +02:00
Simon Pasquier 79a906d619 Use Hiera data to configure the RabbitMQ plugin
This change uses the information that is already avaiable in the
collector's Hiera data to decide whether the RabbitMQ collectd
plugin should be deployed or not.

Change-Id: Ib1df231d6bf99ee6f34ee199fd5241d6b264fc00
2016-08-09 16:17:17 +02:00
Swann Croiset 687ddce421 Configure AFD alarms against 'check' metrics
Change-Id: I849f5ce2ea05457bb5cb0706b6a0c671e93af0e6
2016-08-04 15:16:20 +00:00
Swann Croiset 1ae8829823 Use RabbitMQ management API
The patch uses the management API to retrieve metrics instead of
executing rabbitmqctl command.

A side effect is that all metrics per-queues are not collected anymore.

Change-Id: I5dab785321e369ec0e1a69a79e0700b276810925
Closes-bug: #1594337
2016-07-13 09:19:06 +02:00
Simon Pasquier 6c2573d09a Remove support for Nova EC2 API
Change-Id: I6180a5396c4fb704832bcd44c073868ae6a90a81
Closes-Bug: #1556803
Depends-On: I5699c8b1e695670678250c3c5483d627ef9e6244
2016-06-01 14:55:25 +02:00
Swann Croiset a9a802c56d Avoid to collect notifications from 2 distinct locations
With the Rabbitmq cluster hosted on dedicated nodes, the notifications
must not be collected from controller nodes.

Change-Id: I28b2d3d0c35d16815812af447b2ab8a716276645
2016-05-26 09:28:02 +02:00
Swann Croiset 9d7efe4161 Avoid collecting logs and notifications uselessly
This patches avoids to collect logs and notifications when
both Elasticsearch and InfluxDB are not (yet) deployed.
This is useless and leads to lose all logs and notifications produced
before backends are deployed.

Change-Id: I30a39d65f7a732251def32ccfb8202c34d6408c5
2016-05-26 09:27:16 +02:00
Simon Pasquier c579286779 Move mod_status class to fuel_lma_collector
The mod_status class shouldn't live in the lma_collector class because
the (re)configuration of Apache isn't the responsibility of the LMA
collector module.

Change-Id: If80c9d100263436922e06aea02d2050236ff05cf
Closes-Bug: #1547424
2016-05-25 15:51:28 +00:00
Simon Pasquier 1f759e7f3d Use hiera_hash for network data
Change-Id: I171d08e974d635d85b391c3bc29366f0f4dd7b59
Closes-Bug: #1585350
2016-05-25 10:44:31 +02:00
Swann Croiset 123a86590f Remove the remote Nagios alerting mode
This use case is not really supported because Nagios configuration is
too dependent on the LMA Collector plugin and in practice, no one
chooses this option.

DocImpact

Change-Id: Ia09efb40f476c1daec51530e2c0fb16bc6f99393
2016-05-24 17:05:49 +02:00
Swann Croiset a513e7aaea Factorize infra_alerting configuration
All logic is done once and persisted into hiera.

Change-Id: I352c17393eeba57b5bde460591c2db567c452c45
2016-05-24 16:59:01 +02:00
Swann Croiset debe1883d7 Allow deployment without InfluxDB and Elasticsearch
This allows to support several deployment scenarii where backends are not
deployed initialy, for instance when using the 'virt' nodes to deploy
LMA backends.

The patch factorizes manifests by moving all the configuration data of
InfluxDB and Elasticsearch into hiera.

DocImpact

Fixes-bug: #1570386
Change-Id: I8688bbd10d88bc8ef68b5d31e9edd62a764dc23d
2016-05-23 13:29:50 +02:00
Swann Croiset 306448e8fa Update the Nagios URL used to sent GSE and AFD
Implements: blueprint scalable-nagios-api
Depends-On: I55613dd650b039142767174d3f19fa9262a2a7bc

Change-Id: I152dd8fcb0ec7dfddc99c8a52fa2098a414811c0
2016-05-12 11:46:27 +00:00
Swann Croiset d2eafcb750 Remove all lma_collector:params references in manifests
Change-Id: I3af0eb48aca1aeb0f4d3bb6b0798a7a343bd072e
2016-05-12 13:43:49 +02:00
Guillaume Thouvenin 2d72b53784 Support for the detach-database plugin
This patch adds the support when the database is deployed on a dedicated
node [1].

[1] https://github.com/openstack/fuel-plugin-detach-database

Change-Id: If800d9d09204a1456640863a3ed3c5dc66d29017
Closes-Bug: #1547089
2016-05-10 09:00:44 +02:00
Guillaume Thouvenin 6dc87065fb Support for the detach-rabbitmq plugin
This patch adds the support when the RabbitMQ cluster is deployed on
dedicated nodes [1].

[1] https://github.com/openstack/fuel-plugin-detach-rabbitmq

Change-Id: Icc337e48d9a836ccab85dfc0b8ca86ff58c5cd4d
Closes-Bug: #1547086
Closes-Bug: #1575046
2016-05-10 09:00:44 +02:00
Swann Croiset 391ca132b3 Emit aggregated HTTP metrics
HTTP metrics are now statistics aggregated every 10 seconds.
A new metric is emitted openstack_<service>_response_times with these
values:
- min
- max
- sum
- count
- percentile

Hence, the previous metric disappears (openstack_<service>_responses).

Implements-blueprint: aggregated-http-metrics

Change-Id: I48e92df6f4baa7be942ad138b7f23c3d15f5a24e
2016-05-04 14:34:39 +02:00
Swann Croiset ebac150f8a Separate the (L)og of the LMA collector
This change separates the processing of the logs/notifications and
metric/alerting into 2 dedicated hekad processes, these services are
named 'log_collector' and 'metric_collector'.

Both services are managed by Pacemaker on controller nodes and by Upstart on
other nodes.

All metrics computed by log_collector (HTTP response times and creation time
for instances and volumes) are sent directly to the metric_collector via TCP.
Elasticsearch output (log_collector) uses full_action='block' and the
TCP output uses full_action='drop'.

All outputs of metric_collector (InfluxDB, HTTP and TCP) use
full_action='drop'.

The buffer size configurations are:
* metric_collector:
  - influxdb-output buffer size is increased to 1Gb.
  - aggregator-output (tcp) buffer size is decreased to 256Mb (vs 1Gb).
  - nagios outputs (x3) buffer size are decreased to 1Mb.
* log_collector:
  - elasticsearch-output buffer size is decreased to 256Mb (vs 1Gb).
  - tcp-output buffer size is set to 256Mb.

Implements: blueprint separate-lma-collector-pipelines
Fixes-bug: #1566748

Change-Id: Ieadb93b89f81e944e21cf8e5a65f4d683fd0ffb8
2016-05-04 14:34:14 +02:00
Guillaume Thouvenin 9f80252538 Add notice to identify StackLight manifests in puppet logs
Change-Id: Ieeb7d32c77166b234940f4bf0e57202312a62f0c
2016-04-15 15:53:17 +02:00
Jenkins 7baaa4f203 Merge "Monitor Ceilometer HAProxy backends" 2016-03-30 15:20:18 +00:00
Jenkins ec80e04cd7 Merge "Monitor RabbitMQ queues used by Ceilometer" 2016-03-30 15:20:12 +00:00
Vladimir Kuklin 590590b907 Remove usage of deprecated filter_nodes function
filter_nodes function is deprecated - do not use it anymore

Change-Id: Ibacaf4e0aa263d2fe2b98df19a5e7d1e62a14c51
Partial-bug: #1550253
2016-03-21 09:01:20 +00:00
Guillaume Thouvenin bcdedbf1aa Use OpenStack API to collect the status of Neutron agents
This change uses the Neutron API to get the status of the Neutron
agents instead of querying the MySQL database.

Change-Id: I60fa2386a887e9dac2fe4f1234d225ad6402bf2d
Partial-Bug: #1546188
2016-03-21 07:51:55 +01:00
Guillaume Thouvenin e60e187f32 Use OpenStack API to collect the status of Cinder workers
This change uses the Cinder API to get the status of the Cinder workers
instead of querying the MySQL database.

Change-Id: If92596b3cee8a4c9f0dcf84454fdff2a2532160f
Partial-Bug: #1546188
2016-03-21 07:51:55 +01:00
Guillaume Thouvenin 4e5f35e3a6 Use OpenStack API to collect the status of Nova workers
This change uses the Nova API to get the status of the Nova workers
instead of querying the MySQL database.

Change-Id: I24e84b21f988e4c748d0ead134d60df4bf9dd8b1
Partial-Bug: #1546188
2016-03-21 06:48:04 +00:00
Swann Croiset 232a8f7f44 Monitor Ceilometer HAProxy backends
Change-Id: I95adf5e4fe1a7940f589e73ed8794c03d794be9d
Fixes-bug: #1557558
2016-03-15 15:17:06 +01:00
Swann Croiset 56cf43b1bb Monitor RabbitMQ queues used by Ceilometer
Change-Id: If3e4a421c248db9235343a50dca338fae6c596ab
2016-03-15 15:01:30 +01:00
Simon Pasquier 65ff6607c1 Add backward compatibility for MOS7/LMA 0.8
This change makes sure that the Puppet manifests can be executed with
the 0.8 versions of the InfluxDB/Grafana, Elasticsearch-Kibana and
Nagios plugins.

Change-Id: Ib8bb0aff3497ff7b9e7a307ddb04d15798fbd070
2016-03-10 17:13:03 +01:00
Simon Pasquier 4d7f70832f Ensure manifests fit the 140-character limit
Change-Id: I6663f402c3c64a76190543531ae70c8294414d04
2016-03-10 14:24:55 +01:00
Guillaume Thouvenin e2beeaeef8 Add support for Sahara logs and notifications
Closes-Bug: #1493699
Change-Id: If3d7688d5f60d38ab21ba44de57b5cfdb4b7adc3
2016-03-07 09:28:56 +00:00
Swann Croiset 74cb032123 Collect RabbitMQ queue metrics for specific queues
Fixes-bug: #1549721

Change-Id: Ic45bb22e2aeed43e9139e62360f74e36d7977c1d
2016-03-03 11:06:34 +01:00
Guillaume Thouvenin 7d86ec48e6 Modify the definition of Neutron cluster when Contrail is installed
This change removes the Neutron agents AFD when Contrail plugin is used
to avoid reporting a DOWN status for Neutron. It also removes the
collect of the metrics of the Neutron agents.

Change-Id: I02ecb67489d244aca85bc4b1e3d4a5cd79df1b5b
Closes-Bug: #1546017
2016-02-23 14:02:38 +00:00
Swann Croiset 59d8fd8604 Specify explicitly Neutron log file names parsed by Hekad
Fixes-bug: #1546424
Change-Id: Icacfb8f7c6b81817856df468aeb592978a8d26e8
2016-02-19 15:44:18 +00:00
Jenkins 8dcca0b26f Merge "Restart cinder-volume on controllers to emit notifications" 2016-02-18 09:11:06 +00:00
Swann Croiset 7ffedf2957 Configure InfluxDB VIP HTTP check
Change-Id: I503ef168c785751758a9c8568d9d90b2fd152890
2016-02-18 09:08:14 +00:00
Swann Croiset a8f6265c38 Restart cinder-volume on controllers to emit notifications
When Ceph is used as a Cinder backend, the cinder-volume process runs on each
controller.

Fixes-bug: #1546555

Change-Id: I077bcebe0b637d001cf66803a24102db9c507c15
2016-02-17 16:06:53 +01:00
Jenkins 15487fd9ab Merge "Make changes to lma_collector::collectd::mysql" 2016-02-10 12:17:16 +00:00
Éric Lemoine 54a2e4b4b9 Make changes to lma_collector::collectd::mysql
This commit is related to the usage and documentation of the
lma_collector::collectd::mysql class.

The following changes are made:

1. Make the "username" and "password" parameters required. Today
   they default to the empty string, which doesn't make much sense.
2. Change the internal resource name from "nova" to "config". The
   name "nova" was confusing as the collection of MySQL statistics
   is unrelated to Nova. With this change the generated collectd
   configuration file is named "mysql-config.conf", which makes
   more sense than "mysql-nova.conf" and is consistent with other
   collectd config file names we have (e.g. "python-config.conf").
3. Add a unit test for the class.
4. Adjust the documentation.

Change-Id: I281c28d9f4da7ae728615041e175845ad5829b34
2016-02-09 07:50:31 -08:00
Jenkins ec62ea5a61 Merge "Try to configure Nagios by default" 2016-02-09 09:06:21 +00:00