fuel-plugin-lma-collector

Commit Graph

Author	SHA1	Message	Date
Andreas Jaeger	c929899400	Retire repository Fuel repositories are all retired in openstack namespace, retire remaining fuel repos in x namespace since they are unused now. This change removes all content from the repository and adds the usual README file to point out that the repository is retired following the process from https://docs.openstack.org/infra/manual/drivers.html#retiring-a-project See also http://lists.openstack.org/pipermail/openstack-discuss/2019-December/011675.html A related change is: https://review.opendev.org/699752 . Change-Id: I8aded54f1b9f3b79f3a4bf8f607d3695b92f528b	2019-12-18 19:39:39 +01:00
Swann Croiset	d2fc3a9fd8	Enable pagination for Neutron Change-Id: Ib1be3884a26d7bec9d0cf9dbc4ade9dfd6fab31d	2017-02-09 12:55:08 +00:00
Swann Croiset	88323c91d3	Purge metric_collector toml files from previous version Change-Id: I17ee4dfa70c242daf16da91637f54ef1edbc4801	2017-01-25 14:49:07 +01:00
Swann Croiset	b7c7e7bdc2	Remove the SMTP standalone alerting_mode This feature was broken and not stable enough for production deployment. Related-bug: #1606831 Related-bug: #1643542 Change-Id: I0ce52ec01838d891c43d6e797617d3044a02d10f	2017-01-09 13:23:14 +01:00
Guillaume Thouvenin	9dbf48dbfe	Replace the workers AFD filter This patch uses the generic AFD filter with new alarms to replace the custom AFD for workers. Blueprint: allow-all-alarms-to-be-specified-in-alarming-file Change-Id: I6c432e60a16da5bb3c8d0ecd0bd22a1246fe6f82	2016-10-06 09:05:30 +02:00
Guillaume Thouvenin	215f693307	Replace the API backends AFD filter This patch uses the generic AFD filter with new alarms to replace the custom AFD for API backends. Blueprint: allow-all-alarms-to-be-specified-in-alarming-file Change-Id: Id139e45a9942a9c86a2d35d1966b083d9c75af89	2016-10-05 15:41:55 +00:00
Swann Croiset	553d2040cc	Send GSE service clusters status to alerting Change-Id: Iad33e1f4bffd81066a82a0d73a46e7b489eb23d7 blueprint: alarming-refactoring	2016-09-20 09:41:54 +02:00
Simon Pasquier	38ec02fe46	Add a dedicated manifest to configure collectd This removes duplication of code and limitations we had to deal with because the collectd Puppet resources don't play well when they are created at different times from several manifests. Change-Id: I52fabb1fb5795a33f552168553a148b1520fc496	2016-08-26 15:59:04 +02:00
Jenkins	d46ed8070c	Merge "Check memcached service on controller nodes"	2016-08-26 13:21:44 +00:00
Swann Croiset	26c5788684	Check memcached service on controller nodes The patch replaces the service_heartbeat mechanism. Change-Id: I060e10320cf6f8b874a39037b1f9257ed1996342	2016-08-26 10:56:06 +02:00
Simon Pasquier	ea6c8d3ae5	Fix notification_driver for Cinder and Heat Change-Id: Ic9fd9f7d71ba9dbd9f4979612aefb114176a96ad Closes-Bug: #1616456	2016-08-24 15:50:24 +02:00
Simon Pasquier	3a3ef6f2e3	Add Pacemaker collectd plugin This change adds a collectd plugin that gets metrics from the Pacemaker cluster: - cluster's metrics - node's metrics - resource's metrics Most of the metrics are only collected from the node that is the designated controller except pacemaker_resource_local_active and pacemaker_dc_local_active. The plugin also removes the 'pacemaker_resource' plugin by providing the exact same metrics and notifications for the other collectd plugins. Finally the plugin is also installed on the standalone-rabbitmq and standalone-database nodes if they are present. Change-Id: I8b5b987704f69c6a60b13e8ea982f27924f488d1	2016-08-11 14:53:43 +02:00
Simon Pasquier	79a906d619	Use Hiera data to configure the RabbitMQ plugin This change uses the information that is already avaiable in the collector's Hiera data to decide whether the RabbitMQ collectd plugin should be deployed or not. Change-Id: Ib1df231d6bf99ee6f34ee199fd5241d6b264fc00	2016-08-09 16:17:17 +02:00
Swann Croiset	687ddce421	Configure AFD alarms against 'check' metrics Change-Id: I849f5ce2ea05457bb5cb0706b6a0c671e93af0e6	2016-08-04 15:16:20 +00:00
Swann Croiset	1ae8829823	Use RabbitMQ management API The patch uses the management API to retrieve metrics instead of executing rabbitmqctl command. A side effect is that all metrics per-queues are not collected anymore. Change-Id: I5dab785321e369ec0e1a69a79e0700b276810925 Closes-bug: #1594337	2016-07-13 09:19:06 +02:00
Simon Pasquier	6c2573d09a	Remove support for Nova EC2 API Change-Id: I6180a5396c4fb704832bcd44c073868ae6a90a81 Closes-Bug: #1556803 Depends-On: I5699c8b1e695670678250c3c5483d627ef9e6244	2016-06-01 14:55:25 +02:00
Swann Croiset	a9a802c56d	Avoid to collect notifications from 2 distinct locations With the Rabbitmq cluster hosted on dedicated nodes, the notifications must not be collected from controller nodes. Change-Id: I28b2d3d0c35d16815812af447b2ab8a716276645	2016-05-26 09:28:02 +02:00
Swann Croiset	9d7efe4161	Avoid collecting logs and notifications uselessly This patches avoids to collect logs and notifications when both Elasticsearch and InfluxDB are not (yet) deployed. This is useless and leads to lose all logs and notifications produced before backends are deployed. Change-Id: I30a39d65f7a732251def32ccfb8202c34d6408c5	2016-05-26 09:27:16 +02:00
Simon Pasquier	c579286779	Move mod_status class to fuel_lma_collector The mod_status class shouldn't live in the lma_collector class because the (re)configuration of Apache isn't the responsibility of the LMA collector module. Change-Id: If80c9d100263436922e06aea02d2050236ff05cf Closes-Bug: #1547424	2016-05-25 15:51:28 +00:00
Simon Pasquier	1f759e7f3d	Use hiera_hash for network data Change-Id: I171d08e974d635d85b391c3bc29366f0f4dd7b59 Closes-Bug: #1585350	2016-05-25 10:44:31 +02:00
Swann Croiset	123a86590f	Remove the remote Nagios alerting mode This use case is not really supported because Nagios configuration is too dependent on the LMA Collector plugin and in practice, no one chooses this option. DocImpact Change-Id: Ia09efb40f476c1daec51530e2c0fb16bc6f99393	2016-05-24 17:05:49 +02:00
Swann Croiset	a513e7aaea	Factorize infra_alerting configuration All logic is done once and persisted into hiera. Change-Id: I352c17393eeba57b5bde460591c2db567c452c45	2016-05-24 16:59:01 +02:00
Swann Croiset	debe1883d7	Allow deployment without InfluxDB and Elasticsearch This allows to support several deployment scenarii where backends are not deployed initialy, for instance when using the 'virt' nodes to deploy LMA backends. The patch factorizes manifests by moving all the configuration data of InfluxDB and Elasticsearch into hiera. DocImpact Fixes-bug: #1570386 Change-Id: I8688bbd10d88bc8ef68b5d31e9edd62a764dc23d	2016-05-23 13:29:50 +02:00
Swann Croiset	306448e8fa	Update the Nagios URL used to sent GSE and AFD Implements: blueprint scalable-nagios-api Depends-On: I55613dd650b039142767174d3f19fa9262a2a7bc Change-Id: I152dd8fcb0ec7dfddc99c8a52fa2098a414811c0	2016-05-12 11:46:27 +00:00
Swann Croiset	d2eafcb750	Remove all lma_collector:params references in manifests Change-Id: I3af0eb48aca1aeb0f4d3bb6b0798a7a343bd072e	2016-05-12 13:43:49 +02:00
Guillaume Thouvenin	2d72b53784	Support for the detach-database plugin This patch adds the support when the database is deployed on a dedicated node [1]. [1] https://github.com/openstack/fuel-plugin-detach-database Change-Id: If800d9d09204a1456640863a3ed3c5dc66d29017 Closes-Bug: #1547089	2016-05-10 09:00:44 +02:00
Guillaume Thouvenin	6dc87065fb	Support for the detach-rabbitmq plugin This patch adds the support when the RabbitMQ cluster is deployed on dedicated nodes [1]. [1] https://github.com/openstack/fuel-plugin-detach-rabbitmq Change-Id: Icc337e48d9a836ccab85dfc0b8ca86ff58c5cd4d Closes-Bug: #1547086 Closes-Bug: #1575046	2016-05-10 09:00:44 +02:00
Swann Croiset	391ca132b3	Emit aggregated HTTP metrics HTTP metrics are now statistics aggregated every 10 seconds. A new metric is emitted openstack_<service>_response_times with these values: - min - max - sum - count - percentile Hence, the previous metric disappears (openstack_<service>_responses). Implements-blueprint: aggregated-http-metrics Change-Id: I48e92df6f4baa7be942ad138b7f23c3d15f5a24e	2016-05-04 14:34:39 +02:00
Swann Croiset	ebac150f8a	Separate the (L)og of the LMA collector This change separates the processing of the logs/notifications and metric/alerting into 2 dedicated hekad processes, these services are named 'log_collector' and 'metric_collector'. Both services are managed by Pacemaker on controller nodes and by Upstart on other nodes. All metrics computed by log_collector (HTTP response times and creation time for instances and volumes) are sent directly to the metric_collector via TCP. Elasticsearch output (log_collector) uses full_action='block' and the TCP output uses full_action='drop'. All outputs of metric_collector (InfluxDB, HTTP and TCP) use full_action='drop'. The buffer size configurations are: * metric_collector: - influxdb-output buffer size is increased to 1Gb. - aggregator-output (tcp) buffer size is decreased to 256Mb (vs 1Gb). - nagios outputs (x3) buffer size are decreased to 1Mb. * log_collector: - elasticsearch-output buffer size is decreased to 256Mb (vs 1Gb). - tcp-output buffer size is set to 256Mb. Implements: blueprint separate-lma-collector-pipelines Fixes-bug: #1566748 Change-Id: Ieadb93b89f81e944e21cf8e5a65f4d683fd0ffb8	2016-05-04 14:34:14 +02:00
Guillaume Thouvenin	9f80252538	Add notice to identify StackLight manifests in puppet logs Change-Id: Ieeb7d32c77166b234940f4bf0e57202312a62f0c	2016-04-15 15:53:17 +02:00
Jenkins	7baaa4f203	Merge "Monitor Ceilometer HAProxy backends"	2016-03-30 15:20:18 +00:00
Jenkins	ec80e04cd7	Merge "Monitor RabbitMQ queues used by Ceilometer"	2016-03-30 15:20:12 +00:00
Vladimir Kuklin	590590b907	Remove usage of deprecated filter_nodes function filter_nodes function is deprecated - do not use it anymore Change-Id: Ibacaf4e0aa263d2fe2b98df19a5e7d1e62a14c51 Partial-bug: #1550253	2016-03-21 09:01:20 +00:00
Guillaume Thouvenin	bcdedbf1aa	Use OpenStack API to collect the status of Neutron agents This change uses the Neutron API to get the status of the Neutron agents instead of querying the MySQL database. Change-Id: I60fa2386a887e9dac2fe4f1234d225ad6402bf2d Partial-Bug: #1546188	2016-03-21 07:51:55 +01:00
Guillaume Thouvenin	e60e187f32	Use OpenStack API to collect the status of Cinder workers This change uses the Cinder API to get the status of the Cinder workers instead of querying the MySQL database. Change-Id: If92596b3cee8a4c9f0dcf84454fdff2a2532160f Partial-Bug: #1546188	2016-03-21 07:51:55 +01:00
Guillaume Thouvenin	4e5f35e3a6	Use OpenStack API to collect the status of Nova workers This change uses the Nova API to get the status of the Nova workers instead of querying the MySQL database. Change-Id: I24e84b21f988e4c748d0ead134d60df4bf9dd8b1 Partial-Bug: #1546188	2016-03-21 06:48:04 +00:00
Swann Croiset	232a8f7f44	Monitor Ceilometer HAProxy backends Change-Id: I95adf5e4fe1a7940f589e73ed8794c03d794be9d Fixes-bug: #1557558	2016-03-15 15:17:06 +01:00
Swann Croiset	56cf43b1bb	Monitor RabbitMQ queues used by Ceilometer Change-Id: If3e4a421c248db9235343a50dca338fae6c596ab	2016-03-15 15:01:30 +01:00
Simon Pasquier	65ff6607c1	Add backward compatibility for MOS7/LMA 0.8 This change makes sure that the Puppet manifests can be executed with the 0.8 versions of the InfluxDB/Grafana, Elasticsearch-Kibana and Nagios plugins. Change-Id: Ib8bb0aff3497ff7b9e7a307ddb04d15798fbd070	2016-03-10 17:13:03 +01:00
Simon Pasquier	4d7f70832f	Ensure manifests fit the 140-character limit Change-Id: I6663f402c3c64a76190543531ae70c8294414d04	2016-03-10 14:24:55 +01:00
Guillaume Thouvenin	e2beeaeef8	Add support for Sahara logs and notifications Closes-Bug: #1493699 Change-Id: If3d7688d5f60d38ab21ba44de57b5cfdb4b7adc3	2016-03-07 09:28:56 +00:00
Swann Croiset	74cb032123	Collect RabbitMQ queue metrics for specific queues Fixes-bug: #1549721 Change-Id: Ic45bb22e2aeed43e9139e62360f74e36d7977c1d	2016-03-03 11:06:34 +01:00
Guillaume Thouvenin	7d86ec48e6	Modify the definition of Neutron cluster when Contrail is installed This change removes the Neutron agents AFD when Contrail plugin is used to avoid reporting a DOWN status for Neutron. It also removes the collect of the metrics of the Neutron agents. Change-Id: I02ecb67489d244aca85bc4b1e3d4a5cd79df1b5b Closes-Bug: #1546017	2016-02-23 14:02:38 +00:00
Swann Croiset	59d8fd8604	Specify explicitly Neutron log file names parsed by Hekad Fixes-bug: #1546424 Change-Id: Icacfb8f7c6b81817856df468aeb592978a8d26e8	2016-02-19 15:44:18 +00:00
Jenkins	8dcca0b26f	Merge "Restart cinder-volume on controllers to emit notifications"	2016-02-18 09:11:06 +00:00
Swann Croiset	7ffedf2957	Configure InfluxDB VIP HTTP check Change-Id: I503ef168c785751758a9c8568d9d90b2fd152890	2016-02-18 09:08:14 +00:00
Swann Croiset	a8f6265c38	Restart cinder-volume on controllers to emit notifications When Ceph is used as a Cinder backend, the cinder-volume process runs on each controller. Fixes-bug: #1546555 Change-Id: I077bcebe0b637d001cf66803a24102db9c507c15	2016-02-17 16:06:53 +01:00
Jenkins	15487fd9ab	Merge "Make changes to lma_collector::collectd::mysql"	2016-02-10 12:17:16 +00:00
Éric Lemoine	54a2e4b4b9	Make changes to lma_collector::collectd::mysql This commit is related to the usage and documentation of the lma_collector::collectd::mysql class. The following changes are made: 1. Make the "username" and "password" parameters required. Today they default to the empty string, which doesn't make much sense. 2. Change the internal resource name from "nova" to "config". The name "nova" was confusing as the collection of MySQL statistics is unrelated to Nova. With this change the generated collectd configuration file is named "mysql-config.conf", which makes more sense than "mysql-nova.conf" and is consistent with other collectd config file names we have (e.g. "python-config.conf"). 3. Add a unit test for the class. 4. Adjust the documentation. Change-Id: I281c28d9f4da7ae728615041e175845ad5829b34	2016-02-09 07:50:31 -08:00
Jenkins	ec62ea5a61	Merge "Try to configure Nagios by default"	2016-02-09 09:06:21 +00:00

1 2 3

125 Commits