Run script generate-gitmodules.py to sync with governace repository to
add new repositories and remove retired ones.
Change-Id: Ia42b717ad242958ba4a9689ac814c956f460712f
* Update elastic-recheck from branch 'master'
- Simplify 1708704's query
The previous query for 1708704 was causing e-r to OOM and resulting in a
lack of graph data. This is beacuse tripleo's logstash.txt file is not
getting parsed properly and ends up with the entire file as the single
event. Then e-r downloads all those file copies and fills its memory and
breaks. We can work around this by only looking in job-output.txt files
for this bug. Then when tripleo's fix has flushed the bad events out (10
days after fix merges) we can revert this change.
Change-Id: Id619f90ffe84b3d4de334ea4b17026b9b3239d33
* Update elastic-recheck from branch 'master'
- Merge "Don't pretty print json files"
- Don't pretty print json files
The json file outputs of e-r are loaded by web browsers in order to
render our graphs. These json files are actually quite large and part of
the reason why is we pretty print them with 4 space indents and they
have large nesting. Stop pretty printing (humans can pass the files
through a filter if necessary) in order to reduce the size of these
files and make browsers happier (less time spent downloading).
Change-Id: I19dedc2994169932eb0e90b6cdea3856637f5ef0
* Update elastic-recheck from branch 'master'
- Fix flot xaxis labels
After the flot update the xaxis labels aren't meaningful to humans (in
fact I'm not quite sure what they were showing us). We can explicitly
state the input type as milliseconds and the label render format. Doing
this gives us labels that are meaningful to humans.
Change-Id: I7912a536f3de2756404f8c7e7f31d8bd5890ab22
* Update elastic-recheck from branch 'master'
- Update flot js file inclusion to match new flot
This was necessary after upgrading the status server which pulled in
newer flot.
Change-Id: I28c844d216eefdf6d186b04ce47889fcd044d23a
* Update elastic-recheck from branch 'master'
- Handle ElasticHttpError in graph generation
Getting elasticsearch data for bug 1708704 is failing
in the check queue with:
pyelasticsearch.exceptions.ElasticHttpError: \
(500, 'ArrayIndexOutOfBoundsException[null]')
This might have to do with the size of the resulting
messages from the hits on the tripleo and kolla jobs,
I'm not sure.
What's clear though is the graph generation is blowing
up in the check queue on that bug but not the gate queue,
maybe due to a smaller result set, so this adds some
error handling in the graph generation for when a specific
bug query fails so it does not halt the entire build of the
graph.
Change-Id: Ibe18c9cccc421a6549a18148f1a2ce3c1e4339d4
* Update elastic-recheck from branch 'master'
- Add query for network_sanity_check ping check fail bug 1856760
45 hits in the last 7 days, with a spike since Dec 16, mostly
fortnebula nodes, all failures, check and gate.
Change-Id: Ic856d11e183075244de556b86d4ecdc7bcc78abd
* Update elastic-recheck from branch 'master'
- Update query for bug 1820007
This also shows up in multi-node jobs when attaching a volume
so update the query to include volume attach failures.
Change-Id: Ie77e3998b2ff4a508fa3b6078acb3ceec15d7e37
* Update elastic-recheck from branch 'master'
- Add query for NetworkAmbiguous fail bug 1844568
211 hits in 7 days, check and gate, all failures.
The message shows up in the n-api logs but filtering
on n-api logs isn't sufficient to get 100% failure rate
in logstash because there are tempest tests like
test_create_list_show_delete_interfaces_by_network_port
which handle the error and work around it so even though
the message shows up not all jobs fail because of it so
we use the tempest failure log in the console to fingerprint
this bug.
Change-Id: I7b4a3f4a483c5166e9aee1507f12bb31069a8fe0
* Update elastic-recheck from branch 'master'
- Update query for bug 1853453
This brings the hit count up from 14 to 74. Now we get hits
in the gate queue as well.
Change-Id: Ieb50f0c796cfccbbd6b3da0a534f6b7a7ae3800e
* Update elastic-recheck from branch 'master'
- Add query for test_shelve_volume_backed_instance bug 1853453
This is split off from bug 1848078 since this test specifically
seems to hit this issue a lot in multinode jobs where the instance
is shelved from one node and unshelved on another.
14 hits in 7 days, all failures.
Change-Id: I9bd41e356abf72ff08415693b7b2b11a035a542d
* Update elastic-recheck from branch 'master'
- Add query for nova install on py27 bug 1853166
908 hits in the last 24 hours, check and gate, all failures.
Change-Id: I62f0ad207eee41ce4738b0117bd6718248502d61
* Update elastic-recheck from branch 'master'
- Add query for inap package mirror bug 1852617
80 hits in 7 days (really the last 2), check and gate,
all failures on all branches.
Change-Id: I1d6f827903f4fd4ccf7c0398575ae2ed26a4b539
* Update elastic-recheck from branch 'master'
- Add query for nova functional test fail bug 1850682
26 hits in 7 days, check and gate, all failures. Seems
to be restricted to stable/rocky for some reason.
Change-Id: Ic9463841764c5c4dc552e0b2332d854234b609f5
* Update elastic-recheck from branch 'master'
- Merge "Fix link formatting in CONTRIBUTING.rst"
- Fix link formatting in CONTRIBUTING.rst
The indent made the rst formatting weird so drop it.
Change-Id: Ie0290be5a5c18e08f512911fb21607f177add620
* Update elastic-recheck from branch 'master'
- Merge "Remove H233 from flake8 ignore list"
- Remove H233 from flake8 ignore list
We support python3 so we can stop ignoring H233.
Change-Id: Ifeda88a3af03a15ea35dd8dc491472df0b3bc7ab
* Update elastic-recheck from branch 'master'
- Merge "Hard-code os-brick into TestQueries.openstack_projects"
- Hard-code os-brick into TestQueries.openstack_projects
The elastic-recheck-tox-queries job is failing because
there is a query on an os-brick bug and the os-brick
project in launchpad is not part of the openstack project
group. This change simply hard-codes it since we know os-brick
is part of openstack.
Change-Id: Ia05c009226f88da427ec6ad9724410cd6ebed859
Story: 2006736
Task: 37197
* Update elastic-recheck from branch 'master'
- Add query for ssh fail bug 1849857
20 hits in voting jobs since Oct 21, multiple branches,
all failures, mostly OVH nodes.
Change-Id: Iefdeb509938ef9512fe0ee75fb04cc9e70790b60
* Update elastic-recheck from branch 'master'
- Add query for networkx 2.4 bug 1848499
1278 hits in 24 hours since networkx 2.4 was released.
In nova's case the problem is upper-constraints weren't
being applied to runtime dependencies or transitive
dependencies in tox.ini.
Change-Id: Ie15f3ebbe33116704fa6e066933e69bb6b170f33
* Update elastic-recheck from branch 'master'
- Add query for hard reboot kernel fail bug 1847759
6 hits in 7 days, check queue only right now but multiple
changes, all failures since it breaks SSHing into the guest
after it's hard-rebooted.
Change-Id: Ic9be31f67af3796760b0d514a903f5e9bcd4aca1
* Update elastic-recheck from branch 'master'
- Cap irc package requirement for <py34
irc 17.0 requires python >= 3.4 [1] which breaks
our py27 tox job. This caps the package at <17.0
if python<3.4 and requires irc=>17.0 if python>=3.4.
[1] https://pypi.org/project/irc/17.0/
Change-Id: I85f79c7e28dc2d6e8dfbfe96372c192c3d43b3be
Story: 2006736
Task: 37196
* Update elastic-recheck from branch 'master'
- Merge "Add query for bug 1836754"
- Add query for bug 1836754
It has a low hit rate because it's a race in a very specific case
(deleting an instance before it finished building), but it's a
legitimate bug nonetheless.
Change-Id: Ib5014af4518f7682b79fdc85b7af06814747bacf
* Update elastic-recheck from branch 'master'
- Add query for test_minimum_basic_scenario bug 1838202
This is pretty rare but a known failure, 2 hits in 7 days.
Change-Id: Ib799fba9f3b7b12244fa51bfe696fa01aba0247f
* Update elastic-recheck from branch 'master'
- Add query for 1846093
This tracks an issue in the ansible used to persist firewall rules.
Related-Bug: 1846093
Change-Id: I6fcfb655d418cdc8b91f8680ba753424b686dd52
* Update elastic-recheck from branch 'master'
- Generalize query for bug 1808010
We hit this failure sometimes trying to get other metadata
API routes besides meta-data, e.g. user-data:
failed to get http://169.254.169.254/2009-04-04/user-data
This makes the query more generic to get more hits. This
brings the hit count up to 86 in voting jobs in the last
10 days, multiple branches, check and gate, all failures.
Looks like "ovh-bhs1" is the top offender node provider.
Change-Id: I9fb3885fdc674c7176fc2439a2bbdb65b84048a1
* Update elastic-recheck from branch 'master'
- Add query for timeout waiting for response from cell bug 1844929
1101 hits since Sept 17, check and gate, all failures. Those
numbers are a bit inflated since the message gets duplicated
while the problem occurs.
Change-Id: I12e174effad802928a01800df975990acc908c7a
* Update elastic-recheck from branch 'master'
- Add query for test_boot_server_from_encrypted_volume_luks bug 1820007
This is pretty rare, only 1 hit in 7 days but it's in the gate queue
and is a known issue (the bug was reported against stein) so we might
as well track it.
Change-Id: Ic36fa2b1afe11bdea7af30b897dcc242e442582f
* Update elastic-recheck from branch 'master'
- Add query for configparser 4.0.1 missing bug 1843715
357 hits in 48 hours, check and gate, all failures. It looks
like this is resolved now so this is just getting it off the
uncategorized bugs page.
Change-Id: I88a7137c7dfa1fb50a2385d6a8fa74565348dbc3
Related-Bug: #1843715
* Update elastic-recheck from branch 'master'
- Add query for nova functional test race bug 1843433
8 hits in 24 hours, check and gate, all failures.
Change-Id: Ia16ce7231823e7435de1cf4c096b08367244d674
* Update elastic-recheck from branch 'master'
- Delete old queries: 2019-08-26
Some of these are removed by the cleanup script
and some were removed by hand since they were
duplicates of invalid or fixed bugs and the cleanup
script doesn't yet handle chasing down duplicates.
Change-Id: I2f691511fe620654d5949d617f8131c1360949f7
* Update elastic-recheck from branch 'master'
- Include "Invalid" bugs in cleanup CLI
If a bug is invalid in a project then we should probably
consider its query for removal in the cleanup command.
For example, bug 1663529 and bug 1828244 were both marked
Invalid and had no hits but weren't processed by the
cleanup command.
Change-Id: I7bac9fc169601c86a26565e9fa5b3d72c362a8fc
* Update elastic-recheck from branch 'master'
- Uncap SQLAlchemy in requirements.txt
Unit tests are failing because we're pulling in uncapped
alembic (by way of oslo.db which is pulled in from subunit2sql).
alembic 1.1.0, released on 2019-8-26, requires SQLAlchemy>=1.1.0
which e-r has capped at <1.1.0.
This removes the capped SQLAlchemy dependency and blacklists the
known bad versions from global-requirements.
Change-Id: Iaca9825ea2f5d75c5151b8a726ba1c0283f6536a
Closes-Bug: #1841964
* Update elastic-recheck from branch 'master'
- Add script to remove queries for fixed bugs
This automates the process to remove old queries
for fixed bugs. It's a bit conservative to start
so it doesn't check for open reviews nor does it
filter out affected projects with non-Fix* status
on the bug. It can be made more robust once we're
confident in how it works and play with it on the
open queries.
Change-Id: Iaaf17892804453b99a846be27457c88e5a8f8a55
* Update elastic-recheck from branch 'master'
- Add query for remoteDispatchDomainMigratePerform3Params lock bug 1840159
This is pretty rare in the nova-grenade-live-migration job so right now
it only has 1 hit in 10 days in the check queue, but I've seen it before
so wanted to start tracking it.
Change-Id: I11011f2d60455721490c48fae17c39ffafe28264
* Update elastic-recheck from branch 'master'
- Add query for nova functional test race bug 1839515
73 hits since August 5th, check and gate, all failures.
Change-Id: I8238688e09caeb4b7cd392141f9bfcc9cff99064
* Update elastic-recheck from branch 'master'
- Add query for test_novnc invalid token bug 1669468
This is an old bug that still shows up in multi-node
jobs. 60 hits in the last 7 days (but logstash seems
to be a bit off so there are probably more), all failures.
Change-Id: Ic6135ddef4e503fa1618d5a4fe4fd04309f18939
* Update elastic-recheck from branch 'master'
- Add query for devstack log file permissions bug 1838811
44 hits since https://review.opendev.org/#/c/203698/ merged,
check and gate, all failures, but only in neutron-functional
and fullstack type jobs, I'm not sure why.
Change-Id: Ic2ef0281481f33f10b2b563bd15fa59c191ab1ea