Task #2211
opengreylog/log parsing
0%
Description
We get way too many cronjob and logrotate e-mails and should switch to a better approach. There's a test installation of greylog available, running on the Sun24 box, that needs evaluating and putting into production (or an alternative identified). That helps to better spot possible issues and reduces mail load at the same time.
Related issues
Updated by Florian Effenberger about 7 years ago
- Target version changed from Q2/2017 to Q3/2017
Updated by Florian Effenberger about 7 years ago
- Target version changed from Q3/2017 to Q4/2017
Shifting to Q4, but then we definitely should look into that, to better manage our incoming log messages
Updated by Florian Effenberger over 6 years ago
Is that task the same as the monitoring ticket (#2210)?
Updated by Guilhem Moulin over 6 years ago
- Related to Task #2210: monitoring notifications added
Updated by Guilhem Moulin over 6 years ago
Not exactly the same but related, and they're likely to be solved at the same time.
Updated by Florian Effenberger over 6 years ago
- Target version changed from Q4/2017 to Q2/2018
Updated by Florian Effenberger about 6 years ago
Is there any ETA?
I really would like to get the cronjob mails cleaned up, and have some very basic notification system in place
Updated by Florian Effenberger over 5 years ago
- Target version changed from Q2/2018 to Q2/2019
Any updates on this? :)
Updated by Florian Effenberger about 5 years ago
- Target version changed from Q2/2019 to Q4/2019
Updated by Florian Effenberger over 4 years ago
- Target version changed from Q4/2019 to Q2/2020
Updated by Florian Effenberger about 4 years ago
- Target version changed from Q2/2020 to Q4/2020
Monitoring in place
Still many e-mails each day from cronjobs and other stuff
Updated by Guilhem Moulin about 3 years ago
- Target version changed from Q4/2020 to Q4/2021
The point here is not to clean up mail notification, as long as infra team members are able to handle the mail flood, filter what's relevant, and order on the fly by priority there is nothing wrong with these IMHO. This task is about improving whitebox monitoring in general. This has been a recurring topic in infra calls for a while now, and we're planning to make node monitoring part of the baseline which should solve part of this.