centralcloud/oncall-engine

Author	SHA1	Message	Date
Vadim Stepanov	b8f54f1c53	Add docs & logo for AppDynamics integration (#1916 ) # What this PR does Adds docs & logo for AppDynamics integration. Main PR in private repo: https://github.com/grafana/oncall-private/pull/1790. ## Which issue(s) this PR fixes https://github.com/grafana/oncall-private/issues/1621 ## Checklist - [x] Unit, integration, and e2e (if applicable) tests updated - [x] Documentation added (or `pr:no public docs` PR label added if not required) - No changelog (AppDynamics integration will be only available in cloud)	2023-05-11 16:41:51 +00:00
Joey Orlando	014a9c2ec2	allow the POST incoming alert endpoints to queue create_alert tasks independent of the database status (#1896 ) # What this PR does https://www.loom.com/share/18cc445117de4895a10892d56c7d3699 In preparation to upgrade our cloud databases, this PR makes some minor changes which, after testing locally, allowed the `POST /<integration_type>/<alert_channel_key>` endpoints to successfully receive incoming alerts and queue the celery tasks. I've tested all of the defined `POST /integrations/v1/<integration_type>/<alert_channel_key>` endpoints by sending `POST` requests to an integrations' URL while the MySQL database was down, bringing the database back up, and ensuring the alerts were created. ## Some other findings - the integration heartbeat endpoints will not work as we interact w/ the database to persist the incoming heartbeat instance - if the integration was created in the last 180 seconds, incoming alerts will fail due to the way we cache the integration IDs ([code](https://github.com/grafana/oncall/blob/dev/engine/apps/integrations/mixins/alert_channel_defining_mixin.py#L47-L50)) - The `create_alert` celery task is set to `max_retries=None` and `retry_backoff=True`. This means that the queued tasks will continue retrying forever w/ an exponential backoff, until the alerts can be created in the database (ie. when the database is back online). ## Checklist - [ ] Unit, integration, and e2e (if applicable) tests updated (N/A) - [ ] Documentation added (or `pr:no public docs` PR label added if not required) (N/A) - [ ] `CHANGELOG.md` updated (or `pr:no changelog` PR label added if not required) (N/A)	2023-05-10 12:36:23 +00:00
Oleg Zaytsev	41f7c23c65	Fix and tidy alertmanager heartbeat template (#1865 ) # What this PR does There was an unnecessary indentation in the `rules:` key which made it invalid YAML. Also replaced the mentions to Amixr with Grafana OnCall, used some `<code>` tags and reworded some sentences. Also removed the anchor tag from the webhook link: we don't want people to follow that in their browser, we want them to copy it ## Result screenshot ![image](https://user-images.githubusercontent.com/1511481/236173565-b5201b81-4d69-4d0b-944a-a2106f8fbab3.png) ## Which issue(s) this PR fixes ## Checklist - [ ] Unit, integration, and e2e (if applicable) tests updated - [ ] Documentation added (or `pr:no public docs` PR label added if not required) - [ ] `CHANGELOG.md` updated (or `pr:no changelog` PR label added if not required) --------- Signed-off-by: Oleg Zaytsev <mail@olegzaytsev.com> Co-authored-by: Joey Orlando <joey.orlando@grafana.com>	2023-05-05 00:25:05 +00:00
Vadim Stepanov	d198b932c1	Zendesk inbound integration docs (#1860 ) # What this PR does Add docs & logo for Zendesk integration. Main PR in private repo: https://github.com/grafana/oncall-private/pull/1772 ## Which issue(s) this PR fixes https://github.com/grafana/oncall-private/issues/1627 ## Checklist - [x] Unit, integration, and e2e (if applicable) tests updated - [x] Documentation added (or `pr:no public docs` PR label added if not required) - [x] No changelog (Zendesk integration will be only available in cloud)	2023-05-03 11:38:07 +01:00
Vadim Stepanov	50eb1fed5d	Jira inbound integration docs (#1842 ) # What this PR does Add docs & logo for Jira integration. Main PR in private repo: https://github.com/grafana/oncall-private/pull/1769 ## Which issue(s) this PR fixes https://github.com/grafana/oncall-private/issues/1620 ## Checklist - [x] Unit, integration, and e2e (if applicable) tests updated - [x] Documentation added (or `pr:no public docs` PR label added if not required) - [x] No changelog (Jira integration will be only available in cloud)	2023-05-02 09:37:49 +00:00
Ildar Iskhakov	6e61643750	Limit number of alertmanager alerts in alert group to autoresolve (#1779 ) # What this PR does This PR set the limit so that workers won't attempt to autoresolve too big alertmanager alert groups. ## Which issue(s) this PR fixes ## Checklist - [ ] Unit, integration, and e2e (if applicable) tests updated - [ ] Documentation added (or `pr:no public docs` PR label added if not required) - [ ] `CHANGELOG.md` updated (or `pr:no changelog` PR label added if not required)	2023-04-24 05:38:21 +00:00
Vadim Stepanov	ea60c0d247	Inbound email integration (#837 ) This PR add Inbound Email integration. It designed to support some variety of ESPs, but in prod we will use Mailgun, so locally I tested it only with mailgun ESP. Important: To make it work on different clusters I'm planning to provide different email domains for different regions, like ....@us.oncall.grafana.net, ...@eu.oncall.grafana.net --------- Co-authored-by: Innokentii Konstantinov <innokenty.konstantinov@grafana.com>	2023-03-16 13:59:21 +08:00
Kristian Bremberg	b6d65ebb66	Chore: add integrity hash to templates (#1473 ) # What this PR does Adds integrity hash for scripts loaded from CDN's.	2023-03-07 11:17:07 +00:00
Michael Derynck	49946e6a4e	Change Organization Deleted/Moved Precedence (#1402 ) # What this PR does When an organization is migrated to a different cluster it has it's `migration_destination_slug` set for redirection purposes but it also needs to be deleted so scheduled tasks for it do not run in the old cluster. By changing the order so moved has precedence over deleted API calls will be correctly redirected for moved organizations while the organization is still considered deleted to suppress tasks that are no longer needed in the old cluster. ## Which issue(s) this PR fixes ## Checklist - [ ] Tests updated - [ ] Documentation added - [ ] `CHANGELOG.md` updated	2023-02-24 11:45:21 +00:00
Innokentii Konstantinov	c733d8b9f2	Cleanup ScenarioStep (#1213 ) # What this PR does This PR cleanup ScenarioStep. It's needed to simplify moving Slack to the messaging backends in future. 1. Introduce AlertGroupSlackService to move logic from ScenarioStep. Also it allowed to get rid of importing ScenarioSteps in the code not related to processing of slack callbacks. 2. Remove tags from ScenarioSteps, they are unused. 3. Remove ScenarioStep.dispatch method. It just was calling ScenarioStep.process_scenario. 4. Remove "action" param from process_scenario, it was unused. 5. Remove creation of SlackActionRecord on handling SlackEvents. We are not using it, but it generates INSERT query on most of the user-slack interactions. 6. Remove "random_prefix_for_routing" from ScenarioStep, it was unused. ## Which issue(s) this PR fixes ## Checklist - [ ] Tests updated - [ ] Documentation added - [ ] `CHANGELOG.md` updated --------- Co-authored-by: Joey Orlando <joey.orlando@grafana.com>	2023-02-21 20:22:11 +01:00
Matias Bordese	90def88752	Add escalation chain option when creating a direct page alert group (#1143 ) Also changes the default integration used when creating an alert group for a direct page to a custom manual integration to avoid conflicts/unexpected behaviors with existing manual alerts.	2023-01-18 12:58:26 -03:00
Innokentii Konstantinov	8abbcee050	Org soft-delete (#1073 ) # What this PR does It introduces soft-delete of organization, since grafana stacks are soft-deleted too. Also, we had a problem with deleting orgs with large amounts of alerts, so soft-deletion will fix this problem. I think, that problem of cleaning alerts of deleted orgs should be solved as a part of alert retention	2023-01-05 12:42:55 +08:00
Ildar Iskhakov	1ff0a7da99	1.1.5.5 -> dev (#1060 ) # What this PR does ## Which issue(s) this PR fixes ## Checklist - [ ] Tests updated - [ ] Documentation added - [ ] `CHANGELOG.md` updated Co-authored-by: Vadim Stepanov <vadimkerr@gmail.com> Co-authored-by: Julia <ferril.darkdiver@gmail.com> Co-authored-by: Innokentii Konstantinov <innokenty.konstantinov@grafana.com> Co-authored-by: Matias Bordese <mbordese@gmail.com>	2023-01-03 11:57:16 +08:00
Michael Derynck	6267e31b22	Check id instead of object to avoid unnecessary query	2022-10-28 15:45:51 -06:00
Michael Derynck	febe1b2185	Add basic organization moved exception handling and middleware	2022-10-20 15:04:58 -06:00
Vadim Stepanov	e67d3519fe	Restore email notifications (#621 ) * remove email verification related code * remove email verification related code * remove sendgrid callback * remove sendgrid related code * remove sendgrid related code * rename sendgrid app to email * remove email from built-in channels * remove email from built-in channels * remove email from built-in channels * add email backend: https://github.com/grafana/oncall/pull/50 * add email templater * add email templater * convert md to html * add email settings to live settings * use task to send email, handle some exceptions to create logs * remove ERROR_NOTIFICATION_MAIL_DELIVERY_FAILED usage * add email limit logic * fix tests * add docs * remove old email templates * remove old email templates * add template_fields to messaging backend * add messaging backends templates to public api * add comment for deprecated fields * fix test * fix tests * disable email by default * don't retry on SMTPException and TimeoutError * add tests * bring email back to public api docs * return ERROR_NOTIFICATION_MAIL_LIMIT_EXCEEDED * make template_fields tuple * build_subject_and_title -> build_subject_and_message * add one more comment about template deprecation * use 8 as backend id * add comment about gaierror and BadHeaderError * add comment on importing in notify_user_async * edit oss docs	2022-10-19 12:32:56 +01:00
Michael Derynck	5f5f427c9f	Add middleware to catch exception for missing integration, reduce spamminess of logs	2022-10-13 17:18:22 -06:00
Vadim Stepanov	b84b174e20	Allow multiple database and celery broker types (#582 ) * add libs for celery + redis * move redis & cache config to settings/base.py * move rmq & celery config to settings/base.py * BROKER -> BROKER_TYPE * allow multiple database types * flake8 * add sqlite db creation to dockerfile * fix ci * fix ci * debug * remove some defaults * remove prints * use local memory as cache on ci * debug * add DATABASE_DEFAULTS * add ci test for sqlite + redis * add ci test for sqlite + redis * add ci test for sqlite + redis * debug * add redis healthcheck * fix sqlite * fix dev settings * refactor dev settings * tweak ci settings * clear cache properly between tests * move db and broker types to constants * add librabbitmq deps * use amqp instead of librabbitmq	2022-10-04 09:25:53 +01:00
Michael Derynck	ce8f4e53fa	Conform URLs (#281 ) * Make any URLs build from env vars tolerant of path prefix, trailing/leading slashes * Add comment * Lint	2022-07-25 09:12:50 -06:00
Roman Pertl	7f6077e07f	Fix Typo (HearBeat vs HeartBeat)	2022-06-25 11:19:40 +02:00
Michael Derynck	66e8cf2cbc	Merge dev to main (#54 ) * Log (failed) attempt to notify a user with viewer role * Remove https:// prefix from BASE_URL docker env var * Fix cloud heartbeat name * Polishing telegram * Update docker-compose.yml * Update plugin README (#48) * Update README and screenshot, remove plop for build info since version is now displayed prominently * Sign build Co-authored-by: Michael Derynck <michael.derynck@grafana.com> * Build actions (#38) * Drone, github action changes * Minor version updates * Update frontend dependencies * Re-enable unit test Co-authored-by: Michael Derynck <michael.derynck@grafana.com> * Revert stylelint version (#52) * Revert stylelint version * Build plugin as well as lint * Build in previous step Co-authored-by: Michael Derynck <michael.derynck@grafana.com> * Update screenshot (#53) Co-authored-by: Michael Derynck <michael.derynck@grafana.com> Co-authored-by: Matias Bordese <mbordese@gmail.com> Co-authored-by: Matvey Kukuy <Matvey-Kuk@users.noreply.github.com> Co-authored-by: Innokentii Konstantinov <innokenty.konstantinov@grafana.com> Co-authored-by: Matvey Kukuy <matvey@amixr.io> Co-authored-by: Michael Derynck <michael.derynck@grafana.com>	2022-06-13 16:39:58 -06:00
Ildar Iskhakov	7b385a8790	Refactor integrations	2022-06-09 23:27:40 +03:00
Michael Derynck	6b40f95033	World, meet OnCall! Co-authored-by: Eve832 <eve.meelan@grafana.com> Co-authored-by: Francisco Montes de Oca <nevermind89x@gmail.com> Co-authored-by: Ildar Iskhakov <ildar.iskhakov@grafana.com> Co-authored-by: Innokentii Konstantinov <innokenty.konstantinov@grafana.com> Co-authored-by: Julia <ferril.darkdiver@gmail.com> Co-authored-by: maskin25 <kengurek@gmail.com> Co-authored-by: Matias Bordese <mbordese@gmail.com> Co-authored-by: Matvey Kukuy <motakuk@gmail.com> Co-authored-by: Michael Derynck <michael.derynck@grafana.com> Co-authored-by: Richard Hartmann <richih@richih.org> Co-authored-by: Robby Milo <robbymilo@fastmail.com> Co-authored-by: Timur Olzhabayev <timur.olzhabayev@grafana.com> Co-authored-by: Vadim Stepanov <vadimkerr@gmail.com> Co-authored-by: Yulia Shanyrova <yulia.shanyrova@grafana.com>	2022-06-03 08:09:47 -06:00

23 commits