Grafana OnCall engine fork — self-hosted on-call scheduler and alert router

Find a file

Joey Orlando 1df1b1eaa0 patch redis cluster multi-key operations (#3496 ) # Which issue(s) this PR fixes Related to https://github.com/grafana/oncall-private/issues/2363 Addresses this issue that arises when using `cache.get_many`/`cache.set_many` operations with a Redis Cluster: ```python3 File "/usr/local/lib/python3.11/site-packages/redis/cluster.py", line 1006, in determine_slot raise RedisClusterException( redis.exceptions.RedisClusterException: MGET - all keys must map to the same key slot ``` From the Redis Cluster [docs](https://redis.io/docs/reference/cluster-spec/#hash-tags), this can be addressed with this 👇 . Basically this will ensure that keys in multi-key operations will resolve to the same hash slot (read: node): > Hash tags > There is an exception for the computation of the hash slot that is used in order to implement hash tags. Hash tags are a way to ensure that multiple keys are allocated in the same hash slot. This is used in order to implement multi-key operations in Redis Cluster. > > To implement hash tags, the hash slot for a key is computed in a slightly different way in certain conditions. If the key contains a "{...}" pattern only the substring between { and } is hashed in order to obtain the hash slot. However since it is possible that there are multiple occurrences of { or } the algorithm is well specified by the following rules: > > IF the key contains a { character. > AND IF there is a } character to the right of {. > AND IF there are one or more characters between the first occurrence of { and the first occurrence of }. > Then instead of hashing the key, only what is between the first occurrence of { and the following first occurrence of } is hashed. ## Checklist - [x] Unit, integration, and e2e (if applicable) tests updated - [ ] Documentation added (or `pr:no public docs` PR label added if not required) - [ ] `CHANGELOG.md` updated (or `pr:no changelog` PR label added if not required)		2023-12-04 13:08:57 -05:00
.github	Revert "upgrade to Python 3.12 (#3456 )" and "bump uwsgi version to latest #3466 " (#3483 )	2023-12-01 09:56:26 -05:00
dev	Revert "upgrade to Python 3.12 (#3456 )" and "bump uwsgi version to latest #3466 " (#3483 )	2023-12-01 09:56:26 -05:00
docs	Update `make docs` procedure (#3462 )	2023-11-30 09:58:59 +00:00
engine	patch redis cluster multi-key operations (#3496 )	2023-12-04 13:08:57 -05:00
grafana-plugin	Disallow creating and deleting direct paging integrations (#3475 )	2023-12-04 13:13:53 +00:00
helm	Release oncall Helm chart 1.3.63	2023-11-28 02:17:40 +00:00
terraform	Remove unnecessary team checks (#2606 )	2023-07-21 15:55:57 +01:00
tools	Revert "upgrade to Python 3.12 (#3456 )" and "bump uwsgi version to latest #3466 " (#3483 )	2023-12-01 09:56:26 -05:00
.dockerignore	WIP: Direct paging improvements (#3064 )	2023-09-28 03:57:49 +00:00
.drone.yml	Revert "upgrade to Python 3.12 (#3456 )" and "bump uwsgi version to latest #3466 " (#3483 )	2023-12-01 09:56:26 -05:00
.gitignore	Use Tilt for local development (#1396 )	2023-09-07 19:38:19 +08:00
.markdownlint.json	don't enforce line-length rule for markdownlint for code-blocks or tables (#2145 )	2023-06-09 06:57:19 +00:00
.markdownlintignore	Add tracing support	2022-12-19 17:15:06 +08:00
.pre-commit-config.yaml	Revert "upgrade to Python 3.12 (#3456 )" and "bump uwsgi version to latest #3466 " (#3483 )	2023-12-01 09:56:26 -05:00
.yamllint.yml	configure yamllint pre-commit step (#2728 )	2023-08-03 02:35:08 -04:00
CHANGELOG.md	Disallow creating and deleting direct paging integrations (#3475 )	2023-12-04 13:13:53 +00:00
CODE_OF_CONDUCT.md	add precommit rules for markdown/json files (#915 )	2022-12-01 14:26:54 +01:00
docker-compose-developer.yml	Telegram long polling (#2250 )	2023-08-24 09:12:24 +02:00
docker-compose-mysql-rabbitmq.yml	fix make start command when using mysql/postgres as db (#2744 )	2023-08-03 11:50:40 -04:00
docker-compose.yml	Update docker-compose.yml (#3266 )	2023-11-03 17:09:24 +08:00
GOVERNANCE.md	add precommit rules for markdown/json files (#915 )	2022-12-01 14:26:54 +01:00
LICENSE	World, meet OnCall!	2022-06-03 08:09:47 -06:00
LICENSING.md	add precommit rules for markdown/json files (#915 )	2022-12-01 14:26:54 +01:00
MAINTAINERS.md	add precommit rules for markdown/json files (#915 )	2022-12-01 14:26:54 +01:00
Makefile	Revert "upgrade to Python 3.12 (#3456 )" and "bump uwsgi version to latest #3466 " (#3483 )	2023-12-01 09:56:26 -05:00
README.md	[README]fix prometheus yml indentation error (#2327 )	2023-06-26 13:44:33 +00:00
screenshot.png	Merge dev to main (#54 )	2022-06-13 16:39:58 -06:00
screenshot_mobile.png	Readme updates	2023-04-11 15:43:52 +03:00
Tiltfile	Revert "Cache independent ingestion" (#3417 )	2023-11-23 21:38:06 +08:00

README.md

Grafana OnCall

Developer-friendly incident response with brilliant Slack integration.

Android & iOS:

Collect and analyze alerts from multiple monitoring systems
On-call rotations based on schedules
Automatic escalations
Phone calls, SMS, Slack, Telegram notifications

Getting Started

We prepared multiple environments:

production
developer
hobby (described in the following steps)

Download docker-compose.yml:

curl -fsSL https://raw.githubusercontent.com/grafana/oncall/dev/docker-compose.yml -o docker-compose.yml

Set variables:

echo "DOMAIN=http://localhost:8080
# Remove 'with_grafana' below if you want to use existing grafana
# Add 'with_prometheus' below to optionally enable a local prometheus for oncall metrics
# e.g. COMPOSE_PROFILES=with_grafana,with_prometheus
COMPOSE_PROFILES=with_grafana
# to setup an auth token for prometheus exporter metrics:
# PROMETHEUS_EXPORTER_SECRET=my_random_prometheus_secret
# also, make sure to enable the /metrics endpoint:
# FEATURE_PROMETHEUS_EXPORTER_ENABLED=True
SECRET_KEY=my_random_secret_must_be_more_than_32_characters_long" > .env

(Optional) If you want to enable/setup the prometheus metrics exporter (besides the changes above), create a prometheus.yml file (replacing my_random_prometheus_secret accordingly), next to your docker-compose.yml:

echo "global:
  scrape_interval:     15s
  evaluation_interval: 15s

scrape_configs:
  - job_name: prometheus
    metrics_path: /metrics/
    authorization:
      credentials: my_random_prometheus_secret
    static_configs:
      - targets: [\"host.docker.internal:8080\"]" > prometheus.yml

NOTE: you will need to setup a Prometheus datasource using http://prometheus:9090 as the URL in the Grafana UI.

Launch services:

docker-compose pull && docker-compose up -d

Go to OnCall Plugin Configuration, using log in credentials as defined above: admin/admin (or find OnCall plugin in configuration->plugins) and connect OnCall plugin with OnCall backend:
```
OnCall backend URL: http://engine:8080
```
Enjoy! Check our OSS docs if you want to set up Slack, Telegram, Twilio or SMS/calls through Grafana Cloud.

Update version

To update your Grafana OnCall hobby environment:

# Update Docker image
docker-compose pull engine

# Re-deploy
docker-compose up -d

After updating the engine, you'll also need to click the "Update" button on the plugin version page. See Grafana docs for more info on updating Grafana plugins.

README.md

Grafana OnCall

Getting Started

Update version

Join community

Stargazers over time

Further Reading