keycloak-scim/docs/guides/high-availability/concepts-threads.adoc

<#import "/templates/guide.adoc" as tmpl>
<#import "/templates/links.adoc" as links>

<@tmpl.guide
title="Concepts for configuring thread pools"
summary="Understand these concepts to avoid resource exhaustion and congestion"
tileVisible="false" >


This section is intended when you want to understand the considerations and best practices on how to configure thread pools connection pools for {project_name}.
For a configuration where this is applied, visit <@links.ha id="deploy-keycloak-kubernetes" />.

== Concepts

=== Quarkus executor pool

{project_name} requests, as well as blocking probes, are handled by an executor pool. Depending on the available CPU cores, it has a maximum size of 50 or more threads.
Threads are created as needed, and will end when no longer needed, so the system will scale up and down automatically.
{project_name} allows configuring the maximum thread pool size by the link:{links_server_all-config_url}?q=http-pool-max-threads[`http-pool-max-threads`] configuration option. See <@links.ha id="deploy-keycloak-kubernetes" /> for an example.

When running on Kubernetes, adjust the number of worker threads to avoid creating more load than what the CPU limit allows for the Pod to avoid throttling, which would lead to congestion.
When running on physical machines, adjust the number of worker threads to avoid creating more load than the node can handle to avoid congestion.
Congestion would result in longer response times and an increased memory usage, and eventually an unstable system.

Ideally, you should start with a low limit of threads and adjust it accordingly to the target throughput and response time.
When the load and the number of threads increases, the database connections can also become a bottleneck.
Once a request cannot acquire a database connection within 5 seconds, it will fail with a message in the log like `Unable to acquire JDBC Connection`.
The caller will receive a response with a 5xx HTTP status code indicating a server side error.

If you increase the number of database connections and the number of threads too much, the system will be congested under a high load with requests queueing up, which leads to a bad performance.
The number of database connections is configured via the link:{links_server_all-config_url}?q=db-pool[`Database` settings `db-pool-initial-size`, `db-pool-min-size` and `db-pool-max-size`] respectively.
Low numbers ensure fast response times for all clients, even if there is an occasionally failing request when there is a load spike.

=== JGroups connection pool

[NOTE]
====
* This currently applies to single-site setups only.
In a multi-site setup with an external {jdgserver_name} this is not a restriction.
* This currently applies if virtual threads are disabled.
Since {project_name} 26.1, virtual threads are enabled in both embedded Infinispan and JGroups if running on OpenJDK 21 or higher.
====


The combined number of executor threads in all {project_name} nodes in the cluster should not exceed too much the number of threads available in JGroups thread pool to avoid the warning `thread pool is full (max=<value>, active=<value>)`.

The warning includes a thread dump when the Java system property `-Djgroups.thread_dumps_enabled=true` is set.
It may incur in a penalty in performance collecting those thread dumps.

--
include::partials/threads/executor-jgroups-thread-calculation.adoc[]
--

Use metrics to monitor the total JGroup threads in the pool and for the threads active in the pool.
When using TCP as the JGroups transport protocol, the metrics `vendor_jgroups_tcp_get_thread_pool_size` and `vendor_jgroups_tcp_get_thread_pool_size_active` are available for monitoring. When using UDP, the metrics `vendor_jgroups_udp_get_thread_pool_size` and `vendor_jgroups_udp_get_thread_pool_size_active` are available.
This is useful to monitor that limiting the Quarkus thread pool size keeps the number of active JGroup threads below the maximum JGroup thread pool size.

WARNING: The metrics are not available when virtual threads are enabled in JGroups.

[#load-shedding]
=== Load Shedding

By default, {project_name} will queue all incoming requests infinitely, even if the request processing stalls.
This will use additional memory in the Pod, can exhaust resources in the load balancers, and the requests will eventually time out on the client side without the client knowing if the request has been processed.
To limit the number of queued requests in {project_name}, set an additional Quarkus configuration option.

Configure `http-max-queued-requests` to specify a maximum queue length to allow for effective load shedding once this queue size is exceeded.
Assuming a {project_name} Pod processes around 200 requests per second, a queue of 1000 would lead to maximum waiting times of around 5 seconds.

When this setting is active, requests that exceed the number of queued requests will return with an HTTP 503 error.
{project_name} logs the error message in its log.

[#probes]
=== Probes

{project_name}'s liveness probe is non-blocking to avoid a restart of a Pod under a high load.

// Developer's note: See KeycloakReadyHealthCheck for the details of the blocking/non-blocking behavior
The overall health probe and the readiness can probe in some cases block to check the connection to the database, so they might fail under a high load.
Due to this, a Pod can become non-ready under a high load.

=== OS Resources

In order for Java to create threads, when running on Linux it needs to have file handles available.
Therefore, the number of open files (as retrieved as `ulimit -n` on Linux) need to provide head-space for {project_name} to increase the number of threads needed.
Each thread will also consume memory, and the container memory limits need to be set to a value that allows for this or the Pod will be killed by Kubernetes.

</@tmpl.guide>
Adding a Keycloak High Availability section to Keycloak's docs The content was moved over from the Keycloak Benchmark subproject. Closes #24844 Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Pedro Ruivo <pruivo@redhat.com> Co-authored-by: Michal Hajas <mhajas@redhat.com> Co-authored-by: Kamesh Akella <kakella@redhat.com> Co-authored-by: Ryan Emerson <remerson@redhat.com> Co-authored-by: Anna Manukyan <amanukya@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: Stian Thorgersen <stian@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: AndyMunro <amunro@redhat.com> 2023-11-23 12:27:47 +00:00			`<#import "/templates/guide.adoc" as tmpl>`
			`<#import "/templates/links.adoc" as links>`

			`<@tmpl.guide`
			`title="Concepts for configuring thread pools"`
			`summary="Understand these concepts to avoid resource exhaustion and congestion"`
			`tileVisible="false" >`


			`This section is intended when you want to understand the considerations and best practices on how to configure thread pools connection pools for {project_name}.`
			`For a configuration where this is applied, visit <@links.ha id="deploy-keycloak-kubernetes" />.`

			`== Concepts`

			`=== Quarkus executor pool`

fix: scaling and tuning getting started guide closes: #29388 Signed-off-by: Steve Hawkins <shawkins@redhat.com> Signed-off-by: Steven Hawkins <shawkins@redhat.com> Co-authored-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: andymunro <48995441+andymunro@users.noreply.github.com> 2024-07-18 13:31:37 +00:00			`{project_name} requests, as well as blocking probes, are handled by an executor pool. Depending on the available CPU cores, it has a maximum size of 50 or more threads.`
Improve wording for Concepts for configuring thread pools in docs Closes #26402 Signed-off-by: Thomas Darimont <thomas.darimont@googlemail.com> Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Alexander Schwartz <aschwart@redhat.com> 2024-01-23 12:56:55 +00:00			`Threads are created as needed, and will end when no longer needed, so the system will scale up and down automatically.`
Keycloak users should not need to understand the depths of Quarkus configuration to implement Keycloak HA (#27122) Closes #27121 Signed-off-by: Alexander Schwartz <aschwart@redhat.com> 2024-02-21 12:49:14 +00:00			{project_name} allows configuring the maximum thread pool size by the link:{links_server_all-config_url}?q=http-pool-max-threads[`http-pool-max-threads`] configuration option. See <@links.ha id="deploy-keycloak-kubernetes" /> for an example.
Use http-pool-max-threads in HA guides Closes #26849 Signed-off-by: Michal Hajas <mhajas@redhat.com> Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Alexander Schwartz <aschwart@redhat.com> 2024-02-13 10:01:59 +00:00
Improve wording for Concepts for configuring thread pools in docs Closes #26402 Signed-off-by: Thomas Darimont <thomas.darimont@googlemail.com> Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Alexander Schwartz <aschwart@redhat.com> 2024-01-23 12:56:55 +00:00			`When running on Kubernetes, adjust the number of worker threads to avoid creating more load than what the CPU limit allows for the Pod to avoid throttling, which would lead to congestion.`
Adress keycloak high-availability guide follow-up items Closes #24975 Signed-off-by: Alexander Schwartz <aschwart@redhat.com> 2023-11-23 14:19:11 +00:00			`When running on physical machines, adjust the number of worker threads to avoid creating more load than the node can handle to avoid congestion.`
			`Congestion would result in longer response times and an increased memory usage, and eventually an unstable system.`

			`Ideally, you should start with a low limit of threads and adjust it accordingly to the target throughput and response time.`
Improve wording for Concepts for configuring thread pools in docs Closes #26402 Signed-off-by: Thomas Darimont <thomas.darimont@googlemail.com> Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Alexander Schwartz <aschwart@redhat.com> 2024-01-23 12:56:55 +00:00			`When the load and the number of threads increases, the database connections can also become a bottleneck.`
Reduce internal unsupported options in the Keycloak HA documentation Closes #26068 Signed-off-by: Alexander Schwartz <aschwart@redhat.com> 2024-01-10 10:34:16 +00:00			Once a request cannot acquire a database connection within 5 seconds, it will fail with a message in the log like `Unable to acquire JDBC Connection`.
Adding a Keycloak High Availability section to Keycloak's docs The content was moved over from the Keycloak Benchmark subproject. Closes #24844 Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Pedro Ruivo <pruivo@redhat.com> Co-authored-by: Michal Hajas <mhajas@redhat.com> Co-authored-by: Kamesh Akella <kakella@redhat.com> Co-authored-by: Ryan Emerson <remerson@redhat.com> Co-authored-by: Anna Manukyan <amanukya@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: Stian Thorgersen <stian@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: AndyMunro <amunro@redhat.com> 2023-11-23 12:27:47 +00:00			`The caller will receive a response with a 5xx HTTP status code indicating a server side error.`

Improve wording for Concepts for configuring thread pools in docs Closes #26402 Signed-off-by: Thomas Darimont <thomas.darimont@googlemail.com> Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Alexander Schwartz <aschwart@redhat.com> 2024-01-23 12:56:55 +00:00			`If you increase the number of database connections and the number of threads too much, the system will be congested under a high load with requests queueing up, which leads to a bad performance.`
Keycloak users should not need to understand the depths of Quarkus configuration to implement Keycloak HA (#27122) Closes #27121 Signed-off-by: Alexander Schwartz <aschwart@redhat.com> 2024-02-21 12:49:14 +00:00			The number of database connections is configured via the link:{links_server_all-config_url}?q=db-pool[`Database` settings `db-pool-initial-size`, `db-pool-min-size` and `db-pool-max-size`] respectively.
Reduce internal unsupported options in the Keycloak HA documentation Closes #26068 Signed-off-by: Alexander Schwartz <aschwart@redhat.com> 2024-01-10 10:34:16 +00:00			`Low numbers ensure fast response times for all clients, even if there is an occasionally failing request when there is a load spike.`
Adding a Keycloak High Availability section to Keycloak's docs The content was moved over from the Keycloak Benchmark subproject. Closes #24844 Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Pedro Ruivo <pruivo@redhat.com> Co-authored-by: Michal Hajas <mhajas@redhat.com> Co-authored-by: Kamesh Akella <kakella@redhat.com> Co-authored-by: Ryan Emerson <remerson@redhat.com> Co-authored-by: Anna Manukyan <amanukya@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: Stian Thorgersen <stian@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: AndyMunro <amunro@redhat.com> 2023-11-23 12:27:47 +00:00
			`=== JGroups connection pool`

Enable virtual threads in Infinispan and JGroups by default Closes #33939 Signed-off-by: Pedro Ruivo <pruivo@redhat.com> Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Alexander Schwartz <aschwart@redhat.com> 2024-10-21 16:02:28 +00:00			`[NOTE]`
			`====`
			`* This currently applies to single-site setups only.`
			`In a multi-site setup with an external {jdgserver_name} this is not a restriction.`
			`* This currently applies if virtual threads are disabled.`
			`Since {project_name} 26.1, virtual threads are enabled in both embedded Infinispan and JGroups if running on OpenJDK 21 or higher.`
			`====`
Revisit Multi Site Guide Closes #32745 Signed-off-by: Pedro Ruivo <pruivo@redhat.com> Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Alexander Schwartz <aschwart@redhat.com> 2024-09-24 12:08:15 +00:00
Enable virtual threads in Infinispan and JGroups by default Closes #33939 Signed-off-by: Pedro Ruivo <pruivo@redhat.com> Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Alexander Schwartz <aschwart@redhat.com> 2024-10-21 16:02:28 +00:00
			The combined number of executor threads in all {project_name} nodes in the cluster should not exceed too much the number of threads available in JGroups thread pool to avoid the warning `thread pool is full (max=<value>, active=<value>)`.

			The warning includes a thread dump when the Java system property `-Djgroups.thread_dumps_enabled=true` is set.
			`It may incur in a penalty in performance collecting those thread dumps.`
Adding a Keycloak High Availability section to Keycloak's docs The content was moved over from the Keycloak Benchmark subproject. Closes #24844 Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Pedro Ruivo <pruivo@redhat.com> Co-authored-by: Michal Hajas <mhajas@redhat.com> Co-authored-by: Kamesh Akella <kakella@redhat.com> Co-authored-by: Ryan Emerson <remerson@redhat.com> Co-authored-by: Anna Manukyan <amanukya@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: Stian Thorgersen <stian@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: AndyMunro <amunro@redhat.com> 2023-11-23 12:27:47 +00:00
			`--`
			`include::partials/threads/executor-jgroups-thread-calculation.adoc[]`
			`--`

Add default stack in cache-ispn.xml A bug in Infinispan prevents the metrics to be registered if the "stack" is not specified. Change the default configuration shipped with Keycloak to use the UDP stack as default. UDP is the default in previous Keycloak versions. Fixes #31218 Signed-off-by: Pedro Ruivo <pruivo@redhat.com> Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Alexander Schwartz <aschwart@redhat.com> 2024-07-16 10:05:38 +00:00			`Use metrics to monitor the total JGroup threads in the pool and for the threads active in the pool.`
			When using TCP as the JGroups transport protocol, the metrics `vendor_jgroups_tcp_get_thread_pool_size` and `vendor_jgroups_tcp_get_thread_pool_size_active` are available for monitoring. When using UDP, the metrics `vendor_jgroups_udp_get_thread_pool_size` and `vendor_jgroups_udp_get_thread_pool_size_active` are available.
Adding a Keycloak High Availability section to Keycloak's docs The content was moved over from the Keycloak Benchmark subproject. Closes #24844 Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Pedro Ruivo <pruivo@redhat.com> Co-authored-by: Michal Hajas <mhajas@redhat.com> Co-authored-by: Kamesh Akella <kakella@redhat.com> Co-authored-by: Ryan Emerson <remerson@redhat.com> Co-authored-by: Anna Manukyan <amanukya@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: Stian Thorgersen <stian@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: AndyMunro <amunro@redhat.com> 2023-11-23 12:27:47 +00:00			`This is useful to monitor that limiting the Quarkus thread pool size keeps the number of active JGroup threads below the maximum JGroup thread pool size.`

Enable virtual threads in Infinispan and JGroups by default Closes #33939 Signed-off-by: Pedro Ruivo <pruivo@redhat.com> Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Alexander Schwartz <aschwart@redhat.com> 2024-10-21 16:02:28 +00:00			`WARNING: The metrics are not available when virtual threads are enabled in JGroups.`

Adding a Keycloak High Availability section to Keycloak's docs The content was moved over from the Keycloak Benchmark subproject. Closes #24844 Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Pedro Ruivo <pruivo@redhat.com> Co-authored-by: Michal Hajas <mhajas@redhat.com> Co-authored-by: Kamesh Akella <kakella@redhat.com> Co-authored-by: Ryan Emerson <remerson@redhat.com> Co-authored-by: Anna Manukyan <amanukya@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: Stian Thorgersen <stian@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: AndyMunro <amunro@redhat.com> 2023-11-23 12:27:47 +00:00			`[#load-shedding]`
			`=== Load Shedding`

			`By default, {project_name} will queue all incoming requests infinitely, even if the request processing stalls.`
			`This will use additional memory in the Pod, can exhaust resources in the load balancers, and the requests will eventually time out on the client side without the client knowing if the request has been processed.`
			`To limit the number of queued requests in {project_name}, set an additional Quarkus configuration option.`

Use the http-max-queued-requests option for load shedding in HA docs Resolves #26223 Signed-off-by: Ryan Emerson <remerson@redhat.com> 2024-01-17 14:44:08 +00:00			Configure `http-max-queued-requests` to specify a maximum queue length to allow for effective load shedding once this queue size is exceeded.
Adding a Keycloak High Availability section to Keycloak's docs The content was moved over from the Keycloak Benchmark subproject. Closes #24844 Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Pedro Ruivo <pruivo@redhat.com> Co-authored-by: Michal Hajas <mhajas@redhat.com> Co-authored-by: Kamesh Akella <kakella@redhat.com> Co-authored-by: Ryan Emerson <remerson@redhat.com> Co-authored-by: Anna Manukyan <amanukya@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: Stian Thorgersen <stian@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: AndyMunro <amunro@redhat.com> 2023-11-23 12:27:47 +00:00			`Assuming a {project_name} Pod processes around 200 requests per second, a queue of 1000 would lead to maximum waiting times of around 5 seconds.`

			`When this setting is active, requests that exceed the number of queued requests will return with an HTTP 503 error.`
			`{project_name} logs the error message in its log.`

			`[#probes]`
			`=== Probes`

Update HA guide about non-blocking probes (#26783) Closes #26781 Signed-off-by: Alexander Schwartz <aschwart@redhat.com> 2024-02-07 15:16:50 +00:00			`{project_name}'s liveness probe is non-blocking to avoid a restart of a Pod under a high load.`

			`// Developer's note: See KeycloakReadyHealthCheck for the details of the blocking/non-blocking behavior`
			`The overall health probe and the readiness can probe in some cases block to check the connection to the database, so they might fail under a high load.`
			`Due to this, a Pod can become non-ready under a high load.`
Adding a Keycloak High Availability section to Keycloak's docs The content was moved over from the Keycloak Benchmark subproject. Closes #24844 Signed-off-by: Alexander Schwartz <aschwart@redhat.com> Co-authored-by: Pedro Ruivo <pruivo@redhat.com> Co-authored-by: Michal Hajas <mhajas@redhat.com> Co-authored-by: Kamesh Akella <kakella@redhat.com> Co-authored-by: Ryan Emerson <remerson@redhat.com> Co-authored-by: Anna Manukyan <amanukya@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: Stian Thorgersen <stian@redhat.com> Co-authored-by: Thomas Darimont <thomas.darimont@googlemail.com> Co-authored-by: AndyMunro <amunro@redhat.com> 2023-11-23 12:27:47 +00:00
			`=== OS Resources`

			`In order for Java to create threads, when running on Linux it needs to have file handles available.`
			Therefore, the number of open files (as retrieved as `ulimit -n` on Linux) need to provide head-space for {project_name} to increase the number of threads needed.
			`Each thread will also consume memory, and the container memory limits need to be set to a value that allows for this or the Pod will be killed by Kubernetes.`

			`</@tmpl.guide>`