Prepare to upgrade
Before you upgrade Gloo Edge, complete the following preparatory steps:
- Prepare your environment, such as upgrading your current version to the latest patch and upgrading any dependencies to the required supported versions.
- Review important changes made to Gloo Edge in version 1.15, including CRD, Helm, CLI, and feature changes.
- Review frequently-asked questions about the upgrade process.
Prepare your environment
Review the following preparatory steps that might be required for your environment.
Upgrade your current minor version to the latest patch
Before you upgrade your minor version, first upgrade your current version to the latest patch. For example, if you currently run Gloo Edge Enterprise version 1.14.22, first upgrade your installation to version 1.14.23. This ensures that your current environment is up-to-date with any bug fixes or security patches before you begin the minor version upgrade process.
- Find the latest patch of your minor version by checking the Open Source changelog or Enterprise changelog.
- Go to the documentation set for your current minor version. For example, if you currently run Gloo Edge Enterprise version 1.14.22, use the drop-down menu in the header of this page to select v1.14.x.
- Follow the upgrade guide, using the latest patch for your minor version.
If required, perform incremental minor version updates
If you plan to upgrade to a version that is more than one minor version greater than your current version, such as to version 1.15 from 1.13 or older, you must upgrade incrementally. For example, you must first use the upgrade guide in the v1.14.x documentation set to upgrade from 1.13 to 1.14, and then follow the upgrade guide in the v1.15.x documentation set to upgrade from 1.14 to 1.15.
Upgrade dependencies
Check that your underlying infrastructure platform, such as Kubernetes, and other dependencies run a version that is supported for 1.15.
- Review the supported versions for dependencies such as Kubernetes, Helm, and more.
- Compare the supported versions against the versions you currently use.
- If necessary, upgrade your dependencies, such as consulting your cluster infrastructure provider to upgrade the version of Kubernetes that your cluster runs.
Consider settings to avoid downtime
You might deploy Gloo Edge in Kubernetes environments that use the Kubernetes load balancer, or in non-Kubernetes environments. Depending on your setup, you can take additional steps to avoid downtime during the upgrade process.
- Kubernetes: Enable Envoy readiness and liveness probes during the upgrade. When these probes are set, Kubernetes sends requests only to the healthy Envoy proxy during the upgrade process, which helps to prevent potential downtime. The probes are not enabled in default installations because they can lead to timeouts or other poor getting started experiences.
- Non-Kubernetes: Configure health checks on Envoy. Then, configure your load balancer to leverage these health checks, so that requests stop going to Envoy when it begins draining connections.
Review version 1.15 changes
Review the following changes made to Gloo Edge in version 1.15. For some changes, you might be required to complete additional steps during the upgrade process.
Changelogs
Check the changelogs for the type of Gloo Edge deployment that you have. Focus especially on any Breaking Changes that might require a different upgrade procedure. For Gloo Edge Enterprise, you might also review the open source changelogs because most of the proto definitions are open source.
- Open Source changelogs
- Enterprise changelogs: Keep in mind that Gloo Edge Enterprise pulls in Gloo Edge Open Source as a dependency. Although the major and minor version numbers are the same for open source and enterprise, their patch versions often differ. For example, open source might use version
x.y.a
but enterprise uses versionx.y.b
. If you are unfamiliar with these versioning concepts, see Semantic versioning. Because of the differing patch versions, you might notice different output when checking your version withglooctl version
. For example, your API server might run Gloo Edge Enterprise version 1.15.22, which pulls in Gloo Edge Open Source version 1.15.33 as a dependency.~ > glooctl version Client: {"version":"1.15.33"} Server: {"type":"Gateway","enterprise":true,"kubernetes":...,{"Tag":"1.15.22","Name":"grpcserver-ee","Registry":"quay.io/solo-io"},...,{"Tag":"1.15.33","Name":"discovery","Registry":"quay.io/solo-io"},...}
You can use the changelogs' built-in comparison tool to compare between your current version and the version that you want to upgrade to.
Feature changes
Review the following summary of important new, deprecated, or removed features.
The following lists consist of the changes that were initially introduced with the 1.15.0 release. These changes might be backported to earlier versions of Gloo Edge. Additionally, there might be other changes that are introduced in later 1.15 patch releases. For patch release changes, check the changelogs.
New or improved features:
- Access log flushing: You can now flush the access log on a periodic basis by setting the
tcpProxySettings.accessLogFlushInterval
field in thetcp
CRD. The default behavior is to write to the access log only when a connection is closed. For long-running TCP connections, this process can take a long time. If you flush periodically, you ensure that access logs can be written on a regular interval. - Debug logging on transformations: If you apply transformations in your
VirtualService
resources, you can now enable debug logging with thelogRequestResponseInfo
field. - Inja 3.4: Inja version 3.4 provides access to numerous new templating features for your transformations.
- Kubernetes 1.26 and 1.27: Gloo Edge version 1.15 is now supported on clusters that run Kubernetes version 1.26 and 1.27.
- Online Certificate Status Protocol (OCSP) stapling: If you use servers with OCSP stapling, and fetch or pre-fetch OCSP responses for server domains from OCSP responders, you can now provide an OCSP staple policy for a listener in the
ocspStaplePolicy
field of thessl
CRD. You can store OCSP responses in TLS secrets either by using theglooctl create secret tls
command or by manually storing the OCSP response in thetls.ocsp-staple
field of thesecret
CRD. - Proxy protocol support for upstreams: You can now set the
proxyProtocolVersion
field in yourupstream
Gloo resource. - Redis databases (Enterprise only): When you use a Redis solution other than the default deployment installed by Gloo, you can specify a database other than
0
by setting theredis.service.db
field. Note that this field is ignored for clustered Redis or whenClientSideShardingEnabled
is set to true. - Symmetric encryption: In the
UserSession
section of theExtAuthConfig
resource, you can now apply symmetric encryption to cookie session tokens and values by using thecipherConfig
field. - Timeouts for GraphQL resolutions (Enterprise only): In the
GraphQLApi
resource, you can now define atimeout
for the REST or gRPC resolver. - Upgraded Envoy version dependency: The Envoy dependency in Gloo Edge 1.15 was upgraded from 1.25.x to 1.26.x. This upgrade includes the following changes. For more information about these changes, see the Envoy changelog documentation.
- New header validation: Envoy now validates header names and values before a request is forwarded to the upstream. Header validations are performed after transformation filters are applied to the request. If you use transformation policies to alter header names or values, and an incorrect format is introduced by this transformation, your request might not be forwarded to the upstream. To temporarily revert this change, you can set the
envoy.reloadable_features.validate_upstream_headers
runtime flag to false. For more information about the header manipulation, see the Envoy documentation. - Bugfix for
x-envoy-original-path
header: Envoy fixed a bug where the internalx-envoy-original-path
header was not removed when being sent from untrusted clients. This bugfix is crucial to prevent malicious actors from injecting this header to alter the behavior in Envoy. To temporarily revert this change, you can set theenvoy.reloadable_features.sanitize_original_path
runtime flag to false. - Sanitize non-UTF-8 header data: Envoy now automatically sanitizes non-UTF-8 data and replaces it with a
!
character in gRPC service calls. This behavior fixes a bug where invalid protobuf messages were sent when non-UTF-8 HTTP header data was received. The receiving service typically generated an error when the protobuf message was decoded. This error message led to unintended behavior and other unforseen errors, such as a lack of visibility into requests as requests were not logged. By fixing this bug, Envoy now ensures that data in gRPC service calls is sent in valid UTF-8 format. To temporarily revert this change, you can set theenvoy.reloadable_features.service_sanitize_non_utf8_strings
runtime flag to false.
- New header validation: Envoy now validates header names and values before a request is forwarded to the upstream. Header validations are performed after transformation filters are applied to the request. If you use transformation policies to alter header names or values, and an incorrect format is introduced by this transformation, your request might not be forwarded to the upstream. To temporarily revert this change, you can set the
Helm changes
Review the following summary of important new, deprecated, or removed Helm fields. For full details, see the changelogs.
New and updated Helm fields:
gateway.validation.webhook.skipDeleteValidationResources
: Skip using the validation webhook when you delete certain resources by specifying the resource types. This allows you to delete resources that were valid when created but are now invalid, such as short-lived upstreams.global.extensions.extAuth.namedExtAuth.NAME.name and .namespace
: Specify additional extauth servers.gloo-fed
: The following fields are added to thegloo-fed
section:- Custom securityContexts for pods:
gloo-fed.glooFed.podSecurityContext
- Custom securityContexts for deployments:
gloo-fed.glooFed.glooFed.securityContext
- Custom RBAC roles:
gloo-fed.glooFed.roleRules
- Volumes and volume mounts:
gloo-fed.glooFed.volumes
andgloo-fed.glooFed.glooFed.volumeMounts
- Custom securityContexts for pods:
gloo.headerSecretRefNsMatchesUs
: When set to true, any secrets that are sent in headers to upstreams viaheaderSecretRefs
are required to come from the same namespace as the destination upstream.gloo.settings.ratelimitServer
: Specify override settings for the rate limit server.redis
(Enterprise only): The following fields are added to theredis
section:- When you use a Redis solution other than the default deployment installed by Gloo, you can specify a database other than
0
by setting theredis.service.db
field. Note that this is field is ignored for clustered Redis or whenClientSideShardingEnabled
is set to true. - The
redis.tlsEnabled
field is added to enabled a Redis TLS connection for the rate limit server. The default value is false. - When you set both
redis.disabled
andglobal.extensions.glooRedis.enableAcl
to true, a Redis secret is not created.
- When you use a Redis solution other than the default deployment installed by Gloo, you can specify a database other than
mergePolicy
: This field is added to any security context fields (such asredis.deployment.initContainer.securityContext.mergePolicy
) to allow you to merge custom security context definitions with the default definition, instead of overwriting it.settings.devMode
: In non-production environments, set totrue
to enable a debug endpoint on the Gloo deployment on port 10010.settings.ratelimitServer
: Specify your external rate limit server configuration.
Updated Helm fields:
settings.regexMaxProgramSize
: The default value is changed to1024
. Envoy has a default of 100, which is typically not large enough for regex patterns.
CRD changes
New CRDs are automatically applied to your cluster when performing a helm install
operation, but are not applied when performing an helm upgrade
operation. This is a deliberate design choice on the part of the Helm maintainers, given the risk associated with changing CRDs. Given this limitation, you must apply new CRDs to the cluster before upgrading.
Review the following summary of important new, deprecated, or removed CRD updates. For full details, see the changelogs.
New and updated CRDs:
ExtAuthConfig
: In theUserSession
section, you can now apply symmetric encryption to cookie session tokens and values by using thecipherConfig
field.Gateway
: You can now use thehybridGateway.delegatedTcpGateways
field to configure multiple TCP gateways.Gateway
(Enterprise only): You can now use thehybridGateway.matchedGateway.matcher.passthroughCipherSuites
field to specify passthrough cipher suites.GraphQLApi
(Enterprise only): You can now define atimeout
for the REST or gRPC resolver.hcm
andoptions
: You can now set x-fowarded-host and x-forwarded-post headers by using theappendXForwardedPort
field in thehcm
CRD and theappendXForwardedHost
field in theoptions
CRD
Deprecated CRDs:
ExtAuthConfig
: In theoidcAuthorizationCode
andoauth2Config
sections, thesession
field is now deprecated. Use theuserSession
field instead.
CLI changes
You must upgrade glooctl
before you upgrade Gloo Edge. Because glooctl
can create resources in your cluster, such as with glooctl add route
, you might have errors in Gloo Edge if you create resources with an older version of glooctl
.
Review the following summary of important new, deprecated, or removed CLI options. For full details, see the changelogs.
New CLI commands or options:
glooctl create secret encryptionkey
: Create encryption secrets, such as to use in thecipherConfig
field of theExtAuthConfig
resource.
Frequently-asked questions
Review the following frequently-asked questions about the upgrade process. If you still aren’t sure about the version upgrade impact, or if your use case doesn’t quite fit the standard upgrade path, feel free to post in the #gloo
or #gloo-enterprise
channels of our public Slack.
How do I upgrade Gloo Edge in testing or sandbox environments?
If downtime is not a concern for your use case, you can follow the Quick upgrade guide to update your Gloo Edge installation.
Note that for sandbox or exploratory environments, the easiest way to upgrade is to uninstall Gloo Edge by running glooctl uninstall --all
. Then, re-install Gloo Edge at the desired version by the following one of the installation guides.
How do I upgrade Gloo Edge in a production environment, where downtime is unacceptable?
The basic helm upgrade
process is not suitable for environments in which downtime is unacceptable. Instead, you can follow the Canary upgrade guide to deploy multiple version of Gloo Edge to your cluster, and test the upgrade version before uninstalling the existing version.
Additionally, you might need to take steps to account for other factors such as Gloo Edge version changes, probe configurations, and external infrastructure like the load balancer that Gloo Edge uses. Consider setting up liveness probes and healthchecks in your environment.
What happens to my Gloo Edge CRs during an upgrade? How do I handle breaking changes?
A typical upgrade of Gloo Edge across minor versions should not cause disruptions to the existing Gloo Edge state. In the case of a breaking change, Solo will communicate through the upgrade guides, changelogs, or other channels if you must make a specific adjustment to perform the upgrade. Note that you can always use the glooctl debug yaml
command to download the current Gloo Edge state to one large YAML manifest.
Is the upgrade procedure different if I am not a cluster administrator?
If you are not an administrator of your cluster, you might be unable to create custom resource definitions (CRDs) and other cluster-scoped resources, such as cluster roles and cluster role bindings. If you encounter an error related to these resources, you can disable their creation by including the following setting in your Helm values:
global:
glooRbac:
create: false
Otherwise, you can try performing an installation of Gloo Edge that is scoped to a single namespace by including the following setting in your Helm values:
global:
glooRbac:
namespaced: true
Why do I get an error about re-creating CRDs when upgrading using helm install
or helm upgrade
?
Helm v2 does not manage CRDs well, and is not supported in Gloo Edge. Upgrade to Helm v3, delete the CRDs, and try again.
Why do I get an error about a gateway-certgen
job?
The upgrade creates a Kubernetes Job named gateway-certgen
to generate a certificate for the validation webhook. The job contains the ttlSecondsAfterFinished
value so that the cluster cleans up the job automatically, but because this setting is still in Alpha, your cluster might ignore this value. In this case, you might have an issue while upgrading in which the upgrade attempts to change the gateway-certgen
job, but the change fails because the job is immutable. To fix this issue, you can delete the job, which already completed, and re-apply the upgrade.