Digital Services

Digital Services

Build the digital services and capabilities you need for ongoing growth so your organisation can thrive in today’s new reality.

Discover Digital Services

Services

Digital Advisory
Cloud and engineering
Data and AI
Low Code
User-centred design
Managed services

Impacts

Building digital transformation
Driving continuous improvement
Improving business performance
Improving customer engagement
Workday

Workday

Introduce, embed, and amplify the power of Workday across your business to your precise strategic needs, today and for the future.

Discover Workday

Services

Deploy Workday
Test Workday
Audit Workday
Spark&Grow
Workday Extend
Workday Adaptive Planning
Workday Peakon Employee Voice
Pulsora

Products

Kainos Smart
Smart Test
Smart Audit
Smart Shield
Employee Document Management
EU Pay Transparency
Industries

Industries

With over 30 years of digital design, development, and delivery under our belts, if you’ve got a digital challenge, we’ll work with you to get game-changing results.

Industries

Industries

Financial services
Insurance
Payments
Education
Government
Healthcare and life sciences
Insights

Insights

The latest news, developments and insights from our experts.

Insights

Workday collection
Business
Diversity and inclusion
People

Technology
Trends
Testing and assurance
Transformation
Careers

Careers

At Kainos, our people are at the heart of everything and once you're here there's no limit to how far you can go.

Careers

Our culture
Graduates and students
Kainos Academy
Opportunities

Get in touch

Improving zero downtime on Kubernetes

Home · Insights · Improving zero downtime on Kubernetes

Date posted

7 February 2020

Reading time

10 Minutes

Daryl Porter

Improving zero downtime on Kubernetes

A few months ago, we began a piece of work on my last project setting out to accomplish two things:

Improve the speed of our release process
Allow us to run multiple versions of the application concurrently

These were two limitations of our initial deployment and release design and so in sharing this blog I am hoping the lessons learnt will be able to help you avoid the same issues.

Before jumping into the solution, I'll quickly describe our starting point.

Please note: This is a simplified view of the infrastructure, we'll only describe elements relevant to the zero downtime aspect of the solution.

Configuration ? before

/></figure>

<strong>Design decisions</strong><br>
<ul>
<li>Separate namespaces for web tier (app-publishing) & application tier (blue & green)</li>
<li>Application namespaces (blue/green) would be immutable and therefore would only ever contain a single release</li>
<li>Application deployed via single Helm chart</li>
<li>Use the <a href=

/></figure>

<p>Updating the Ingress resource to point at a new service means the Ingress controller would need to perform a reload of the Nginx configuration, which incurred on average <strong>one</strong> <strong>second</strong> of dropped connection, resulting in a 500 error (this and other causes of reloads can be found <a href=

About the author

Daryl Porter