Feature flags for blue-green deployment: getting started

August 19, 2024

Article by Michael Ferranti

A blue-green deployment is a software delivery technique that’s been around for a little while. It’s meant to help you minimize the risks of downtime and/or errors associated with deploying new code to production.

Together, blue-green deployments and feature flags enable teams to test, validate, and release software with greater flexibility, lesser risk, and more control.

In this guide, we’ll explore how blue-green deployments work, their trade-offs, and how feature flags can complement or even replace blue-green deployments.

What is a blue-green deployment?

Blue-green deployment is a technique for releasing new software updates with minimal downtime and risk. It involves running two identical production environments, called “blue” and “green”, in parallel. The green environment is used to test and validate new software versions, while the blue environment continues to run the previous one.

To release a new version of the software, the blue environment is taken offline and the green environment is brought online. This allows the new version to be tested in a live production environment before it is deployed to all users. However, if any issues are encountered during testing, the blue environment can be brought back online, while the issues in the green environment are fixed.

Once the new version has been tested in the green environment and proven stable, it can be deployed to the blue environment and the process can be repeated for future releases. This approach enables fast and flexible deployment of software updates while minimizing the risk of downtime or other issues.

How does a blue-green deployment work?

The process of doing a “blue/green deployment” after these two environments are set up is to deploy new code to the green instance that’s running idle. Then, a portion of traffic is directed to the green environment via a load balancer, where it is tested thoroughly. If everything looks good, you start gradually shifting more traffic from blue to green.

If all goes well, you eventually move all your traffic to the green environment and can retire the old blue environment. And then your clients see the new version with no downtime between the existing application running your old code and the green instance running your new code.

If you find a problem, you can quickly switch back to the blue environment while you work on a fix. Once the fixed version has been tested in the green environment and proven stable, it can be deployed to the blue environment and the process can be repeated for future releases.

The advantages of blue-green deployment

As we’ve just determined, blue-green deployments are great for testing new versions in a real production environment, and they let you quickly roll back if something goes wrong. That ability to quickly roll back to a known good state is a big safety net. But there are other advantages:

Some teams use it as part of a disaster recovery strategy because it gives you a standby, production-ready environment ready to go.
Some teams make use of the idle green deployment as part of their load testing strategy. You can copy all production traffic and send it to the green environment (also known as traffic shadowing). As it receives the copied traffic, you can observe how it handles the load, while responses are not sent back to your actual users. The blue environment continues to serve all real user requests.
It allows easy rollback to the previous version if there are problems with the new one,
It minimizes downtime, because the change from one environment to another is usually quick and seamless,
It allows for testing new version in a production-like environment before being made available to users,
It enables for continuous deployment of new software versions.

Blue-green deployment process is often used in conjunction with automated deployment and testing tools, such as CI/CD pipelines, to make the new software testing and releasing more efficient and reliable.

The challenges with blue-green deployment

Like everything in software, there are no solutions, only trade-offs. Here are some of the challenges with this approach.

Replicating a production environment can get complicated, especially when working with microservices.
There can be costs associated with maintaining a copy of your production environment.
Changes to databases. First, most applications can only have one version of a database at a time which makes two environments with two separate databases challenging.
Schema changes are also a challenge. If your database schema has to change between deployments, a good practice to follow is the expand/contract pattern. This approach supports both old and new code versions during the transition, making it compatible with blue-green deployments and allowing for safe rollbacks if needed. We have more tips to help you deal with this in our documentation.

When to use a feature flag instead of blue-green deployment

A blue-green deployment lets you test a new version of your code on a subset of users. That sounds a lot like a feature flag. So it begs the question, when should I use a blue-green deployment? And when should I use a feature flag?

Here are some scenarios where using feature flags make more sense than blue/green deployments. Like everything else in software, it’s about trade-offs, but feature flags make your life a lot easier when you need to deal with the following:

Fine-grained control: When you need to enable or disable specific features at a very granular level, down to individual users, subsets of users or requests.
Long-term toggles: When operational features need to be switchable over extended periods. Like log levels, rate limits, kill switches, etc.
Complex rollback scenarios: In cases where rolling back involves more than just switching traffic, like reversing migrations.
Complex testing: For complex A/B or multivariate testing that requires you to maintain multiple variants of a feature at the same time.
Frequent production updates: When you’re releasing small changes multiple times a day, and want to control their release without having to copy your production environment.
Restricting features by geographies: It’s significantly easier to or enable features only to users in specific countries or regions with feature flags. It’s possible at the load balancer level, but feature flags make this process more transparent to other people in your company.

Using feature flags with blue-green deployment

Feature flags can also be used as an enhancement to blue-green deployments. The idea is to connect your feature management service to both environments. You can then enable or disable feature flags in either environment.

That lets you gradually activate and test new features in the green environment before the full traffic switch. If there are problems, you can quickly disable specific flags without rolling back the entire deployment.

This combined approach lets you:

Maintain the safety net of blue-green deployments
Get the flexibility of feature flags
Reduce the risk associated with releasing multiple changes simultaneously
Perform more granular testing in production with real users

The Unleash approach to feature flags

Feature flags enable development teams to manage features in a dynamic, flexible, and controlled manner. Like any tool, you need to use them the right way—you don’t want to build a spaceship out of bricks.

Unlike proprietary software where users are bound to the product roadmap determined by the company (and its shareholders), an open-source feature management system allows you to modify and improve the software based on your specific use cases. Our users are not bound or dependent on the limitations of our code.

Unleash open source is available as a docker container, or click-to-deploy in Heroku and Digital Ocean. Choose your preferred deployment and get started in minutes.

While there are other proprietary tools such as LaunchDarkly, we believe there are a lot of benefits to using an open-source system like ours. See for yourself.

Share this article

Feature flags for blue-green deployment: getting started

What is a blue-green deployment?

How does a blue-green deployment work?

The advantages of blue-green deployment

The challenges with blue-green deployment

When to use a feature flag instead of blue-green deployment

Using feature flags with blue-green deployment

The Unleash approach to feature flags

Explore further

Agentic Software Development: The Hard Part Is Leadership

Unleash 7 Webinar Recap: In Case You Missed It

Tired of Cleaning Up Stale Feature Flags? Let AI Do the Work!