Three reasons why Ops Composition works: Cluster Linking, Services and Configuration (pt 2)

Posted on October 7, 2016 by Rob H

In part pt 1, we reviewed the RackN team’s hard won insights from previous deployment automation. We feel strongly that prioritizing portability in provisioning automation is important. Individual sites may initially succeed building just for their own needs; however, these divergences limit future collaboration and ultimately make it more expensive to maintain operations.

If it’s more expensive isolate then why have we failed to create shared underlay? Very simply, it’s hard to encapsulate differences between sites in a consistent way.

What makes cluster construction so hard?

There are a three key things we have to solve together: cross-node dependencies (linking), a lack of service configuration (services) and isolating attribute chains (configuration). While they all come back to thinking of the whole system as a cluster instead of individual nodes. let’s break them down:

Cross Dependencies (Cluster Linking) – The reason for building a multi-node system, is to create an interconnected system. For example, we want a database cluster with automated fail-over or we want a storage system that predictably distributes redundant copies of our data. Most critically and most overlooked, we also want to make sure that we can trust cluster members before we share secrets with them.

These cluster building actions require that we synchronize configuration so that each step has the information it requires. While it’s possible to repeatedly bang on the configure until it converges, that approach is frustrating to watch, hard to troubleshoot and fraught with timing issues. Taking this to the next logical steps, doing upgrades, require sequence control with circuit breakers – that’s exactly what Digital Rebar was built to provide.

Service Configuration (Cluster Services) – We’ve been so captivated with node configuration tools (like Ansible) that we overlook the reality that real deployments are intertwined mix of service, node and cross-node configuration. Even after interacting with a cloud service to get nodes, we still need to configure services for network access, load balancers and certificates. Once the platform is installed, then we use the platform as a services. On physical, there are even more including DNS, IPAM and Provisioning.

The challenge with service configurations is that they are not static and generally impossible to predict in advance. Using a load balancer? You can’t configure it until you’ve got the node addresses allocated. And then it needs to be updated as you manage your cluster. This is what makes platforms awesome – they handle the housekeeping for the apps once they are installed.

Digital Rebar decomposition solves this problem because it is able to mix service and node configuration. The orchestration engine can use node specific information to update services in the middle of a node configuration workflow sequence. For example, bringing a NIC online with a new IP address requires multiple trusted DNS entries. The same applies for PKI, Load Balancer and Networking.

Isolating Attribute Chains (Cluster Configuration) – Clusters have a difficult duality: they are managed as both a single entity and a collection of parts. That means that our configuration attributes are coupled together and often iterative. Typically, we solve this problem by front loading all the configuration. This leads to several problems: first, clusters must be configured in stages and, second, configuration attributes are predetermined and then statically passed into each component making variation and substitution difficult.

Our solution to this problem is to treat configuration more like functional programming where configuration steps are treated as isolated units with fully contained inputs and outputs. This approach allows us to accommodate variation between sites or cluster needs without tightly coupling steps. If we need to change container engines or networking layers then we can insert or remove modules without rewriting or complicating the majority of the chain.

This approach is a critical consideration because it allows us to accommodate both site and time changes. Even if a single site remains consistent, the software being installed will not. We must be resilient both site to site and version to version on a component basis. Any other pattern forces us to into an unmaintainable lock step provisioning model.

To avoid solving these three hard issues in the past, we’ve built provisioning monoliths. Even worse, we’ve seen projects try to solve these cluster building problems within their own context. That leads to confusing boot-strap architectures that distract from making the platforms easy for their intended audiences. It is OK for running a platform to be a different problem than using the platform.
In summary, we want composition because we are totally against ops magic. No unicorns, no rainbows, no hidden anything.

Basically, we want to avoid all magic in a deployment. For scale operations, there should never be a “push and prey” step where we are counting on timing or unknown configuration for it to succeed. Those systems are impossible to maintain, share and scale.

I hope that this helps you look at the Digital Rebar underlay approach in a holistic why and see how it can help create a more portable and sustainable IT foundation.

Fast Talk: Creating Operating Environments that Span Clouds and Physical Infrastructures

Posted on April 14, 2016 by Rob H

This short 15-minute talk pulls together a few themes around composability that you’ll see in future blogs where I lay out the challenges and solutions for hybrid DevOps practices. Like any DevOps concept – it’s a mix of technology, attitude (culture) and process.

Our hybrid DevOps objective is simple: We need multi-infrastructure Amazon equivalence for ops automation.

Here’s the summary:

Hybrid Infrastructure is new normal
Amazon is the Ops benchmark
Embrace operations automation
Invest in making IT composable

Want to listen to it? Here’s the voice over:

Problems with the “Give me a Wookiee” hybrid API

Posted on April 13, 2016 by Rob H

Greg Althaus, RackN CTO, creates amazing hybrid DevOps orchestration that spans metal and cloud implementations. When it comes to knowing the nooks and crannies of data centers, his ops scar tissue has scar tissue. So, I knew you’d all enjoy this funny story he wrote after previewing my OpenStack API report.

“APIs are only valuable if the parameters mean the same thing and you get back what you expect.” Greg Althaus

The following is a guest post by Greg:

While building the Digital Rebar OpenStack node provider, Rob Hirschfeld tried to integrate with 7+ OpenStack clouds. While the APIs matched across instances, there are all sorts of challenges with what comes out of the API calls.

The discovery made me realize that APIs are not the end of interoperability. They are the beginning.

I found I could best describe it with a story.

I found an API on a service and that API creates a Wookiee!

I can tell the API that I want a tall or short Wookiee or young or old Wookiee. I test against the Kashyyyk service. I consistently get a 8ft Brown 300 year old Wookiee when I ask for a Tall Old Wookiee.

I get a 6ft Brown 50 Year old Wookiee when I ask for a Short Young Wookiee. Exactly what I want, all the time.

My pointy-haired emperor boss says I need to now use the Forest Moon of Endor (FME) Service. He was told it is the exact same thing but cheaper. Okay, let’s do this. It consistently gives me 5 year old 4 ft tall Brown Ewok (called a Wookiee) when I ask for the Tall Young Wookiee.

This is a fail. I mean, yes, they are both furry and brown, but the Ewok can’t reach the top of my bookshelf.

The next service has to work, right? About the same price as FME, the Tatooine Service claims to be really good too. It passes tests. It hands out things called Wookiees. The only problem is that, while size is an API field, the service requires the use of petite and big instead of short and tall. This is just annoying. This time my tall (well big) young Wookiee is 8 ft tall and 50 years old, but it is green and bald (scales are like that).

I don’t really know what it is. I’m sure it isn’t a Wookiee.

And while she is awesome (better than the male Wookiees), she almost froze to death in the arctic tundra that is Boston.

My point: APIs are only valuable if the parameters mean the same thing and you get back what you expect.

Hybrid DevOps: Union of Configuration, Orchestration and Composability

Posted on March 8, 2016 by Rob H

Steven Spector and I talked about “Hybrid DevOps” as a concept. Our discussion led to a ‘there’s a picture for that!’ moment that often helped clarify the concept. We believe that this concept, like Rugged DevOps, is additive to existing DevOps thinking and culture. It’s about expanding our thinking to include orchestration and composability.

Hybrid DevOps 3 components (1) Here’s our write-up: Hybrid DevOps: Union of Configuration, Orchestration and Composability

Is Hybrid DevOps Like The Tokyo Metro?

Posted on March 8, 2016 by Rob H

I LOVE OPS ANALOGIES! The “Hybrid DevOps = Tokyo Metro” really works because it accepts that some complexity is inescapable. It would be great if Tokyo was a single system, but it’s not. Cloud and infrastructure are the same – they are not a single vendor system and going to converge.

With that intro…Dan Choquette writes how DevOps at scale like a major city’s subway system? Both require strict processes and operational excellence to move a lot of different parts at once. How else? If you had …

Source: Is Hybrid DevOps Like The Tokyo Metro?

Composability & Commerce: drivers for #CloudMinds Hybrid discussion

Posted on February 24, 2016 by Rob H

Last night, I had the privilege of being included in an IBM think tank group called CloudMinds. The topic for the night was accelerating hybrid cloud. cb81gdhukaetyga

During discussion, I felt that key how and why aspects of hybrid computing emerged: composability and commerce.

Composability, the discipline of creating segmenting IT into isolated parts, was considered a primary need. Without composability, we create vertically integrated solutions that are difficult to hybrid.

Commerce, the acknowledgement that we are building technology to solve problems, was considered a way to combat the dogma that seems to creep into the platform wars. That seems obvious, yet I believe it’s often overlooked and the group seemed to agree.

It’s also worth adding that the group strongly felt that hybrid was not a cloud discussion – it was a technology discussion. It is a description of how to maintain an innovative and disruptive industry by embracing change.

The purpose of the think tank is to create seeds of an ongoing discussion. We’d love to get your perspective on this too.

Hybrid & Container Disruption [Notes from CTP Mike Kavis’ Interview]

Posted on February 23, 2016 by Rob H

Last week, Cloud Technology Partner VP Mike Kavis (aka MadGreek65) and I talked for 30 minutes about current trends in Hybrid Infrastructure and Containers.

Mike Kavis

Three of the top questions that we discussed were:

Why Composability is required for deployment? [5:45]
Is Configuration Management dead? [10:15]
How can containers be more secure than VMs? [23:30]

Here’s the audio matching the time stamps in my notes:

00:44: What is RackN? – scale data center operations automation
01:45: Digital Rebar is… 3^rd generation provisioning to manage data center ops & bring up
02:30: Customers were struggling on Ops more than code or hardware
04:00: Rethinking “open” to include user choice of infrastructure, not just if the code is open source.
05:00: Use platforms where it’s right for users.
05:45: Composability – it’s how do we deal with complexity. Hybrid DevOps
06:40: How do we may Ops more portable
07:00: Five components of Hybrid DevOps
07:27: Rob has “Rick Perry” Moment…
08:30: 80/20 Rule for DevOps where 20% is mixed.
10:15: “Is configuration management dead” > Docker does hurt Configuration Management
11:00: How Service Registry can replace Configuration.
11:40: Reference to John Willis on the importance of sequence.
12:30: Importance of Sequence, Services & Configuration working together
12:50: Digital Rebar intermixes all three
13:30: The race to have orchestration – “it’s always been there”
14:30: Rightscale Report > Enterprises average SIX platforms in use
15:30: Fidelity Gap – Why everyone will hybrid but need to avoid monoliths
16:50: Avoid hybrid trap and keep a level of abstraction
17:41: You have to pay some “abstraction tax” if you want to hybrid BUT you can get some additional benefits: hybrid + ops management.
18:00: Rob gives a shout out to Rightscale
19:20: Rushing to solutions does not create secure and sustained delivery
20:40: If you work in a silo, you loose the ability to collaborate and reuse other works
21:05: Rob is sad about “OpenStack explosion of installers”
21:45: Container benefit from services containers – how they can be MORE SECURE
23:00: Automation required for security
23:30: How containers will be more secure than containers
24:30: Rob bring up “cheese” again…
26:15: If you have more situational awareness, you can be more secure WITHOUT putting more work for developers.
27:00: Containers can help developers worry about as many aspects of Ops
27:45: Wrap up

What do you think? I’d love to hear your opinion on these topics!

Deployment Fidelity – reducing tooling transistions for fun and profit

Posted on January 11, 2016 by Rob H

At the OpenStack Tokyo summit, I gave a short interview on Deployment Fidelity. I’ve come to see the fidelity problem more broadly as the hybrid DevOps challenge that I described in my 2016 Predictions post as the end of mono-clouds. Thanks Ken Hui from OpenStack Superuser TV for resurfacing this link!

Rob Hirschfeld

On Computing, Containers, Cloud & Tech Culture

Tag Archives: Hybrid DevOps