Skip to main content
Cloud

What Is a Cloud Managed Service Provider? Two Models and How to Pick

Cloud MSPs come in two very different flavors, and the one you pick determines what you get. Here's how they actually differ and how to choose between them.

John Lane 2024-11-06 6 min read
What Is a Cloud Managed Service Provider? Two Models and How to Pick

"Cloud managed service provider" is a vague term that gets used to describe at least two different businesses. Both are legitimate. Both have their place. But they are not interchangeable, and confusing them is one of the most common mistakes we see during vendor selection. A customer hires an MSP expecting one thing and gets the other, and six months later everyone is unhappy.

Here is what a cloud managed service provider actually is, the two main models that exist under that label, and how to pick the one that fits what you actually need.

The basic definition

At its simplest, a cloud managed service provider is an organization that operates cloud infrastructure on behalf of a customer. "Operates" is doing a lot of work in that sentence, because what it means varies. At minimum it includes monitoring, incident response, patching, backup verification, and access management. At maximum it includes architecture, engineering, security, compliance, cost optimization, and 24/7 operations across multiple clouds.

The question is where on that spectrum your MSP sits, and the honest answer is that most MSPs fall into one of two categories. Once you understand the two categories, picking between them is much easier than trying to compare twelve vendors on a feature matrix.

Model 1: The operations-first MSP

The first type of MSP is operations-first. They exist to keep the lights on. They are good at monitoring, alerting, ticketing, incident response, patching, and the boring-but-critical work of keeping infrastructure healthy. They usually have a 24/7 NOC, a well-defined escalation process, an SLA, and a deep bench of engineers who have seen a lot of failure modes.

What they are not, typically, is architects. An operations-first MSP takes your infrastructure as it is and runs it. They will tell you if something is broken, they will fix it when it breaks, and they will give you advice about operational improvements. They will generally not propose a major architectural change, will not lead a cloud migration, and will not design new systems for you. That is not their job and not their skillset.

This model is a good fit if:

  • You already have engineering capability in-house and you need operational coverage.
  • Your architecture is settled and working, and you mostly need someone to run it reliably.
  • You want 24/7 incident response without staffing your own NOC.
  • You care about predictable operational SLAs and measurable response times.
  • Your cost pressure is on operations, not on infrastructure design.

This model is a bad fit if:

  • Your architecture is a mess and needs to be rethought. An operations-first MSP will keep running the mess. They will not fix it.
  • You are in the middle of a cloud migration and need someone to lead it. They will not.
  • You need ongoing engineering work — new services, new integrations, new capabilities. They will point you at a different partner.
  • You want a single vendor to own the outcome, not just the uptime.

The operations-first MSP is the right answer for many customers, especially mature organizations with stable workloads and their own engineering leadership. You just need to know that is what you are buying.

Model 2: The engineering-led MSP

The second type of MSP is engineering-led. They exist to build the right thing and then run it. Their staff looks more like a consulting firm than a NOC — engineers, architects, security specialists, and a smaller but deep operations bench. They will design your cloud environment, migrate you onto it, improve it over time, and operate it once it is running. The operations are a byproduct of the engineering, not the main product.

Logical Front sits on this side of the line. Most of the customers we work with have either a complicated migration to do, an existing environment that needs substantial rework, or a set of applications that need engineering attention in addition to operations. We can staff the NOC part because we have to, but the value we provide is primarily the design and improvement of the environment over time.

This model is a good fit if:

  • You are migrating to or between clouds and need someone to own the migration.
  • Your current environment has technical debt that is causing operational pain, and you want someone who will actually fix it instead of just running around it.
  • You need ongoing engineering capacity beyond just operations.
  • You want a partner who will make architectural recommendations and help you evaluate them.
  • You are willing to pay for engineering quality, not just shift coverage.

This model is a bad fit if:

  • You just need overnight coverage and your environment is already well-designed.
  • You have your own architects and do not want outside opinions on your design.
  • You are extremely cost-sensitive and only want to pay for the minimum operational footprint.
  • Your organization cannot absorb engineering recommendations because the decision-making is slow or political.

The engineering-led MSP is more expensive per hour than the operations-first MSP, but for the customers who need engineering, it is usually cheaper in total — because the improvements the MSP makes over time reduce the operational load. A well-designed environment generates fewer incidents, which means the operational cost comes down even as the engineering bill stays stable.

How to pick between them

The honest test is to look at the problem you are actually trying to solve.

If you can describe your problem as "we need someone to watch our infrastructure and respond when it breaks," you want an operations-first MSP. If you can describe your problem as "we need someone to improve and run our infrastructure over time," you want an engineering-led MSP. If your problem is some of both — which it often is — the engineering-led MSP is usually the right starting point, because they can do operations, but an operations-first MSP usually cannot do engineering.

A few diagnostic questions that help:

  • Is your current environment the environment you want to run for the next three years? If yes, operations-first is probably fine. If no, you need engineering.
  • Do you have internal architects and cloud engineers, or do you need to rent them? If you have them, operations-first fills the gap you have. If you need to rent them, engineering-led is the model.
  • Are incidents in your environment caused by bad architecture or by normal failure modes? Bad architecture is an engineering problem. Normal failure modes are an operations problem.
  • Is your leadership expecting the MSP to improve things, or just to maintain them? Different expectations require different models, and mismatched expectations produce unhappy engagements.

What both models get wrong

For honesty: both models have common failure modes and you should check for them during selection.

The operations-first MSP will sometimes cover up symptoms that are really architectural problems, because fixing the architecture is out of scope. You end up with a stable-but-expensive environment that nobody ever improves. If you hire an operations-first MSP, make sure you have someone — internal or external — paying attention to the architecture.

The engineering-led MSP will sometimes over-engineer things that did not need to be re-engineered. Consultants like interesting problems, and sometimes the interesting work is not the most valuable work. If you hire an engineering-led MSP, make sure the engagement has clear priorities tied to business value, not just "improve the infrastructure."

The short version

A cloud managed service provider is someone who operates cloud infrastructure for you. The two main models are operations-first (run what you have) and engineering-led (improve and run what you have). Pick based on whether your problem is primarily operational coverage or primarily a combination of engineering and operations. Be honest about which one you are, and insist your vendor be honest about which one they are. That alone will avoid most of the disappointment we see in the MSP category.

Talk with us about your infrastructure

Schedule a consultation with a solutions architect.

Schedule a Consultation
Talk to an expert →