VMMIG - Module02 - Discover - Assess Phase
VMMIG - Module02 - Discover - Assess Phase
https://cloud.google.com/solutions/best-practices-migrating-vm-to-compute-engine
Assess/Discover Plan/Foundation Migrate! Optimize your
your application Create a landing Pick a path, operations and
landscape zone to the cloud and get started save on costs
TCO/ROI estimates were quite possibly completed in the sales process, but we will
refine them as part of discovery.
Agenda
Project Phases
Phase 3: Analysis
Phase 4: Deliverables
Project phases
Make the point that there’s the major, up front discovery/foundation phase you do to
build the foundation you need before you start migrating VMs to the cloud. Then,
there’s the short, more focused discover/foundation you’ll do each wave, to determine
how the foundation needs to be fine tuned. Plan/Foundation is Module 2, and then
we’ll be talking about the migration sprints in Module 3.
Agenda
Project Phases
Phase 3: Analysis
Phase 4: Deliverables
Discover and assess
The goal here is to understand what you have, and what VMs can be migrated. We’ll
typically look to gather inventory data for the whole estate in one go using an
automated tool. This may have been done already as part of the initial ROI/TCO
analysis used to make the argument for moving to GCP. Once the inventory is
gathered, then consultants work with application owners and SMEs to help assign
meaning to what the tools are finding. This phase is the starting point of your VM
migration journey
Discovery estimated Full Time Equivalents (FTEs)
● Typically discovery is 30%-50% of migration effort
● Rough estimates for assessment/discovery timing
○ Small – 200 VMs, up to 10 apps, 17 FTE days
○ Medium – 1000 VMs, up to 25 apps, 25 FTE days
○ Large – 2500 VMs, up to 60 apps, 50 FTE days
○ Extra Large, 4+ months
● Discovery for Migration Timing Details
● Discovery FTE Calculator
These numbers are based on Google’s experience with many VM migrations for
clients. The linked documents are used by Google as starting estimates for a
particular VM project.
The migration discovery team
Partner Client
Let’s start with the team. The VM migration team will include partner consultants,
client contacts, and perhaps Google personnel.
Executive sponsorship is also critical for “making things happen” within the client
organization. They can, for instance, ensure you get the cooperation and support of
the key people within the organisation who you are relying upon for information
Start!
● Start with a Kickoff event
○ Explain expectations and responsibilities to client
○ Migration Discovery Kickoff Deck
○ Migration Discovery Kickoff Agenda
● Create/contact GCP Center of Excellence team
○ Client is creating a Center of Excellence team
● Identify communication channels and key personnel
○ Track for easy access (Google sheet)
○ Who’s good at solving what kind of problem?
The linked documents are Google presentations you can tailor for your particular
project.
As your project progresses, you will need to document key personnel e.g. who knows
about what within the organisation.
Agenda
Project Phases
Phase 3: Analysis
Phase 4: Deliverables
Phase 1: automated discovery
● Number of tools that can help
○ CloudPhysics
○ StratoZone
○ RISC Networks, Cloudamize, others…
● When partners register the opportunity with Google,
CloudPhysics and StratoZone are free
○ May have optional commercial features
● RISC Networks and Cloudamize might require licensing
○ Keep checking!
Register to use:
● CloudPhysics: https://app.cloudphysics.com/partner/GCP/register
● Stratozone: https://gogcp.stratozone.com/
● RISC Networks: https://riscnetworks.com/google
● Cloudamize: https://app.cloudamize.com/setup .
Currently CloudPhysics and Stratozone are free for Google partners to use. The
other two tools may become free to use in the future, subject to agreement between
Google and the respective vendors
Goals of automated discovery
● Shorten the sales cycle
● Understand existing infrastructure costs
○ On-prem or other cloud providers
● Predict TCO based on various configurations for GCP
● Spot migration challenges and obstacles
○ vSphere and OS versions
○ Machine and OS types
Automated tools can firm up the ROI figures for moving to the cloud, and can thus
shorten the sales cycle. It can also help you understand the current environment, and
even spot challenging VMs, all before you start trying to move it.
Tools collect data in three main ways
● 1) From vSphere or AWS
○ Core infrastructure information
● 2) From machines using SSH/WMI
○ Add processes, installed applications, network connections, etc.
● 3) From agents installed node by node
○ Add database structures, resource usage breakdowns
● Data then compiled, encrypted, and passed back to tool hub
○ Usually cloud-based
CloudPhysics and StratoZone can both gather using the direct collection technique,
talking directly to AWS or vSphere. They can get a good deal of information with little
access.
StratoZone, RISC Networks, and Cloudamize all have options where an app or virtual
appliance is installed on a machine in the data center and then granted SSH/WMI
access to the other machines so they can gather more detailed data. RISC Networks
even has a version that can be installed all locally so no inventory or personalized
data passes out through firewall, if you’re willing to pay :-)
Cloudamize has a version that can get even more detailed information by installing an
agent on any inventoried machines.
Note that the same tool can use multiple information-gathering techniques, with
different access/security requirements. You will need to negotiate the latter with the
client.
CloudPhysics VS StratoZone
Both these tools are currently free to use for Google partners, so these are the two we
will be concentrating on in this class.
When to use:
Nothing to install. Makes secure connection to VSphere or AWS and strictly pulls
infrastructure information. No agent to install, no internal machine metrics.
Sample report:
https://drive.google.com/open?id=1zM2rMKaH94z3nl_pOR7L0EKqYj3RlApY
CloudPhysics operation
● Install the CloudPhysics Observer virtual appliance
○ Installing the Cloudphysics Observer
○ CloudPhysics Security
● Collect data from vSphere/AWS through read-only API access
○ No direct machine access
○ Updates every 20 seconds
● Compile data and send to CloudPhysics via SSL
Partners can register and set up their CloudPhysics partner portal here:
https://app.cloudphysics.com/partner/GCP/register
VMs typically won’t be moved in isolation, you’ll want to move a group of related VMs
as a unit. A wave is a group of machines that we move together.
API gateway +
Data Center
VPN/Interconnect
Users
Dependency Dependency
So the StratoProbe VM is installed into the data center. Firewalls are configured to
allow it to see the other machines. The probe is given credentials to get access to
vCenter, SSH access to Linux machines, and WMI access to Windows machines.
Using a combination of the inventory data coming out of vSphere, interactions
between machines tracked with netstat, and services inventories from the machines
themselves, StratoZone builds a map of different VMs and how they interact with
others. It also assigns labels to machines with common services.
https://www.riscnetworks.com/use-casescloud-and-data-center-migration/
Can get flat rate partner or transactional pricing.
Partner licensing starts at $14,400 for 100 node licenses.
Transactional: $30 per node for 30 days, a node is a VM/device/server
Each collector can handle up to 15,000 VMs. Most tools top out at about 500 per.
Similar capabilities to Stratozone, but easier to use. Downside is cost (at time of
writing, partners have to pay to use this tool)
Cloudamize
Assess
Make the business case of moving
to the cloud via TCO analysis across Data-backed
cloud service providers, and see answers to your
exactly what your infrastructure questions
would look like in the cloud
Plan
Quickly build an accurate, $100M in
executable migration plan based on end-client cost
application dependencies, savings
suitability, rightsizing, and in last 18 Months
complexity
Migrate 75%
Use to build more automated time reduction of
processes for migrating to the cloud migration
lifecycle
Cloudamize can get a lot of specific details using their installed agent approach, but
that means an agent being installed on every VM, which can be a challenge on many
fronts. Without the local agent, it takes a snapshot every 5 min and operates using
SSH/WMI and netstat.
Cloudamize can also break down R (Rehost, Replace, …) scores and generate a
matrix for you to help make decisions. Can also generate Migrate for Compute Engine
runbooks.
27
Tool summary
● CloudPhysics
○ Good with initial TCO/ROI, “Should we move to GCP”
○ Requires access to vSphere, AWS, or manually data upload
○ Free
● StratoZone
○ Good with automated discovery (including ROI) and dependency
mapping
○ Needs SSH/WMI access to machines
○ Poor documentation
○ UI could be more user friendly
○ Free
Phase 3: Analysis
Phase 4: Deliverables
Phase 2: surveys and interviews
● Phase 1 delivered good information
● What’s missing?
Context! Through tools like StratoZone we see the machines with names, can
determine the running processes, and then the network interactions between can
identify dependency patterns, but even with all of that, we can’t know for sure exactly
what we are seeing.
What kind of people should we talk to?
●
●
●
●
●
Why start with questionnaires and not jump right to in person interviews?
What if there are 100 apps. How many meetings before you find the right people, and
then find the answers that you need? Let the questionnaires filter and do initial
information gathering. Figure out who will be the helpful few for each application.
Keep those names for when you start migration (as opposed do discover/foundation)
sprints.
But! People tend to ignore questionnaires. Make sure there’s a good executive
sponsor who can wield the carrot/stick to keep the answers flowing.
Lab 3
Set up and use the application discovery template
Questionnaire review
● In the questionnaire, what were the major question categories?
● Any questions surprise you?
● Any questions you would add?
Remember, there’s a fine line between asking enough and asking too many. Not
everyone will know the answers to all questions, but this will help find who can help
and in which areas.
Agenda
Project Phases
Phase 3: Analysis
Phase 4: Deliverables
Coming into phase 3: analysis
● Prerequisites for analysis:
○ Automated inventory and dependency groupings
○ Domain experts have added context
■ But they are still available to answer questions!
○ Better understanding of application architectures
Phase 3: analysis
Limitation Analysis
Migration Approach
Effort Estimation
Wave Grouping
First Mover
1. Limitation analysis
○ Hard or impossible to move
2. Migration approach selection
○ How to get it into the cloud
3. Effort/complexity estimation
○ S/M/L complexity per app
4. Wave grouping (categorization; not waves)
5. First mover identification (nomination)
Limitation Analysis
Limitation Analysis
Migration Approach
Remember, most of our migrations will be somewhere on the lift and shift/improve and
move spectrum
3) Effort/complexity estimation
Effort Estimation
One approach here is to get each of the team to assign a complexity score to each of
the applications that need to be moved. High score and low score can explain how
they came up with the assigned score. Then the team leader can pick a score and
assign it to that particular application.
The linked document on the slide shows some of the dimensions of an application
that Google have found need to be considered when assigning a complexity score to
a migration.
Sprints, backlogs, and waves
Once you have categorised your applications, you can decide in which sequence you
are going to migrate your application waves. As per Agile methods, we can organise
these waves into sprints, backlogs and waves.
Sprint: https://www.atlassian.com/agile/scrum/sprints
Backlog: https://www.atlassian.com/agile/scrum/backlogs
Wave is a migration-specific construct, that is defined by a runbook in Migrate for
Compute Engine
Migrate
4) Wave grouping Prepare
Build target
environments and
Execute move of
apps and
services to GCP
select migration
candidates from
● Move groups of apps in waves backlog
Wave Grouping
© 2019 Google LLC. All rights reserved.
So look at the two main considerations: business and technical. First and foremost
there’s Technical. It would be preferable to move applications that are connected or
dependent at the same time, in the same wave. But from the business perspective,
you might not want to take out two particular critical systems at the same time while
they are being moved, or might not like the risk of the dual simultaneous move.
Also, don’t forget that with the right team, there can’t be two different sub-teams
working on moving waves simultaneously.
Backlog
● Each major discovery and foundation component
goes on the backlog
○ Name some?
● Initial sorting should consider:
○ Low hanging fruit (quick wins)
○ Business impact vs tolerance of downtime
● Format up to team
○ Post-its on a board
○ Google Doc
○ Jira
Examples of application sorting
Legend:
As the migration proceeds, you will want to refine these initial classifications
5) First mover identification
First Mover
Never underestimate the importance of the confidence a good first move will bring
both the client and the migration team.
First mover sweet spot
Business Criticality
Sensitivity to Downtime
First Mover
An ideal first mover is both reasonably important to the business (i.e. so people care
when the move is successful), and relatively insensitive to downtime.
Poor first movers are:
The more of these characteristics a particular application has, the less suitable it may
be as a first mover.
Architecture: good first mover
Middleware clusters Log storage
Mobile application
Logger pool MongoDb
Users Javascript
WebApp
Metadata storage
Comments
pool MongoDb
Cluster monitoring
Infrastructure Monitoring/
logging
Mobile application. It’s stand alone, interacts with other systems through the load
balancer making service calls, not tightly tied to anytihng it couldn’t access from GCP.
Help minimize risk
● Make the move in low/no usage periods
○ Dark migration
● Have a clear rollback strategy
○ May be as easy as updating DNS
● Are internal facing
● Plan for the worst case
● Have architectural and application support
personnel on hand to help troubleshoot
All of the above factors can help mitigate the risk that a cloud migration represents to
the client.
Agenda
Project Phases
Phase 3: Analysis
Phase 4: Deliverables
Discover/Assess Plan/Foundation Migrate! Optimize your
your application Create a landing Pick a path, operations and
landscape zone to the cloud and get started save on costs
Time to move on
Deliverables
● Summary report
○ Everything done up to this point
● First mover/wave recommendation
● Technical landscape requirements
○ Need a place for the VMs to land, that’s next
Lab 4
Analyze four applications, check for limitations,
estimate complexity of migration, sort migration
waves, and pick a first mover