Download as pdf or txt
Download as pdf or txt
You are on page 1of 96

BRKCRS-2112

Serviceability of SD-WAN

Chandrabalaji Rajaram & Ali Shaikh


Cisco Spark
Questions?
Use Cisco Spark to communicate
with the speaker after the session

How
1. Find this session in the Cisco Live Mobile App
2. Click “Join the Discussion”
3. Install Spark or go directly to the space
4. Enter messages/questions in the space

cs.co/ciscolivebot#BRKCRS-2112

© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Cisco SD-WAN learning Journey at Cisco Live
Monday Tuesday Wednesday Thursday Friday

BRKCRS-2110 BRKCRS-2111
Delivering Cisco Next Migration to Next-Gen
Generation SD-WAN SD-WAN
with Viptela

Deep Dive SP
Serviceability
orchestration
Architecture Migration
and solution and vQOE
TECCRS-20004 BRKRST-2557 BRKCRS-2112
Cisco SD-WAN SD-WAN and NFV Serviceability for
Technical Deep Dive BRKCRS-2113 BRKRST-2514 Orchestration for Next Generation
Managed Service SD-WAN
Cloud-Ready WAN for Next Gen SDWAN with
Providers
IAAS and SAAS with application
Cisco Next-Gen SD- acceleration/optimization
WAN
Agenda

• SDWAN Components overiew


• Day 0 – Deployment and troubleshooting
• Day N – Deployment and troubleshooting
• System Maintenance
• Tech Support
• Demo
SDWAN Components Overview
SDWAN Components overview
vManage
NMS

vSmart
vEdge
Controller
Cloud Router

SDWAN
Components

vBond
vEdge
Orchestrator
Router

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 7
SDWAN Components overview
Orchestration Plane Orchestration Plane
vManage
Cisco vBond
APIs

3rd Party
vAnalytics  Orchestrates Connectivity
Automation

vBond  First point of authentication

vSmart Controllers (white-list model)

 Facilitates NAT traversal


MPLS 4G

INET
vEdge Routers

Cloud Data Center Campus Branch SOHO

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 8
SDWAN Components overview
Management Plane Management Plane
vManage
Cisco vManage
APIs

3rd Party  Single pane of glass


vAnalytics
Automation
 Policies and Templates
vBond
 Troubleshooting and
vSmart Controllers
Monitoring
MPLS 4G  Programmatic interfaces
INET
vEdge Routers

Cloud Data Center Campus Branch SOHO

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 9
SDWAN Components overview
Control Plane Control Plane
vManage
Cisco vSmart
APIs

3rd Party
vAnalytics
Automation  Handles all the Overlay-network
routing
vBond
 Facilitates the DP encryption
vSmart Controllers
between vEdges

MPLS 4G  Propagates the policies for


INET handling DP traffic
vEdge Routers

Cloud Data Center Campus Branch SOHO

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 10
SDWAN Components overview Data Plane
Physical/Virtual
Data Plane
vManage
vEdge vEdge Cloud
APIs

3rd Party  WAN edge router


vAnalytics
Automation
 Provides secure data plane with
vBond
remote vEdge routers
vSmart Controllers
 Implements data plane and

4G application aware routing


MPLS
INET policies
vEdge Routers

Cloud Data Center Campus Branch SOHO

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 11
Cisco SD-WAN Cloud-Delivered Architecture
Multitenant, Cloud-Operated and Cloud-Delivered

Cloud
GUI
vSmart Data Center
Secure
REST API vManage Controllers
SD-WAN Fabric
Analytics

Private/Hosted/Managed
Cloud
Data Center
MPLS 4G

INET
Secure
Control Plane
Small Office
Home Office
vEdge Router Campus

Branch
BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 12
Fabric Operation Walk-Through
OMP Update:
OMP
vSmart Reachability – IP Subnets, TLOCs
Security – Encryption Keys
DTLS/TLS Tunnel
Policy – Data/App-route Policies
IPSec Tunnel
OMP OMP
BFD Update Update
Policies
OMP OMP
Update Update

vEdge vEdge
Transport1
TLOCs TLOCs

VPN1 VPN2 Transport2 VPN1 VPN2


BGP, OSPF, BGP, OSPF,
Connected, Connected,
Static A B C D Static

Subnets Subnets
BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 13
Secure Segmentation
 Security Zoning
Interface  Compliance
 Guest WiFi
VLAN  Multi-Tenancy
 Extranet
Per-VPN Topology

Full-Mesh Hub-and-Spoke Partial Mesh Point-to-Point

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 14
Day 0

Deployment and troubleshooting


Zero Touch Provisioning – vEdge Appliance
Control and Policy
Zero Touch Provisioning
Elements
Server

2 3
5
1 Full Registration and
Configuration
4
Assumption:
• DHCP on Transport Side (WAN)
• DNS to resolve ztp.viptela.com*

vEdge
* Factory default config  Delivered as-a-Service

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 16
Zero Touch Provisioning – vEdge Cloud
Control and Policy
vManage Elements

1
Cloud-Init

VM
Provisioning 3
5
Tool
2 Full Registration and
Configuration
4
Assumption:
• DHCP on Transport Side (WAN)

vEdge Cloud
* Factory default config

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 17
Building basic overlay network

1) Perform initial bring-up and do


basic configuration.
2) Enable host or service-side
interfaces and routing.
3) Enable overlay routing over OMP.
4) Check the automatic setup of the
IPsec data plane.
5) Enforce policies.

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 18
Checking Control connections

 Control Up: Total number of


devices with the required number
of operational control plane
connections to a vSmart controller.

 Partial: Total number of devices


with some, but not all, operational
control plane connections to
vSmart controllers.

 Control Down: Total number of


devices with no control plane
connection to a vSmart controller.

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 19
Checking Control connections for a single device

 If the device has multiple


interfaces, vManage NMS displays
a graphical topology of all control
connections for each color.

 Click the arrow to the left to view


the control connections for that
TLOC color.

 Click the checkbox to the left to


select and deselect control
connections.

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 20
Checking Data connections

 Down: Non-operational
connections with other vEdge
routers in the network.

 Init: Connections that are


reachable but not up yet.

 Up: Operational connections

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 21
Checking OMP Summary
OMP Summary of the vEdge router

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 22
Checking OMP Summary
OMP Summary of the vEdge router

Field Explanation
admin-state Administrative state of the OMP session. It can be UP or DOWN.

omp-uptime How long the OMP session has been up and operational.

oper-state Operational status of the OMP session. It can be UP or DOWN.

routes-installed Number of routes installed over the OMP session.

routes-received Number of routes received over the OMP session.

tlocs-installed Number of TLOCs installed that were learned over OMP sessions.

tlocs-received Number of TLOCs received over OMP sessions.

tlocs-sent Number of TLOCs advertised over OMP sessions.

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 23
Checking OMP Peers detail
OMP Peers of the vEdge router

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 24
Checking OMP Peers detail
OMP Peers of the vEdge router

Field Explanation
Peer IP address of the connected Edge device.

Type Type of SDWAN device

State  down—The connection is not functioning.


 init—The connection is initializing.
 up—The connection is operating.
Domain ID Identifier of the domain that the device is a member of.

Site ID Identifier of the administrative site where the connect Edge device is
located.
R/I/S Number of routes received, installed, and sent over the OMP session.

routes-installed Number of routes installed over the OMP session.

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 25
Checking Device bring-up

- Indicates control plane connections are successful

- Indicates ZTP is disabled. Seen during SW upgrade only

- Indicates control plane connection failure

- Indicates that the reason for device bring-up failure is


Unknown

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 26
Troubleshooting Control connections
Possible causes for control connection failure

Connectivity Issues Certificate Issues

 DTLS Connection Failure  Serial number(s) not present

 TLOC Disabled  Certificate revoked/invalidated

 Transient Conditions  Certificate Verification Failed

 Org. Name Mismatch

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 28
DTLS connection failure

Probable causes Debugging steps:

 NH not reachable PING Def-GW


 Def-GW not installed in RIB Ping vBond if ICMP is allowed on the
 DTLS port not open in the vBond
Controllers Traceroute to vBond DNS Address

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 29
TLOC disabled

Probable causes

 Clearing of Control Connections


 Changing the color on TLOC
 Change in System IP
 Change in any of the configs mentioned in the system block or in the tunnel properties

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 30
Transient Conditions

Following are some Transient conditions where the control connections flap.
System-IP change on the vEdge
Tear-down msg. to vBond [control connection to vBond is transient]

This can be verified using the “show control connections” output as shown below

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 31
Transient Conditions

Following are some Transient conditions where the control connections flap.
System-IP change on the vEdge
Tear-down msg. to vBond [control connection to vBond is transient]

This can be verified using the “show control connections” output as shown below Disconnect vBond after register
reply

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 32
Transient Conditions

Following are some Transient conditions where the control connections flap.
System-IP change on the vEdge
Tear-down msg. to vBond [control connection to vBond is transient]

This can be verified using the “show control connections” output as shown below

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 33
Transient Conditions

Following are some Transient conditions where the control connections flap.
System-IP change on the vEdge
Tear-down msg. to vBond [control connection to vBond is transient]

This can be verified using the “show control connections” output as shown below
System-IP Changed

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 34
Transient Conditions

Following are some Transient conditions where the control connections flap.
System-IP change on the vEdge
Tear-down msg. to vBond [control connection to vBond is transient]

This can be verified using the “show control connections” output as shown below

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 35
Serial Number(s) NOT present

 If the serial number is not present on the controllers for a given vEdge, the control Challenge response rejected by peer
connections
will fail

 Verify this by “send to controllers” option from vManage and / or ‘show controllers [ valid-
vsmarts | valid-vedges ]’.

Peer Board ID Cert not verified

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 36
Serial Number(s) NOT present….Contd

Serial Number is NOT present

Challenge response rejected


by peer

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 37
Certificate revoked/Invalidated

 The certificate will be revoked in case of controllers or vEdge serial number is invalidated

vSmart Certificate
revoked

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 38
Certificate installation failed

 Certification verification failure is when certificate cannot be verified with the root cert installed.

Fail to verify Peer Certificate

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 39
Organization-name Mismatch

 For a given a overlay, the Org. Name has to match across all the controllers and vEdges so that
control connections can come up.
 If not, you will see “Certificate Org. name mismatch” as seen below in the “show control
connections” output.

Certificate Org name mismatch

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 40
Day ”N”

Monitoring and troubleshooting


Health Status Check on vEdge
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 43
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 44
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 45
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 46
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 47
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 48
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 49
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 50
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 51
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 52
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 53
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 54
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 55
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 56
Checking System Status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 57
Checking System status….contd

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 58
Checking the Circuit-Utilization

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 59
Checking the Circuit-Utilization

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 60
Checking Transport Quality
WAN > TLOC status

© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Checking Transport Quality
WAN > TLOC status

© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Checking Tunnel Quality
WAN > Tunnel status

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 63
Checking DPI stats

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 64
Checking DPI stats

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 65
Checking DPI stats

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 66
Checking DPI stats

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 67
Checking DPI stats

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 68
Checking App flows

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 69
Checking Events

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 70
Service-side to Service-side Troubleshooting

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 71
Service-side to Service-side Troubleshooting

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 72
Service-side to Service-side Troubleshooting

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 73
Service-side to Service-side Troubleshooting

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 74
App route visualization

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 75
Simulate Flows

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 76
Simulate Flows

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 77
Debug Logs

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 78
System Maintenance
Configuration roll-back
Configuration roll-back using vManage

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 81
Software Upgrade
Software Upgrade
Upgrade software version of the vEdge router

NOTE: If the software upgrade is NOT successful and the device loses its connectivity after upgrade, it
will automatically roll-back to the previous Software version

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 83
Device Reboot
Device Reboot

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 85
Generic Alarms/Notifications
System Alarm - Types

Major Alarm - RED

 One or more hardware components on the router has failed.


 One or more hardware components on the router has exceeded the
temperature threshold.

Minor Alarm - YELLOW


 Indicates a warning on the router that, if left unattended, might result in
an interruption in router operation or degradation in router performance.

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 87
Checking Alarms

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 88
Tech-Support
Collecting Show Admin Tech Generate Show-admin Tech

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 90
Demo
Cisco Spark
Questions?
Use Cisco Spark to communicate
with the speaker after the session

How
1. Find this session in the Cisco Live Mobile App
2. Click “Join the Discussion”
3. Install Spark or go directly to the space
4. Enter messages/questions in the space

cs.co/ciscolivebot#BRKCRS-2112

© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
• Please complete your Online Complete Your Online
Session Evaluations after each
session
Session Evaluation
• Complete 4 Session Evaluations
& the Overall Conference
Evaluation (available from
Thursday) to receive your Cisco
Live T-shirt
• All surveys can be completed via
the Cisco Live Mobile App or the
Communication Stations
Don’t forget: Cisco Live sessions will be available
for viewing on-demand after the event at
www.ciscolive.com/global/on-demand-library/.

© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Continue Your Education
• Demos in the Cisco campus
• Walk-in Self-Paced Labs
• Tech Circle
• Meet the Engineer 1:1 meetings
• Related sessions

BRKCRS-2112 © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public 94
Thank you

You might also like