Professional Documents
Culture Documents
NetBackup Troubleshooting Refresher Introduction
NetBackup Troubleshooting Refresher Introduction
• Netbackup is not a program but rather a collection of processes that work together
• Process name prefixes – why are they different?
– Bp_ = legacy process (bp comes from Backup plus the original product). Examples:
• bprd= request daemon.
• bpdmn = Database Manager.
– Nb_ = newer processes. Multithreaded. ( made their debut in 6.x) Always running. Examples:
• Nbrb = NetBackup Resource Broker. Allocates and tracks resources.
• Nbproxy = NetBackup Proxy used to talk to legacy processes.
• As a service
– Some processes run as a service. They are started when NetBackup starts and stay running.
– Bprd, bpcd, bpdbm, nbpem, nbrb, and nbemm are exampls of these.
• Per job processes
– Bpbkar = Backup Archive and Restore. A misnomer since it does not handle restores.
– Tar.. The oddball. A legaxy process without the bp prefix.
– These processes are called per job, with start, perform their job, and exit. A good examples is bpcd (Communication
daemon).
How Process Share Information
• Storage Server
– MSDP, Advanced Disk, OST Plugin for Dxi, or Data Domain.
• Client Media
– Sends the Data Server
• API
– NDMP, Vmware and Hyper-V
• Upgrading
– Netbackup should always be updated from the top down. Opscenter, Master Server, Media Server, Client.
– They do not all need to be done at the same time Client
– A master can work with mixed media server versions and mixed client versions with some limitations and exceptions.
– Opscenter must ALWAYS be the highest level or at least match the master server.
Master Server UNIX Client
nbproxy bpbackup or
nbpem
bpcd EMM Database bparchive
nbjm
nbemm nbrb
up
ag
mount i
Shared mag
im
Memory bpbrm
u
ck
Ba
6
Sample Process Flow
7
Sample Process Flow II
8
Capturing and Logging
9
Logging Types
• Legacy logging.
– bp_ processes.
– These are single threaded which means each launch has a new PID.
• Robust logging
– An enhancement to legacy logging which allows for better log management including breaking
logs into smaller logs, recycling logs to save space, and intelligent pruning options.
• Vxlogging
– Nb_ processes use these logs.
– Logging is still stored in the same logging locations and uses the legacy log folder structure.
However the logs stored in a different format requiring vxlogview to “query” the logs and extract
the details.
– Far more manageable.
– Allows for gathering specific details that previously required a lot of time.
• PID parsing, timframe, etc.
10
Logging levels for Legacy Processes
11
Logging Levels for nb processes
• Robust Logging
– Enable within the GUI. Options are all controlled in either the java console or the BAR GUI.
• Vxlogging
– Can be configured by the command line or in the GUI. Some processes such as NBEMM,
NBPROXY, and PBX have to be configured through the command line using vxlogcfg.
– Take effect right away as long as the folder exists.
– Use vxlogview to retrieve there logs.
• /NetBackup/bin vxlogview –p 51216 –o 117 –t 01:30:00 >c:\temp\NBJM-117.txt NBJM log
• -p is the program ID. NetBackup, OpsCenter, BMR, Storage Foundation, etc.
• -o is the originator ID. This is the process that create the message.
• https://www.veritas.com/content/support/en_US/doc/86063237-127664549-0/v40600850-
127664549is
a current list of OIDs (Originator IDs)
12
Gathering up logs
13
Cleaning Up
• Cleaning up
– Vxlogs
• Netbackup\bin
• Vxlogmgr –F tells NetBackup to purge all vxlogs except those currently in
use
– Legacy logs
• Use the GUI or edit bp.conf to lower the logging level
• Delete the folder. Some processes will log the log and the folder cannot be
removed. Either bring down services or wait for the process to end
depending on the process. Bpbkar for example is per job.
14
Reading Logs and Understanding them
15
Sample Job Details
16
Knowing Where to Look
• When tracking down an issue it is important to know where the breakdown is happening.
– A backup failure status code 71 means not all of the files were backup up. This is typically a client side issue.
– The Master told the media and client what was to be backed up but the client did not present all the data.
– Reviewing the job details from the admin console to identify the errors followed by reviewing the client logs.
– The job details in activity monitor provides direction on what to look for.
• Job ID is passed process to process. It is often a great way to find on what to look for.
• Job details in activity monitor will usually show the pid of a process.
– Pulling out all lines with a specific PID will show the entire “life” of the PID for per job processes.
• Timestamp are another useful way to look through a log
• Process Exit Status.
– 22:41:39.980 [11544.9748] <16> bpcd exit_server: FTL – BPCD EXIT STATUS 46
• Line Code.
– 22:41:39.730 [11544.9748] <16> bpcd valid_server: MEDIA1 is not a media server either
• Other useful information.
– Job Overview tab.
• File list
• Policy Name, Start Time, Elapsed, End Time.
• Schedule name, Type, and Retentions.
• Client name and Master name.
– Details Tab.
• Attempt number
• Job PID
• Media Server and Storage Unit
• Files and KB Written.
17
Logs at a Glance
18
Logs at a Glance II
• Interesting Notes
19
Sample Job Details
20
Other Tools: NBSU and DataCollect
21
NBSU
22
NBSU II
• When troubleshooting issues Engineers often will request a NBSU and for NetBackup appliances and
updated DataCollect.
• These help support engineers by providing a in depth view of the configuration and design of the
environment as it relates to NetBackup. This data provides a “jump start” on the troubleshooting
process.
• Application specific logs. These logs are typically internal to the application and contain “the other side”
of the issue. Often times there are two views of the issue. And example would be a client loses
connection to a media server. The client says the media server wnt away.. The media server thins the
client went away.
– MS SQL Enterprise Manager Logs.
– Exchange Server logs from Event Viewer.
– SharePoint specific logs from withing SharePoint Admin Portal.
– NDMPD logs from NDMP filers.
– Oracle internal logs.
– Active Directory Logging.
23
How to run the NBSU
• NBSU is supported on almost every NetBackup install type available including master,
media, and clients
• Is included in the installation
• Typically is just run from the command line using nbsu with no switches. It will typically
detect the installation type
• Location
– /usr/openv/netbackup/bin/support for Non windows
– <install folder>\netbackup\bin\support form windows
– On NetBackup Appliances it can be found under support in the CLISH
• Non Intrusive
– It will take anywhere from 2 minutes to 15 minutes to run and does not interfere with
NetBackup.
24
What is DataCollect
25
What data to collect?
26