Download as pdf or txt
Download as pdf or txt
You are on page 1of 3

B.

GOWTHAM RAJ
9581364202 | gowtham.b.raj91@gmail.com | Hyderabad | LinkedIn
Boosts availability, scalability, and efficiency of systems by implementing architecture improvements and automating
processes. Site Reliability Engineering | DevOps Engineering | Cloud Infrastructure | Systems Availability Monitoring |
Agile/Scrum

Select Accomplishments at Wells Fargo:


● Reduced services monitoring time from 2 hours to a few minutes and enabled automated, ITRS-integrated alerts in case of
Windows Services failure by creating scripts to generate server status reports, thereby streamlining change management. ●
Facilitated company-wide transition of on-premises infrastructure, data, and applications to the Azure cloud platform by
supporting change releases; collaborated with the development team to oversee configuration and testing during sprint reviews.
● Maintained optimum staffing levels during periodic change releases by deploying and leading junior and systems operations
engineers to support the business. Trained over 10 team members on monitoring applications and service failure indicators.

PROFESSIONAL EXPERIENCE

Lead DevOps/Site Reliability Engineer, Wells Fargo 10/2020 – 5/2023 Leads various projects to optimize the overall system health;
oversees systems performance monitoring, quarterly technology refreshments, biannual disaster recovery exercises, and automation
building. Ensures 24/7 availability of critical and complex production systems and applications by alleviating performance risks and
troubleshooting issues during releases; executes change, incident, and problem management as part of ITIL. Sustains interaction of
multiple applications by maintaining service reliability.
Technology Transformation Support
● Minimized application downtime by plugging in a self-healing feature, enabling instant service restart during critical periods. ●
Automated file transfer from one server to another, switching the process previously taking 20 minutes to monitor each file to
monitoring AutoSys jobs only, by developing Windows batch scripts; collaborated with cross-functional teams. ● Enhanced
application reliability by identifying recurring issues and implementing targeted automation solutions as a key member of the toil
reduction team, resulting in significant noise reduction and continuous system optimization. ● Ensured zero misses of business-
critical systems emails by simplifying email notifications as part of the email reduction program. ● Streamlined knowledge sharing
by defining the structure to adopt Confluence for content management; developed a rule book and onboarded team members;
served as the point of contact for setting up and updating application data and maintaining SOPs.
Cloud Migration
● Ensured high availability and scalability and cost optimization by deploying secure architecture to migrate on-premise resources.
● Enabled continuous integration and delivery (CI/CD) for migrated applications by using Azure DevOps to streamline deployment.
● Sustained business continuity during the migration by implementing Azure Site Recovery for disaster recovery. ● Successfully
migrated multi-tier applications and ensured seamless integration by collaborating closely with application owners, architects, and
IT security teams.
Systems Health Maintenance
● Maintained 99.9% Service-Level Agreements (SLAs) and 99% Service-Level Incidents (SLIs) and Service-Levels Objectives (SLOs) by
ensuring on-time cross-system transfer of encrypted files using IBM Sterling Connect, sustaining a smooth application lifecycle.
● Ensured on-time delivery of application sprints while handling product deliverables by ensuring seamless change releases. ●
Onboarded and trained new hires on application performance monitoring through training sessions and hands-on support. ●
Supported the load tests for performance tuning; triaged and fixed production issues within defined SLAs and escalation matrices.
● Adhered to the Center of Risk and Reliability (CRR) SOPs by overseeing release management, Base image/OS upgrades, and ad hoc
changes. Handled escalations as the first point of contact; monitored SLO/SLI reports.
Sr. DevOps Cloud Engineer, Wells Fargo 08/2017 – 09/2020 Facilitated 24/7 and 365-day support to complex and advanced systems
by managing end-to-end project deliverables and resources. Oversaw toil management, capacity planning, report generation,
documentation, vendor collaboration, and shift rotation planning.
● Generated real-time insights on infrastructure (servers) and service (APIs) performance status by developing an ITRS-based
dashboard to enable service endpoint checks; established a holistic view of processes using a color coding feature. ● Cut data
retrieval time during disaster recovery from 4 hours to 40 minutes by aiding the implementation of a customized tool; configured
and plugged the tool into the CI/CD pipeline in a structured manner to ensure seamless application communication. ● Ensured on-
time automated alert sending to the SMEs and the Core SRE Team by monitoring SLAs and SLOs. Stayed within the error budget by
ensuring systems stability to avoid potential customer impact.
● Sustained good score levels for Mean-Time-To-Determine (MTTD), Mean-Time-To-Reduce (MTTR), and Mean-Time-To-Fix (MTTF)
by conducting post-incident review meetings and driving troubleshooting triage calls to avoid similar incidents. ● Streamlined
loophole and code leakage identification by setting up a Windows-based application. ● Participated in the implementation of
advanced DevOps tools, including Docker, Kubernetes, and Terraform.

DevOps Engineer, Wells Fargo 08/2015 – 07/2017 Managed day-to-day technical and production support for applications installed at
client sites. Conducted daily scrum meetings and standup calls and interacted with the onsite team to plan the sprint releases.
Supported all technical aspects of the clients using distributed environments, primarily Linux/Unix and Windows. Onboarded
applications onto Pivotal Cloud Foundry (PCF), AppDynamics (AppD), and Splunk to analyze production environment data. Conducted
sprint demos for client knowledge building.

Build & Release Engineer, Wells Fargo 02/2013 – 07/2015 Built, managed, and continuously improved the build infrastructure for
global software development engineering teams by implementing built scripts, such as Jenkins; continuous integration infrastructure;
and deployment tools. Monitored, identified, and supported alerts for Linux/Unix servers. Reduced downtime by maintaining systems
in smooth running conditions.
● Documented post-deployment issues in a log and assisted in resolution; updated issue status within the log.
● Identified changes in SVN and GIT and executed a sequence of targets by creating Jenkins jobs.

TECHNICAL SKILLS

Key Skills: Automation Building, Performance Monitoring, Sprint Reviews, Systems Configuration
Management, Disaster Recovery, Troubleshooting, Team Management, Stakeholder Coordination
Cloud Platforms: Microsoft Azure, Amazon Web Service (AWS)
Monitoring & Alerting: AppDynamics, Splunk, ITRS, Grafana Prometheus, ELstack
OS & Programming Languages: Linux, Windows, Python, Shell Scripting, PowerShell
Networking & Schedulers: TCP/IP, DNS, Load Balancing, AutoSys
RDDMS & Reporting: Oracle, MS SQL Server, ServiceNow, Jira
Collaboration & CI/CD: Confluence, Jenkins, GitLab

PROFESSIONAL DEVELOPMENT

M. Tech in Data Analytics & Engineering, BITS Pilani 2023 Post-Graduation Program (PGP) in Data Science, INSOFE 2018 ITIL®
Foundation Certificate in IT Service Management 2018 Microsoft Certified: Azure Fundamentals 2023 Certification in Site Reliability
Engineering: Measuring and Managing Reliability 2023

Bachelor of Engineering Information Technology, JNTU 2012


Awards & Recognitions: Wells Fargo Champion Award; Shared Success Awards for Individual and Team

Gowtham Raj Resume | Page 02 | 832.935.2288 | gowtham.b.raj91@gmail.com |


LinkedIn

You might also like