Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 19

A Technical Seminar report entitled

On

DATA SCIENCE FOR DEVOPS


In partial fulfilment of the requirements for the award of
BACHELOR OF TECHNOLOGY
In
Computer Science and Engineering (Data science)
Submitted by
V. Varsha (20E51A6750)

Under the Esteemed guidance of


Dr. P. MADHURI
Associate Professor

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

HYDERABAD INSTITUTE OF TECHNOLOGY AND MANAGEMENT


Gowdavelly (Village), Medchal, Hyderabad, Telangana, 501401

(UGC Autonomous, Affiliated to JNTUH, Accredited by NAAC (A+) and NBA)

2023-2024

1
HYDERABAD INSTITUTE OF TECHNOLOGY AND
MANAGEMENT

(UGC Autonomous, Affiliated to JNTUH, Accredited by NAAC (A+) and NBA)

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

CERTIFICATE

This is to certify that the Technical Seminar entitled “Data science for DevOps" is being
submitted by V. Varsha bearing hall ticket number 20E51A6750 in partial fulfilment of
the requirements for the degree BACHELOR OF TECHNOLOGY in COMPUTER
SCIENCE AND ENGINEERING (DATA SCIENCE ) by the Jawaharlal Nehru
Technological University, Hyderabad, during the academic year 2023-2024. The matter
contained in this document has not been submitted to any other University or institute for
the award of any degree or diploma.

Internal Supervisor Head of the Department


Dr. P. Madhuri Dr. Ila Chandana Kumari
Associate Professor

2
HYDERABAD INSTITUTE OF TECHNOLOGY AND
MANAGEMENT
(UGC Autonomous, Affiliated to JNTUH, Accredited by NAAC (A+) and NBA)

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING


(DATA SCIENCE)

DECLARATION

I “V. Varsha” student of ‘Bachelor of Technology in CSD’, session: 2023 - 2024,


Hyderabad Institute of Technology and Management, Gowdavelly, Hyderabad, Telangana
State, hereby declare that the work presented in this Technical Seminar entitled ‘Data
science for devOps’ is the outcome of our own bonafide work and is correct to the best of
our knowledge and this work has been undertaken taking care of engineering ethics. It
contains no material previously published or written by another person nor material
which has been accepted for the award of any other degree or diploma of the university
or other institute of higher learning, except where due acknowledgment has been made in
the text.

V. Varsha(20E51A6750)

3
ACKNOWLEDGEMENT
An endeavor of a long period can be successful only with the advice of many
well-wishers. We would like to thank our chairman, SRI. ARUTLA
PRASHANTH, for providing all the facilities to carry out Technical Seminar
successfully. We would like to thank our Principal Dr. P. RAJESH KUMAR, who
has inspired lot through their speeches and providing this opportunity to carry out
our technical Seminar successfully. We are very thankful to our Head of the
Department, Dr Ila Chandana Kumari and B-Tech Technical Seminar Coordinator
Mrs.Dr. P. Madhuri We would like to specially thank my internal supervisor
Mrs.Dr. P. Madhuri, ASSOCIATE PROFESSOR for Technical Guidance. We
wish to convey our gratitude and express sincere thanks to all D.C
(DEPARTMENTAL COMMITTEE) and T.R.C (TECHNICAL REVIEW
COMMITTEE) members, non-teaching staff for their support and Co-operation
rendered for successful submission of our Technical Seminar. We also want to
express our sincere gratitude to all my family members and my friends for their
individual care and everlasting moral support.

V. Varsha(20E51A6750)

4
TABLE OF CONTENTS

ABSTRACT……………………………………………………………………………....i
LIST OF FIGURES……………………………………………………………………...ii

1. CHAPTER - 01…………………………………………………………………….....1

● INTRODUCTION

2. CHAPTER - 02……………………………………………………………………….1

● IMPORTANCE OF DEVOPS

2.1 Definition
2.2 Key principle of devops.
2.3 How does devOps works?

3. CHAPTER - 03………………………………………………………………………2

● TOOLS AND TECHNIQUES FOR DEVOPS


3.1 tools
3.1.1 puppet
3.1.2 ansible
3.1.3 Docker
3.1.4 Nagios
3.1.5 CHEF
3.2 Techniques
3.2.1 Devops toolchain
3.2.2 Agile practices
3.2.3 DevSecOps
5
4 CHAPTER - 04…………………………………………………………………….8
 BEST PRACTICES FOR DEVOPS
4.1 Collaborative teams
4.2 Version Control
4.3 Feedback Loop

5 CHAPTER - 05…………………………………………………………………….9

 DEVOPS PROCESS AND LIFE CYCLE


5.1 Process
5.2 Life Cycle
6 CHAPTER - 06…………………………………………………………………….11

 BENEFITS AND CHALLENGES


6.1 Data driven- decision making
6.2 data quality and availabilty
6.3 skill gap
6.4 integration complexity
6.5 security enhancement
6.6 predictive analytics

7 CONCLUSION ……………………………………………………………………15

8 REFERENCES…………………………………………………………………….17

6
LIST OF FIGURES

Sl.no CAPTION

1. DevOps
2. Principles of DevOps
3. Tools and techniques
4. Version control
5. Phases of DevOps
6. Quality and security

7
i

ABSTRACT

Title: Data Science in DevOps

The integration of data science within the DevOps framework has emerged as a pivotal
catalyst for enhancing software development practices. By amalgamating statistical
analysis, machine learning, and predictive modelling , this synergy optimizes the realms
of development, deployment, and operations. Data science empowers DevOps teams by
fostering data-driven decision-making, thereby elevating the efficiency of the entire
software development lifecycle. This topic encapsulates the profound benefits and
challenges inherent in the confluence of data science and DevOps. It delves into how
data science fortifies DevOps practices, enabling organizations to deliver superior
software products while expounding upon the challenges and viable solutions for their
seamless integration. Also, elucidates the strategic imperatives for success: setting
business objectives, engendering a feedback loop between data science and DevOps, and
cultivating a culture of data-driven collaboration. In summary, the integration of data
science into DevOps crystallizes a paradigm shift in software development.
Key words: DevOps, process, models, Techniques

1. INTRODUCTION

8
The term “DevOps” has been around for a while, but it was only in the early
2000s that people started using it to describe a new way of working.
In 2005, at the International Consortium of Automation and Measurement
(I-CAM) annual conference, Joe Banister presented his keynote speech titled
“DevOps: A Key to Improving Software Project Success”. In this talk, he
explained how DevOps is not just about tools or technologies but focuses on
organisational cultural change.
History of DevOps
The history of DevOps is a short one. It was born out of the need to deal
with the speed of software development and increased agility, not to mention
effective software delivery. The rise of Agile development in recent years
has been largely attributed to companies like Amazon, Google and Netflix,
who have embraced this new way of working with their employees. Many
companies have now moved beyond Agile into Continuous Delivery (CD).
This means that rather than releasing new features every few weeks or
months via an annual release cycle as was common before then, they iterate
on those changes continuously throughout their products’ lifecycles so that
each release can be tested against performance metrics before becoming
available for use by customers as part of their regular use case scenarios –
which makes things much simpler when it comes down right down what
exactly constitutes “reads” versus “not reads”

2. IMPORTANCE OF DEVOPS
2.1. Definition

9
 DevOps(“development” and “Operation”) is the combination of
practices and tools designed to increase an organization’s ability to
deliver applications and services faster than traditional software
development processes.
 Which provides a starting point for understanding exactly.

2.2. key principles of DevOps

 DevOps is based o a set of principles that guide its


implementation. These principles include automation,
collaboration, and communication.
 Automation is essential for devops, as it allows teams to reduce
the time and effort needed to build, test, and deploy applications.
 Collaboration ensures that everyone is on the same page and
understands their role in the development process.

2.3. How does DevOps works?

 The DevOps model relies on effective tooling to help teams


rapidly and reliable deploy and innovate for their customers.
These tools automate manual tasks, help teams manage complex
environment at scale, and keep engineers In control of the high
velocity that is enabled by DevOps.

3. TOOLS AD TECHNIQUES
10
3.1TOOLS
3.1.1 Puppet
 puppet is the most widely used DevOps tool.
 It allows the delivery and release of the technology changes
quickly and frequently.
3.1.2 Ansible
 Ansible is leading DevOps tool.
 Ansible is a tool open-source IT engine that automates application
deployment, cloud provisioning, intra service orchestration, and
other IT tools.
3.1.3 Docker
 Docker is a high-end DevOps tool that allows building, ship, and
run distributed applications on multiple system.
3.1.4 Nagios
 Nagios is one of the more useful tools for DevOps. It can determine
the errors and rectify them with the help of network, infrastructure,
server, and log monitoring systems.
3.1.5 CHEF
 A chef is a useful tool for achieving scale, speed, and consistency.
The chef is a cloud-based system and open source technology.

3.2 TECHNIQUES

3.2.1 DevOps toolchain

11
 it creates a cohesive toolchain of integrated DevOps tools that
streamline the development, testing, deployment, and monitoring
processes.
3.2.2 Agile Practices
 the Agile methodology is a project management approach that
involves breaking the project into phases and emphasizes
continuous collaboration and improvement.
3.2.3 DevSevOps
 DevSecOps build tools focus on automated security analysis against
the build output artifact.
 DevsecOps stands for development, security, and operations.

12
4.BEST PRACTICES FOR DATA SCIENCE IN DEVOPS

4.1 Collaborative teams


 Foster a culture of data-driven decision-making and collaboration
across teams.
 It allows teams to share information and resources, and to work
together to improve processes and resolve problems.
4.2 Version Control
 Apply version control to data science code and models using tools
like git to track changes, enable reproducibility, and facilitate
collaboration.
4.3Feedback Loop
 Establish a feedback loop between data science and DevOps teams
for continuous improvement.
 Encourage a culture of continuous improvement by regularly
collecting feedback from users and stakeholders and using it to refine
data science processes.

13
5. DEVOPS PROCESS AND LIFE CYCLE

5.1 Financial market Data

 Data science utilizes historical and real-time market data such as stock
prices, interest rates, exchange price.
 Data science is used in the finance industry in areas such as
algorithmic trading, customer analytics, and many more.

5.2 Economic Data

 Data science leverages economic indicators such as GDP, inflation


rates, employment data, and consumer sentiment surveys.
 Economic data are the data describing an actual economy, past or
present.

5.3 Alternative data

 Data science incorporates non-traditional data sources like social


media sentiment, web scraping to gain a competitive edge.
 Examples of alternative data includes credit card transactions, social
medial commentary, product reviews, and satellites imagery.

14
6. FUTURE TRENDS IN DATA SCIENCE FOR
FINANCE AND ECONOMICS

The emerging trends in data science are data analytics, artificial intelligence,
big data, data science. Businesses want to streamline their business
processes by adopting data-driven models.
AI and machine learning are becoming increasingly important in data
science, and trend is expected to continue.

6.1 Artificial intelligence

 The Integration of AI technology like natural language processing and


computer vision will further enchance data analysis capabilities.
 AI capabilities allow companies to reengineer their business
processes, significantly increase their productivity and reduce cost.

6.2 Explainable Artificial Intelligence

15
 There will be growing emphasis on developing transparent and
interpretable AI models to address regulatory and ethical concerns.
 A set of tools and framework to help you understand and interpret
predictions made by your machine learning models.

6.3 Real- time Analytics

 The ability to process and analyze data in real-time will become


increasingly important for making timely decisions in finance and
economics.

16
7. CHALLENGES AND APPLICATIONS

Despite obvious advantages, data science, unfortunately, might bring you


some hard times. The truth is that operating vast volumes of data rarely
means an effortless tasks.

6.1 Data quality and Security


o Certainly, there are plenty of ways of collecting financial data.
Another vital concern for financial data science is accessing the
data sets appropriately according to complicate requirements.
It’s crucial to ensure permanent security and compliance with
data protection regulations.
o Financial and economic data can be incomplete, inconsistent, or
contain errors, Requiring careful data cleaning and
preprocessing.

6. 2 Ethical Consideration

o Data science in finance and economics raises concerns about


privacy, security, and algorithmic bias that need to be
addressed.

6.3 Risk management

o Data science techniques Can identify and analyze potential


risks, allowing for better risk assessment and Strategies.

6.4 Fraud Detection

o Data Science can detect Fraudulent activities by analyzing


patterns and Anomalies in financial transaction

17
8. Conclusion

Data science is transforming the finance and economics industry by enabling data-driven
decision-making and improving predictive capabilities. The application of data science in
finance and economics extends beyond traditional models, allowing for more accurate
risk assessment, fraud detection. It is about taking calculated risks to maximize profits. It
offers useful information that businesses may use to improve decision-making,
streamline processes, and maintain competitiveness in a constantly changing market.
Data science has revolutionized the fields of finance and economics by providing
powerful tools and techniques for data analysis, prediction, and decision-making.
Through the use of big data, machine learning, and advanced analytics, professional in
these domain can extract valuable insights, reduce risk, optimize investments, and
enhance economic modelling. As data continues to play a central role in shaping
financial strategies and economic policies, the integration of data science methodologies
is expected to remain pivotal for driving innovation and informed decision-making in
these critical sectors.

18
9.References

1. Aruoba, S. B., Diebold, F. X., & Scotti, C. (2009). Real-time


measurement of business conditions. Journal of Business & Economic
Statistics, 27(4), 417–427.
2. Babii, A., Chen, X., & Ghysels, E. (2019). Commercial and residential
mortgage defaults: Spatial dependence with frailty. Journal of Econometrics,
212, 47–77.
3. Baesens, B., Van Vlasselaer, V., & Verbeke, W. (2015). Fraud analytics
using descriptive, predictive, and social network techniques: a guide to data
science for fraud detection. Chichester: John Wiley & Sons.
4. Barbaglia, L., Consoli, S., & Manzan, S. (2020). Monitoring the business
cycle with fine-grained, aspect-based sentiment extraction from news. In V.
Bitetta et al. (Eds.), Mining Data for Financial Applications (MIDAS 2019),
Lecture Notes in Computer Science (Vol. 11985, pp.101–106). Cham:
Springer. https://doi.org/10.1007/978-3-030-37720-5_8
5. Barra, S., Carta, S., Corriga, A., Podda, A. S., & Reforgiato Recupero, D.
(2020). Deep learning and time series-to-image encoding for financial
forecasting. IEEE Journal of Automatica Sinica, 7, 683–692.
6. Benidis, K., Rangapuram, S. S., Flunkert, V., Wang, B., Maddix, D. C.,
Türkmen, C., Gasthaus, J., Bohlke-Schneider, M., Salinas, D., Stella, L.,
Callot, L., & Januschowski, T. (2020). Neural forecasting: Introduction and
literature overview. CoRR, abs/2004.10240.
7. Berners-Lee, T., Chen, Y., Chilton, L., Connolly, D., Dhanaraj, R.,
Hollenbach, J., Lerer, A.,
& Sheets, D. (2006). Tabulator: Exploring and analyzing linked data on the
semantic web. In Proc. 3rd International Semantic Web User Interaction
Workshop (SWUI 2006).

19

You might also like