Download as pdf or txt
Download as pdf or txt
You are on page 1of 9

In simple way we can say that data science is a field which helps the business to extract useful

information out of the data so that business head can make some important decision for his company.

Let’s give an Example.

Let's say you work for a small online bookstore. Your manager wants to understand which genres of
books are most popular among customers to better the inventory.

As a data analyst/data scientist, your tasks might include:


Data Collection: Gather data from the bookstore's sales records, which includes information on each
book sold, including its genre.

Data Cleaning and Preprocessing: Clean the data to remove any errors or inconsistencies. Ensure that
each book's genre is correctly categorized and standardized.

Exploratory Data Analysis (EDA): Analyze the sales data to identify trends and patterns. You might create
a bar chart showing the distribution of book sales by genre.

Analysis and Insights: Based on your analysis, you find that mystery and romance novels are the best-
selling genres, followed by fantasy and science fiction.

Reporting and Visualization: Create a report or presentation summarizing your findings, including
visualizations like charts or graphs to illustrate the popularity of different genres.

Recommendations: Provide recommendations to your manager on how to adjust inventory levels or


marketing strategies based on the popularity of different genres. For example, you might suggest
increasing the stock of mystery and romance novels and promoting them more prominently on the
website.

By analyzing sales data, you help the bookstore make data-driven decisions to optimize its inventory and
improve customer satisfaction.

Now after understanding Data science lets understand different branches of data science.

1 Data engineer: A data engineer is like the architect and builder of a sturdy bridge that connects
different places. Instead of roads and rivers, they deal with data from various sources like databases,
applications

In short data engineer job is to construct a pathway for others branches.

Now what kind of pathway?????


Let’s understand in terms of an example>> now assume a data engineer is like a construction worker

Their job involves:

Building Data Highways: They create pathways (data pipelines) that safely move data from one place to
another, ensuring it reaches its destination smoothly and on time.

Constructing Data Warehouses: They set up storage spaces (data warehouses) where all the data can be
stored neatly, making it easy to find and use when needed.

Organizing the Traffic: They design systems to organize and structure data in a way that makes it easy to
understand and work with, just like lanes and signs keep traffic flowing smoothly.
Cleaning Up the Mess: They develop tools and processes to clean and tidy up messy data, so it's
accurate and reliable for analysis, just like cleaning a cluttered room.

Connecting the Dots: They integrate data from different sources and systems, making sure they all work
together seamlessly, like connecting roads and highways.

Making Sure Everything Runs Smoothly: They keep an eye on the data systems, fixing any potholes
(performance issues) or roadblocks (errors) that might slow things down or cause problems.

2. Data scientist: Think of a data scientist as a detective with a magnifying glass, searching for clues and
solving mysteries, but instead of investigating crimes, they're exploring data to uncover valuable insights
and solve complex problems.

Here's how a data scientist works:

Investigating Data Mysteries: Like a detective, a data scientist dives deep into data sets, searching
for patterns, trends, and correlations that others might overlook.

Gathering Evidence (Data Collection): They collect data from various sources, just like a detective
collects evidence from crime scenes. This could include customer transactions, website
interactions, or sensor readings.

Analyzing Clues (Exploratory Data Analysis): Using statistical tools and techniques, they analyze
the data to understand its characteristics and find important clues. Visualizations, like maps or
timelines, help them see the big picture.

Building Hypotheses (Modeling): Just as a detective forms theories about how a crime was
committed, a data scientist develops models to explain and predict behavior based on the
evidence they've gathered.

Testing Theories (Evaluation): They test their models against new data to see how well they
perform, just like a detective might test a theory by gathering more evidence.

Solving the Case (Insights and Recommendations): Finally, they draw conclusions from their
analysis and provide recommendations based on their findings, helping businesses make informed
decisions and solve real-world problems.

Overall, data scientists are like modern-day detectives, using their analytical skills and tools to
uncover hidden insights and drive innovation.

3. Data Analyst: Imagine you're a chef in a bustling restaurant kitchen. Your job as a data analyst is
similar to that of a sous chef who prepares ingredients, follows recipes, and ensures dishes are cooked
to perfection.

Here's how a data analyst works in this scenario:


Preparing Ingredients (Data Collection): Just as a sous chef gathers fresh ingredients from the
pantry, fridge, and suppliers, a data analyst collects data from various sources such as databases,
spreadsheets, and surveys.

Following Recipes (Data Cleaning and Preprocessing): Like a sous chef who washes, chops, and
organizes ingredients before cooking, a data analyst cleans and preprocesses data, ensuring it's
consistent, accurate, and ready for analysis.

Cooking Techniques (Exploratory Data Analysis): Using cooking techniques to enhance flavors, a
data analyst employs statistical methods and visualization tools to explore the data, uncovering
patterns, trends, and relationships between different variables.

Tasting and Adjusting (Analysis and Insights): Just as a sous chef tastes dishes during cooking to
adjust seasoning and flavors, a data analyst continuously evaluates their analysis, refining their
approach and insights based on feedback and new discoveries.

Plating and Presentation (Reporting and Visualization): Finally, like a sous chef who plates dishes
beautifully for presentation, a data analyst presents their findings in visually appealing reports,
dashboards, or presentations, making it easy for others to understand and digest the insights.

Overall, a data analyst is like a sous chef in the kitchen of data, meticulously preparing, analyzing,
and presenting information to help decision-makers create delicious outcomes for their
businesses.

4.Business Analyst: Imagine you're on a large ship, sailing through uncharted waters toward a distant
destination. The business analyst is like the navigator, responsible for plotting the course, guiding the
ship, and ensuring a smooth journey.

Here's how a business analyst works in this analogy:

Charting the Course (Understanding Business Goals): Just as a navigator studies maps and charts to plan
the ship's route, a business analyst works closely with stakeholders to understand the organization's
goals, strategies, and challenges.

Navigating the Journey (Data Collection and Analysis): Using a compass and navigation tools, the
navigator tracks the ship's progress and adjusts the course as needed. Similarly, a business analyst
gathers data from various sources, analyzes trends, and identifies opportunities or risks that could
impact the organization's success.

Monitoring Conditions (Market Research): The navigator keeps an eye on weather conditions, currents,
and other factors that could affect the ship's journey. Likewise, a business analyst conducts market
research to understand industry trends, competitor activities, and economic factors that could impact
the business.

Communicating with the Crew (Stakeholder Engagement): The navigator communicates regularly with
the captain and crew, providing updates on the ship's position and any navigational adjustments.
Similarly, a business analyst collaborates with stakeholders across the organization, sharing insights and
recommendations to support decision-making.
Making Course Corrections (Recommendations and Strategy): Based on their analysis and insights, the
navigator advises the captain on course corrections to optimize the ship's route and avoid obstacles.
Similarly, a business analyst provides recommendations to business leaders, suggesting strategic
initiatives or operational improvements to achieve business objectives.

Ensuring a Safe Arrival (Business Impact): Ultimately, the navigator's goal is to guide the ship safely to its
destination, on time and within budget. Similarly, the business analyst aims to drive positive business
outcomes by helping the organization navigate challenges, capitalize on opportunities, and achieve its
goals.

In summary, a business analyst serves as a strategic navigator, using data and insights to guide the
organization toward success in a dynamic and ever-changing business environment.

Now we are clear about the different branches of data science. Now let’s understand what it takes to
get into these branches.
Let’s talk about Data analyst

Normally DA works on
1.data manipulation.
2.data visualization.

1. Data manipulation refers to the process of altering, transforming, or organizing data to make it
more useful for analysis or presentation. This can involve a variety of operations, including:

Filtering: Selecting specific rows or columns of data based on certain criteria or conditions.

Sorting: Rearranging data in a particular order, such as alphabetical or numerical.

Joining: Combining data from multiple sources or tables based on common fields or keys.

Aggregating: Summarizing or grouping data to calculate statistics such as sums, averages, counts, or
percentages.

Cleaning: Removing or correcting errors, inconsistencies, or missing values in the data.

Transforming: Converting data from one format or structure to another, such as converting dates to a
standard format or creating new variables based on existing ones.

Data manipulation is a fundamental step in the data analysis process, as it allows analysts to prepare
and shape the data in a way that facilitates insights and decision-making. It often involves using tools
such as SQL queries, scripting languages like Python or specialized data manipulation software like Excel
2. Data visualization is the process of representing data graphically to provide insights, identify
patterns, and communicate findings effectively. It involves creating visual representations of
data that are easy to understand and interpret. Some basic concepts and techniques in data
visualization include:

Charts and Graphs: Using various types of charts and graphs, such as bar charts, line graphs, pie charts,
histograms, scatter plots, and heatmaps, to visually display different aspects of the data.

Color Coding: Using colors strategically to differentiate categories, highlight trends, or encode additional
information in the visualizations.

Interactivity: Adding interactive elements to visualizations, such as tooltips, zooming, filtering, or drill-
down capabilities, to allow users to explore the data in more detail.

Dashboard Design: Creating dashboards that combine multiple visualizations and key performance
indicators (KPIs) to provide a comprehensive view of the data and facilitate decision-making.

Data Storytelling: Structuring visualizations in a narrative format to tell a compelling story or convey a
specific message to the audience.

Choosing the Right Visualization: Selecting the most appropriate type of visualization based on the data
characteristics, the analysis goals, and the audience's needs and preferences.

Tools that are used for data viz are Tableau and power BI we can also use tools like excel for basic
dashboarding

What are the important Tools that we use in our Data Analyst
1. For data manipulation we use tools like Excel, SQL, Python etc.
2. For visualization we use tools like Tableau, Power BI and Excel.
Now why we use Visualization Tools????

Now let’s say we have a DATA for Amazon.


Here we can see we have some order id, date, city, sale, profit etc.

If we look at this data and we want to know that what is the total sales for Florida it will be difficult for
us to answer this question if the data is long.
So, we need some kind of visualization tools to represent this data into graph.
Now we know why we us visualization tools
But what about data manipulation tools>>>>

We manipulate Data using different Tools like Excel, Python, SQL etc. We know what is Excel but let’s
talk about SQL and python

You might also like