Professional Documents
Culture Documents
Asm1 1641
Asm1 1641
Asm1 1641
Student declaration
I certify that the assignment submission is entirely my own work and I fully understand the consequences of plagiarism. I understand
that making a false declaration is a form of malpractice.
Student’s signature Km
Grading grid
P1 P2 M1 M2 D1 D2
❒ Summative Feedback: ❒ Resubmission Feedback:
List of Figures
KKC company is built by 3 members in group 5, this member is students at Greenwich's university. KKC company built with
purpose evaluate, analyze amount of salary of data science in recent years. We will analyze the salary of a staff based their
work experience, role worked in during the year, year start works for company, etc. Then we will guess their current salary,
compare it with dataset and evaluate if this salary is reasonable or not.
2. Business Intelligence
a) Definition
Business intelligence (BI) is a technology-driven process for analyzing data and delivering actionable information that helps
executives, managers and workers make informed business decisions. As part of the BI process, organizations collect data
from internal IT systems and external sources, prepare it for analysis, run queries against the data and create data
visualizations, BI dashboards, and reports to make the analytics results available to business users for operational decision-
making and strategic planning.
The goal of BI initiatives is to drive better business decisions that enable organizations to increase revenue, improve
operational efficiency and gain competitive advantages over business rivals. To achieve that goal, BI incorporates a
combination of analytics, data management and reporting tools, plus various methodologies for managing and analyzing
data.
Business intelligence architecture includes more than just BI software. Business intelligence data is typically stored in a data
warehouse built for an entire organization or in smaller data marts that hold subsets of business information for individual
departments and business units, often with ties to an enterprise data warehouse. In addition, data lakes based on Hadoop
clusters or other big data systems are increasingly used as repositories or landing pads for BI and analytics data, especially
for log files, sensor data, text, and other types of unstructured or semi structured data.
BI data can include historical information and real-time data gathered from source systems as it's generated, enabling BI
tools to support both strategic and tactical decision-making processes. Before it's used in BI applications, raw data from
different source systems generally must be integrated, consolidated, and cleansed using data integration and data quality
management tools to ensure that BI teams and business users are analyzing accurate and consistent information.
BI initiatives also provide narrower business benefits among them, making it easier for project managers to track the
status of business projects and for organizations to gather competitive intelligence on their rivals. In addition, BI, data
management and IT teams themselves benefit from business intelligence, using it to analyze various aspects of
technology and analytics operations.
3. Business Process
a) Definition
A business process is defined as a collection of business tasks and activities that, when performed by people or systems in a
structured course, produce an outcome that contributes to the business goals. A business process includes at least one of, but
not limited to, the following elements:
• Task/ Activity
• System
• Employee(s)
• Workflow
• Data
Business processes are invented to derive and contribute to organizational goals. The continuous and repeated execution of
business processes is pivotal to successful business operations and business growth.
Business process structures can be simple or complex, based on the elements involved in the process. Through every business
process, a business strives to achieve certain goals.
Step 1: Define Your Goals - At this stage, you will first define the basic purpose of the process as well as the reasons for
creating it.
Step 2: Plan and Map Your Process - Here you ruminate on effective strategies you can adopt to achieve the set goals and
what resources you need for this.
Step 3: Set Actions and Assign - People that will handle it Who are the best individuals to handle the tasks? What are the
machines they will need to execute the plan? These and more are what you will make decisions about here.
Step 4: Test the Process - The goal here is to first scrutinize the process at a low-key level and see how it fared.
Step 5: Implement the Process - If you are convinced with the results you get after testing the process, then it is time to
run it on a global level. At this stage, present the process to all those who will be handling the tasks and train them for
better results.
Step 6: Monitor the Results - Try reviewing the process and checking for any threats it may later bring and find a way to
mitigate the risk.
Step 7: Repeat - If you were able to achieve the set goals with the process, continue to replicate it and always track its
effectiveness and progress along the line.
The processes that plan, organize, coordinate and control all the functions of the business fall under management
processes. These processes are goal oriented. It includes helping and motivating your team to achieve their targets.
These processes also help in forming a direction for the further growth of your business. Regulating day to day tasks,
teaching employees how to complete a task effectively, launching a new product, etc. are some of the processes under
this category.
These are the main functions of the business. These processes are directly related to the fundamental values, vision,
and mission of the business. They are also known as primary processes. You need to give extra attention to these
processes as they are the primary revenue streams of the company. Filling the gaps and adding necessary improvements
in these processes is essential. Once you complete the evaluation of these processes, it will have a direct impact on the
growth of your business.
E.g., McDonald’s restaurant’s operating process would be taking orders, making food, and serving it to the customers.
These are the processes that are not directly related to the delivery of the service or product to the customer. But they
help the business create an environment where the primary processes can work better. That is where the name
“supporting process” comes from. These are the processes under the accounting department, human resource
management department, and any other department that supports the main functions of the business.
4. Support Process
4.1. Definition
Support processes provide essential backing to the core business processes. They may not directly contribute to the creation
or delivery of the organization's key products or services, but without them, the core operations could falter. Support processes
typically encompass functions that maintain the smooth running of the organization, enabling other processes to deliver
optimal results.
Information Technology (IT) and Human Resources (HR) processes are the quintessential examples of support processes. They
exist not to directly drive the primary purpose of the company, but to facilitate and enhance the performance of those core
processes that do. IT processes ensure the seamless operation of the technological infrastructure, while HR processes focus on
staff recruitment, retention, development, and wellbeing – all crucial elements in ensuring that the company has the right
people, in the right roles, performing at their best.
Other examples of support processes include:
• Financial processes: These processes ensure that the organization's finances are managed effectively, including
accounting, budgeting, and forecasting.
• Procurement processes: These processes ensure that the organization has the necessary goods and services at
the right time and price.
• Logistics processes: These processes ensure that goods and services are delivered to customers in a timely and
efficient manner.
• Customer service processes: These processes ensure that customers receive the highest possible level of service.
4.3. Steps of Support Process
II. Differentiate between unstructured and semi-structured data within an organization. (M1)
The process of assessing structured or unstructured data and categorizing it based on file type, contents, and other information is
known as data classification. Data categorization enables companies to answer critical questions about their data, which in turn
influences how they reduce risk and manage data governance regulations. It can inform you where your most essential data is stored
or what kind of sensitive data your users generate the most frequently. To comply with contemporary data privacy standards,
comprehensive data categorization is required (but not sufficient).
Figure 4: Type of Data.
1. Structured Data
Structured data is known as quantitative data and is objective facts and numbers that analytics software can collect -- this type
of data is easy to export, store, and organize in a database such as Excel or SQL. Structured data is valuable because you can
gain insights into overarching trends by running the data through data analysis methods, such as regression analysis and pivot
tables.
Alternatively, semi-structured data does not conform to relational databases such as Excel or SQL, but nonetheless contains
some level of organization through semantic elements like tags. For instance, consider HTML, which does not restrict the
amount of information you can collect in a document, but enforces a certain hierarchy:
3. Unstructured Data
From 80% to 90% of data generated and collected by organizations is unstructured, and its volumes are growing rapidly —
many times faster than the rate of growth for structured databases. Unstructured data stores contain a wealth of information
that can be used to guide business decisions. With the help of AI and machine learning, new software tools are emerging that
can search through vast quantities of it to uncover beneficial and actionable business intelligence.
Figure 7: Unstructured Data types.
LO2: Compare the tools and technologies associated with business intelligence functionality.
III. Compare the types of support available for business decision-making at varying levels within an organization.(P2)
1. The tools and technologies associated with business intelligence functionality.
a) Why use business intelligence tools?
To begin with, data discovery, which was previously restricted to the skills of advanced analytics professionals, is now
something that anybody can perform using these tools. Not only that, but these technologies provide you with the
information you need to achieve things like growth, handle pressing concerns, collect all of your data in one place,
estimate future results, and much more.
Python is an interpreted, object-oriented, high-level programming language with dynamic semantics. Its high-level
built in data structures, combined with dynamic typing and dynamic binding, make it very attractive for Rapid
Application Development, as well as for use as a scripting or glue language to connect existing components together.
Python's simple, easy to learn syntax emphasizes readability and therefore reduces the cost of program
maintenance. Python supports modules and packages, which encourages program modularity and code reuse. The
Python interpreter and the extensive standard library are available in source or binary form without charge for all
major platforms and can be freely distributed.
o Advantages
• Easy to Read, Learn and Write - Python is a high-level programming language that has English-like syntax. This makes
it easier to read and understand the code. Python is easy to pick up and learn, that is why a lot of people recommend
Python to beginners. You need less lines of code to perform the same task as compared to other major languages
like C/C++ and Java.
• Improved Productivity - Python is a very productive language. Due to the simplicity of Python, developers can focus
on solving the problem. They don’t need to spend too much time understanding the syntax or behavior of the
programming language. You write less code and get more things done.
• Interpreted Language - Python is an interpreted language which means that Python directly executes the code line
by line. In case of any error, it stops further execution and reports back the error which has occurred. Python shows
only one error even if the program has multiple errors. This makes debugging easier.
• Dynamically Typed - Python doesn’t know the type of variable until we run the code. It automatically assigns the
data type during execution. The programmer doesn’t need to worry about declaring variables and their data types.
• Free and Open-Source - Python comes under the OSI approved open-source license. This makes it free to use and
distribute. You can download the source code, modify it and even distribute your version of Python. This is useful
for organizations that want to modify some specific behavior and use their version for development.
• Vast Libraries Support - The standard library of Python is huge, you can find almost all the functions needed for your
task. So, you don’t have to depend on external libraries. But even if you do, a Python package manager (pip) makes
things easier to import other great packages from the Python package index (PyPi). It consists of over 200,000
packages.
• Portability - In many languages like C/C++, you need to change your code to run the program on different platforms.
That is not the same with Python. You only write once and run it anywhere. However, you should be careful not to
include any system-dependent features.
o Disadvantages:
• Slow Speed - We discussed above that Python is an interpreted language and dynamically typed language. The line-
by-line execution of code often leads to slow execution. The dynamic nature of Python is also responsible for the
slow speed of Python because it must do extra work while executing code. So, Python is not used for purposes
where speed is an important aspect of the project.
• Not Memory Efficient - To provide simplicity to the developer, Python has to do a little tradeoff. The Python
programming language uses a large amount of memory. This can be a disadvantage while building applications
when we prefer memory optimization.
• Weak in Mobile Computing - Python is generally used in server-side programming. We don’t get to see Python on
the client-side or mobile applications because of the following reasons. Python is not memory efficient, and it has
slow processing power as compared to other languages.
• Database Access - Programming in Python is easy and stress-free. But when we are interacting with the database,
it is behind. The Python’s database access layer is primitive and underdeveloped in comparison to the popular
technologies like JDBC and ODBC. Huge enterprises need smooth interaction of complex legacy data and Python is
thus rarely used in enterprises.
• Runtime Errors - As we know Python is a dynamically typed language so the data type of a variable can change
anytime. A variable containing integer number may hold a string in the future, which can lead to Runtime Errors.
Therefore, Python programmers need to perform thorough testing of the applications.
Power BI is a collection of software services, apps, and connectors that work together to turn your unrelated sources
of data into coherent, visually immersive, and interactive insights. Your data might be an Excel spreadsheet, or a
collection of cloud-based and on-premises hybrid data warehouses. Power BI lets you easily connect to your data
sources, visualize and discover what's important, and share that with anyone or everyone you want.
o Pros of Power BI
• Affordability - A major advantage of using Power BI for data analysis and visualization is that it is affordable and
relatively inexpensive. The Power BI Desktop version is free of cost. You can download and start using it to make
reports and dashboards on your computer. However, if you want to use more Power BI services and publish your
reports on the cloud, you can take the Power BI Cloud service solution for $9.99 per user per month. Thus, Power
BI is offered at a fair price as compared to other BI tools.
• Custom Visualizations - Power BI offers a wide range of custom visualizations i.e. visualizations made by developers
for a specific use. Custom visuals are available on Microsoft marketplace. In addition to the general set of
visualizations available you can use Power BI custom visualizations in your reports and dashboards.
• Excel Integration - In Power BI, you also have the option to upload and view your data in Excel. You can
select/filter/slice data in a Power BI report or dashboard and put it on Excel. You can then open Excel and view the
same data in tabular form in an Excel spreadsheet.
• Data Connectivity - Another major advantage of using Power BI as your data analysis tool is that you can import
data from a wide range of data sources. It offers data connectivity to data files (such as XML, JSON), Microsoft Excel,
SQL Server databases, Azure sources, cloud-based sources, online services such as Google Analytics, Facebook, etc.
• Prompt Updates - Power BI gets upgrades from Microsoft every month. Microsoft has made a user community
where users can upload their suggestions and tweaks about Power BI. The suggestions that get the most rating by
other users have a good chance of getting included in the next update of Power BI.
o Cons of Power BI
• Table Relationships - Power BI is good with handling simple relationships between tables in a data model. But, if
there are complex relationships between tables, that is, if they have more than one links between tables, Power
BI might not handle them well.
• Configuration of Visuals - In most cases, you might not feel the need to configure and optimize visualizations in
Power BI. But even if you do, Power BI does not provide many options to configure your visualizations as per your
requirements. Thus, users have limited options for what they can change in visuals.
• Crowded User Interface - The user interface of Power BI is often found crowded and bulky by the users. It is in the
sense that there are many icons of options that block the view of dashboard or report.
• Handling Large Data Volumes - Power BI has a limit of ingesting data at a time which is approximately 2 GBs of
data. If you wish to import and use data of even greater volumes, you need to extend your free version to a paid
version of Power BI.
• Complex to Understand and Master - Usually, Power BI is the easiest to use BI tool if you are using it simply to
import data and create reports. But Power BI is an entire suite having a lot of other interrelated tools.
iii. Tableau
Tableau is the most powerful, secure, and flexible end-to-end analytics platform for your data. Elevate people with
the power of data. Designed for the individual, but scaled for the enterprise, Tableau is the only business intelligence
platform that turns your data into insights that drive action.
o Advantages of Tableau
▪ Data visualization - Tableau is a data visualization tool first and foremost. Therefore, it’s
technology is there to support complex computations, data blending and dashboarding
for the purpose of creating beautiful visualizations that deliver insights that cannot easily
be derived from staring at a spreadsheet. It has climbed to the top of the data visualization
heap because of it’s dedication to this purpose
▪ Quickly Create Interactive visualizations: Using drag-n-drop functionalities of Tableau, the
user can create a very interactive visual within minutes. The interface can handle endless
variations while also limiting you from creating charts that are against data visualization
best practices.
▪ Ease of Implementation: There are many different types of visualization options available
in Tableau which enhance the user experience. Also, Tableau is very easy to learn
compared to Python, Business Objects and Domo, anyone without having knowledge of
coding can easily learn Tableau.
▪ Tableau can handle large amounts of data: Tableau can handle millions of rows of data
with ease. Different types of visualization can be created with a large amount of data
without impacting the performance of the dashboards.
▪ Use of other scripting languages in Tableau: To avoid the performance issues and to do
complex table calculations in Tableau, users can incorporate Python or R. Using Python
script can take the load off the software by performing data cleansing tasks with
packages.
▪ Mobile Support and Responsive Dashboard: Tableau Dashboard has a great reporting
feature that allows you to customize dashboard specifically for a certain device such as a
mobile phone or laptop.
o Disadvantages of Tableau
▪ Scheduling or notification of reports: Tableau does not provide the feature of automatic
refreshing of the reports with the help of scheduling. There is no option of scheduling in
Tableau. Therefore, there is always some manual effort required when users need to
update the data in the back end.
▪ No Custom Visual Imports - Tableau is not a completely open tool. Unlike other tools like
Power BI, developers can create custom visuals that can be easily imported from Tableau.
So, any new visuals need to be recreated instead of imported.
▪ Custom formatting in Tableau: Tableau’s conditional formatting and limited 16 column
table displays are pain points for users. Also, to implement the same formatting to
multiple fields there is no way a user can do that for all fields directly.
▪ Static and single value parameters: Tableau’s parameters are static and always single
value can be selected using a parameter. Whenever the data gets changed, these
parameters need to be updated manually every time.
▪ Screen Resolution on Tableau Dashboards: The layout of the dashboards gets disturbed if
the tableau developer’s screen resolution is different from end user’s screen resolution
for example if the dashboards have been created in a screen resolution of 1920X1080 and
are being viewed in 2560X1440 then the layout of the dashboards will get distorted a little
bit.
2. Levels of Management Decision Making
2.1. Definition
Decision making is the process of making choices by identifying a decision, gathering information, and assessing alternative
resolutions. Using a step-by- step decision-making process can help you make more deliberate, thoughtful decisions by
organizing relevant information and defining alternatives.
Strategic decision-making refers to when a business bases its shorter-term decisions on the longer-term vision for the
direction of the organization. This method typically entails using small, quantifiable goals that contribute to its overall
mission. Managers typically make strategic decisions regarding high-level factors that are crucial to the success of an
organization, for instance, budget, structure, and potential risk. Successful strategic decision-making is usually a
collaborative process that involves a leadership team and even team members and company employees, depending on
the context.
Example of Strategic decision making: A dog food company's mission is to sell the healthiest dog food in the country.
The company director notices their customers prefer high-quality freshly prepared food instead of cheaper dog food
sold in larger amounts, although they're similar in terms of nutritional value. To guide the future of the company to suit
the evolving tastes of their customers, the director alters their product line to focus on high-quality, freshly prepared
dog food. While the products have a shorter shelf-life, they also result in an increased profit margin because dog owners
will pay a higher price for freshly prepared food.
➢ This is an example of a strategic decision. The director of the dog food company decided to alter the
company's product line to focus on higher-quality foods that cost more, considering the evolving needs
of its customers. If the company's mission was to be the top supplier of low-priced dog food, then it
wouldn't make strategic sense to prioritize fresh ingredients. Sourcing cheaper ingredients with a longer
shelf life would align more with the company mission.
2.3.2. Tactical decision making
Tactical decisions translate strategic decisions into action. Tactical decisions are more straightforward and less complex
than strategic-level decisions. They are decisions taken by middle level management that are usually semi-structured.
When they are in alignment with your company’s core values or its overall mission, tactical decisions add even more
value to the outcomes of the implementation. Conversely, if tactical decisions become detached from the company’s
direction, you and your employees end up expending a lot of effort on tasks that don’t help the company achieve its
goals or vision.
Operational decisions are short-term choices that are typically made on a weekly, daily, or hourly basis. They are
primarily concerned with operational details, daily resource allocation, inventory control, and delivery routing to
maximize product flow along biomass-based production chains. Due to the changing internal and external conditions
of supply chains and related activities, operational decisions might be changed and amended often.
Examples of Operational Decisions that Businesses make:
• Monthly and daily forecasts to determine the demand from end users.
• Determining the deadline for satisfying demand and analyzing the variances between lead time and deadline.
• Assigning a cargo of biomass products or sources to a certain method of transportation and vehicle.
• Routes and timetables for daily and weekly deliveries, fleet management.
• Keeping track of supply shortfalls and backlogs, planning additional daily and weekly operations to address the
shortages.
3. Compare the various degrees of assistance provided for business decision making inside a company.
3.1. Strategic Decision
Strategic decision making requires determining the optimal path to achievement. For example, if you are starting a new
company, you must consider factors such as cost, time, and target market. How do you sort through decisions to find the
best solution? Strategic decision making will assist you in developing an action plan and aligning your short-term goals with
the big picture. Strategic decision making is different from day-to-day decisions from a management standpoint. As a
manager, you must allocate tasks, communicate goals to team members or external stakeholders, and account for
uncertainties. The judgments you make apply not only to you but also to the entire company. You should practice making
objective decisions, free from bias and prejudice.
According to Blue Collar University, tactical decision making is an approach to business in which decisions are made with
the aim of ensuring the success of the company. This implies that the company makes decisions that contribute to its long-
term viability, profitability, and continued progress in all areas of its operations. Tactical decision making involves the big
picture. the whole scene as well as the here and now. According to Chron.com Small Business, while making tactical
decisions, the entire company is involved. Tactical decision making allows a company to evaluate its infrastructure and
operations and make changes when necessary. A tactical decision is made when the board considers the facts presented
and selects an immediate course of action with the intention of accomplishing a specific objective that will provide direct
benefits. directly and immediately for most people. According to Business Case Studies, despite the fact that they involve
many individuals and departments, tactical options have limited impact in the event of failure because they can be
corrected, and fixed or modified relatively quickly.
A form of short-term option exercised by a company instead of long-term plans while acquiring the company's assets. These
issues relate to the day-to-day operations of the company and therefore must be handled before any major strategic issues.
Activity options are often quite detailed and clear. Although individual operational decisions have little lasting or significant
impact on the organization, they direct the implementation of strategic and tactical strategies. The operations manager is
responsible for responding to the company's executives regarding the operational choices made by the operations
manager.
IV. Justify, with specific examples, the key features of business intelligence functionality.
1. Key Business Intelligence Features
a) Executive dashboards
A personalized business intelligence dashboard delivers relevant, easily understood real-time data to business leaders,
enabling faster and better decision-making whilst lowering response times to internal and external events. Executives need
access to personalized dashboards that deliver easy-to-understand KPIs and summary information on a regular, scheduled
basis. Why? Because personalized BI dashboards allow executives and business users alike to quickly and easily see the
data that is important to them, and make critical decisions based on that information. More importantly, they can
consistently stay up-to-date on their company's performance, all without having to sift through a pile of reports.
b) Location intelligence
Location intelligence is the ability to map and visualize data in geographical formats. It provides a real-time data stream
that tracks the geographic location of customers, which can be a powerful tool for businesses. With location intelligence,
BI users can understand the patterns and trends in customer behavior, which can help them make better decisions about
marketing and sales. Exploring and visualizing data sets based on spatial elements enables organizations to understand
their business operations from new perspectives, such as sales per region.
“What if” analysis allows businesses to assess the potential effects of critical business decisions before they’re actually
made. Using existing data, users can formulate strategies to achieve business targets and avoid the default “hit and miss”
approach. This helps management undertake accurate strategic planning. Artificial intelligence (AI) can be leveraged to
simulate future scenarios and determine the potential effects of different actions. This type of analysis can help businesses
make informed decisions about their operations and future. Predictive analytics can also help businesses understand
customer behavior and predict future trends.
d) Interactive reports
Interactive reports help users convert data into knowledge. They allow users to better understand the analysis within
reports, and the underlying data those reports are based on, to support better decision-making. User should be able to:
e) Ranking reports
This feature allows you to create reports that order specific categories of information, from across multiple dimensions, by
selecting specific criteria. Ranking reports let you view the best and worst performing aspects of your business. For
example, you could create a report that ranks your top salaries. This is where ranking reports can allow businesses to
automatically rank different data sets in order of specific importance and enable their end-users to identify which data sets
need more attention quickly and easily.
f) Cross-platform compatibility
Not only should a BI solution be compatible with a variety of different platforms, but a key feature of BI solutions is the
ability to easily share data between different platforms. This allows businesses to more easily share data between different
departments and collaborate on projects.
2. Tools for applying for scenario.
Applying Python for BI
2.1. Pandas and Numpy.
Pandas is a very popular library for working with data (its goal is to be the most powerful and flexible open-source tool, and in
our opinion, it has reached that goal). DataFrames are at the center of pandas. A DataFrame is structured like a table or
spreadsheet. The rows and the columns both have indexes, and you can perform operations on rows or columns separately. A
pandas DataFrame can be easily changed and manipulated. Pandas have helpful functions for handling missing data,
performing operations on columns and rows, and transforming data. If that wasn’t enough, a lot of SQL functions have
counterparts in pandas, such as join, merge, filter by, and group by. With all of these powerful tools, it should come as no
surprise that pandas are very popular among data scientists.
NumPy is an open-source Python library that facilitates efficient numerical operations on large quantities of data. There are a
few functions that exist in NumPy that we use on pandas DataFrames. For us, the most important part about NumPy is that
pandas is built on top of it. So, NumPy is a dependency of Pandas.
Matplotlib is a Python library used for plotting graphs with the help of other libraries like Numpy and Pandas. It is a powerful
tool for visualizing data in Python. It is used for creating statical inferences and plotting 2D graphs of arrays. It was first
introduced by John D. Hunter in 2002. It uses Pyplot for providing MATLAB like interface free and open-source. It is capable of
dealing with various operating systems and their graphical backends.
Seaborn is also a Python library used for plotting graphs with the help of Matplotlib, Pandas, and Numpy. It is built on the roof
of Matplotlib and is considered as a superset of the Matplotlib library. It helps in visualizing univariate and bivariate data. It
uses beautiful themes for decorating Matplotlib graphics. It acts as an important tool in picturing Linear Regression Models. It
serves in making graphs of statical Time-Series data. It eliminates the overlapping of graphs and also aids in their beautification.
Figure 16: Matplotlib and Seaborn.
Scikit-learn (Sklearn) is the most useful and robust library for machine learning in Python. It provides a selection of efficient
tools for machine learning and statistical modeling including classification, regression, clustering and dimensionality reduction
via a consistence interface in Python. This library, which is largely written in Python, is built upon NumPy, SciPy and Matplotlib.