Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 23

MACHINE LEARNING, BIG DATA Y

PROCESAMIENTO DE IMÁGENES
BECA MINTIC / ICETEX

ANDRES DARIO SUAREZ GUARNIZO

ING. EN MECATRÓNICA

10.27.2020
AGENDA
INTRODUCTION TO PYTHON

SUPERVISED LEARNING

P Y S PA R K A N D B I G D ATA

IMAGE PROCESSING

2
INTRODUCTION TO PYTHON
W H AT I S P Y T H O N

Python is an interpreted programming language


whose philosophy emphasizes the readability of its
code.

3
WHAT LANGUAGES ARE BETTER?

Team R or Team Python?, Genevieve Hayes, 2019


4
NOTEBOOKS AND IDE

5
LET’S CODE IT
I N T R O D U C T I O N T O
P Y T H O N

6
QUESTIONS

7
SUPERVISED LEARNING
MACHINE LEARNING

Supervised learning is a technique for deducing a function from


training data. The training data consists of pairs of objects: one
component of the pair is the input data and the other is the
desired results.

8
TRAIN_TEST_SPLIT

9
K-FOLDER

10
K NEAREST NEIGHBOR

KNN is a non-parametric and lazy learning


algorithm. Non-parametric means there is no
assumption for underlying data distribution. In
other words, the model structure determined from
the dataset. This will be very helpful in practice
where most of the real world datasets do not follow
mathematical theoretical assumptions.

https://www.datacamp.com/community/tutorials/d
ecision-tree-classification-python
11
DECISION TREE
A decision tree is a flowchart-like tree structure
where an internal node represents feature(or
attribute), the branch represents a decision rule,
and each leaf node represents the outcome.

https://www.datacamp.com/community/tutorials/d
ecision-tree-classification-python
12
UNDERFITING AND OVERFITING

13
LET’S CODE IT
I N T R O D U C T I O N T O
P Y T H O N

14
QUESTIONS

15
BIG DATA
S PA R K

When we talk about Big Data we refer to data sets or


combinations of data sets whose size (volume), complexity
(variability) and growth rate (speed) make it difficult to
capture, manage, process or analyze it using conventional
technologies and tools.

16
RDD
The central abstraction and the beginnings of Apache Spark are
the Resilient Distributed Datasets (RDD).

17
LET’S CODE IT
I N T R O D U C T I O N T O
P Y T H O N

18
QUESTIONS

19
PROCESAMIENTO DE
IMÁGENES SKIMAGE

Digital image processing is the set of techniques that


are applied to digital images with the aim of
improving quality or facilitating the search for
information.

20
LET’S CODE IT
I N T R O D U C T I O N T O
P Y T H O N

21
WHAT’S NEXT
LOOKING AHEAD

22
THANK YOU

Andrés Darío Suárez +57 3173548881 andres.dario1906@gmail.com

UNIVERSIDAD DE CÓRDOBA

You might also like