Professional Documents
Culture Documents
CHI For Student About MLops
CHI For Student About MLops
2
Kirill Krinkin, Yulia Shichkina Cognitive Architecture for Co-Evolutionary Hybrid Intelligence – presented at AGI-2022
Пример CHI
Example
Pipeline concept
A pipeline is a description of
a machine learning (ML)
workflow, including all of
the components in the
workflow and how the
components relate to each
other in the form of a graph.
Pipeline concept
A pipeline is a description of a machine learning (ML) Important:
workflow, including all of the components in the workflow and • data can be passed between the components
how the components relate to each other in the form of a graph. • components can execute multiple times in loops
• conditionally after resolving an if/else
Pipeline configuration includes:
• definition of the inputs Small data
• the inputs and outputs of Component 1
each component Output data -> serialized
component Docker
A pipeline component is self-contained set image Component 2
of code that performs one step in the ML
Input data -> deserialized
workflow (pipeline), such as:
• data preprocessing Big data
component Docker
• data transformation Component 1
image
• model training Output data -> file
• …
‘OutputPath’
Run a pipeline: Data file
component Docker
• system launches one or more ‘InputPath’
image
Kubernetes Pods
• Pods start Docker containers Component 2
• containers in turn start programs File -> Input data
Pipeline examples
https://qiita.com/oguogura/items/32fcaaa7ece2ab868e81 https://cloud.google.com/blog/products/ai-machine-learning/getting-started-kubeflow-pipelines
Factors impacting the structure of the Pipeline
same and different input data can be executed in parallel. Code Make a Code
» Components can be executed depending on a certain condition group?
P2(p1,…,pk)
Code Code
Code
Code Code
Example:
p1 = code execution time Conditional execution is
p2 = frequency of code changes possible?
p3 - necessity of execution for different data
Code
Tasks
Lightweight containers
Step 0: Step 1:
1. Install kubeflow 1. Explore built-in optimization techniques
2. Using simple examples to learn combinations 2. Explore existing approaches
of component execution:
1. Independent
2. Sequential Step 2 and following steps:
3. Parallel 1. Consolidating sequential containers
4. Grouped 2. Organizing parallel containers
5. Conditional 3. Organizing group containers
3. Use simple examples to learn how to transfer 4. ……
9
data
Install Kubeflow
Kubeflow – is an open-source platform for machine learning and MLOps on Kubernetes introduced by Google.
Contacts:
shichkina@co-evolution.ai
89819636645 - WhatsApp