Professional Documents
Culture Documents
Tugas Forum 13 - Achmad Aldo Alfalus - 41817120056
Tugas Forum 13 - Achmad Aldo Alfalus - 41817120056
Tugas Forum 13 - Achmad Aldo Alfalus - 41817120056
NIM : 41817120056
There are a few data mining techniques that come close to process mining. Examples are
sequence and episode mining. However, these techniques do not consider end-to-end processes. Through
process mining, it becomes easier to apply data mining techniques to event data. For example, decision
rules can be learned using standard data mining tools after the control-flow backbone (e.g., a Petri net)
has been learned using a process mining tool. RapidProM, available through the RapidMiner Marketplace,
shows that process mining and data mining can be combined in various ways.
• Transportation waste: Each time a product is moved, it encounters the risk of being damaged,
lost, delayed, etc. Transportation does not make any transformation to the product that the
consumer is willing to pay for (except for the final delivery).
• Inventory waste: Inventory may exist in the form of raw materials, work-inprogress, or finished
goods. Inventory that is not being actively processed can be considered as waste because it
consumes capital and space.
• Motion waste: Resources (equipment and people) that are used in the production processes
suffer from “wear and tear”. Unnecessary activities (e.g., transformation and double work) result
in additional degradation of resources and increase the risk of incidents (e.g., accidents).
• Unnecessary waiting: Whenever goods are not in transport or being processed, they are waiting.
In traditional processes, a large part of an individual product’s life is spent waiting to be worked
on. The total flow time of a case is often orders of magnitude larger than the sum of all service
times.
• Over-processing waste: All additional efforts done for a product not directly required by the
customer are considered as waste. This includes using components that are more precise,
complex, of higher quality, and thus more expensive than absolutely required.
• Overproduction waste: Producing more than what is required by the customers at a particular
time is a potential form of waste. Overproduction may lead to excess inventory and the
customer’s preferences may change over time making products outdated or less valuable.
• Defects: Rework, scrap, missing parts, poor work instructions, and correction activities are defects
that can increase the costs of a product drastically
BPR is process-centric, i.e., the focus is on the process just like in process mining. However, BPR
is not data-driven. It promotes “thinking outside the box” rather than analyzing data in great detail.
Process mining helps to identify the problems and assists shareholders in defining improvement actions.
However, process mining cannot come up with completely different ways of working (unless event data
is enriched with domain knowledge).
The term Business Process Intelligence (BPI) is used in different ways. It is often used as a
synonym for process mining (perhaps with less emphasis on process models). See, for example, the annual
International Workshop on Business Process Intelligence that has been running since 2005. Almost all
papers presented at these BPI workshops use or propose process mining techniques. BPI can also be
understood as BI with a focus on analyzing operational processes. Some of the products positioned as BPI
tools do not support discovery, i.e., performance data are mapped onto hand-made models. These tools
assume a stable and known process model. Terms comparable to BPI are used by a range of vendors
The term Workflow Mining (WM) was a precursor for process mining. It dates from a time where
the main aim of process mining was the automatic configuration of a WFM system. Ideally, one would like
to observe an existing process and automatically generate the corresponding executable workflow model.
This view turned out to be too narrow and often unrealistic. Process mining has a much wider applicability,
also in areas unrelated to WFM/BPM systems.
2.5.9 How Process Mining Compares to Big Data
Big Data does not focus on a particular type of analysis and is not limited to process-related data.
However, Big Data infrastructures enable us to collect, store, and process huge event logs. Process mining
tools can exploit such infrastructures to distribute large analysis tasks over multiple computing resources.
The data is used to reason about the process and for decision making within the process. Moreover, the
acronyms express a clear desire to get more insight into the actual processes, to improve them, and to
make sure that they are compliant.
3.3.1 Verification
Another verification task is the comparison of two models. For example, the implementation of a
process needs is compared to the high-level specification of the process. As indicated before, there exist
different equivalence notions (trace equivalence, branching bisimilarity, etc.). Moreover, there are also
various simulation notions demanding that one model can “follow all moves” of the other but not vice
versa. There are various tools to verify process models. A classical example is Woflan that is tailored
towards checking soundness. Also workflow systems such as YAWL provide verification capabilities.
• The lead time (also referred to as flow time) is the total time from the creation of the case to the
completion of the case. In terms of a WF-net, this is the time it takes to go from source place i to
sink place o. One can measure the average lead time over all cases. However, the degree of
variance may also be important, i.e., it makes a difference whether all cases take more or less two
weeks or if some take just a few hours whereas others take more than one month. The service
level is the percentage of cases having a lead time lower than some threshold value, e.g., the
percentage of cases handled within two weeks.
• The service time is the time actually worked on a case. One can measure the service time per
activity, e.g., the average time needed to make a decision is 35 minutes, or for the entire case.
Note that in case of concurrency the overall service time (i.e., summing up the times spent on the
various activities) may be longer than the lead time. However, typically the service time is just a
fraction of the lead time (minutes versus weeks).
• The waiting time is the time a case is waiting for a resource to become available. This time can be
measured per activity or for the case as a whole. An example is the waiting time for a customer
who wants to talk to a sales representative. Another example is the time a patient needs to wait
before getting a knee operation. Again one may be interested in the average or variance of waiting
times. It is also possible to focus on a service level, e.g., the percentage of patients that has a knee
operation within three weeks after the initial diagnosis.
• The synchronization time is the time an activity is not yet fully enabled and waiting for an external
trigger or another parallel branch. Unlike waiting time, the activity is not fully enabled yet, i.e.,
the case is waiting for synchronization rather than a resource. Consider, for example, a case at
marking [c2,c3] in the WF-net shown in Fig. 3.2. Activity e is waiting for check ticket to complete.
The difference between the arrival time of the token in condition c4 and the arrival time of the
token in condition c3 is the synchronization time