Download as pdf or txt
Download as pdf or txt
You are on page 1of 19

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/354770875

Digital twin data: methods and key technologies

Article · September 2021


DOI: 10.12688/digitaltwin.17467.1

CITATIONS READS
4 1,123

7 authors, including:

Meng Zhang Fei Tao


Beihang University (BUAA) Beihang University (BUAA)
36 PUBLICATIONS   4,171 CITATIONS    255 PUBLICATIONS   18,738 CITATIONS   

SEE PROFILE SEE PROFILE

Ang Liu Lihui Wang


UNSW Sydney KTH Royal Institute of Technology
108 PUBLICATIONS   3,953 CITATIONS    745 PUBLICATIONS   17,468 CITATIONS   

SEE PROFILE SEE PROFILE

Some of the authors of this publication are also working on these related projects:

SYMBIO-TIC View project

Next Generation Condition-based Maintenance View project

All content following this page was uploaded by Ang Liu on 21 November 2021.

The user has requested enhancement of the downloaded file.


Digital Twin Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

METHOD ARTICLE

Digital twin data: methods and key technologies [version 1;


peer review: 1 approved, 3 approved with reservations]
Meng Zhang 1, Fei Tao 2, Biqing Huang1, Ang Liu 3, Lihui Wang 4,

Nabil Anwer5, A. Y. C. Nee6


1Department of Automation, Tsinghua University, Beijing, 100084, China
2School of Automation Science and Electrical Engineering, Beihang University, Beijing, 100191, China
3School of Mechanical and Manufacturing Engineering, University of New South Wales, Sydney, NSW, 2052, Australia
4Department of Production Engineering, KTH Royal Institute of Technology, Stockholm, SE-10044, Sweden
5Automated Production Research Laboratory, Paris-Saclay University, ENS Paris-Saclay, LURPA, 91190, Gif-sur-Yvette, France
6Department of Mechanical Engineering, National University of Singapore, Singapore, 117576, Singapore

v1 First published: 22 Sep 2021, 1:2 Open Peer Review


https://doi.org/10.12688/digitaltwin.17467.1
Latest published: 22 Sep 2021, 1:2
https://doi.org/10.12688/digitaltwin.17467.1 Reviewer Status

Invited Reviewers
Abstract
As a promising technology to converge the traditional industry with 1 2 3 4
the digital economy, digital twin (DT) is being investigated by
researchers and practitioners across many different fields. The version 1
importance of data to DT cannot be overstated. Data plays critical 22 Sep 2021 report report report report
roles in constructing virtual models, building cyber-physical
connections, and executing intelligent operations. The unique
1. Huibin Sun, Northwestern Polytechnical
characteristics of DT put forward a set of new requirements on data.
Against this background, this paper discusses the emerging University, Xi'an, China
requirements on DT-related data with respect to data gathering,
2. Pai Zheng , The Hong Kong Polytechnic
mining, fusion, interaction, iterative optimization, universality, and on-
demand usage. A new notion, namely digital twin data (DTD), is University, Hong Kong, Hong Kong
introduced. This paper explores some basic principles and methods
for DTD gathering, storage, interaction, association, fusion, evolution 3. Yicha Zhang, Universite de Technologie de
and servitization, as well as the key enabling technologies. Based on Belfort-Montbeliard, Sevenans, France
the theoretical underpinning provided in this paper, it is expected that
more DT researchers and practitioners can incorporate DTD into their 4. Aydin Nassehi , University of Bristol,
DT development process. Bristol, UK

Keywords Any reports and responses or comments on the


digital twin (DT), digital twin data (DTD), principles, methods, key
article can be found at the end of the article.
technologies

 
Page 1 of 18
Digital Twin Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

Corresponding author: Fei Tao (ftao@buaa.edu.cn)


Author roles: Zhang M: Investigation, Methodology, Visualization, Writing – Original Draft Preparation; Tao F: Conceptualization,
Funding Acquisition, Methodology, Project Administration, Supervision, Writing – Review & Editing; Huang B: Methodology, Supervision,
Writing – Review & Editing; Liu A: Methodology, Writing – Review & Editing; Wang L: Methodology, Visualization, Writing – Review &
Editing; Anwer N: Methodology, Writing – Review & Editing; Nee AYC: Methodology, Writing – Review & Editing
Competing interests: No competing interests were disclosed.
Grant information: This work is supported by the National Key Research and Development Program of China (2020YFB1708400) and
Beijing Municipal Natural Science Foundation (JQ19011). The grants were assigned to Prof. Fei Tao.
Copyright: © 2021 Zhang M et al. This is an open access article distributed under the terms of the Creative Commons Attribution License
, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
How to cite this article: Zhang M, Tao F, Huang B et al. Digital twin data: methods and key technologies [version 1; peer review: 1
approved, 3 approved with reservations] Digital Twin 2021, 1:2 https://doi.org/10.12688/digitaltwin.17467.1
First published: 22 Sep 2021, 1:2 https://doi.org/10.12688/digitaltwin.17467.1

 
Page 2 of 18
Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

From data to digital twin data uncertainties25. A successful DT solution should be powered
With the rapid development of new information technologies by comprehensive data obtained from both the physical and
(e.g., Internet of Things, cloud computing, big data, and artifi- virtual worlds.
cial intelligence), the digital economy continues to flourish on a
global scale. As an innovative way to converge the traditional (2) Requirements on knowledge mining
industry and digital economy, digital twin (DT) is attracting To build insightful virtual models that can reflect a physical
continued attention from different fields1, such as aerospace2,3, entity’s internal mechanisms and rules, it is necessary to
automotive4,5, shipping6, smart grid7,8, and smart city9. Espe- derive knowledge behind raw data through data mining. The
cially in the field of smart manufacturing, DT has been in-depth mining of massive data (e.g., physical entity data,
widely applied in control and management for shop-floor10,11, virtual model data, and information system data) towards new
rapid configuration of production line12, product lifecycle knowledge remains a challenge. On the one hand, not every
management13, intelligent logistics14, dynamic scheduling15,16, robot data is equally useful for extracting information and knowl-
operation optimization17, product quality assurance18,19, machine edge, in particular, concerning the irrelevant data, abnormal data,
tool maintenance20, and human-robot interaction21,22. Practi- and redundant data. On the other hand, it is difficult to fully
cal applications of digital twin involve a set of key enabling explore various relations hidden behind the data (e.g., causal,
technologies23, concerning the construction and verifica- analogous, and complementary relations) to further support
tion of virtual models, the construction and management in-depth knowledge mining.
of intelligent services, the real-time sensing and control of
physical entities, cyber-physical interaction and integration, (3) Requirements on seamless data fusion
multi-modal data association and fusion, and so forth. Since Since DT-related data comes from multiple sources (e.g., physi-
data is indispensable in empowering all these technologies, it cal entity, virtual model, and service), there exists data noise,
can be argued that the success of DT lies in the availability of inconsistency, and conflict. For the data collected from a
high-quality data source. physical entity, various factors such as sensor malfunction,
environmental fluctuation, and human interference would affect
Early-stage data acquisition is typically conducted by manual information entropy (an indicator of the degree of informa-
means, which suffers from low efficiency and high cost. Since tion uncertainty). For the data simulated by virtual models,
the collected data is less in quantity and poor in real-time per- deviations from the physical reality due to unsatisfac-
formance, they could only reflect features of a physical entity tory model effectiveness would reduce the data reliability.
(e.g., machine tool and process) for a limited period of time In addition, neither the collected data nor the simulated data
with inevitable delays. Benefiting from the emergence of new is sufficient to derive global perspectives. Data fusion is there-
information technologies in recent years, a huge volume of data fore required, through which, data obtained from diversified
can be collected by sensors, IoT devices, mobile devices, and sources is integrated synthetically. By doing so, data can be
wearable devices in real time and processed through an inte- verified, corrected, and supplemented by each other, hence
grated computing infrastructure (e.g., cloud computing, edge leading to more accurate and consistent information extraction.
computing, and fog computing). As a result, it is possible to
obtain a complete data record and conduct timely analysis (4) Requirements on real-time data interaction
throughout the whole lifecycle of a physical entity. On the Real-time data interaction is required to enable coordinated
other hand, the growth of DT-related research and application operations. Firstly, real-time data from the physical entity can
puts forward new requirements on data. be used to dynamically update parameters of the virtual mod-
els, whereas simulation data from the virtual models is fed back
(1) Requirements on comprehensive data gathering to the physical entity in order to align its behaviors with a simu-
Comprehensive data is required to enhance the accuracy, effi- lation plan. Secondly, data from DT-based services should be
ciency, and adaptability of DT-based services (e.g., performance communicated with the physical entity for timely diagnosis,
prediction, process optimization, and quality assurance). maintenance, and control, whereas real-time data from the
Comprehensive data refers to a wide spectrum of data includ- physical entity can be used to update the services and make
ing, for example, data on both normal state and abnormal them more adaptable to the changing physical realities. Thirdly,
state, data on both common event and rare event, data on both since service availability should be verified before execution,
certain scenario and uncertain scenario, and so forth. DT appli- real-time simulation results by the virtual models should be
cations developed upon incomprehensive data suffer from communicated with service providers to illustrate deficiencies,
various challenges. On the one hand, some existing work whereas service data can be used to calibrate the virtual models
focuses mainly on data gathered from the physical world, for and improve their accuracy.
which, it is difficult to include low-probability data (e.g., fail-
ure data and extreme environment data) and data that could not (5) Requirements on iterative optimization
be measured directly (e.g., deformation data, stress distribu- In the paper, iterative optimization refers to a cyclic process of
tion, and airflow distribution)24. On the other hand, some other “data increase - data fusion - information increase”, through
work focuses primarily on data gathered from virtual models, which, new data is fused with historical data to generate new
for which, it is difficult to accurately simulate disturbance data information continuously. Iterative optimization plays a critical
caused by abrupt disturbances and time-varying data with high role in improving the virtual models and services. In practice,

Page 3 of 18
Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

iterative optimization faces two challenges. Firstly, for data domain knowledge. Figure 1 illustrates how these parts are
fusion, the lack of autonomy and continuity makes it difficult interrelated towards a whole DTD.
to iterate effectively. Secondly, even if an iterative process
can be sustained, the continued information growth cannot Physical entity refers to an object existing in the physical world
always be guaranteed, because data fusion from time to time can with specific functions, behaviors, and structures26. Physical
result in information loss. entity-related data can be used to describe the static property
(e.g., size, position, attribute, and capability) and reflect the
(6) Requirements on data universality dynamic state (e.g., health status, abrupt disturbance, and
Low universality of data is a main obstacle for DT applications. working conditions) concerning a physical entity. Physical
It is difficult to transfer DT across different application scenar- entity-related data is the basis of constructing DTD.
ios, which enclose different requirements and constraints on data
acquisition and accumulation. The difficulty of data exchange Virtual models reproduce and describe a physical entity in the
and parsing is affected by various facets of a certain applica- digital space with respect to different aspects, such as geomet-
tion scenario, such as different physical entities (e.g., robot, ric property, physical parameter, dynamic behavior, operation
machine tool, and autonomous vehicle), data types, structures, and maintenance rules, and so forth26. Virtual model-related data
interfaces and communication protocols, etc. Against different include model parameters and simulation data (e.g., simulation
application scenarios (e.g., design, production, and main- conditions, simulation process data, and simulation results).
tenance), it is difficult to achieve smooth data integration Virtual model-related data is naturally coupled with the physi-
and sharing due to various data formats, classifications, and cal entity-related data, as virtual models are essentially built
encapsulations. To cope with these difficulties, it is necessary upon a physical entity.
to unify data transformation towards high universality of data.
DT services can be classified into application services and func-
(7) Requirements on on-demand data usage tional services27. Application services are provided, based
Since DT users play different roles and undertake different on the physical entity-related data and virtual model-related
responsibilities, they tend to demand different types of data. data, to directly solve problems in a certain application sce-
For examples, field operators demand on-site operational nario, such as equipment prognosis, resources scheduling,
data (e.g., assembly sequence, maintenance steps, and control and product quality assurance. Therefore, the service-related
order); technicians demand process data (e.g., real-time data mainly includes performance data, scheduling data, and
condition monitoring data, equipment performance data, and quality data. On the other hand, functional services are provided
diagnosis data); senior managers demand market data (e.g., mate- to support normal operations of DT by realizing relevant
rial cost, market dynamics, and product benchmarking)26. It is functions such as model management, data processing, data
difficult to implement generic data operations (e.g., searching, connection, etc. Related data therefore includes model
matching, combination, invocation and visualization) in con- configuration data, algorithm configuration data and service
sideration of significant user discrepancies in terms of demand, encapsulation data, etc.
professional skill, knowledge level, etc. To address this issue,
it is necessary to encapsulate data provision as on-demand Domain knowledge includes common knowledge such as
services. expert experience, predefined rules, and industry standards26.
Besides, domain knowledge also involves new knowledge
Considering the above-mentioned requirements, the authors being generated by data mining. Domain knowledge can be
proposed a new notion, namely digital twin data (DTD), which used to guide model construction, service optimization, data
constitutes an important part of the five-dimension DT, in the processing, etc.
previous work26. Based on this, this paper aims to provide the
theoretical underpinning of DTD with respect to its structure Fusion data can be considered as a result of merging the physi-
and processing. The remainder of this paper is organized cal entity-related data, virtual model-related data, service-related
as follows. Section 2, ‘Composition of digital twin data’, data and knowledge. Through data fusion, multi-source data
introduces the composition of DTD, followed by a set of with underlying relations are made complementary with
DTD principles in Section 3, ‘Principles for digital twin data’. each other, towards forming a unified description26. Compared with
Section 4, ‘Methodology for digital twin data’, presents a single-source data, fusion data can combine data from multiple
structured methodology for DTD gathering, storage, interaction, perspectives, thus, including more abundant information.
association, fusion, evolution and servitization. Section 5, ‘Key
enabling technologies for digital twin data’ elaborates a set of Connection data refers to data derived from and transferred
key technologies concerning seven aspects. The final section among the physical entity-related data, virtual model-related
concludes this work and outlines future work. data, service-related data, fusion data and domain knowledge26.
Different from raw data of the five parts, connection data is
Composition of digital twin data typically compressed through data processing algorithms to
According to previous studies26, DTD can be classified into six streamline data traffic and reduce communication cost. Connection
parts, i.e., physical entity-related data, virtual model-related data makes it possible to compare data from different parts
data, service-related data, fusion data, connection data, and of DTD towards higher consistency.

Page 4 of 18
Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

Figure 1. Composition of digital twin data (DTD).

Principles for digital twin data data universality, thus to support data exchange between
In this section, a set of basic principles of gathering and process- different application objects, data migration and reuse under
ing DTD are proposed by the authors. They are developed different application conditions, as well as data sharing and
in order to fulfil the requirements raised in Section 1 (‘From integration against different application scenarios.
data to digital twin data’), i.e., comprehensive data gathering,
knowledge mining, seamless data fusion, real-time data interac- (3) Timeliness principle
tion, iterative optimization, data universality, and on-demand The timeliness principle emphasizes that the connection data
data usage, as shown in Figure 2. between any two parts of DTD (i.e., physical entity-related
data, virtual model-related data, service-related data, fusion
(1) Complementary principle data and domain knowledge) should be transferred and adjusted
The complementary principle of DTD corresponds to the in a timely manner, as a way to enable real-time data inter-
requirement of comprehensive data gathering. It emphasizes action. Towards this goal, the connection data should be
the simultaneous gathering of data from both the physical and efficiently compressed in the interest of data transmission.
virtual worlds, which supplement each other and make up Besides, when the connection data between two parts is discov-
respective deficiencies. On one hand, the physical entity-related ered to be inconsistent (i.e., the discrepancy exceeds a predefined
data can truly reflect the dynamic changes of physical real- value), timely adjustment is triggered to diagnose inconsistency
ity, including uncertainty, fuzziness, and complexity, which and restore consistency.
are difficult to simulate. On the other hand, the virtual mod-
els can generate a great variety of rare event data, unmeasurable (4) Association principle
data, and multi-physics coupling data through simulation at low The association principle corresponds to the requirement on
cost, which cannot be directly collected from the physical world. knowledge mining. It emphasizes extracting various associa-
tion relations among different parts of DTD, such as the causal,
(2) Standardization principle analogous, and complementary relations among the physical
As much as possible, DTD obtained from various objects, entity-related data, virtual model-related data, service-related
models, conditions, and scenes should be transformed data, and domain knowledge. The extracted associations can
into standardized data with unified structure, format, type, reflect valuable information such as behavioral patterns of the
encapsulation, interface, etc. A template that describes the data physical entity, simulation mechanisms of the virtual mod-
with respect to different aspects would be built, to help identify els, and behavior-performance mappings between different
their differences with the standard version. The standardization parts of DT. Underlying knowledge can be further induced based
principle is intended for DTD to fulfil the requirement on on such information.

Page 5 of 18
Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

Figure 2. Principles for digital twin data (DTD).

(5) Fusion principle which can be treated as black boxes with respect to the corre-
The fusion principle corresponds to the requirement on sponding input, output, control, and mechanism. When a user
seamless data fusion. It emphasizes realizing the data fusion proposes a demand on data, which is decomposed into
by fully merging data with different relations. For examples, sub-demands, suitable services are searched, matched, and
by fusing analogous data from the physical and virtual worlds, combined to provide the required data. This principle is intended
which could be verified by each other, it can minimize the uncer- to lower the requirements on a user’s professional skills in
tainty, randomness, and fuzziness of the physical entity-related virtue of those on-demand data services.
data or improve the reliability of the virtual model-related
data. Besides, by fusing complementary data from different Methodology for digital twin data
sources, more comprehensive data can be obtained to achieve Based on the above principles, a methodology for DTD gath-
richer information. ering, storage, interaction, association, fusion, evolution, and
servitization is developed by authors, as shown in Figure 3.
(6) Information growth principle According to the complementary principle, with respect to data
The information growth principle corresponds to the require- gathering, data from both the physical and virtual worlds should
ment on iterative optimization. It means that, with the be collected simultaneously to ensure data comprehensive-
continuous accumulation of data, DTD should be optimized ness. According to the standardization principle, with respect to
iteratively through continued data fusion between the new data data storage, data with different formats, structures, encap-
with the historical data. Through iterative fusions, the rules of sulations, and interfaces should be transformed towards
information extraction should be mined to guide the subse- standard shapes. The real-time interaction is realized through
quent data fusions. After each fusion, the increase or decrease in timely transmission and adjustment for the connection data
information quantity can be evaluated to decide whether accept based on the timeliness principle. According to the association
the fusion or not, as a way, to ensure the lasting growth of principle, various relations between different DT compo-
information. nents should be extracted to support further knowledge min-
ing. Based on the fusion principle, fusion data is generated by
(7) Servitization principle merging similar data and complementary data for mutual cor-
The servitization principle means encapsulating DTD-related rection and supplementation. The information growth principle
resources (e.g., data, data models, data processing algorithms, is followed to interpret the progressive data evolution. Based
and data visualization methods) into on-demand services, on the servitization principle, the data-related resources are

Page 6 of 18
Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

Figure 3. Methodology for digital twin data (DTD).

encapsulated into services to enable the on-demand usage of data Digital twin data storage
by different users. The relevant methods are elaborated as The physical entity-related data, virtual model-related data,
follows. and service-related data collected from different applica-
tion objects, conditions, and scenarios should be transformed
Digital twin data gathering into a unified mode and then stored for sharing, reuse, and
DTD gathering refers to the gathering of physical entity- exchange. Towards this goal, data should be represented in a
related data, virtual model-related data, service-related data, standardized fashion first, which involves the description of data
and domain knowledge. These four kinds of data constitute the format, structure, encapsulation, sampling frequency, historical
foundation of DTD, upon which, the fusion data and connec- data accumulation, interface, communication protocol, etc.
tion data can be derived. For the physical entity-related data, Next, necessary constraints are defined in terms of histori-
the dynamic data (e.g., entity state, environment parameters, cal data accumulation, data type, sampling frequency, etc.,
and abrupt disturbances) can be collected in real time by means to filter those unqualified application conditions, where DT
of sensors, embedded systems, radio frequency identification is inapplicable due to the lack of adequate data. If certain
(RFID), and so forth; while the static data (e.g., entity constraints cannot be complied with, it indicates that the cor-
attribute, performance, and functions) can be obtained through responding application conditions are not qualified. For the
off-line measurement and sampling inspection. The virtual qualified ones, data from different scenarios (e.g., design,
model-related data is typically generated by the process of production, and maintenance) with various formats, structures
modeling, simulating, and verifying those geometric, physi- and encapsulations are converted based on a unified tem-
cal, behavioral, and rule models. The data can be gathered plate. Data from different objects (e.g., robot and machine
through manual books, simulation logs, and real-time simu- tool) are transferred by means of unified interface and com-
lation outputs. In particular, since the physical model-based munication protocol. Then the data can be modeled based
simulation (e.g., computational fluid dynamics simulation) tends on a suitable modeling language. There are a wide variety of
to be time-consuming and even inaccurate, a hybrid method modelling languages and methods used in the literature
that combines physical models and data-driven approaches can to handle data and information modeling for products and
be adopted to improve the simulation efficiency and accuracy19. systems, such as unified modeling language (UML)28, systems
Service-related data is generated based on application services modeling language (SysML)29, ontology language30, etc. How-
and functional services. Such data can be obtained throughout ever, each modeling language has its own semantics which
the service construction, operation, and maintenance. Domain limits data and format compatibility and exchanges. New sound
knowledge is typically collected from experts, crowdsourced, mathematical approaches based on Category theory31 could
knowledge bases, and historical data, etc. provide a new way to create a comprehensive foundation for

Page 7 of 18
Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

modeling, interoperability and integration. Based on this, the multiple sources, which can describe the same attribute or
data can be further stored. behavior from different perspectives.

Digital twin data interaction Digital twin data fusion


Connectivity among different parts of DTD should be estab- Existing work on data fusion focuses primarily on the merg-
lished to support real-time interactions. Firstly, it is necessary ing of data from the physical world (e.g., fusing sensor data
to select the suitable data that carries useful information with manual data), while few efforts have been devoted to
from different parts of DTD to support the message transmis- fusing data from both the physical and virtual worlds35–37. In
sion between any pair of two parts. Taking an equipment for contrast, DTD fusion involves a holistic merging of physical
example, the actual states collected by sensors, which would entity-related data, virtual model-related data, service-related
reflect the real performances of the equipment, and the simulated data and domain knowledge. DTD fusion includes the follow-
states yielded by virtual models, which indicate the expected ing aspects. In the case that the physical entity-related data
states, can be preliminarily selected for message transmission is disturbed by environment perturbations, sensor failures,
between the physical entity-related and virtual model-related and human interferences, methods such as the weighted aver-
data. Secondly, in the interest of data transmission, the data age method38, Dempster–Shafer theory39, and Kalman filter40
is further processed by cleaning algorithms, dimensional- can be used to fuse the physical entity-related data with
ity reduction algorithms, and compaction algorithms, which similar virtual model-related data and service-related data
are intended to remove data noise, duplicate and redundancy. to minimize the information entropy. By doing so, the data
Data is then transferred through communication interfaces of uncertainty, randomness, and fuzziness can be reduced. Like-
sensing devices, software, and database in DT. Thirdly, the wise, in the case that the virtual model-related data and
consistency of connection data between two parts is evalu- service-related data deviate from the reality, methods such as
ated by calculating their Euclidean distance in real time. When the Bayesian method41 and neural network42 can be used to
the distance exceeds a predefined threshold, which indicates merge the data with similar physical entity-related data, as a
inconsistency/contradiction between connected parts, nec- way, to improve accuracy and reliability. For the complementary
essary measures should be taken to restore consistency by multi-modal data from different parts of DTD, methods
correcting parameters of the virtual model, updating service such as neural network42, and weighted average method38
configurations, or adjusting behaviors of the physical entity. can be used to increase information diversity.
In short, DTD interaction plays important roles in driving the
synchronization among the physical entity, virtual model, and Digital twin data evolution
service of DT. DTD evolution is characterized by a dynamic process, through
which, new data is continuously added, processed, and then
Digital twin data association fused with historical data. This process can be described by
Association relations among DTD are mined to support knowl- the data relation network built based on the complex network.
edge discovery. Firstly, data obtained from the physical entity, When new data is incorporated into the network, new asso-
virtual model, and service is preprocessed through data filter- ciation relations are built between new data and historical data
ing, data reduction, and feature extraction in order to remove through preprocessing new data, aligning new data with histori-
irrelevant and useless data. Secondly, temporal and spatial cal data, and then extracting association relations from aligned
alignments are conducted. For example, the least squares method data by using data mining algorithms, following by fusing
can be used to synchronize data in time, and transform data the new data with similar or complementary relations for data
into the same coordinate system in space. Next, the relations correction and supplement. This process can be guided by an
(e.g., causality, similarity, and complementation) among data are automatic fusion mechanism to reduce the dependency on
mined by means of Pearson correlation analysis32, K-means33, manual labor. The notions of information entropy43 and network
Apriori algorithm34, etc. A complex network can be built to fully features44 are employed to evaluate the effectiveness of data evo-
express these relations. In the network, various data variables lution. For example, the quantity of information contained in
are regarded as nodes, whereas data association relations are a network node can be measured by the information entropy.
treated as edges. Based on this, further knowledge can be By comparing the information entropy before and after the
deduced through, for instance, statistical method, clustering fusion, the gain or loss of information can be evaluated. Mean-
method, and classification method, and the deduced knowl- while, the structure change of the network after each fusion can
edge can be expressed in the form of knowledge graph. To sup- be analyzed with respect to degree distribution, edge between-
port the subsequent data fusion, two kinds of data associations ness, clustering coefficient, etc. In light of the evaluation
are especially important, i.e., the similar relation and comple- results over time, certain DTD evolution rules can be mined,
mentary relation. The former refers to the relation between such as the rules for information transfer and the relations
data with similar attributes, values or changing trends, whereas between network structure and information distribution. In turn,
the latter refers to the relation between multi-modal data from such rules can be used to guide the subsequent data fusions with

Page 8 of 18
Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

respect to the selection of fusion data, methods, and mechanisms, (1)   DTD gathering deals with rare-event data, extreme
to ensure iterative optimization and information growth. environment data, multi-physics coupling data, etc. Since
such data is difficult to be measured directly, data gen-
Digital twin data servitization eration based on simulation plays a crucial role. Multiple
DTD servitization aims to enable users to access data through technologies can be used to support the data generation.
on-demand services, including multiple aspects as follows45. The multidimensional modeling technology can be used to
Firstly, various digital resources are encapsulated into services, build various virtual models that can generate data
which have functions (e.g. data searching, processing, and concerning geometry, physics, behaviors, and rules. For
virtualization), inputs (e.g. data type, volume, and feature), the scenario that the modeling data is insufficient, trans-
outputs (e.g. processed data and visual diagram), quality fer learning technology can be employed to enable the
(e.g. time, cost, and reliability), and states (e.g. working and modeling based on small sample data. The high-efficiency
idle). Given a user demand (e.g., the query of equipment simulation technology (e.g., metamodel49) can be used to
status data, part remaining life data, and maintenance guidance improve the efficiency of data generation, whereas it is
data), relevant services can be prescribed for data extraction, necessary to verify the accuracy of simulation data.
processing, and visualization, etc., respectively. The separate
services can be then combined, in compliance with relevant (2)   DTD storage deals with data with different structures,
constraints (e.g., time and cost), towards an integrated service formats, types, encapsulations, interfaces, etc. Related
solution. Related methods include demand decomposition, technologies include unified data modeling technology
similarity matching, multi-objective optimization, and so that can support data modeling and interoperability;
forth45. With respect to the DTD visualization (i.e., a particular database management technology that can perform basic
form of DTD service), on top of the traditional visual data functions such as adding, deleting, changing, and
diagrams, virtual reality (VR)46 and augmented reality (AR)47 checking; data space expansion technology that can adapt
can be exploited to visualize the mapping relations between the continuously growing DTD; data integration technol-
the physical entity and virtual models. Finally, the integrated ogy that can bridge data silos between different parts of
services provide demanded results to users. DTD; as well as various data security technologies.

Key enabling technologies for digital twin data (3)   With respect to data interaction, data collection technol-
As illustrated in Figure 4, the key enabling technologies for ogy is required to collect connection data from differ-
DTD are explored based on the enabling technologies for DT ent parts of DTD by various means such as sensors and
in general23 as well as the enabling technologies for digital data crawling. Dimensionality reduction technology can
twin shop-floor (DTS) in particular48. Some main technologies be used to remove the redundant data. Data middleware
are explained as follows. technology50 completes the transformations for data

Figure 4. Key enabling technologies for digital twin data (DTD). AR, augmented reality; VR, virtual reality.

Page 9 of 18
Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

formats, interfaces, and communication protocols, and ensures integrate multiple services in light of conflicting objectives
synchronous data transmission. Besides, data consistency (e.g., time, cost, and reliability). Data visualization based
evaluation technology can be used to support data align- on VR and AR can be used to present the relations among
ment, data distance calculation, and threshold selection for data, virtual models, and physical entities in a more
consistency evaluation. intuitive manner.

(4)   With respect to data association, related technologies include Conclusions and future work
spatial-temporal data alignment, data mining, knowledge Data is a core driver for DT. This paper focuses on the DTD
reasoning, knowledge representation, etc. Spatial-temporal that can be classified into physical entity-related data, vir-
data alignment technology makes DTD synchronous in tual model-related data, service-related data, domain knowl-
time and in the same coordinate system in space. Data edge, fusion data, and connection data. The advent of DT
mining algorithms (e.g., Pearson correlation analysis32, puts forward some new requirements on data in terms of data
K-means33, Apriori algorithm34) can make visible cluster- gathering, mining, fusion, interaction, iterative optimization,
ing groups and association relations among DTD. Knowl- universality, and on-demand usage. Triggered by these require-
edge reasoning technology can be used to extract new ments, seven basic principles are proposed to support the DTD
knowledge from the extracted relations and groups, or from organization and processing. Guided by these principles, related
existing knowledge. Knowledge representation technology methods for DTD gathering, storage, interaction, association,
(e.g., knowledge graph) can visually describe the knowledge, fusion, evolution, and servitization are discussed. Finally, the
knowledge carriers, and knowledge relations. key enabling technologies are discussed.

(5)   With respect to data fusion, related technologies include This paper provides some theoretical underpinnings for
anomaly detection, granularity transformation, heterogene- DTD, which are imperative for the further promotion and
ous data fusion, fault-tolerant technology, etc.51. Anomaly application of DT, to support more DT researchers to incor-
detection technology can be employed to remove abnormal porate the DTD into their DT development process. When
data before fusion. Granularity transformation technology it comes to a certain DT application scenario, the research-
can be used to transform data with different granularities ers are expected to analyze the data requirements according to
(e.g., sparse data and dense data, raw data and extracted fea- Section 1 (‘From data to digital twin data’) first. Then to sat-
tures, abstract symbols and concrete data)51 into the same isfy the requirements, related principles for DTD proposed
level. Multisource heterogeneous data fusion technology can in Section 3 (‘Principles for digital twin data’) should be
converge data with different attributes such as types, dimen- observed. Under the guideline of the principles, researchers can
sions, and sampling frequencies. Fault-tolerant technology employ the corresponding methods and key enabling tech-
can be used to enhance the robustness of data fusion. nologies explored in Section 4 (‘Methodology for digital twin
data’) and Section 5 (‘Key enabling technologies for digital
(6)   Data evolution deals with the iterative process of data twin data’) for DTD gathering and processing. The above proc-
fusion. Complex network modeling technology can be used esses facilitate achieving more comprehensive and consistent
to build the data network and update the network in light data, more abundant and deeper information, as well as
of emerging data. Information measurement technology more convenient and standard data usage, which could bring
can be used to evaluate information quantity, information countless benefits to DT applications. For example, the
distribution, and information accumulation, based on DTD-based prognosis would have higher accuracy in virtue
which, to evaluate the efficiency of data fusion. Propa- of the comprehensive data support. The DTD-based produc-
gation dynamics modeling technology44 can analyze the tion control can better align the practical process with the
information transfer from individual data to fusion data, simulated plan through the real-time data interaction. The
which dynamically describes the information spread DTD-based product design could have higher efficiency due to
process. In addition, rule extraction technology is used to the efficient knowledge mining of key information for designers.
mine rules for selecting fusion data type, method and mech-
anism from the information growth, change and transfer In the future, this work will continue to be enriched by
processes, which can facilitate the subsequent data fusion. (1) applying the generic principles and methods to guide DTD
gathering and processing in practical applications, (2) refining
(7)   With respect to data servitization, related technologies the principles and methods based on lessons learned from practi-
include resource encapsulation, demand decomposition, cal applications, for instance, further improving related process-
multi-objective combinational optimization, data visuali- ing algorithms to better adapt to DTD, and (3) introducing
zation based on VR and AR, etc. Resource encapsulation big data-related tools and platforms to support large-scale
technology can transform digital resources into services, DT applications, since in this case, the DTD, which is closely
which can be described by tuples. Demand decomposition related to real-time dynamic DT, also has the “4V” character
technology can be used to decompose a complicated user of big data (i.e. volume, variety, value and velocity).
demand into specific sub-demands, which can facili-
tate the search of a specific service. Multi-objective Data availability
combinational optimization technology can be used to No data are associated with this article.

Page 10 of 18
Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

References

1. Tao F, Qi Q: Make more digital twin. Nature. 2019; 573(7775): 490–491. 24. Wang Y, Tao F, Zhang M, et al.: Digital twin enhanced fault prediction for the
PubMed Abstract | Publisher Full Text autoclave with insufficient data. J Manuf Syst. 2021; 60: 350–359.
2. Liu W, Tao F, Cheng J, et al.: Digital twin satellite: concept, key technologies Publisher Full Text
and applications. Comput Integr Manuf Syst. 2020; 26(3): 565–588. 25. Uhlmann E, Barth E, Seifarth T, et al.: Simulation of metal cutting with
Publisher Full Text cutting fluid using the Finite-Pointset-Method. Procedia CIRP. 2021; 101:
3. Mandolla C, Petruzzell A, Percoco G, et al.: Building a digital twin for additive 98–101.
manufacturing through the exploitation of blockchain: A case analysis of Publisher Full Text
the aircraft industry. Comput Ind. 2019; 109: 134–152. 26. Tao F, Zhang M, Liu Y, et al.: Digital twin driven prognostics and health
Publisher Full Text management for complex equipment. CIRP Ann Manuf Technol. 2018; 67(1):
4. Rajesh PK, Manikandan N, Ramshankar CS, et al.: Digital twin of an 169–172.
automotive brake Pad for predictive maintenance. Procedia Comput Sci. Publisher Full Text
2019; 165: 18–24. 27. Tao F, Liu W, Zhang M: Five-dimension digital twin model and its ten
Publisher Full Text applications. Computer Integrated Manufacturing System. 2019; 25(1): 1–18.
5. Zheng Y, Chen L, Lu X, et al.: Digital twin for geometric feature online Publisher Full Text
inspection system of car body-in-white. Int J Comput Integr Manuf. 2020; 28. Uke S, Thool R: UML Based modeling for data aggregation in secured
34(7–8): 752–763. wireless sensor network. Procedia Comput Sci. 2016; 78: 706–713.
Publisher Full Text Publisher Full Text
6. Coraddu A, Oneto L, Baldi F, et al.: Data-driven ship digital twin for 29. Brahmi R, Hammadi M, Aifaoui N, et al.: Interoperability of CAD models and
estimating the speed loss caused by the marine fouling. Ocean Eng. 2019; SysML specifications for the automated checking of design requirements.
186: 106063. Procedia CIRP. 2021; 100: 259–264.
Publisher Full Text Publisher Full Text
7. Zhou M, Yan J, Feng D: Digital twin framework and its application to power 30. Manaa M, Akaichi J: Ontology-based modeling and querying of trajectory
grid online analysis. CSEE J Power Energy Syst. 2019; 5(3): 391–398. data. Data & Knowledge Engineering. 2017; 111: 58–72.
Publisher Full Text Publisher Full Text
8. Peng Y, Zhao S, Wang H: A digital twin based estimation method for health 31. Legatiuk D: Mathematical modelling by help of category theory: models
indicators of DC-DC Converters. IEEE Trans Power Electron. 2020; 36(2): 2105– and relations between them. Mathematics. 2021; 9(16): 1946.
2118. Publisher Full Text
Publisher Full Text 32. Jebli I, Belouadha F, Kabbaj M, et al.: Prediction of solar energy guided by
9. Deng T, Zhang K, Shen Z: A systematic review of a digital twin city: A new pearson correlation using machine learning. Energy. 2021; 224: 120109.
pattern of urban governance toward smart cities. J Manag Sci Eng. 2021; Publisher Full Text
6(2): 125–134. 33. Wang X, Wang Z, Sheng M, et al.: An adaptive and opposite K-means
Publisher Full Text operation based memetic algorithm for data clustering. Neurocomputing.
10. Tao F, Zhang M: Digital twin shop-floor: a new shop-floor paradigm towards 2021; 437: 131–142.
smart manufacturing. IEEE Access. 2017; 5: 20418–20427. Publisher Full Text
Publisher Full Text 34. Tian M, Zhang L, Guo P, et al.: Data dependence analysis for defects data of
11. Tao F, Zhang M, Nee AYC: Digital twin driven smart manufacturing. Elsevier, relay protection devices based on Apriori algorithm. IEEE Access, 2020, 8:
2019. 120647–120653.
Reference Source Reference Source
12. Leng J, Liu Q, Ye S, et al.: Digital twin-driven rapid reconfiguration of the 35. Segreto T, Caggiano A, Teti R: Neuro-fuzzy system implementation in
automated manufacturing system via an open architecture model. Robot multiple sensor monitoring for Ni-Ti alloy machinability evaluation.
Comput Integr Manuf. 2020; 63: 101895. Procedia CIRP. 2015; 37: 193–198.
Publisher Full Text Publisher Full Text
13. Tao F, Cheng J, Qi Q, et al.: Digital twin-driven product design, 36. Wang J, Xie J, Zhao R, et al.: Multisensory fusion based virtual tool wear
manufacturing and service with big data. Int J Adv Manuf Technol. 2018; sensing for ubiquitous manufacturing. Robot Comput Integr Manuf. 2017; 45:
94(9–12): 3563–3576. 47–58.
Publisher Full Text Publisher Full Text
14. Zhang L, Liu W, Cheng J, et al.: Just-in-time material distribution method for 37. Yang R, Er PV, Wang Z, et al.: An RBF neural network approach towards
satellite assembly digital twin shop-floor. Computer Integrated Manufacturing precision motion system with selective sensor fusion. Neurocomputing.
System. 2020; 26(11): 2897–2914. 2016; 199: 31–39.
15. Zhang M, Tao F, Nee AYC: Digital twin enhanced dynamic job-shop Publisher Full Text
scheduling. J Manuf Syst. 2021; 58(Part B): 146–156. 38. Wang S, Guo X, Tie Y, et al.: Weighted hybrid fusion with rank consistency.
Publisher Full Text Pattern Recognit Lett. 2020; 138: 329–335.
16. Fang Y, Peng C, Lou P, et al.: Digital twin-based job shop scheduling toward Publisher Full Text
smart manufacturing. IEEE Trans Industr Inform. 2019; 15(12): 6425–6435. 39. Mourtzis D, Vlachou E, Doukas M, et al.: Cloud-based adaptive shop-floor
Publisher Full Text scheduling considering machine tool availability. ASME 2015 International
17. Xu W, Cui J, Li L, et al.: Digital twin-based industrial cloud robotics: Mechanical Engineering Congress and Exposition, November 13–19, 2015,
framework, control approach and implementation. J Manuf Syst. 2021; Houston, Texas, USA 2015, 57588: V015T19A017.
58(Part B):196–209. Publisher Full Text
Publisher Full Text 40. Zheng Z, Qiu H, Wang Z, et al.: Data fusion based multi-rate Kalman filtering
18. Söderberg R, Wärmefjord K, Carlson JS, et al.: Toward a digital twin for real- with unknown input for on-line estimation of dynamic displacements.
time geometry assurance in individualized production. CIRP Ann Manuf Measurement. 2019; 131: 211–218.
Technol. 2017; 66(1): 137–140. Reference Source
Publisher Full Text 41. Taylor C, Bishop A: Homogeneous functionals and Bayesian data fusion
19. Zhang M, Tao F, Huang B, et al.: A physical model and data-driven hybrid with unknown correlation. Information Fusion. 2019; 45: 179–189.
prediction method towards quality assurance for composite components. Publisher Full Text
CIRP Ann Manuf Technol. 2021; 70(1): 115–118. 42. Guan D, Cao Y, Yang J, et al.: Fusion of multispectral data through
Publisher Full Text illumination-aware deep neural networks for pedestrian detection.
20. Luo W, Hu T, Ye Y, et al.: A hybrid predictive maintenance approach for CNC Information Fusion. 2019; 50: 148–157.
machine tool driven by digital twin. Robot Comput Integr Manuf. 2020; 65: Publisher Full Text
101974. 43. Cover T, Thomas J: Elements of information theory. Wiley, 2006.
Publisher Full Text Reference Source
21. Bilberg A, Malik A: Digital twin driven human-robot collaborative assembly. 44. Bu Z, Cao J: Complex network and big data analysis. Tsinghua University
CIRP Ann Manuf Technol. 2019; 68(1): 499–502. Press, 2019.
Publisher Full Text 45. Tao F, Cheng J, Cheng Y, et al.: SDMSim: A manufacturing service supply-
22. Malik AA, Brem A: Digital twins for collaborative robots: A case study in demand matching simulator under cloud environment. Robot Comput Integr
human-robot interaction. Robot Comput Integr Manuf. 2021; 68: 102092. Manuf. 2017; 45: 34–46.
Publisher Full Text Publisher Full Text
23. Qi Q, Tao F, Hu T, et al.: Enabling technologies and tools for digital twin. 46. Nee AYC, Ong SK: Virtual and Augmented Reality Applications in
J Manuf Syst. 2019; 58(Part B): 3–21. Manufacturing. IFAC Proceedings Volumes. 2013; 46(9): 15–26.
Publisher Full Text Publisher Full Text

Page 11 of 18
Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

47. Ong SK, Yew AWW, Thanigaivel NK, et al.: Augmented reality-assisted robot designs in simulation: A review. Eur J Oper Res. 2017; 256(1): 1–16.
programming system for industrial applications. Robot Comput Integr Manuf. Publisher Full Text
2020; 61: 101820. 50. Tao F, Cheng J, Qi Q: IIHub: An industrial Internet-of-Things hub toward
Publisher Full Text smart manufacturing based on cyber-physical system. IEEE Trans Industr
48. Tao F, Zhang M, Cheng J, et al.: Digital twin workshop: a new paradigm for Inform. 2017; 14(5): 2271–2280.
future workshop. Computer Integrated Manufacturing System, 2017, 23(1): 1–9. Publisher Full Text
Publisher Full Text 51. Han C, Zhu H, Duan Z, et al.: Multi-source information fusion. Tsinghua
49. Kleijnen J: Regression and Kriging metamodels with their experimental University Press, 2010.

Page 12 of 18
Digital Twin Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

Open Peer Review


Current Peer Review Status:

Version 1

Reviewer Report 22 October 2021

https://doi.org/10.21956/digitaltwin.18742.r26810

© 2021 Nassehi A. This is an open access peer review report distributed under the terms of the Creative
Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium,
provided the original work is properly cited.

Aydin Nassehi
Department of Mechanical Engineering, University of Bristol, Bristol, UK

The paper presents an excellent overview of challenges and potential solutions to the issues
related to digital twin data. The authors should consider including references to the ISO23247
standard which would complement the presented work very nicely. In particular, Part 3 of the
document provides a standardised framework for digital representation of the elements in a
digital twin and Part 4 provides the framework to underpin information exchange in Digital Twins.

There are a few other well-cited references which contain digital twin data issues that should be
used in the paper to position the work better. These include:
○ Wagner et al. (2017)1 - Provides an overview of the Asset Administration Shell as an
interoperability wrapper that enables data transfer between the physical and digital
counterparts in a manner that virtualises the asset and allows various twins to
communicate in a similar manner.
 
○ Jones et al. (2020)2 - Provides a contextual framework for Virtual-Physical and Physical-
Virtual twinning to underpin the various purposes of data transfer and storage in Digital
Twins (an article of mine).
 
○ Boschert et al. (2016)3 - Provides a view of the Digital Twin and its data across the lifecycle of
a product and its production system that provides scope for the topic of discussion in the
presented paper.

References
1. Wagner C, Grothoff J, Epple U, Drath R, et al.: The role of the Industry 4.0 asset administration
shell and the digital twin during the life cycle of a plant. 2017 22nd IEEE International Conference on
Emerging Technologies and Factory Automation (ETFA). 2017. Publisher Full Text
2. Jones D, Snider C, Nassehi A, Yon J, et al.: Characterising the Digital Twin: A systematic literature
review. CIRP Journal of Manufacturing Science and Technology. 2020; 29: 36-52 Publisher Full Text
3. Boschert S, Rosen R: Digital Twin—The Simulation Aspect. 2016. 59-74 Publisher Full Text

 
Page 13 of 18
Digital Twin Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

Is the rationale for developing the new method (or application) clearly explained?
Yes

Is the description of the method technically sound?


Yes

Are sufficient details provided to allow replication of the method development and its use
by others?
No

If any results are presented, are all the source data underlying the results available to
ensure full reproducibility?
Yes

Are the conclusions about the method and its performance adequately supported by the
findings presented in the article?
Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: AI in Manufacturing, Agent Based Modelling of Distributed System

I confirm that I have read this submission and believe that I have an appropriate level of
expertise to confirm that it is of an acceptable scientific standard, however I have
significant reservations, as outlined above.

Reviewer Report 18 October 2021

https://doi.org/10.21956/digitaltwin.18742.r26809

© 2021 Zhang Y. This is an open access peer review report distributed under the terms of the Creative Commons
Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the
original work is properly cited.

Yicha Zhang
Mechanical Engineering and Design, Génie Mécanique et Conception, Campus de Sevenans,
Universite de Technologie de Belfort-Montbeliard, Sevenans, France

This paper tries to propose a new concept, DTD, to describe the related requirements, principles
and enabling techs. It is interesting in the attempt to draw a new DT data processing framework
or guideline. However, from reviewer's perspective on operation in practice, there are still some
places that are unclear and hard to follow.

1. There is no clear definition about DTD, but more about data classification in the context of DT. It
is possible to give a new definition to show the difference of data in the context of DT? The

 
Page 14 of 18
Digital Twin Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

definition difference would focus on more about contents or format or application?

2. Is there an exact logic for the listing of DTD requirements? Mining is before or after fusion, or
both directions are possible?

3. It is hard to understand 'data interaction'. It is more similar to service as we often encountered


in IOT or CPS systems.

4. What exactly is 'iterative optimization'? As you mentioned in the paper: iterative optimization
refers to a cyclic process of “data increase - data fusion - information increase”, through which,
new data is fused with historical data to generate new information continuously. This means to
generate new data sets. It is more about data evolution? However, as you also mentioned in the
later section of the paper, 'optimization of DTD'. So, how to optimize DTD? What are the objective,
process and constraints?

5. The requirement of data universality seems an ideal solution in every domain. However, DT is
usually customized system. Hence, is there any conflict between this requirement with the
application? What does it mean by universality, contents or format?

6. What is fusion data, data of data (well structured and associated) or 'green data' (intermediate
data)?

7. Are the components of DTD organized in a hierarchical way or not?

Is the rationale for developing the new method (or application) clearly explained?
Yes

Is the description of the method technically sound?


Partly

Are sufficient details provided to allow replication of the method development and its use
by others?
Partly

If any results are presented, are all the source data underlying the results available to
ensure full reproducibility?
No source data required

Are the conclusions about the method and its performance adequately supported by the
findings presented in the article?
Partly

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Digital design, planning & manufacturing, additive manufacturing, product-
service-system engineering

 
Page 15 of 18
Digital Twin Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

I confirm that I have read this submission and believe that I have an appropriate level of
expertise to confirm that it is of an acceptable scientific standard, however I have
significant reservations, as outlined above.

Reviewer Report 11 October 2021

https://doi.org/10.21956/digitaltwin.18742.r26808

© 2021 Zheng P. This is an open access peer review report distributed under the terms of the Creative Commons
Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the
original work is properly cited.

Pai Zheng
Department of Industrial and Systems Engineering, The Hong Kong Polytechnic University, Hong
Kong, Hong Kong

In this paper, the authors provide a future prospect and couple of requirements of data
applications in the digital twin domain from structure, principle, and methodology views. With the
illustration, the potential coming research directions are summarized and pointed out. This topic
and corresponding learning areas are promising and worth investigating deeper.  

Here are some minor issues that may be considered for improving this paper:
1. In the third requirement of digital twin data, could you explain more on the seamless mean?
Such requirement is too blurry and what kind of performance do you want to achieve?
 
2. In the composition of digital twin data section, the explanation of the connection data is
provided theoretically. However, it lacks the intuitive example demonstration, therefore,
could more cases or concrete technologies be provided to help better understanding?
 
3. In the principle part, could more explanation of figure 2 be provided? What’s the serial
numbers and arrows in the figure mean? What do the different levels stand for in the
figure?
 
4. In figure 3, the quality of figures and expression way may be refined.
 
5. In key enabling technologies, some technologies concepts are adopted to enhance the
development of DTD. However, could the authors provide more technical/algorithm details
to achieve the corresponding goal/proposed performances?  

Is the rationale for developing the new method (or application) clearly explained?
Yes

Is the description of the method technically sound?


Yes

Are sufficient details provided to allow replication of the method development and its use

 
Page 16 of 18
Digital Twin Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

by others?
Yes

If any results are presented, are all the source data underlying the results available to
ensure full reproducibility?
No source data required

Are the conclusions about the method and its performance adequately supported by the
findings presented in the article?
Partly

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Smart manufacturing system, product-service systems, engineering design

I confirm that I have read this submission and believe that I have an appropriate level of
expertise to confirm that it is of an acceptable scientific standard, however I have
significant reservations, as outlined above.

Reviewer Report 07 October 2021

https://doi.org/10.21956/digitaltwin.18742.r26807

© 2021 Sun H. This is an open access peer review report distributed under the terms of the Creative Commons
Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the
original work is properly cited.

Huibin Sun
School of Mechanical Engineering, Northwestern Polytechnical University, Xi'an, China

Data plays critical roles in constructing virtual models, building cyber-physical connections, and
executing intelligent operations in digital twins. This paper explores some basic principles and
methods for digital twin data gathering, storage, interaction, association, fusion, evolution and
servitization, as well as the key enabling technologies. The theoretical underpinnings for DTD
proposed in this paper, are imperative for the further promotion and application of DT.

1. The barriers or challenges to digital twin data could be addressed.

2. If possible, an example or a scenario could be used to enhance the paper’s contributions.

Is the rationale for developing the new method (or application) clearly explained?
Yes

Is the description of the method technically sound?


Yes

 
Page 17 of 18
Digital Twin Digital Twin 2021, 1:2 Last updated: 22 OCT 2021

Are sufficient details provided to allow replication of the method development and its use
by others?
Yes

If any results are presented, are all the source data underlying the results available to
ensure full reproducibility?
No source data required

Are the conclusions about the method and its performance adequately supported by the
findings presented in the article?
Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Digital twin

I confirm that I have read this submission and believe that I have an appropriate level of
expertise to confirm that it is of an acceptable scientific standard.

 
Page 18 of 18

View publication stats

You might also like