Healthcare Data Warehousing

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 14

RESEARCH PAPERS

HEALTHCARE DATA WAREHOUSING

By
ARUNACHALAM S. * TOM PAGE ** THORSTEINSSON G. ***
* Senior Lecturer, Department of Mechanical Engineering, University of East London, London.
** Senior Lecturer, Electronic Product Design, Loughborough University, UK.
*** Professor, Department of Design and Craft, University of Iceland, Iceland.

Date Received: 08/02/2017 Date Revised: 09/03/2017 Date Accepted: 08/05/2017

ABSTRACT
The term data warehouse was first used by Inmon in 1990 which he defined as “A warehouse is a subject oriented,
integrated, time-invariant and nonvolatile collection of data in support of management decision making process”
[Inmon, 1990]. On the other hand, Ralph Kimball provided a much simpler definition of the data warehouse. In the book
‘The Data Warehouse Toolkit’ he said that a data warehouse is “a copy of transaction data specifically structured for query
and analysis” [Inmon and Kimball, 1996]. This paper provides theoretical description of data ware housing.
Implementation of clinical Data warehousing were analyzed, finally evaluating them. The proto type considered for
implementation has been collected from the East Hospital London Clinical Data Warehouse Project.
Keywords: Healthcare, Data Warehousing, Clinical Data Warehouse.

INTRODUCTION must have strategic weapon in today's business world.


A data warehouse can be normalized or denormalised, Now most of the big corporations have built or are building
as it can be a relational database, multidimensional data warehouses to support business analysis activities
database, flat file, hierarchical database, object and decision making tasks.
database, etc. With the widespread availability of the ever The Clinical Data Warehouse
changing computer technologies, telecommunication The Clinical Data Warehouse (CDW) is a place where
technologies and internet access, most of the business healthcare providers can gain access to clinical data
handles huge amounts of data. The business gathered in the patient care process. It is also anticipated
organizations collect, examine and take their decision for that such data warehouse may provide information to
the next move. The information they collect from different users in areas ranging from research to management
sources help the decision makers to take the right decision [Sen, and Jacob, 1998]. A clinical data warehouse is a
at the right time. Data warehouse which acts as a decision tailored data warehouse for the needs of users in clinical
support system, extracts, integrates, and stores relevant environment. The clinical data warehouse combines
information. This is to support the information needed by information from a variety of legacy healthcare database
the information workers and decision makers. Finally, and extract operational data to form a centralized
“Data warehouse is a process, not a product, for repository to answer the informational needs of all clinical
assembling and managing data from various sources for users. The data in a clinical warehouse are not only used
the purpose of gaining a single, detailed view of part or all by the patients, nurses and doctors, but also used by
of a business” [Stephen, 1998]. researchers, scientists, and medical students. The clinical
Today data warehousing is a part and parcel of the whole data warehouse is all the above, for clinical administrators
decision support solution for a business organization. For “for example, a clinical responsible for utilization
the huge potentiality of the data warehouse, most management may retrieve data on all patients with a
corporations have discovered that a data warehouse is a specific DRG/CMG and determine how many lab tests of

i-manager’s Journal on Computer Science, Vol. 4 l


No. 4 l
December 2016 - February 2017 1
RESEARCH PAPERS

which type were ordered, to see if services appear to be researchers, scientists, and medical students. To design a
under or over utilized. An administrator may want to look at clinical data warehouse it is necessary to find out the
the average length of stay over all patients or selected requirements of the client. Since the clinical data
groupings. A department manager may want to review warehouse is used by various medical organizations, it is
the number of paid staff hours that is required for each necessar y to choose the correct vendor or
type of test performed for those tests that actual methodologies to avoid complications.
turnaround times within a certain range during the Research Methodology
summer versus the winter” [Covvy, 2000].
The research for the paper started with several literature
The clinical data warehouse can provide significant and journal study covering data warehouse, data
improvements in patient care, hospital management, warehouse architecture, data warehouse methodologies,
organisational effectiveness, etc. Clinical data and the importance of data warehousing in a clinical
warehouse can facilitate efficient storage, enhance context. After that reading books in the related field and
timely analysis and increase the quality of real time taking important notes and writing down for the future use
decision making process. The clinical data warehouse is of the clinical data warehouse.
a place where health care providers can gain access to
The Project Scope
clinical data as a part of the patient care process. The
This research describes the data warehouse as well as the
clinical data warehouse can produce improved patient
clinical data warehouse elaborately. This research also
care service and improved clinical research. They can
describes the problems that rise while designing a clinical
reduce the medical errors and enhance quality
data warehouse and gives the possible solutions. This
measurement. They also can decrease costs and at
research also contains a case study based on the
times, increase revenue.
University of Virginia Clinical Data Warehouse which helps
Clinical data warehouse is much more complex than the
someone to understand the clinical data warehouse.
business facts of average data warehouse data.
Those who want to build a clinical data warehouse can
Furthermore, a clinical data warehouse demand even
have a picture of the problems. At the same time they will
more security measures and less downtime than a
get the possible solutions for those problems. This paper
conventional data warehouse. But a conventional data
also used some data warehouse and clinical related
warehouse and a clinical data warehouse do not differ so
terminology which is assumed to be understandable to
much. After all, they use the same technology, but in the
the interested readers of this project.
case of a clinical data warehouse everything gets more
1. An Overview of Data Warehousing
complex and complicated.
1.1 The Definition
The Project Objectives
The term data warehouse was first used by Inmon in 1990
The primary objective of this paper is to evaluate a data
which he defined as “A warehouse is a subject oriented,
warehouse in a clinical context. Data warehouse in
integrated, time-invariant and nonvolatile collection of
clinical context has traditionally been administrative in
data in support of management decision making
nature, focusing on patient-billing and patient-care
process” [Inmon, 1990]. Inmon (1990) defined the terms in
management. The clinical data warehouse combines
the sentence as follows:
information from a variety of legacy healthcare database
·Subject oriented: The data in the data warehouse are
and extract operational data to form a centralized
organised. All the relevant data about a subject is
repository. The data is, then used for making crucial
collected and stored in a useful format. The data are
decisions. The data in a clinical warehouse are not only
given information about a particular subject instead
used by the patients, nurses and doctors, but also used by
about a company's ongoing operation.

2 i-manager’s Journal on Computer Science, Vol. 4 l


No. 4 l
December 2016 - February 2017
RESEARCH PAPERS

·Integrated: Integration is the process storing data in Clinical Data Warehouse and the data warehouse.
such a format that everybody could understand. 2.1 The Introduction of the Clinical Data Warehouse
Data in a data warehouse gathered from a variety of
Data warehouse in clinical context has traditionally
sources and merged into a single database. This data
been administrative in nature, focusing on patient-
is made consistent in a data warehouse.
billing, and patient-care management. Patient-billing
·Time-variant: All data in the data warehouse is systems manage financial systems and are primarily
identified with a particular time period. The data concerned with providing information for financial
warehouse contains a history of the subject. As optimization, managing costs, and coordinating health
opposed to operational databases, historical data is insurance providers. They typically contain demographic
of high importance in the data warehouse world. The information about the patient, length of stay in the
changes to the data in the data warehouse are hospital, costs, etc. On the other hand, patient-care
tracked and recorded and these changes are shown systems generally manage information about individual
in reports. patients such as patient's location in the hospital,
·Non-volatile: Non volatility means that data in data medical tests, prescribed medicines, etc. They are
warehouses are stable. The information is consistent in mainly concerned with optimizing administrative efforts
data warehouse regardless of when the warehouse is within the hospital. They were mainly operational
accessed. The data is static, read-only, and retained databases where large quantities of data were
for future reporting. collected. But later it was also being required sorting,
A single subject data warehouse is typically referred as combining and analysis of data. As a result, the clinical
data mart, whereas data warehouses are generally data warehouse comes into the limelight. A clinical
enterprise in scope. Data warehouses can also be data warehouse is a tailored data warehouse for the
volatile. Due to the large amount of storage required for needs of users in clinical environment. The clinical data
data warehouse, only a certain number of periods of warehouse combines information from a variety of
history are kept in the data warehouse. legacy health-care database and extract operational
data to form a centralized repository to answer the
On the other hand, Ralph Kimball provided a much
informational needs of all clinical users. The data in a
simpler definition of the data warehouse. In his book, ‘The
clinical warehouse are not only used by the patients,
Data Warehouse Toolkit’, he said that a data warehouse is
nurses and doctors, but also used by researchers,
“a copy of transaction data specifically structured for
scientists, and medical students.
query and analysis” [Kimball and Inmon, 1996].
2.2 Fields of Application
Both Bill Inmon and Ralph Kimball agree that a data
warehouse integrates data from various operational The clinical data warehouse has various fields of
source systems. In Inmon's approach, the data application. Some of them are as follows:
warehouse is physically implemented as a normalized 2.2.1 Clinical Research
data store. In Kimball's approach, the data warehouse is ·Case Finding: for example, identify patients for
physically implemented in a dimensional data store. clinical trials
2. The Clinical Data Warehouse (CDW) ·Explorator y Analysis: for example, “assess
This section provides an overview of Clinical Data preoperative changes in neuroendocrine function in
Warehouse. Section 2.1 provides the brief introduction to patients undergoing pituitar y microsurger y ”
the Clinical Data Warehouse. Section 2.2 describes the (Einbinder, et al. 1999).
fields of application of the Clinical Data Warehouse. ·Specific Research: for example, trying for new kind of
Section 2.3 describes the comparison between the medicine for specific disease on different patients.

i-manager’s Journal on Computer Science, Vol. 4 l


No. 4 l
December 2016 - February 2017 3
RESEARCH PAPERS

2.2.2 Patient Service same time, clinical data warehouse is a tool for care
·Give patients advice about the disease. providers and researchers. Care providers check out
data of an individual patient and compare it to the
·Help patients finding the right doctors.
previous ones. They also check the medical records,
·In the case of emergency, give patients the
diagnosis, medications, etc. On the other hand,
temporary solutions before an ambulance arrives.
researchers are interested in many patients just like
2.2.3 Quality Improvement
conventional data warehouse users. Their interest lies in
·Outcome Evaluation and Assessment: for example, the retrieval of complex clinical data of the patients that is
“assess mortality of children with severe neuro organized in more complex networks. In these cases,
developmental disabilities who receive tube exploratory knowledge discovery and advanced data
feeding” [Einbinder, et al. 1999]. mining techniques are used.
·Treatment Patterns: for example, evaluate before The clinical data warehouse is much more complex than
and after treatment results. the business facts of average data warehouse data.
·Medical Management: for example, improve Sometimes physicians or researchers have to go into the
patient services, improve the knowledge of the deep to get a patient's original healthcare record. For this,
doctors and nurses, medical staff activities, etc. fast calculating power and massive storage are required.
2.3 A Comparison between Data Warehouse and Furthermore, a clinical data warehouse demand even
Clinical Data Warehouse more security measures and less downtime, but a

As the authors have mentioned earlier, the data conventional data warehouse and a clinical data

warehouse is a data structure that is optimized for warehouse do not differ so much. After all, they use the

distribution, mass storage, and complex quer y same technology, but in the case of a clinical data

processing. Data warehousing methodologies share a warehouse ever ything gets more complex and

common set of tasks, including business requirements complicated.

analysis, data design, implementation, and deployment. 3. The Challenges and Issues of Clinical Data Warehouse
On the other hand, the clinical data warehouse is a place From the previous section, we have learned that the
where healthcare providers can gain access to clinical clinical data warehouses are complex in nature in
data gathered in the patient care process. It is also comparison to a data warehouse. It is important to build a
anticipated that such data warehouse many provide clinical data warehouse successfully. Clinical data
information to users in areas ranging from research to warehouses require extensive analysis on data design,
management (Sen and Jacob, 1998). The clinical data architectural design, implementation, and deployment.
warehouse is all the above, for clinical administrators When designing a clinical data warehouse, data
“for example, a clinical responsible for utilization integration tasks of the medical data store are
management may retrieve data on all patients with a challenging scenarios. As the clinical data warehouse
specific DRG/CMG and determine how many lab tests systems are much more complex, they must operate
of which type were ordered, to see if services appear to reliably to deliver continuous patient care service. Any
be under or over utilized. An administrator may want to fault in any medication or test result could be fatal to a
look at the average length of stay over all patients or patient. We should keep this in mind to avoid mistakes and
selected groupings. A department manager may want ignorance. For these reasons and others, it is therefore
to review the number of paid staff hours that is required important to train people who understand healthcare,
for each type of test performed for those tests that information technology, health information regulation
actual turnaround times within a certain range during and have knowledge about using clinical data
the summer versus the winter ” [Covvy, 2000]. At the warehouse to work in a clinical data warehouse. Only a

4 i-manager’s Journal on Computer Science, Vol. 4 l


No. 4 l
December 2016 - February 2017
RESEARCH PAPERS

successfully implemented clinical data warehouse can decision making and analysis cannot be achieved. Data
provide patient safety, quality and organizational integration is another factor to be dealt with while
efficiency, but there are many challenges and risks designing a clinical data warehouse. In a clinical data
involved while implementing a clinical data warehouse. warehouse, scenario data is collected from various
Among the many challenges and issues, the most departments, laboratories, and related administrative
prominent are architectural issue, data quality, patient processes. Not every department or laboratories would
privacy, report consistency, scalability, and user use the same platform or the same vendor to store data.
involvement. A clinical data warehouse should be able to collect
3.1 Architectural Issue data from different sources. So it is important that the
clinical data warehouse has the ability to data
While designing a clinical data warehouse, several
integration.
technical issues are to be considered, but among them
the architectural challenge is the most important. There 3.2 Data Quality
are some technologies for data warehouse available in One of the major issues in the development of the clinical
the market, but not every one of them are suitable for data warehouse is data quality. As a decision support
designing a clinical data warehouse. Due to the system is one of the main aspects of the data warehouse
confidentiality of healthcare data and the privacy policy and it is not exceptionally for the clinical data warehouse.
of the patient, it is important to choose the right Poor data quality can lead to poor decision which could
technology for designing a clinical data warehouse. be even fatal for the patient. Poor data quality can destroy
During the design, and planning process, it is important to the whole clinical data warehouse project. Data quality in
study the functionality of different departments. For a clinical data warehouse is critical as strategic decision is
example, patient management scenarios may differ made on the basis of the data available in the clinical
from the mental health management scenarios. data warehouse. Improved data quality is one of the main
Inpatient care management analysis is conducted for issues of running a clinical data warehouse project
admission, discharges and transfers, length of stay, successfully. Improved data quality:
patient's diagnosis, severity of illness, etc., but for the ·reduces the risk from bad decisions,
mental health management, the scenarios are different.
·enables better customer service,
In this context, it is essential to monitor health service
·reduces costs and at the same time add better value
activities and investigate patient outcomes. The middle or
to the services,
senior management could neither effectively monitor
·enhances strategic decision making, and
levels of team activities, nor they can determine the
predictive factors of the clinical outcomes of mental ·improves productivity.
health patients. As a result, the reporting on the behaviour It is important that data in a clinical data warehouse is
of the mental is an important element for analysis while good in quality. Bad data can destroy the whole clinical
designing a clinical data warehouse for a mental health data warehouse project. The data that is stored in the
care. clinical data warehouse, may lead to a bad decision or
Some issues need to be addressed while designing a incorrect medication. These reasons could be fatal for a
clinical data warehouse. Among them data location, patient. A bad quality data:
technical platforms, and data formats are important. ·increases the risk of bad decisions which could be
Organisational behaviour on processing the data is also fatal,
important while designing a clinical data warehouse. ·reduces the quality of customer service,
These factors are very important and if these factors are ·reduces productivity, and
not dealt with properly, the required levels of quality

i-manager’s Journal on Computer Science, Vol. 4 l


No. 4 l
December 2016 - February 2017 5
RESEARCH PAPERS

·creates problem in strategic decision making. is collected may be from several disparate mainframe
Error in data is the main reason for bad quality data. There environments. This may be time-consuming and at the
are mainly five data errors: same time, it increases the chances of inaccuracies and
conflicts of data. Sometimes it leaves a little time to
·Design errors.
analyze the data and produce a correct report. Usually a
·Collection errors.
doctor takes his decision based on the report he got from
·Strategic errors.
different data analysis. Based on the test report, diagnosis
·Integration errors. report and medication report, the doctor takes his
·Query errors. decision for the patient whether the patient needs an
Design, collection, integration and query errors are operation or further medication. For example, the
technical in nature. Data in a clinical data warehouse is situation of a patient is critical and doctors need to take a
collected from different sources. Integration of these quick decision about the patient. If the collection of
kinds of data are an important task and also important data takes long time, it leaves only a little time to analyze
problem to tackle, but the main problem of a clinical the data. This may lead to delay in taking decisions
data warehouse is data collection. “Incorrectly which may be fatal for the patient. So it is important that
recorded values, mixed records, dropped data, data is collected accurately which helps making report
incorrect units, inconsistent standards or untimely consistent.
collection practices can contaminate the data with 4. Implementation: Designing a Clinical Data
scant evidence of error ” [Berndt, et al., 2001]. Warehouse (Prototype)
3.3 Patient Privacy This section presents an implementation of a clinical data
Another major issue for the clinical data warehouse is the warehouse project. Section 4.1 provides the conventional
concern regarding the protection of the patient privacy. way designing a data warehouse. Sections 4.2 describes
As the personal data as well as the medical data of a the project overview. Section 4.2 and 4.3 give a brief
patient stored in the clinical data warehouse, any user description about MySQL and SQLyog, respectively.
who has the authority to access the clinical data Section 4.4 describes the implementation process of the
warehouse can see the personal data of a patient. A prototype clinical data warehouse. Finally, section 4.5
clinical data warehouse not only is used by the doctors or provides the summary of this section.
nurses, but also by the researchers or medical students for 4.1 The Conventional Data Warehouse Design Process
their research. So the priority of the patient privacy should Data warehouse design is complex. It requires extensive
be high while designing a clinical data warehouse. It is survey and analysis. There are some steps in designing a
important to keep a patient's data private. If a patient's data warehouse. Before designing a data warehouse, it is
privacy be broken, the patient might lose his confidence very important to understand and represent the user
in coming to the health care centre. The patient might requirements accurately. The strategies used for this are
also be in the embarrassing situation if the patient suffers interviews, document analysis, surveys, company
any sexual disease and it comes to the public. The clinical activities, etc. After getting the requirements, the data
data warehouse contains sensitive data and so, security model is created. Then logical data is created by using an
measures are essential. Entity Relation Diagram (ERD) or some other type of
3.4 Report Consistency conceptual model. Then in the process of physical
Report consistency is another issue for the clinical data design, this logical data is translated into a star schema or
warehouse. In a clinical data warehouse, data are snowflake schema. The next is the architecture design
collected from different sources. The sources of data that procedure. There are mainly four strategies available to

6 i-manager’s Journal on Computer Science, Vol. 4 l


No. 4 l
December 2016 - February 2017
RESEARCH PAPERS

design data warehouse architectures. They are supports more than 10 terabytes. To record data in a clinic,
centralized data warehouse, data warehouse and data doctors chamber, laboratory, etc., front end tools like
mart, distributed data warehouse and hybrid deployment Microsoft Excel, Microsoft Access or any other customised
strategy. Based on their demands, the organizations will software can be used. All these features support MySQL.
choose the most suitable methodology. This project has That’s why MySQL is getting more and more popularity. By
been developed by following centralized data using tools like SQLyog Enterprise, we can create a
warehouse methodology. The next step is the database, create table design schemas, etc.
implementation process. The implementation can be 4.3 MySQL
facilitated by some RAD tools. SQLyog tool has been used
MySQL is one of the leading open source database
for implementation. One of the important aspects of
solutions in the market. It provides consistent fast
designing a data warehouse is the metadata
performance, and high reliability. Furthermore, it is easy to
management. Almost all vendors focus on metadata
use. MySQL powers many online enterprises, embedded,
management. Building a data warehouse will be
and business intelligence applications for over twelve years.
successful if only all the procedures be followed.
4.4 SQLyog
4.2 Project Overview
SQLyog is the MySQL Graphical User Interface (GUI). It is
The aim of the project is to build a clinical data warehouse
one of the most powerful MySQL manager and admin
for the proposed “East London Hospital”. The authors have
tools. It combines the features of the MySQL query
designed and created a clinical data warehouse, and also
browser, administrator and various other MySQL front ends
created a centralized database which will be residing in the
and MySQL clients in a single interface. It helps creating
hospital. Administrators and doctors in the hospital can
databases, tables, and generates schemas. SQLyog is an
collect data from different sources. For example, a patient
'All in One' tool. We do not have to switch between
is referred to the hospital from the clinic. So the doctor in the
different applications to perform database related tasks.
hospital need not go to the clinic to collect data. The doctor
4.5 Project Implementation
or administrator can extract data from the clinic. On the
The proposed East London Clinical Data Warehouse will
other hand, if a doctor wants to see the report for a patient,
be implemented now. There are some steps to
the doctor does not have to go to the laboratory. The doctor
implement. The following sub-sections describe the steps.
can easily collect data from the laboratory. At the same
time, a doctor can see the patient's past record whether the 4.5.1 Installing MySQL
patient was suffering from any major illness. After collecting From the previous sections, the authors find that to build a
all the information, the doctor can prescribe the best clinical data warehouse one need to plan, analyse and
possible medication for the patient. If it is a serious condition, choose the right database vendor. In this case, they will
the hospital administration can create a medical board use MySQL which is open source and free. It is reliable
and analyse all the records, test reports, diagnosis, previous and easy to maintain. It supports almost all the features
medication, etc. After analysis, the medical board can take of other expensive database vendors. Many well
their decision. The data warehouse here acts as a decision recognized companies use this MySQL. So for this
support system for them. This makes the doctors, work easier project, they have choosen MySQL. It is open source and
than before. Not only that, because of the massive free. So they download it to their laptop. For the project
calculating power and reliability, the chance of errors are they used MySQL server 6.0 version.
less likely. A clinical data warehouse requires massive 4.5.2 Installing SQLyog
storage support because it stores the patient's history and To build a data warehouse, first we need to create tables
medical data. The more the patients are the more the data and design a schema. For creating table effectively and
storage requires. A clinical data warehouse, typically

i-manager’s Journal on Computer Science, Vol. 4 l


No. 4 l
December 2016 - February 2017 7
RESEARCH PAPERS

easily, we will use SQLyog Enterprise Edition (30 day trial ·After clicking, the initial page will open.
version). This software also helps to create tables and ·We will press 'continue' to proceed. Then the
insert data. To install the SQLyog Enterprise, they need to connection wizard will appear.
'click' on the download link from the vendor's website.
·We will 'click' connect to connect to the MySQL host
4.5.3 Schema Design and to go to the main page of the software.
To design a data warehouse, they need to design a ·On the left hand side of the page, we can see the
schema. There are mainly two schemas available to database that the authors have created for this
design a data warehouse. One is star schema and the project. If we 'click' on the database, table folder will
other is a snowflake schema. appear. Now from the table drop down menu we will
4.5.3.1 Star Schema 'click' 'create table' to create tables.
Star schema is the simplest form of data warehouse ·When completed, the tables will be appearing in the
schema which contains one or more dimensions and fact 'el hospital' database.
tables. The central table of the star schema is called as a ·Now we can insert data, update data, making query,
fact table. The entity-relationship diagram between etc. Figure 2 shows the data inserted in the doctor
dimensions and fact tables are like a star and so, it is Table. Figure 3 shows the data in patient Table. Figure
called star schema. In this schema, one fact table is 4 shows the data in private clinic Table. Figure 5 shows
connected to multiple dimensions. In the centre of the the data in lab Table. Figure 6 shows the data in
star schema, there is a large fact table and it points hospital table. The hospital table is the fact Table.
towards the dimension tables. For this project, they will use
·After creating the Tables, now it is possible to query.
star schema. Figure 1 shows the diagram of a star
Data can be added and updated using this GUI.
schema.
5. Evaluation
4.5.4 Schema Diagram of the Project
The title of this research is “An Evaluation of the Clinical
For the project, star schema is used to design the schema
Data Warehouse”. So, the primary task is to evaluate a
is used. The star schema will meet their goal to design the
clinical data warehouse. In this process the research
project. In this project there is a fact table which is called
shows the benefits and that application of clinical data
hospital and four dimensional tables, namely private
warehouse in theory and practice. The design process
clinic, lab, patient, and doctor.
and the methodologies are also described. These
4.5.5 Creating Tables
The next step is to create tables in the database. This
project is about clinical database and so, tables are
created where clinical data can be stored. The data
warehouse will be residing in the hospital. From the
schema design, we will find that we need to create five
tables. They are a Hospital, lab, private clinic, patient and
doctor. To create a table, we will use SQLyog which is a GUI
for MySQL. We can easily create table using this software. It
is very easy to use and improves performance. It also
reduces a significant amount of time. The steps to create
tables in the database:
·From the start menu, we need to SQLyog enterprise to
start SQLyog. Figure 1. Star Schema Design of the Project

8 i-manager’s Journal on Computer Science, Vol. 4 l


No. 4 l
December 2016 - February 2017
RESEARCH PAPERS

descriptions, the authors believe, would be very These vendors as well as some of the vendors who give
beneficial for the interested readers and for those who data warehouse solutions are analyzed based on
would like to develop a clinical data warehouse. different attributes. This, it is believed that it would be a
Another aspect of this research is to find out the great help for the interested readers and the data
challenges and issues available in the present world. warehouse developers.
There are many challenges and issues. After learning all To get the overall activity of a clinical data warehouse, a
the major challenges, they tried to figure out some case study has been looked upon. This case study is
possible solutions. These solutions are based on their based on the clinical data warehouse for academic
research in this field. By reading some very good articles, purpose. This is based at the University of Virginia. This case
they have gathered huge knowledge which will be study provides the design and architectural background.
beneficial for future life. The challenges and issues and It also provides user statistics. This case study is of great
the possible solutions of those would be very beneficial for benefit for their understanding and helpful for their
the clinical data warehouse developers. implementation.
Today data warehouse is considered as a must strategic The East London Hospital Clinical Data Warehouse project
business weapon. So using the appropriate product for prototype is implemented to depict the architectural and
the business is important. There are many vendors who design aspects that are discussed throughout the
provide data warehouse solutions. Some vendors provide theoretical work in the paper. At the same time, the
DBMS engines while others give infrastructure solutions. ELHCDW is intended as a clinical data warehouse design
The infrastructure tools typically work with database solution for the interested reader. Building a clinical data
engines. There are many vendors who provide data warehouse in real world environment is a massive work to
warehouse solutions like Microsoft, Oracle, Sybase, etc. do. There are design steps, data modelling steps, schema
Different vendors support different attributes and features. design steps, architectural steps, implementation steps,

Figure 2. The Doctor Table

i-manager’s Journal on Computer Science, Vol. 4 l


No. 4 l
December 2016 - February 2017 9
RESEARCH PAPERS

Figure 3. The Patient Table

Figure 4. The Private Clinic Table

etc. So designing a clinical warehouse in three months is clinical data warehouse. They have also learned the
not enough. During this research period they have architectural designs and the methodologies behind it.
gathered knowledge about the data warehouse and the The benefits of the data warehouse and the clinical data

10 i-manager’s Journal on Computer Science, Vol. 4 l


No. 4 l
December 2016 - February 2017
RESEARCH PAPERS

Figure 5. The Lab Table

Figure 6. The Hospital Table

i-manager’s Journal on Computer Science, Vol. 4 l


No. 4 l
December 2016 - February 2017 11
RESEARCH PAPERS

warehouse were also learnt. data warehousing. As mentioned earlier, because of the
A data warehouse extracts data from different sources, short period of time, the project implementation was only
integrates them, and stores them in a centralize data completed with 40% of the overall project. The rest 60%
repository system to support decision making. In this could be completed as a future work along with the
implementation they have tried to show this process. In possible modification and upgrades. Future research
the project, the proposed East London hospital has a could also focus on developing and extending the
centralized data repository database where all the data prototype and incorporating more features to make the
were collected from different sources like private clinic prototype more applicable to the healthcare sector.
or laboratory. This data is then analysed and the doctors Finally, in further research, for more analysis about the
and administrators of the hospital could make a clinical data warehouse and to identify new challenges
decision for a patient. For this prototype clinical data and issues with the activities to resolve those, is suggested.
warehouse design, they use MySQL which one of the References
most popular databases available. To facilitate the [1]. Berndt D. J., Fisher J. W., Hevner A. R., and Studnicki J.,
creation of tables and query, they also use SQLyog (2001). “Healthcare Data Warehousing and Quality
which is a GUI for MySQL. This software is useful, easy to Assurance”. IEEE Computer Society, Vol.34, No.12, pp.56-
use, and saves time. 65.
The ideas are that are given in this research are based on [2]. Covvy D., (2000). “What is a Data Warehouse?”
extensive research, prior knowledge, and good Healthcare Information Management in Canada. Vol.14,
understanding of a data warehouse. Due to time No.2, pp.36-38.
constraint, implementation, development was only
[3]. Einbinder J. S., Scully K., Pates R. D., Schubart J.R.,
completed with 40% of the overall development. The rest
Reynolds R. E., Spraggings T. A. and Krumholz R.M.,
60% could be completed as a future work along with the
(1999). “Web-Accessible Patient data Warehouse at the
possible modification and upgrades. The design and
University of Virginia”. AMIA Symposium, pp.12-16
implementation of the project closely follow the rules for
[4]. Inmon W H, (1990). Building the Data Warehouse. QED
developing the clinical data warehouse as presented in
Information Sciences.
the previous sections. Had the prototype been done
[5]. Sen A. and Jacob VS. (1998). “Industrial Strength Data
completely, this prototype should be able to benefit the
Warehousing”. Communication of the ACM, Vol.41, No.9,
healthcare sectors. The interested readers and the data
pp.28-31.
warehouse developers will get the knowledge of
designing a clinical data warehouse. This will help them to [6]. Stephen R., (1998). “Building the Data Warehouse”.
build and develop a data warehouse in a clinical context. Communication of the ACM, Vol.41, No.9, pp.52-60.

Conclusion and Future Work [7]. Kimball and Inmon, (1996). The Data Warehouse
Toolkit. John Wiley: New York.
The paper provides a good theoretical description of

12 i-manager’s Journal on Computer Science, Vol. 4 l


No. 4 l
December 2016 - February 2017
RESEARCH PAPERS

ABOUT THE AUTHORS


Dr. Subramaniam Arunachalam is a Senior Lecturer in the Mechanical Engineering Group in the University of East London. He
holds a Doctorate in Mechanical and Manufacturing Engineering. He has many years of experience in teaching and developing
course materials at undergraduate and postgraduate levels in Mechanical Engineering and Manufacturing Engineering
programmes. He has published 203 articles in high profile journals, edited books and international conferences in the area of
Mechanical and Manufacturing Engineering. His papers have been cited more than 3387 times. He serves on the editorial
advisory board of many International Journals. Dr. Arunachalam is a visiting fellow at several universities abroad and serves as an
academic advisor to promote international research and academic link.

Dr. Tom Page is a Senior Lecturer in Electronic Product Design in the Department of Design and Technology at Loughborough
University. He obtained his BSc. and M.Phil in Engineering Design from Napier University, in 1988 and 1992. He obtained a PhD
from the University of Hertfordshire in 2002. He is an external examiner on Engineering and Manufacturing programmes at
Sheffield Halllam University. He is a visiting scholar at Iceland University and the University of Lapland in Finland and has been an
external examiner on undergraduate fields in Product Design and Manufacturing Engineering at the University of East London.
(2008 to present). Among his main research areas are in the Research and Development of Computer Applications for Design
and Technology Education, Logistics and Supply Chain Management and Electronic Product Design.

Dr. Gisli Thorsteinsson is a Professor in the Department of Design and Craft at Iceland University of Education. He completed his
PhD study at Loughborough University, where he explored the values of using Virtual Learning Environment for ideation in general
school education. Dr. Gisli has been the Chairman of the Association of Icelandic Industrial Arts Teachers since 1995 and is
associated with the NST Coalition of Industrial Arts Teachers in Scandinavia. From 2000 he has been on the Board of 'Nordfo', the
Pan Scandinavian co-operative researching art and design projects in Scandinavia. In 1999 he was involved in the National
Curriculum development for Technology Education in Iceland and wrote the curriculum part for Design and Craft. Gisli has
written numerous articles concerning Design and Craft education and has published several textbooks about Innovation
Education.

i-manager’s Journal on Computer Science, Vol. 4 l


No. 4 l
December 2016 - February 2017 13
Reproduced with permission of copyright owner.
Further reproduction prohibited without permission.

You might also like