Professional Documents
Culture Documents
Distribution, Data, Deployment: Software Arhitechure Convergence in Big Data System
Distribution, Data, Deployment: Software Arhitechure Convergence in Big Data System
Distribution, Data, Deployment: Software Arhitechure Convergence in Big Data System
DEPLOYMENT: SOFTWARE
ARHITECHURE
CONVERGENCE IN BIG DATA
SYSTEM
INTRODUCTION:
Over the last two and a half years we have designed, implemented, and deployed a
distributed storage system for managing structured data at Google called Big table. Big table
is designed to reliably scale to petabytes of data and thousands of machines. Big table has
achieved several goals: wide applicability, scalability, high performance, and high
availability. Big table is used by more than sixty Google products and projects, including
Google Analytics, Google Finance, Orkut, Personalized Search, Writely, and Google Earth.
These products use Big table for a variety of demanding workloads, which range from
throughput-oriented batch-processing jobs to latency-sensitive serving of data to end users.
The Big table clusters used by these products span a wide range of configurations, from a
handful to thousands of servers, and store up to several hundred terabytes of data. In many
ways, Big table resembles a database: it shares many implementation strategies with
databases. Parallel databases and main-memory databases have achieved scalability and high
performance, but Big table provides a different interface than such systems. Big table does
not support a full relational data model; instead, it provides clients with a simple data model
that supports dynamic control over data layout and format, and allows clients to reason about
the locality properties of the data represented in the underlying storage. Data is indexed
using row and column names that can be arbitrary strings. Big table also treats data as
uninterrupted strings, although clients often serialize various forms of structured and semistructured data into these strings. Clients can control the locality of their data through careful
choices in their schemas. Finally, Big table schema parameters let clients dynamically control
whether to serve data out of memory or from disk.
LITERATURE SURVEY
1. Bigtable: A Distributed Storage System for Structured Data
Authors: Fay Chang, Jeffrey Dean, Sanjay Ghemawat
Year: 2007
Description:
Bigtable is a distributed storage system for managing structured data that is designed to scale
to a very large size: peta bytes of data across thousands of commodity servers. Many projects
at Google store data in Bigtable, including web indexing, Google Earth, and Google Finance.
These applications place very different demands on Bigtable, both in terms of data size (from
URLs to web pages to satellite imagery) and latency requirements (from backend bulk
processing to real-time data serving). Despite these varied demands, Bigtable has successfully
provided a _exible, high-performance solution for all of these Google products. In this paper
we describe the simple data model provided by Bigtable, which gives clients dynamic control
over data layout and format, and we describe the design and implementation of Bigtable. Over
the last two and a half years we have designed, implemented, and deployed a distributed
storage system for managing structured data at Google called Bigtable. Bigtable is designed to
reliably scale to petabytes of data and thousands of machines. Bigtable has achieved several
goals: wide applicability, scalability, high performance, and high availability. Bigtable is used
by more than sixty Google products and projects, including Google Analytics, Google Finance,
Orkut, Personalized Search, Writely, and Google Earth. These products use Bigtable for a
variety of demanding workloads, which range from throughput-oriented batch-processing jobs
to latency-sensitive serving of data to end users. The Bigtable clusters used by these products
span a wide range of con_gurations, from a handful to thousands of servers, and store up to
several hundred terabytes of data.
5 Big Data and Cloud Computing: Current State and Future Opportunities
Authors: Divyakant Agrawal Sudipto Das Amr El Abbadi
Year: 2010
Description:
Scalable database management systems (DBMS)both for update intensive
application workloads as well as decision support systems for descriptive and deep
analyticsare a critical part of the cloud infrastructure and play an important role
in ensuring the smooth transition of applications from the traditional enterprise
infrastructures to next generation cloud infrastructures. Though scalable data
management has been a vision for more than three decades and much research has
focussed on large scale data management in traditional enterprise setting, cloud
computing brings its own set of novel challenges that must be addressed to ensure
the success of data management solutions in the cloud environment. This tutorial
presents an organized picture of the challenges faced by application developers and
DBMS designers in developing and deploying internet scale applications. Our
background study encompasses both classes of systems: (i) for supporting update
heavy applications, and (ii) for ad-hoc analytics and decision support. We then
focus on providing an in-depth analysis of systems for supporting update intensive
web-applications and provide a survey of the state-of-the art in this domain. We
crystallize the design choices made by some successful systems large scale
database management systems, analyze the application demands and access
patterns, and enumerate the desiderata for a cloud-bound DBMS.
PROJECT IMPLEMENTATION
Modules:
ADMIN
Authentication.
View Patient Records.
Preference Setting.
Generate EMR.
Generate MIS Report.
USER
Authentication.
Diagnose Schedule
EMR Details
View MIS Report
Login
Check
status
Database
Proceed to
hierarchy
next
View Patient
information
Login
Admin
BDS
Preference Setting
The admin will be view the patient information and set the priority as
based on the treatment. This is based on the type of treatment which they
mention in the document.
Admin
Login
View
Patient
informatio
n
Set
preferenc
e
BDS
Admin
Maintain
Patient
informatio
n
Generate
MIS
Report
BDS
Generate EMR
The admin will collect the patient details and stored in the
BDS. This EMR contains all the records from enquiry to till in patient
discharge information has been collected and stored in the BDS.
Admin
Login
Collect
Patient
informatio
n
Generate
Single
EMR
BDS
Login
The user has to provide exact username and password which was
provided at the time of registration, if login success means it will take up
to main page else it will remain in the login page itself.
LOGIN
CHECK
STATUS
Proceed To next
stage
Hierarchy
DB
Diagnose Schedule
The user after the successful login goes to view the schedule for take
initial treatment that provides by the admin. It contains records and
scheduling information.
Patien
Patien
t1
t1
User
Login
Patien
Patien
t2
t2
Patien
Patien
t3
t3
BDS
User
View
Diagnose
Schedule
Check
Prefere
nce
View EMR
records
BDS
User
View
Diagnose
Schedule
View EMR
records
View MIS
Details
BDS
2. USER
Authentication
Input: Provide username and password to get permission for access.
Output: Became authenticated person to request and process the request.
Diagnose Schedule
Input: Admin will give the schedule for the consultation
Output: It will show consultation time and date.
View EMR Detail
Input: Admin generated EMR report is send to the patient
Output: It will show the patient treatment report and scanning report detail.
View MIS Report
Input: The retrieval data will send to the corresponding patient.
Output: It will show the MIS details
: Windows7
IDE
Front End
: ASP.NET
Coding Language
Backend
: C#
: SQL Server 2008
HARDWARE REQUIREMENTS
Processor
Hard disk
: 40 GB
Mouse
: Logitech.
RAM
: 2GB(minimum)
Keyboard
SYSTEM DESIGN:
USE CASE DIAGRAM:
A use case diagram is a type of behavioral diagram created
from a Use-case analysis. The purpose of use case is to present overview
of the functionality provided by the system in terms of actors, their goals
and any dependencies between those use cases.
Login
USER
View Patient information
Generate EMR
View EMR
Generate MIS
ADMIN
CLASS DIAGRAM:
A class diagram in the UML is a type of static structure diagram that
describes the structure of a system by showing the systems classes, their
attributes, and the relationships between the classes.
Private visibility hides information from anything outside the class
partition. Public visibility allows all other classes to view the marked
information.
Protected visibility allows child classes to access information they
inherited from a parent class.
Patient
name
Address
Profession
mail
mob
Administrator
name
address
mob no
mailid
EMR()
MIS()
Schedule()
view()
registration()
OBJECT DIAGRAM:
An object diagram in the Unified Modeling Language (UML) is
a diagram that shows a complete or partial view of the structure of a
modeled system at a specific time.
An Object diagram focuses on some particular set of object instances
and attributes, and the links between the instances. A correlated set of
object diagrams provides insight into how an arbitrary view of a system is
expected to evolve over time.
Object diagrams are more concrete than class diagrams, and are often
used to provide examples, or act as test cases for the class diagrams. Only
those aspects of a model that are of current interest need be shown on an
object diagram.
Register Details
User Id=xxxx
Name=yyyy
View information
User Id=xxxx
Name=yyyy
Treatment= t1,t2..
Generate EMR
User Id= xxx
Login
Username= Admin/
User
PassWord=*****
Treatment=t1,t2
Report=L1,L2..
View EMR Details
User Id= xxx
Treatment=t1,t2
Report=L1,L2..
Generate MIS
Record
Payment opt=O1 or
O2
Test report=r1,r2
STATE DIAGRAM:
A state diagram is a type of diagram used in computer science and
related fields to describe the behavior of systems. State diagrams require
that the system described is composed of a finite number of states;
sometimes, this is indeed the case, while at other times this is a
reasonable abstraction. There are many forms of state diagrams, which
differ slightly and have different semantics.
Registration
User Login
Admin Login
View Patient
details
View Schedule
details
Generate EMR
Report
View EMR
Record
Generate MIS
details
View MIS
detail
Logout
ACTIVITY DIAGRAM
Activity diagram are a loosely defined diagram to show workflows of
stepwise activities and actions, with support for choice, iteration and
concurrency. UML, activity diagrams can be used to describe the business
and operational step-by-step workflows of components in a system. UML
activity diagrams could potentially model the internal logic of a complex
operation. In many ways UML activity diagrams are the object-oriented
equivalent of flow charts and data flow diagrams (DFDs) from structural
development.
No
Register
Login
User
Admin
View Diagnose
Schedule
Set preference
View EMR
Record
Generate EMR
record
Generate MIS
detail
Logout
SEQUENCE DIAGRAM:
A sequence diagram in UML is a kind of interaction diagram that
shows how the processes operate with one another and in what order.
It is a construct of a message sequence chart. Sequence diagrams are
sometimes called Event-trace diagrams, event scenarios, and timing
diagrams.
The below diagram shows the sequence flow shows how the
process occurs in this project.
User
Admin
Big Data
System
Login User
Register Details
Admin login
Setting Priority
Generate EMR
Admin Logout
User Logout
COLLABORATION DIAGRAM:
A collaboration diagram show the objects and relationships involved
in an interaction, and the sequence of messages exchanged among the
objects during the interaction.
The collaboration diagram can be a decomposition of a class, class
diagram, or part of a class diagram. It can be the decomposition of a use
case, use case diagram, or part of a use case diagram.
The collaboration diagram shows messages being sent between
classes and object (instances). A diagram is created for each system
operation that relates to the current development cycle (iteration).
User
1: Login User
2: Register Details
3: View Diagnose Schedule
9: View EMR detail
10: View MIS report
11: User Logout
Big Data
System
4: Admin login
5: Setting Priority
6: Generate EMR
7: Generate MIS report
8: Admin Logout
Admin
COMPONENT DIAGRAM:
Components are wired together by using an assembly connector to
connect the required interface of one component with the provided
interface of another component. This illustrates the service consumer service provider relationship between the two components.
An assembly connector is a "connector between two components that
defines that one component provides the services that another component
requires. An assembly connector is a connector that is defined from a
required interface or port to a provided interface or port.
When using a component diagram to show the internal structure of a
component, the provided and required interfaces of the encompassing
component can delegate to the corresponding interfaces of the contained
components.
View Diagnose
Schedule
View EMR
Record
Logout
Login
Set Preference
Setting
Generate
EMR
Generate MIS
Details
LEVEL 0:
LEVEL 1:
LEVEL 2:
ALL LEVEL:
E-R DIAGRAM:
In software engineering, an Entity-Relationship Model (ERM) is
an abstract and conceptual representation of data. Entity-relationship
modeling is a database modeling method, used to produce a type
of conceptual schema or semantic data model of a system, often
a relational database, and its requirements in a top-down fashion.
Diagrams created by this process are called Entity-Relationship
Diagrams, ER diagrams, or ERDs.
Userna
me
Passwor
d
Login
Admin
User
View
Patient
informati
on
Vie
w
Preferen
ce
Setting
View
MIS
details
Genera
te EMR
Genera
te MIS
Report
Registe
r
Patient
details
View
Diagnose
Schedule
View
EMR
Record
SYSTEM ARCHITECTUER.
Architecture diagram shows the relationship between different
components of system. This diagram is very important to understand the
overall concept of system. Architecture diagram is a diagram of a system,
in which the principal parts or functions are represented by blocks
connected by lines that show the relationships of the blocks. They are
heavily used in the engineering world in hardware design, electronic
design, software design, and process flow diagrams
User
Global
Networ
k
Admin
Big Data
System
FUTURE ENHANCEMENT:
Description:
In the future, we plan to conduct an empirical evaluation to assess
how the replicate can store in the Big data system.
Module Diagram:
User
View
Files
Request a
file
Verify the
key and
count
Retrieve
a file
ADVANTANGES
High availability
Write heavy workloads
Variable request loads
APPLICATIONS
Online Banking System
Financial Management system
Health Care System
CONCLUSION
We have described Big table, a distributed system for storing
structured data at Google. Our users like the performance and high
availability provided by the Big table implementation, and that they can
scale the capacity of their clusters by simply adding more machines to
the system as their resource demands change over time.
REFERENCES OR BIBLIOGRAPHY:
1.P. Ciuccarelli, M.I. Sessa, and M. Tucci, Code: A Graphic Language for Complex
System Visualization, Proc. Italian Assoc. for Information Systems (ItAIS), 2010.
2. J. Bertin, Semiology of Graphics: Diagrams, Networks, Maps/Jacques Bertin;
Translated by William J. Berg., Univ. of Wisconsin Press, 1983.
3. S.K. Card, J.D. Mackinlay, and B. Shneiderman, Readings in Information
Visualization: Using Vision to Think. Morgan Kaufmann Publishers Inc., 1999.
4. E.R. Tufte, The Visual Display of Quantitative Information, second ed. Graphics
Press, May 2001.
5. A. Unger, P. Muigg, H. Doleisch, and H. Schumann, Visualizing Statistical
Properties of Smoothly Brushed Data Subsets, Proc. Fourth Intl Conf. Information
Visualisation, pp. 233-239, 2008.
6. J. Heer and G. Robertson, Animated Transitions in Statistical Data Graphics,
IEEE Trans. Visualization and Computer Graphics, vol. 13, no. 6, pp. 1240-1247,
Nov. 2007.
7. E. Papageorgiou, Review Study on Fuzzy Cognitive Maps and Their
Applications during the Last Decade, Proc. IEEE Intl Conf. Fuzzy Systems
(FUZZ), pp. 828-835, 2011.
THANK YOU