DISC 112 Computer and Problem Solving: Session 9 February 20, 2018

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 17

DISC 112 Computer and

Problem Solving
Session 9
February 20, 2018
Recap
• System Development Lifecycle (SDLC)
• Analysis
• Design
• Implementation (+ Testing)
• Maintenance
• SDLC Approaches
• Waterfall
• Agile
Overview
• Information (and IT) to gain competitive advantage
• Understanding information needs of an organization
• [Information] Value chain
• Decision requirements at different levels within the organization
• Information Systems
• Functional
• Cross-functional (Enterprise-wide)
• [Information] Systems Development Lifecycle
• Mapping business requirements to information [storage/management]
specifications
Importance of Data Management?
• Foundation of Business Intelligence and Analytics
• BI is any system (manual or IT-based) that benefits business
professionals in making better decisions, especially if it allows
them to compete better
• Business Analytics is essentially BI that relies on statistical and/or
machine learning tools and techniques
• Big Data Analytics is a term that has evolved (and is evolving)
• Volume – scale of data
• Velocity – analysis of streaming data
• Veracity – uncertainty of data
• Variety – different forms of data
Data Resource Management
• Basic elements of data
• Bits, bytes, fields, records, files, databases
• Data [storage] models
• Relational [old school]
• Object-oriented
• Multi-dimensional and more …
• Determined by (and affects) the kind of [business] questions you
wish to address
Data Hierarchy
• Traditional storage format
• Flat file – think of a ledger
• Redundancy
• Program-data dependency
• Inconsistency
• Solution
• Single source for data
• “Managed” access to all users
(applications)
Database management system (DBMS)
Data Models
• Organizing data
• Relational data model SName

• Defining relationships between data elements


• Example SID
Student
• Students
• Student’s name SDoB
• Student’s ID
• Student’s major
• SMajor
Student’s date of birth
SID SName SDoB SMajor Saddress
20202020 Skywalker, Anakin 43 BBY Dark Side Tatooine
[Entity] Relationship Diagram
CTitle SName

CCode Can enroll in multiple courses SID


Course Student
Taken by multiple students

CID SDoB

CCredits SMajor
SemesterID CID SID InstructorID Enrollment
1602 1107 20202020 123 286
1601 1106 20202020 123 80
[Entity] Relationship Diagram CID

CID CNumber

CName Can have multiple cards CTitle


Customer Credit Card
Belongs to one customer

CAddress CType

CCredits CCode

CExpDate
Retrieving Data
• Show credit card information for customer “XYZ”

SELECT CUSTOMER.Cname, CUSTOMER.Caddress, WHERE CUSTOMER.CID = CREDIT_CARD.CID AND


CREDIT_CARD.CNumber, CREDIT_CARD.CType CUSTOMER.Cname = “XYZ”

Where is the data coming from?

FROM CUSTOMER, CREDIT_CARD


Retrieving Data
Data Organization Matters …
• Each data model has some advantages as well as limitations
• For example, how will you change the “type” of data in fields in a relational
table?
• How do you add attributes? Re-define relationships (because business
requirements have changed)?
• What if more data needs to be added?
• Data Warehouse
• Different from “operational” databases
• Stores current and historical data
Data Warehouse
Multi-dimensional analysis
Business Intelligence Tools
• Online Analytical Processing
• Enables multidimensional data analysis (only if data is stored using
dimensions!)
• Facilitates ad-hoc querying of data (remember the pyramid of decision-
making and information requirements?)
• Data Mining
• Predictive analysis
• Pattern recognition
• Text mining
Questions??

You might also like