Professional Documents
Culture Documents
INFO607: Applied Database Technology: Dimensional Modeling, 2
INFO607: Applied Database Technology: Dimensional Modeling, 2
Fall 2012
Online
Course Website: http://learn.dcollege.net
Instructor:
Dr. Il-Yeol Song
Office: Rush 332, 215-895-2489 (Office), 895-2494 (Fax)
Office Hours: W: 5:00-6:00 or by appointment
Email: song@drexel.edu; (Best way to reach me; Put INFO607 in the email subject)
Home page: http://www.ischool.drexel.edu/faculty/song/
Prerequisite:
Info606 (Database Management II) and Info620 (Information Systems Analysis & Design) or
permission of the instructor
Text Books:
• (Mandatory) Kimball, R., The Data Warehouse Toolkit : The Complete Guide to
Dimensional Modeling, 2nd. Ed., Wiley, 2002. (Note that this book is still THE best data
warehousing book in the market)
• (Optional) Urman, S., Oracle 10g PL/SQL Programming, McGraw Hill, 2004.
(Recommended; but you may use other books or Web resources.) Source code of the book:
http://www.mhprofessional.com/getpage.php?c=oraclepress_downloads.php&cat=4222
Website: Learning materials will be available online every Monday morning by 8AM:
To-do List: A list of tasks for each week (under “Discussion” folder).
Lecture Notes: Lecture slides are the most important materials (under “Course Material”
folder).
Exercises: Problems for exercises only. Not to be graded. Solution will be provided
afterwards (under “Discussion” folder).
Homework: Several homework problems to be graded will be assigned (Submission link
will be provided under “Assignments” folder).
Mandatory discussion item: I expect you participate in the discussion within a week, from
Monday to Sunday midnight of your local time. Not graded. Just count participation.
Optional discussion item: You may participate in the discussion any time, but you don’t
have to.
INFO607, Fall 2012, Il-Yeol Song, IST, Drexel University
References:
Data Warehousing
• Kimball, R., Ross, M., Thornthwait, W., etc., The Data Warehouse Lifecycle Toolkit: Expert
Methods for Designing, Developing, and Deploying Data Warehouses, Wiley, 2nd. Ed., 2008.
• Kimball, R. and Caserta, J., The Data Warehouse ETL Toolkit: Practical Techniques for
Extracting, Cleaning Data, Wiley, 2004.
• Mundy, J. and Thornthwaite, W. with Kimball, R., The Microsoft Data Warehouse Toolkit:
with SQL Server 2008 R2 and the Microsoft Business Intelligence Toolset, Wiley, 2011.
• Golfarelli, M. and Rizzi, S., Data Warehouse Design: Modern Principles and Methodologies,
McGraw-Hill, 2009.
• Adamson, C., Star Schema: The Complete Reference, McGraw-Hill, 2010.
• Imhoff, C., Galemmo, N., and Geiger J., Mastering Data Warehouse Design: Relational and
Dimensional Techniques. Wiley, 2003.
• Inmon, W.H., et al., DW 2.0: The Architecture for the Next Generation of Data warehousing,,
Morgan Kaufman, 2008.
• Inmon, W.H., et al., Tapping into Unstructured Data: Integrating Unstructured Data and
Textual Analysis into Business Intelligence, Prentice Hall, 2007.
• Inmon, W.H., et al., Business Metadata: Capturing Enterprise Knowledge, Morgan
Kauffman, 2008.
• Adamson, C. and Venerable, M., Data Warehouse Design Solutions, John Wiley, 1998.
• Kimball, R. and Merz, R., The Data Webhouse Toolkit: Building the Web-Enabled Data
Warehouse. Wiley, 2000.
• Sweiger, M., Madson, M.R., Langston, J., and Lombard, H., ClickStream Data Warehousing.
Wiley, 2002.
Data Mining
• Han, J. and Kamber, M., Data Mining: Concepts and Techniques, 3rd ed., Morgan
Kauffman, 2011.
• Witten, I. and Frank, E., Data Mining: Practical Machine Learning Tools and Techniques
3rd ed.., Morgan Kauffman, 2011.
• Lindoff, G. and Berry, M, Data Mining Techniques for Marketing, Sales, and Customer
Relationship Management, 3rd ed.., Wiley, 2011.
ORACLE
SQL:
Loney, K. and Koch, Oracle Database 11g: The Complete Reference. McGraw-Hill
Osborne Media, 2008.
• Bryla, B. and Loney, K. and Theiriault, M., Oracle11g DBA handbook, Oracle Press, 2007.
2
INFO607, Fall 2012, Il-Yeol Song, IST, Drexel University
Course Objectives:
This course covers principles and techniques related to data warehousing & OLAP
technologies as well as Oracle PL/SQL programming. We will cover PL/SQL programming
including stored procedures & functions, packages, and triggers. Topics for data warehousing
include dimensional modeling and design, OLAP, ROLAP/MOLAP, aggregation, ETL, physical
data warehouse design, optimization techniques such as partitioning, bit map indexing, multi-table
join indexing, star schema query optimization, and materialized views.
Visio: Get a recent copy of MS Visio; You will automatically receive an email from
“DreamSpark MSDNAA” account that you can download a recent copy of Visio
software. The email will be sent you on Monday afternoon of the first week. You will use
Visio to develop Entity-Relationship Diagrams (ERDs) and dimensional schemas for
exercises, assignments, and term project. (If you already have a Visio, the old version is
sufficient for this class purpose.) From the website, you may also download other MS
software such as SQL Server. If you require any assistance for downloading Visio, contact
the iCommon by emailing ihelp@drexel.edu by sending your name, email address, and
course (INFO 605) information.
If this is their first time getting an account, The email will come from “Drexel University -
The iSchool - DreamSpark Premium noreply@e-academy.com” with the subject of “An
Account has been created for you”.
Alternately if you already had an account, they will get the following message from
“Drexel University - The iSchool - DreamSpark Premium noreply@e-academy.com”
3
INFO607, Fall 2012, Il-Yeol Song, IST, Drexel University
Mandatory Discussion The Class participation to mandatory discussion items (but neither
optional discussion nor exercises) will be graded as follows:
2 missing : full credit
3 missing: -1
4 missing: -2
5 missing: -3
6 missing: -5
7 missing: no credit
If you miss more than 7, you will get zero percent for the Class Participation part.
Certain assignments are graded, while others are only for practice. I will make it clear
which one will be graded.
Letter Grade:
A+ 97-100 Walks on water. Professional level work of the highest caliber.
A 93-96 Outstanding achievement. Student performance demonstrates full command of the course materials
and evinces a high level of originality and/or creativity that far surpasses project and course
expectations.
A- 90-92 Excellent achievement. Student performance demonstrates thorough knowledge of the course
materials and exceeds project and course expectations by completing all requirements in a
superior manner.
A key difference between a B and an A is the degree to which the work turned in is original, creative, and shows
mastery of the concepts. Doing what is expected and no more, or simply following my suggestions for improvement,
however well done, earns, at best, a B+.
B+ 87-89 Very good work. Student performance demonstrates above-average comprehension of the
course materials and exceeds project and course expectations on all tasks as defined in the
course syllabus.
B 84-86 Student performance meets designated project and course expectations and demonstrates
understanding of the course materials at an acceptable level.
B- 80-83 Marginal work. Student performance demonstrates incomplete understanding of course materials.
In graduate school, a C does not mean satisfactory, even though it is considered a “passing” grade. It’s an indication
that there is something definitely wrong that needs to be addressed
4
INFO607, Fall 2012, Il-Yeol Song, IST, Drexel University
C+ 77-79 Unsatisfactory work. Student performance demonstrates incomplete and inadequate understanding
C 74-76 of course materials.
C- 70-73 Unacceptable work at varying levels of unacceptability.
D+ 68-69
D 64-67
D- 60-63
F 0.0 Failing—or dropped the course and forgot to tell anyone
All the graded assignments are usually due within one week, unless otherwise stated. I expect 4
assignments throughout the term. Assignments must be turned in on the due date before the
midnight. No late assignments will be accepted for grading once the assignment material is
discussed and answers are posted. Late assignments will be allowed on the condition that (1) it is
pre-arranged before the due date (2) there is unavoidable reason such as business trip, important
family matters, sickness, or accidents. All late assignments, however, will receive 10% off
regardless of the reasons for the lateness. No make-up assignments will be given.
Individual completion of assignments and tests is mandatory. Note that copying sentences
directly from the textbook, lecture notes, or other’s work is plagiarism and will not be accepted.
Write with your own words. You can site one or a few sentences with clear citation. Do not
give/show a soft/hard copy of assignment answers or send via email your assignment answer
files to your classmates. Once caught, both parties will suffer. You can discuss lecture
materials, but you cannot discuss assignments with your classmates. You cannot co-develop a
part or the whole solution of any assignments. Both parties will suffer for doing this. Ask me in
advance if you have any questions on what is allowed and what is not allowed. For this course,
remember that all assignments and final exams are individual efforts and only the project is a joint
effort venture.
I am well aware that you have a life and responsibilities outside of this course. If you encounter serious
personal difficulties (e.g., death in the family) that preclude you from completing an assignment or
attending an examination, please contact me in advance of the due date to discuss alternatives.
Plagiarism
Plagiarism is the representation of another’s words, ideas or work as your own. The standards for
academic honesty are described in Student Handbook and on Drexel website. All students are
responsible for reading and understanding these rules.
All submissions should be original work, any exchange, reuse or direct submission of other’s work
(documents, code etc) as your own or as your team’s is Plagiarism.
For this course, students found guilty of plagiarism can expect one or more or all of -
(a) Incident being reported to the University Judicial Office, where a permanent record is
maintained, and
(b) Fail the course.
5
INFO607, Fall 2012, Il-Yeol Song, IST, Drexel University
For group project’s, all the team members are equally responsible to observe this policy. If a team
member does not faithfully collaborate, I recommend you talk to me about the nature of
collaboration. Different members of a group may receive a different grade, if such an instance is
noticed or suspected.
Code of conduct
http://www.drexel.edu/provost/policy_reminders_fall_2006.asp#conduct
If you have any type of disability which will interfere with your participation and comprehension
in this class, or with completion of assignments, please contact the Office of Disability Services as
soon as possible. Student with disabilities requesting accommodations and services at Drexel
University need to present a current accommodation verification letter (“AVL”) to faculty before
accommodations can be made. AVL's are issued by the Office of Disability Services (“ODS”). For
additional information, contact the ODS at http://www.drexel.edu/oed/disability, 3201 Arch St.,
Ste. 210, Philadelphia, PA 19104, Phone 215-895-1401, or TTY 215-895-2299.
Submission of assignments, proposals, term projects, peer evaluation form, and the final exam
(1) All the submissions (except questions) must be submitted to digital dropbox.
(2) By the new college rule imposed by the dean to raise awareness about academic dishonesty and
reduce cases of academic dishonesty in the College. All the submissions must show the
following statements at the beginning of page 1:
---------------------------------------------------------------------------------------------------
I certify that:
I have not quoted the words of any other person from a printed source or a website without
indicating what has been quoted and providing an appropriate citation.
I have not submitted this submission to satisfy the requirements of any other course.
6
INFO607, Fall 2012, Il-Yeol Song, IST, Drexel University
8 ETL
Important Dates:
Project Proposal Due: Week3 (Wednesday, October 10, 11:59PM)
Final Exam: (take home) Given on Week 11 (Given on Thursday, December 6)
Final Due: Wednesday, December 12, 11:59PM
Project and Peer Evaluation Form Due: Friday, December 14, 11:59PM.
7
INFO607, Fall 2012, Il-Yeol Song, IST, Drexel University