ADMS 2511
Management Information Systems
Session 4
Copyright I. Splettstoesser, 2017 1
Why is data management so
important?
What is Big data? From: The Economist 2012http://
[Link]/watch?v=ahZGEusG13A
Microsoft illustrates what can be done with its business
intelligence tools, including SQL Server (query language)
[Link]
Copyright I. Splettstoesser, 2017 2
Session 4 (and Chapter 5)
Learning Objectives
4.1 Describe the difficulties of managing data
4.2 Explain how data governance is facilitated by master data
management
4.3 Types of data; Use the data hierarchy; build E-R (entity
relationship) diagrams
4.4 Explain characteristics of relational DBMS and its tools
4.5 Explain nature of big data, data warehouses and data
marts, advantages and disadvantages and role in data mining
4.6 Describe the KMS cycle and discuss types of knowledge
Copyright I. Splettstoesser, 2017 3
Data source examples
Credit card swipes RFID tags Digital video
surveillance
E-mails Radiology Blog
scans 4 s
Copyright I. Splettstoesser, 2017
What can “go wrong” with
data?
It can be ….
Lost
Copied
Erased
Changed
Have multiple copies (that are
slightly different!)
Hard to find
Overwhelming!
Copyright I. Splettstoesser, 2017 5
Data quality problems almost cost Library
and Archives Canada $200,000
Library and Archives Canada wanted to buy an old
map worth $200,000
Then, they found that they already had two copies!
Culprits: translation errors, incomplete data entry
and data entry errors
6
Copyright I. Splettstoesser, 2017
Difficulties of managing data: provide
an example for each
1. Amount of data increases exponentially with time
2. Data is scattered due to collection and storage methods
3. It is generated from multiple sources (existing and new)
4. Data degrades over time
5. Data media rots
Copyright I. Splettstoesser, 2017 7
Difficulties of managing data:
(continued) provide an example for each
6. Data security, quality or integrity may be compromised
7. Legal requirements need to be met; updates to data storage
methods or management procedures result from legal changes
8. Legacy IT systems may result in repetition (redundancy) or
data conflicts
Solutions to these difficulties include effective data
governance.
Copyright I. Splettstoesser, 2017 8
Session 4 (and Chapter 5)
Learning Objectives
4.1 Describe the difficulties of managing data
4.2 Explain how data governance is facilitated by
master data management
4.3 Types of data; Use the data hierarchy; build E-R (entity
relationship) diagrams
4.4 Explain characteristics of relational DBMS and its tools
4.5 Explain nature of big data, data warehouses and data marts,
advantages and disadvantages and role in data mining
4.6 Describe the KMS cycle and discuss types of knowledge
Copyright I. Splettstoesser, 2017 9
Data includes: transaction
data and master data
Transaction data represents activities or events, such as
payroll cheque, customer invoice; transaction data is stored
in transaction files or as tables in a database
Master files (master data) are semi-permanent data, such
as employee name, address, customer name, customer
credit limit. Master data is stored in a master file, in a
database, or in tables. Video explanation:
[Link]
Copyright I. Splettstoesser, 2017 10
Role of data governance
Data governance (a subset of IT governance)
Planned approach to data management for all
types of data
Master data management is a subset of this. It
focuses on consistent use of master data.
Involves the entire organization
It is management of the creation, collection,
handling and protection of information
Copyright I. Splettstoesser, 2017 11
Legislation can affect data archiving requirements
Legislative that has an effect on data storage and security:
In Canada, PIPEDA (Personal Information Protection and
Electronic Documents Act) came into effect January 1,
2004
In the U.S. Sarbanes-Oxley July 2002
Canada’s anti-spam legislation in effect July 1, 2014
means tracking specific consent to send emails
[Link]
12
Copyright I. Splettstoesser, 2017
Let’s do Practice Question
#1
Read Case 5.1, Data problems at Target Canada, p. 116
For best results, do the practice problem before looking at
the answers.
Answers will be posted after the classroom sections are
completed working on the problems.
13
Copyright I. Splettstoesser, 2017
Session 4 (and Chapter 5)
Learning Objectives
4.1 Describe the difficulties of managing data
4.2 Explain how data governance is facilitated by master data
management
4.3 Types of data; Use the data hierarchy; build E-R (entity
relationship) diagrams
4.4 Explain characteristics of relational DBMS and its tools
4.5 Explain nature of big data, data warehouses and data marts, advantages
and disadvantages and role in data mining
4.6 Describe the KMS cycle and discuss types of knowledge
Copyright I. Splettstoesser, 2017 14
Data Hierarchy, Fig 5.2, p. 121
Bit (pixel for images)
Byte = character
Field = attribute
Record = entity
File = table
Database = groups of tables or groups of files
[Note, these are not really equal, but are similar.]
15
Copyright I. Splettstoesser, 2017
Data design for database and file-based
systems (Refer to p. 121-123)
Data term for Examples Data term for file-
database systems based systems
Attribute Carolyn Field
Primary key OR 111222333 (unique Same: Primary key
identifier (a special student number) (a special kind of
kind of attribute) field)
Secondary key (a Jones (last name) Same: Secondary
special type of key (a special kind
attribute) of field)
16
Copyright I. Splettstoesser, 2017
Data design for database and file-based
systems (continued)
Data term for Examples Data term for file-
database systems based systems
Entity or instance 111222333, Carolyn Record
(information about Jones, 41 Alpha
one student) Street, Toronto,
M3S 6T9
Entity class (or (all of the File or flat file (or
table) information for a table)
particular entity
type)
17
Copyright I. Splettstoesser, 2017
E-R Modeling: Fig. 5A.1,
Cardinality symbols
Copyright I. Splettstoesser, 2017 18
E-R Modelling: Fig. 5A.2 One-to-one
and Fig 5A.3 One-to-many relationships
Copyright I. Splettstoesser, 2017 19
E-R Modelling, Fig. 5A.4 Many-to-
many relationship
Copyright I. Splettstoesser, 2017 20
E-R diagram models document what is
happening with data
21
Copyright I. Splettstoesser, 2017
E-R diagram steps
Read the description of the entity classes (table names,
attribute names, relationships, e.g. 1:1 or 1:M or M:M)
Rectangles lable the entity class
Diamonds link classes
Show the relationship (e.g. 1:M) on both sides of the
diamonds and with narrative
Copyright I. Splettstoesser, 2017 22
Entity-relationship diagram practice
for a movie database
Movies have: title, year of release, length (minutes), genre (e.g. comedy,
action), directors, actors, rating (e.g. Pg, A), studio
Studios (name and address) produce one or more movies
Actors (name and date of birth): roles in one or more movies
Directors (name and date of birth) direct one or more movies
Videos explaining how to do E-R diagrams
[Link] or
[Link]
A. List the attributes
B. Indicate the relationships (1:1, 1:M, M:1 or M:M)
C. Draw the E-R Diagram 23
Copyright I. Splettstoesser, 2017
Session 4 (and Chapter 5)
Learning Objectives
4.1 Describe the difficulties of managing data
4.2 Explain how data governance is facilitated by master data
management
4.3 Types of data; Use the data hierarchy; build E-R (entity
relationship) diagrams
4.4 Explain characteristics of relational DBMS and its tools
4.5 Explain nature of big data, data warehouses and data marts,
advantages and disadvantages and role in data mining
4.6 Describe the KMS cycle and discuss types of knowledge
Copyright I. Splettstoesser, 2017 24
Traditional (Functional)
File Environment
Data in older functional (legacy) systems is organized
functionally (by application), usually sequentially and “flat”
Advantages include:
Simplicity of operation and implementation
Faster processing speeds
Example: Monthly payroll
Copyright I. Splettstoesser, 2017 25
Advantages of Database
Management Systems (Fig. 5.1, p. 120)
Reduces data redundancy Increases data integrity
Provides data independence Security over data can be
improved
Reduces data inconsistency
Copyright I. Splettstoesser, 2017 26
Disadvantages of Database Management
Systems
More complex (and costly) to set up and
maintain.
Complex structures may be slower for
processing high volume periodic
transaction updates
How Information Builders is leading
York Regional Police into the 21st
Century, by Emin Wood, March 2, 2017,
[Link], explains the
importance of data consistency
27
Copyright I. Splettstoesser, 2017
Using the relational
database model (Appendix)
Based on linked two-dimensional tables
Comprises:
Query language (SQL or QBE)
Data dictionary
Normalization process
Database joins
How would our student information be organized in a
relational table?
28
Copyright I. Splettstoesser, 2017
Let’s do Practice Question
#2
Review Case 5.1, Data problems at Target Canada, p. 116
For best results, do the practice problem before looking at
the answers.
Answers will be posted after the classroom sections are
completed working on the problems.
Copyright I. Splettstoesser, 2017 29
Session 4 (and Chapter 5)
Learning Objectives
4.1 Describe the difficulties of managing data
4.2 Explain how data governance is facilitated by master data
management
4.3 Types of data; Use the data hierarchy; build E-R (entity
relationship) diagrams
4.4 Explain characteristics of relational DBMS and its tools
4.5 Explain nature of big data, data warehouses and data
marts, advantages and disadvantages and role in data
mining
4.6 Describe the KMS cycle and discuss types of knowledge
Copyright I. Splettstoesser, 2017 30
Big Data
Definition: diverse, high-volume, high velocity information assets that require new
forms of processing to enable enhanced decision-making, insight discovery and
process optimization
Characteristics: Volume, velocity, variety
Issues: untrusted sources, “dirtiness,” changes especially in
data streams
Managing: requires data integration (e.g. using data
warehouses), effective query and business intelligence tools
(Chapter 12, Session 11), and the ability to handle both
structured and unstructured data
Copyright I. Splettstoesser, 2017 31
What is a data warehouse?
Def’n: a repository of historical data that are organized by subject to
support decision makers
A specialized form of database
An architectural structure that defines how historical
data is stored
Linking components of the data warehouse via data
communications increases the scope of data available
Def’n of data mart: a lower cost, scaled down version of a data
warehouse that is designed for the needs of a strategic business unit
or individual department
32
Copyright I. Splettstoesser, 2017
Data warehouse framework, Fig. 5.4, p. 132
33
Copyright I. Splettstoesser, 2017
Advantages of data warehouses
Data is:
Organized and consistent
Integrated (and possibly ‘cleansed’)
Historical and nonvolatile
Optimized for access (for OLAP use,
multi-dimensional)
How does a data mart differ from a
data warehouse?
34
Copyright I. Splettstoesser, 2017
Disadvantages/constraints of data
warehouses
VERY costly and complex to establish (hardware, software and people)
Requires continual maintenance as supporting applications change
Requires high levels of security to ensure access to authorized users
Data is NOT the system of record
35
Copyright I. Splettstoesser, 2017
Session 4 (and Chapter 5)
Learning Objectives
4.1 Describe the difficulties of managing data
4.2 Explain how data governance is facilitated by master data
management
4.3 Types of data; Use the data hierarchy; build E-R (entity
relationship) diagrams
4.4 Explain characteristics of relational DBMS and its tools
4.5 Explain nature of big data, data warehouses and data marts,
advantages and disadvantages and role in data mining
4.6 Describe the KMS cycle and discuss types of knowledge
Copyright I. Splettstoesser, 2017 36
Big data in a relationship …
Big data, TEDxUofM - Jameson Toole - Big Data for
Tomorrow; unusual examples and purposehttp
://[Link]/watch?v=HSVQ5RDBEJs
37
Copyright I. Splettstoesser, 2017
Data can facilitate wisdom
(data life cycle triangle)
How do we move up the data life cycle triangle from the initial
data collection?
Is knowledge currently in emails? Video:
[Link]
38
Copyright I. Splettstoesser, 2017
Tacit and explicit
knowledge
Explicit knowledge is more objective, rational and
technical while
Tacit knowledge is cumulative, more subjective or
experiential
What is the role of knowledge management systems in
relation to explicit and tacit knowledge?
Copyright I. Splettstoesser, 2017 39
Knowledge management systems
(Fig. 5.8, p. 140)
Why have them? Musical video [Link]
Telus uses SAP Jam for KMS, see
http
://[Link]/news/telus-uses-sap-jam-as-glue-in-corporate-culture-transformation/47101?utm_source=enews&utm_campai
40
gn=itbother&utm_medium=
Copyright I. Splettstoesser, 2017
newsletter
Let’s do Practice Question
#3
Read IT’s About Business 5.2, Google’ Knowledge Graph,
p. 127
For best results, do the practice problem before looking at
the answers.
Answers will be posted after the classroom sections are
completed working on the problems.
41
Copyright I. Splettstoesser, 2017
Study tip # 4 – Start and
focus now
Start now, not later. If you have already started, great, do more
now.
Practice: Sit motionless. What part of you is tense? Relaxed?
Does any part of you want to move? Your body is a complex
system – start at the top of your head and work your way down to
your toes to be methodical. This is faster than going where your
attention is directed. Take ten breaths. Move in a comfortable
way.
Just as you focused on yourself, you are able to focus on studying.
Take it one step at a time. Start now.
Copyright I. Splettstoesser, 2017 42
What key topics did we
cover today?
Copyright I. Splettstoesser, 2017 43