0% found this document useful (0 votes)
27 views5 pages

Data Warehouse MCQ 40 Questions

The document contains a set of 40 multiple-choice questions (MCQs) focused on data warehousing concepts. Topics covered include the purpose of data warehouses, data types, schemas, OLAP and OLTP systems, and data management processes. Each question presents four options, testing knowledge on the fundamental aspects of data warehousing.

Uploaded by

majha loka
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
27 views5 pages

Data Warehouse MCQ 40 Questions

The document contains a set of 40 multiple-choice questions (MCQs) focused on data warehousing concepts. Topics covered include the purpose of data warehouses, data types, schemas, OLAP and OLTP systems, and data management processes. Each question presents four options, testing knowledge on the fundamental aspects of data warehousing.

Uploaded by

majha loka
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Data Warehousing MCQ – 40 Questions (Questions Only)

1. What is a Data Warehouse mainly used for?


A. Transaction processing
B. Data storage only
C. Data analysis and reporting
D. Data encryption

2. A Data Warehouse is primarily designed for:


A. Insert and update operations
B. Query and analysis
C. File storage
D. Application hosting

3. Data stored in a Data Warehouse is usually:


A. Current transactional data
B. Temporary data
C. Historical data
D. Unstructured data

4. Which of the following best defines a Data Warehouse?


A. Transaction system
B. Data management system for BI
C. File storage system
D. Application server

5. Which data is mostly present in a Data Warehouse?


A. Live transaction data
B. Historical summarized data
C. Temporary session data
D. Cache data

6. Which system is optimized for query performance?


A. OLTP
B. DBMS
C. Data Warehouse
D. File system

7. Data cleaning involves:


A. Combining databases
B. Removing inconsistencies and noise
C. Loading data
D. Creating schemas

8. Data integration means:


A. Removing duplicates
B. Combining multiple data sources
C. Data transformation
D. Data loading

9. Data transformation converts:


A. Warehouse data to source data
B. Host format to warehouse format
C. Structured data to unstructured data
D. OLAP data to OLTP data
10. Data loading includes:
A. Sorting and summarizing
B. Cleaning data
C. Removing outliers
D. Extracting data

11. Data refreshing is used to:


A. Delete old data
B. Propagate updates
C. Normalize tables
D. Clean data

12. A Data Mart is:


A. Entire enterprise data
B. Department-specific subset
C. Transaction database
D. Metadata repository

13. Data Marts are created mainly for:


A. Developers
B. End users
C. Specific departments
D. External partners

14. Data Mart size is generally:


A. Larger than warehouse
B. Same as warehouse
C. Smaller than warehouse
D. Unlimited

15. Data Warehouse is:


A. Transaction-oriented
B. Subject-oriented
C. Application-oriented
D. File-oriented

16. Which characteristic means historical data is stored?


A. Integrated
B. Subject-oriented
C. Time-variant
D. Non-volatile

17. Non-volatile data means:


A. Data changes frequently
B. Data is read-only
C. Data is temporary
D. Data is deleted often

18. Integrated data means:


A. Single source only
B. Data from multiple sources
C. Only structured data
D. Only current data
19. Metadata provides information about:
A. Transactions
B. Source and usage of data
C. Reports only
D. Indexes

20. Which tool allows interaction with warehouse data?


A. ETL tools
B. OLAP tools
C. Source systems
D. Metadata

21. OLAP stands for:


A. Online Logical Application Processing
B. Online Analytical Processing
C. Offline Analytical Processing
D. Online Large Application Processing

22. OLAP is used for:


A. Data entry
B. Transaction processing
C. Data analysis
D. Backup

23. OLTP stands for:


A. Online Logical Transaction Processing
B. Online Transaction Processing
C. Offline Transaction Processing
D. Online Technical Processing

24. OLTP systems handle:


A. Reports
B. Analysis
C. Transactions
D. Forecasting

25. ATM transactions are handled by:


A. OLAP
B. Data Mart
C. Data Warehouse
D. OLTP

26. Multidimensional model stores data in form of:


A. Tables
B. Files
C. Data cubes
D. Trees

27. Dimensions are:


A. Numeric values
B. Measures
C. Text attributes
D. Calculations
28. Facts are:
A. Descriptive attributes
B. Numeric measures
C. Metadata
D. Indexes

29. Which schema is simplest in design?


A. Snowflake
B. Star
C. Fact constellation
D. Hybrid

30. Star schema contains:


A. Only dimensions
B. Only fact table
C. Fact table connected to dimensions
D. Dimension tables connected together

31. In Star Schema, dimension tables are connected using:


A. Indexes
B. Primary key to foreign key
C. Views
D. Triggers

32. Snowflake schema is:


A. Simpler than star
B. More complex than star
C. Same as star
D. No dimensions

33. Snowflake schema uses:


A. Denormalized dimensions
B. Normalized dimensions
C. No dimensions
D. Only fact tables

34. Fact constellation schema is also called:


A. Star schema
B. Snowflake schema
C. Galaxy schema
D. Hybrid schema

35. Fact constellation schema contains:


A. One fact table
B. No dimensions
C. Multiple fact tables
D. One dimension

36. Which schema has the most complex design?


A. Star
B. Snowflake
C. Fact constellation
D. Flat
37. Data Warehouse supports:
A. OLTP only
B. OLAP only
C. Both OLTP and OLAP
D. File processing

38. DBMS mainly focuses on:


A. Analysis
B. Historical data
C. Transactions
D. Reporting

39. Data Warehousing mainly focuses on:


A. Data capture
B. Data analysis
C. Data insertion
D. Data deletion

40. Which of the following is NOT a Data Warehouse component?


A. Central database
B. Metadata
C. Access tools
D. Compiler

Common questions

Powered by AI

A Data Warehouse is designed mainly for data analysis and reporting of historical data, whereas OLTP (Online Transaction Processing) systems handle current transactions and are optimized for insert and update operations . Data Warehouses are optimized for query performance, storing integrated, non-volatile, and time-variant data, primarily used for business intelligence activities . Conversely, OLTP systems focus on fast data processing for transaction consistency and availability in daily operations .

Data integration is significant in a Data Warehouse as it ensures data from disparate sources is seamlessly combined and presented in a unified manner, enabling a comprehensive data analysis and business insights generation . However, challenges include dealing with data inconsistency, differing data formats, and temporal data synchronization across various origin systems. Overcoming these challenges requires robust ETL processes to clean, transform, and load data efficiently, ensuring data accuracy, quality, and reliability in the integrated warehouse environment .

Fact Constellation schema, also known as a Galaxy schema, allows for multiple fact tables sharing dimension tables, enabling complex data modeling and supporting multiple business processes. This offers flexibility in multi-dimensional querying and can reduce data redundancy . However, its complexity can increase the difficulty of understanding and managing the schema, potentially leading to longer development times and greater maintenance challenges. It requires careful design to ensure that the schema remains efficient and does not degrade query performance .

Metadata in a Data Warehouse provides crucial information about the data, including its source, transformation, loading processes, and usage. It allows users and tools within the warehouse environment to understand the structure and function of the data for more efficient management and querying . By ensuring comprehensive data documentation, metadata facilitates better data governance, lineage tracking, and aids in maintaining data quality and consistency across the warehouse .

The time-variant characteristic is crucial because it enables data to be stored and analyzed across multiple time periods, which is essential for trend analysis, forecasting, and historical reporting . Unlike operational databases that focus on current data, a Data Warehouse uses time-variant data to provide snapshots of information over time, supporting historical comparison and longitudinal analysis that are vital for strategic decision-making and business intelligence operations .

Data cleaning is essential in Data Warehousing as it ensures the accuracy and quality of data being stored and analyzed. It involves removing inconsistencies, errors, and noise from datasets, thus maintaining data integrity . Major components of data cleaning include identifying and correcting errors, filling in missing values, removing duplicates, and resolving conflicts between datasets. By improving data quality, data cleaning enhances the usability of data for business intelligence and analytical applications, ensuring reliable and valid results from data analysis .

OLAP tools are designed to enable users to perform interactive analysis of multidimensional data stored in a Data Warehouse . They provide functionalities for complex calculations, trend analysis, and sophisticated data modeling, thus turning raw data into actionable insights for strategic business decisions. By allowing drill-down, slicing, and dicing functionalities, OLAP tools help in exploring data from different perspectives and aggregating data at various levels, thereby enhancing the overall business intelligence capabilities of the Data Warehouse .

A star schema comprises a central fact table connected to multiple dimension tables, providing a denormalized structure which simplifies queries and improves performance . In contrast, a snowflake schema addresses complexity by normalizing dimension tables into additional tables, thereby creating a more complex schema which can improve storage efficiency but might complicate query processing . Star schemas are simpler, while snowflake schemas offer normalized dimensions leading to more complex designs .

A Data Mart is a subset of a Data Warehouse, focused on specific business lines or departments, making it smaller in scope . It is designed to meet the precise needs of a particular end-user group, allowing faster access to relevant information. In contrast, a Data Warehouse encompasses the entire organization's data, providing an integrated view across multiple departments, and supports broader analytical queries . While Data Marts allow for quicker access and easier query execution for targeted areas, the Data Warehouse supports comprehensive analysis and broader data integration .

Non-volatile data storage ensures that data once entered into the Data Warehouse is not erased, allowing users to perform consistent and reliable analyses over time . It provides stable and unchanging data snapshots that are vital for accurate historical analysis. This stability supports comprehensive trend analysis and decision-making processes, as it prevents data loss or changes that could skew analytical results, thus enhancing the reliability of reports and insights derived from the warehouse .

You might also like