0% found this document useful (0 votes)
38 views31 pages

Unit 1 and Unit 2

The document provides an overview of Business Intelligence (BI) and Data Warehousing, detailing their definitions, importance, tools, and methodologies. It contrasts traditional BI with modern BI, emphasizing the advantages of real-time data access and user-friendly interfaces. Additionally, it outlines various data warehouse architectures, including single, two, and three-tier models, and discusses the top-down and bottom-up approaches to data warehouse design.

Uploaded by

pdmthapa1992
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
38 views31 pages

Unit 1 and Unit 2

The document provides an overview of Business Intelligence (BI) and Data Warehousing, detailing their definitions, importance, tools, and methodologies. It contrasts traditional BI with modern BI, emphasizing the advantages of real-time data access and user-friendly interfaces. Additionally, it outlines various data warehouse architectures, including single, two, and three-tier models, and discusses the top-down and bottom-up approaches to data warehouse design.

Uploaded by

pdmthapa1992
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Unit 1: Introduction to Business

Intelligence 5 LHs
Syllabus: Definition and scope of BI; Importance of BI in decision-making; Overview of
BI tools and technologies.

Business Intelligence:
Business Intelligence is a collection of procedures, mechanisms, and technologies that
modify raw data into significant information that drives cost-effective business services. It
is a collection of software and services to modify data into actionable intelligence and
recognition.

BI has a huge impact on an organizations methods and tactical and operational business
decisions. BI supports fact-based decision-making using historical data rather than
assumptions and gut feelings.

BI tools implement data analysis and make documents, summaries, dashboards, maps,
graphs, and charts to support users with detailed intelligence about the features of the
business.

Business Intelligence is one of the most dynamic tools many organizations use to know
their user base and industry better. It defines the business methodology in which the raw
information is transformed into useful data which support decision-making.

Business intelligence has wide software, and if talking about the advantage of business
intelligence in the retail sector, current business intelligence tools allow organizations to
take benefit of information not only to consider current sales but also to estimate future
potential, patterns, trends, and understand the demand of the user on a deeper level.

BI helps make data easily accessible and understandable for decision-makers so they can
make informed choices. This ongoing process is designed to be efficient and scalable,
ensuring businesses have the insights they need to improve performance and achieve their
goals.

BI was created to help businesses overcome the issue of "garbage in, garbage out," which
occurs when data analysis is based on inaccurate or incomplete information.

Business Intelligence Objective

The main objective of business intelligence is to make Business work effectively by ensuring
that information is easy to access, understand, and secure. Start by using intuitive tools
that present accurate and up-to-date data clearly, so users can make better decisions. Data
should come from reliable sources and be accessible only to those who need it.

Traditional BI Vs Modern BI
Heres a comparison to highlight their key differences −

Traditional BI Modern BI

Modern BI makes it easy for business users by giving


Traditional BI often depended on IT teams to
them access to data and the tools they need,
access data, which made it harder for business
especially AI tools, to quickly achieve the desired
users to get the results they needed.
results.

Traditional BI, business users often had to wait With modern BI, business users can access
for reports, which could mean the information accurate, up-to-date information whenever they
was outdated by the time they received it. need it.

Traditional BI is time-consuming and involves


Modern BI allows quick access to data.
delays.

Traditional BI often results in inconsistent data Modern BI ensures consistent data use, so everyone
usage, which can create confusion and errors. has access to the same reliable information.

Traditional BI platforms mostly focused on Modern BI focuses on real-time data analysis and
giving users detailed historical reports and advanced tools for in-depth insights and predictive
user-friendly ad-hoc analysis tools. analytics.

Traditional BI often required access from


Modern BI makes it easy for your team to access
specific locations or devices, making it harder
data and insights from anywhere, on any device.
to get data on the go.

Methods of Business Intelligence

The methods of business intelligence are as follows −


1. Data analysis visualization
Data analysis visualization is all about how it visualizes the data. It presents records on
dashboards and uses customized metrics associated with the business to create better
decisions based on facts.

2. Reporting

Business intelligence tools are used for reporting information gathering from all the sources
and processing it to enable better reporting and financial decision-making with a rational
mind.

3. Predictive Analytics

Predictive analytics is how you learn an action will work. The fact is you don't learn, and if
you learn not 100 percent. However, with business intelligence, it can make an evidence-
based decision to drive business further. Business intelligence allows us to create a
reasonable prediction of the current trends and user behaviors that impact the
organization's complete development.

4. Data Mining

Data mining is a computer-supported technique to reveal previously anonymous or


unnoticed relations between data entities. Data mining is the procedure of discovering
useful new correlations, designs, and trends by sharing a high amount of data saved in the
warehouse, using pattern recognition technologies such as statistical and numerical
approaches.

Types of BI Tools and Software


BI tools are software programs that help in gathering, processing, and analyzing a large
amount of data from different sources and these software turns these data into valuable
information, making it easier for businesses to understand and use the data for decision-
making.

Below are different BI software and solutions −

• Spreadsheets − In this software, the user inputs, stores, edits, organizes,


computes, and visualizes data.

Examples − MS Excel, Google Sheets, etc.

• OLAP − OLAP stands for Online Analytical Processing. OLAP solutions allow users
to view and analyze data from different perspectives by storing it in a
multidimensional format.

• Data visualization − It helps us to represent the data in the form of charts, graphs,
plots, maps etc.
Examples − Tableau Desktop, Power BI Desktop, etc.

• Data mining − In business, we handle large amounts of data. To search and analyze
this data to find valuable insights, we use data mining.

Example − Knime, RapidMiner, etc.

• Database − A database stores a large amount of data. There are various types of
databases such as relational, NoSQL, and distributed databases.

Examples

o Relational Database − Oracle DB, SQL Server, PostgreSQL.

o NoSQL Databases − MongoDB, Cassandra.

• ETL Tool − ETL stands for Extract, transform, and load. It is a process that is used for
data integration. It involves extracting data, transforming it, and loading it into a
destination.

Examples − Informatica, Ab Initio, IBM DataStage, etc.

• Project management Tools − JIRA software, MS Project, MS Excel.

• Data Modeling Tools − Oracle Data Modeler, Toad Data Modeler

• Reporting and Analytics − MicroStrategy, SAP BusinessObjects Business


Intelligence.

Uses of Business Intelligence


There are three main uses of Business Intelligence −

• Decision Making

• Business Performance Management

• Finding Business Opportunities and Identifying Problems


1. Decision Making
In this context, decision-making refers to the support that Business Intelligence offers to
decision-makers, such as managers. It helps them make informed business decisions
across all levels strategic, tactical, and operational by providing valuable information and
insights.

Strategic

Example − On which product we should invest for the long run output?

Tactical

Example − To meet yearly targets which promotion we should repeat?

Operational

Example − How many part-time staff should we hire for the weekend?

2. Business Performance Management


The second main use of business intelligence is business performance management. Every
successful company has managers at various levels who closely monitor the performance
of their individual sections. Constant monitoring keeps them updated on the state of the
business and guarantees that performance is in line with their objectives and the more
general expectations set by management, including SLAs and Targets.

BI supports two parts of business performance management −

• To understand whats happening in the business.


• To track whether the performance is in accordance with the set objectives and
goals.

3. Finding Business Opportunities and Identifying Problems


The third and the last most important part of using BI to find business opportunities identify
problems. The third key use of BI is to proactively discover opportunities and identify
problems that might have otherwise gone unnoticed. This involves either analyzing data
with a specific question in mind or exploring the data without a particular focus, aiming to
uncover hidden insights and potential issues.

Unit 2: Data Warehousing 8 LHs


Syllabus:

Introduction to data warehousing concepts; Data warehouse architecture; ETL


(Extract, Transform, Load) processes; Data modeling for BI

DATA WAREHOUSE ARCHITECTURE AND ITS TYPES


Data warehouse architecture is a data storage framework’s design of an organization. It
takes information from raw data sets and stores it in a structured and easily digestible
format. A data warehouse architecture plays a vital role in the data enterprise. As databases
assist in storing and processing data, and data warehouses help in analyzing that data. Data
warehousing is a process of storing a large amount of data by a business or organization.
The data warehouse is designed to perform large complex analytical queries on large multi-
dimensional datasets in a straightforward manner. Data warehouses extract data from
different resources, which are in different fonts, convert it into a unique form, and place
data in Data Warehouse.

Types of Data Warehouse Architectures


Data warehouse architecture defines the arrangement of the data in different databases. As
the data must be organized and cleansed to be valuable, a modern data warehouse
structure identifies the most effective technique of extracting information from raw data.
Using a dimensional model, the raw data in the staging area is extracted and converted into
a simple consumable warehousing structure to deliver valuable business intelligence.
When designing a data warehouse, there are three different types of models to consider,
based on the approach of number of tiers the architecture has.

(i) Single-tier data warehouse architecture (ii)


(ii) Two-tier data warehouse architecture
(iii) Three-tier data warehouse architecture

The details of each of the architecture are given below

Single-tier data warehouse architecture


The single-tier architecture (Figure 1) is not a frequently practiced approach. The main
goal of having such architecture is to remove redundancy by minimizing the amount of
data stored. Its primary disadvantage is that it doesn’t have a component that separates
analytical and transactional processing.

(ii) Two-tier data warehouse architecture


The two-tier architecture (Figure 2) includes a staging area for all data sources, before the
data warehouse layer. By adding a staging area between the sources and the storage
repository, you ensure all data loaded into the warehouse is cleansed and in the appropriate
format
(iii) Three-tier data warehouse architecture
The three-tier approach (Figure 3) is the most widely used architecture for data warehouse
systems. Essentially, it consists of three tiers:

1) The bottom tier is the database of the warehouse, where the cleansed and
transformed data is loaded. Data Warehouse Architecture
2) The middle tier is the application layer giving an abstracted view of the
database. It arranges the data to make it more suitable for analysis. This is
done with an OLAP server, implemented using the ROLAP or MOLAP model.
3) The top-tier is where the user accesses and interacts with the data. It
represents the front-end client layer. You can use reporting tools, query,
analysis or data mining tools.
Cloud-based Data Warehouse Architecture
Cloud-based data warehouse architecture is relatively new when compared to
legacy options. This data warehouse architecture means that the actual data warehouses
are accessed through the cloud. There are several cloud based data warehouses options,
each of which has different architectures for the same benefits of integrating, analyzing,
and acting on data from different sources. The difference between a cloud-based data
warehouse approach compared to that of a traditional approach include:

• Up-front costs: The different components required for traditional, on premises data
warehouses mandate pricey up-front expenses. Since the components of cloud
architecture are accessed through the cloud, these expenses don’t apply.

• Ongoing costs: While businesses with on-prem data warehouses must deal with upgrade
and maintenance costs, the cloud offers a low, pay-as-you-go model.

Speed: Cloud-based data warehouse architecture is substantially speedier than on-


premises options, partly due to the use of ELT — which is an uncommon process for on-
premises counterparts.
Flexibility: Cloud data warehouses are designed to account for the variety of formats and
structures found in big data. Traditional relational options are designed simply to integrate
similarly structured data.

Scale: The elastic resources of the cloud make it ideal for the scale required of big datasets.
Additionally, cloud-based data warehousing options can also scale down as needed, which
is difficult to do with other approaches.

Cloud-based platforms make it possible to create, share, and store massive data sets with
ease, paving the way for more efficient and effective data access and analysis. Cloud
systems are built for sustainable business growth, with many modern Software-as-a
Service (SaaS) providers separating data storage from computing to improve scalability
when querying data

Components of Data Warehouse Architecture


A data warehouse architecture consists of several key components that work together to
store, manage and analyze data.

• External Sources: Where data originates. Includes: Structured (databases,


spreadsheets), Semi-structured (XML, JSON) & Unstructured (emails, images)

• Staging Area: A temporary space where raw data is cleaned and validated before
moving to the warehouse. ETL tools manage this process: Extract (E) - Pulls raw data
from sources, Transform (T) - Standardizes and formats the data & Load (L) - Moves
the data into the data warehouse

• Data Warehouse: A central storage for organized, cleansed data, including both raw
data and metadata. Supports analysis, reporting and decision-making.

• Data Marts: Smaller, focused sections of the data warehouse for specific teams
(e.g., sales, marketing), enabling quick access to relevant data.

• Data Mining: Analyzing large datasets in the warehouse to find patterns, trends and
insights that support decisions and improve operations.

Top-Down Approach
The Top-Down Approach, introduced by Bill Inmon, is a method for designing data
warehouses that starts by building a centralized, company-wide data warehouse. This
central repository acts as the single source of truth for managing and analyzing data across
the organization. It ensures data consistency and provides a strong foundation for decision-
making.

Working of Top-Down Approach


• Central Data Warehouse: The process begins with creating a comprehensive data
warehouse where data from various sources is collected, integrated and stored.
This involves the ETL (Extract, Transform, Load) process to clean and transform the
data.

• Specialized Data Marts: Once the central warehouse is established, smaller,


department-specific data marts (e.g., for finance or marketing) are built. These data
marts pull information from the main data warehouse, ensuring consistency across
departments.

Advantages of Top-Down Approach

• Consistent View: Data marts built from a central warehouse ensure uniform data
across departments, reducing reporting discrepancies.

• High Data Consistency: Standardizing data through one source minimizes errors
and improves the reliability of insights.

• Simplified Maintenance: Updates in the central warehouse automatically reflect in


all data marts, saving time and effort.

• Scalable Architecture: New data marts can be added easily as business needs
grow or change.

• Stronger Governance: Centralized control improves data security, access


management and compliance.

• Less Data Duplication: Data is stored once in the warehouse, saving space and
avoiding redundant or conflicting records.

• Better Reporting: A unified data source enables faster, more accurate reporting and
decision-making.
• Improved Integration: Central sourcing makes it easier to combine data from
multiple systems for deeper analysis.

Disadvantages of Top-Down Approach

• High Cost & Time: Building a central data warehouse and data marts requires major
investment and long implementation time, making it hard for smaller organizations.

• Complex Setup: Designing and managing a centralized system is technically


complex and requires skilled resources and careful planning.

• Low Flexibility: Predefined structures make it hard to adapt quickly to changing


business needs or reporting requirements.

• Limited User Input: IT-led development can exclude business users, resulting in
solutions that may not meet their actual needs.

• Data Delays: Pulling data from various systems can cause processing delays,
affecting real-time reporting and insights.

• Unclear Data Ownership: Centralization can blur responsibility, making it unclear


who manages or maintains specific data.

Bottom-Up Approach
The Bottom-Up Approach, popularized by Ralph Kimball, takes a more flexible and
incremental path to designing data warehouses. Instead of starting with a central data
warehouse, it begins by building small, department-specific data marts that cater to the
immediate needs of individual teams, such as sales or finance. These data marts are later
integrated to form a larger, unified data warehouse.

Working of Bottom-Up Approach

• Department-Specific Data Marts: The process starts with creating data marts for
individual departments or specific business functions. These data marts are
designed to meet immediate data analysis and reporting needs, allowing
departments to gain quick insights.

• Integration into a Data Warehouse: Over time, these data marts are connected
and consolidated to create a unified data warehouse. The integration ensures
consistency and provides a comprehensive view of the organization’s data.
Advantages of Bottom-Up Approach

• Faster Reporting: Data marts allow quick insights and report generation.

• Step-by-Step Development: Enables gradual rollout with quick wins.

• User-Centric: Involves business users to meet actual needs.

• Highly Flexible: Easily customized for departments or evolving needs.

• Quick Results: Early setup gives immediate value.

Disadvantages of Bottom-Up Approach

• Inconsistent Views: Different structures can lead to inconsistent reporting.

• Data Silos: Independent marts may cause duplication and isolation.

• Integration Difficulty: Combining varied marts into one warehouse is hard.

• Redundant Efforts: Similar marts may be built by different teams.

• Harder to Manage: Multiple marts increase maintenance overhead. to Manage –


Multiple marts increase maintenance overhead.

Extraction, Transformation, and Loading Tools (ETL)


ETL tools are central components of enterprise data warehouse architecture. These tools
help extract data from different sources, transform it into a suitable arrangement, and load
it into a data warehouse.
The ETL tool you choose will determine:

• The time expended in data extraction

Approaches to extracting data

Kind of transformations applied and the simplicity to do so

• Business rule definition for data validation and cleansing to improve end product analytics
Filling mislaid data

Outlining information distribution from the fundamental depository to your BI applications

ETL (Extract, Transform, Load) is a key process in data warehousing that prepares data for
analysis. It involves:

• Extracting data from multiple sources

• Transforming it into a consistent format

• Loading it into a central data warehouse or data lake

Note: ETL helps businesses unify and clean data, making it reliable and ready for analysis. It
improves data quality, security and accessibility, enabling better insights and faster
decision-making in a world of diverse data sources.

ETL Process
The ETL process, which stands for Extract, Transform and Load, is a critical methodology
used to prepare data for storage, analysis and reporting in a data warehouse. It involves
three distinct stages that help to streamline raw data from multiple sources into a
clean, structured and usable form.
1. Extraction
The Extract phase is the first step in the ETL process, where raw data is collected from
various data sources. These sources can be diverse, ranging from structured sources
like databases (SQL, NoSQL), to semi-structured data like JSON, XML or unstructured
data such as emails or flat files. The main goal of extraction is to gather data without altering
its format, enabling it to be further processed in the next stage.

Types of data sources can include:

• Structured: SQL databases, ERPs, CRMs

• Semi-structured: JSON, XML

• Unstructured: Emails, web pages, flat files

2. Transformation
The Transform phase is where the magic happens. Data extracted in the previous phase is
often raw and inconsistent. During transformation, the data is cleaned, aggregated and
formatted according to business rules. This is a crucial step because it ensures that the
data meets the quality standards required for accurate analysis.

Common transformations include:

• Data Filtering: Removing irrelevant or incorrect data.

• Data Sorting: Organizing data into a required order for easier analysis.

• Data Aggregating: Summarizing data to provide meaningful insights (e.g., averaging


sales data).

Note: The transformation stage can also involve more complex operations such as currency
conversions, text normalization or applying domain-specific rules to ensure the data aligns
with organizational needs.

3. Loading
Once data has been cleaned and transformed, it is ready for the final step: Loading. This
phase involves transferring the transformed data into a data warehouse, data lake or
another target system for storage. Depending on the use case, there are two types of
loading methods:

• Full Load: All data is loaded into the target system, often used during the initial
population of the warehouse.

• Incremental Load: Only new or updated data is loaded, making this method more
efficient for ongoing data updates.

Pipelining in ETL Process


Pipelining in the ETL process involves processing data in overlapping stages to enhance
efficiency. Instead of completing each step sequentially, data is extracted, transformed and
loaded concurrently. As soon as data is extracted, it is transformed and while transformed
data is being loaded into the warehouse, new data can continue being extracted and
processed.

• This parallel execution reduces downtime, speeds up the overall process and
improves system resource utilization, making the ETL pipeline faster and more
scalable.

• In short, the ETL process involves extracting raw data from various sources,
transforming it into a clean format and loading it into a target system for analysis.

Importance of ETL
• Data Integration: ETL combines data from various sources,
including structured and unstructured formats, ensuring seamless integration for a
unified view.

• Data Quality: By transforming raw data, ETL cleanses and standardizes it, improving
data accuracy and consistency for more reliable insights.

• Essential for Data Warehousing: ETL prepares data for storage in data warehouses,
making it accessible for analysis and reporting by aligning it with the target system's
requirements.

• Enhanced Decision-Making: ETL helps businesses derive actionable insights,


enabling better forecasting, resource allocation and strategic planning.

• Operational Efficiency: Automating the data pipeline through ETL speeds up data
processing, allowing organizations to make real-time decisions based on the most
current data.
Challenges in ETL Process
The ETL process, while essential for data integration, comes with its own set of challenges
that can hinder efficiency and accuracy. These challenges, if not addressed properly, can
impact the overall performance and reliability of data systems.

• Data Quality Issues: Inconsistent, incomplete or duplicate data from multiple


sources can impact transformation and loading, leading to inaccurate insights.

• Performance Bottlenecks: Large datasets can slow down or cause ETL processes
to fail, particularly during complex transformations like cleansing and aggregation.

• Scalability Issues: Legacy ETL systems may struggle to scale with growing data
volumes, diverse sources and more complex transformations.

Solutions to Overcome ETL Challenges

• Data Quality Management: Use data validation and cleansing tools, along with
automated checks, to ensure accurate and relevant data during the ETL process.

• Optimization Techniques: Overcome performance bottlenecks by parallelizing


tasks, using batch processing and leveraging cloud solutions for better processing
power and storage.

• Scalable ETL Systems: Modern cloud-based ETL tools (e.g., Google BigQuery,
Amazon Redshift) offer scalability, automation and efficient handling of growing
data volumes.

ETL Tools and Technologies


ETL (Extract, Transform, Load) tools play a vital role in automating the process of data
integration, making it easier for businesses to manage and analyze large datasets. These
tools simplify the movement, transformation and storage of data from multiple sources to a
centralized location like a data warehouse, ensuring high-quality, actionable insights. Some
of the widely used ETL tools include:

• Apache Nifi: Open-source tool for real-time data flow management and automation
across systems.

• Talend: Open-source ETL tool supporting batch and real-time data processing for
large-scale integration.

• Microsoft SSIS: Commercial ETL tool integrated with SQL Server, known for
performance and scalability in data integration.

• Hevo: Modern data pipeline platform automating ETL and real-time data replication
for cloud data warehouses.
• Oracle Warehouse Builder: Commercial ETL tool for managing large-scale data
warehouses with transformation, cleansing and integration features.

Open-Source vs. Commercial ETL Tools


1. Open-Source ETL Tools:

• These tools, like Talend Open Studio and Apache Nifi, are free to use and modify.

• They offer flexibility and are often ideal for smaller businesses or those with in-
house technical expertise.

• However, open-source tools may lack the advanced support and certain features of
commercial tools, requiring more effort to maintain and scale.

2. Commercial ETL Tools:

• Tools like Microsoft SSIS, Hevo and Oracle Warehouse Builder are feature-rich, offer
better customer support and come with more robust security and compliance
features.

• These tools are generally easier to use and scale, making them suitable for larger
organizations that require high performance, reliability and advanced
functionalities. However, they come with licensing costs.

Choosing the Right ETL Tool for Your Data Warehouse


• Data Volume: Large enterprises dealing with massive datasets may prefer
commercial tools like Microsoft SSIS or Oracle Warehouse Builder for their
scalability and performance.

• Real-Time Processing: For real-time data integration and AI applications, tools like
Hevo or Talend are ideal, as they support both batch and streaming data processing.

• Budget: Smaller businesses or startups may benefit from open-source tools like
Apache Nifi or Talend Open Studio, as they provide robust features without the hefty
price tag of commercial tools.

• Ease of Use: If ease of use and a user-friendly interface are important, commercial
tools often provide more intuitive visual design and drag-and-drop interfaces.

Data Modeling in Power BI


Data modeling is the process of identifying, organizing and defining the types of data a
business collects and the relationships between them. It uses diagrams, symbols and
textual definitions to visually represent how data is captured, stored and used. A well-
designed data model helps:
• Understand data requirements

• Ensure proper structure for reporting

• Align with business goals

• Maintain data integrity

Power BI Workspace is a folder where all your workbooks, datasets, reports and
dashboards are stored. This is the very initial look of the workspace once the user opens
up the Microsoft Power BI desktop.

The user can see different panes in the Power BI dashboards with Report, Visualizations
and Data panes. Report pane show reports based on the data fields selected from the
Data pane in the right hand side. Visualizations pane helps to select the type of chart
used for showing the report.
The following image shows one simple example of Data Pane section in the right hand
side, once the data source from excel file is loaded in the Power BI desktop.

Power BI provides facility to extract data from one or many data sources. It also helps in
grouping and filtering data for detailed analysis. Data can be imported from cloud based
online sources and files in your system. The different type of data are excel, text/csv,
XML, JSON, oracle database, Azure SQL databases and many more. The following image
shows the list displayed when the user wants to select his data source type for
extraction.
As shown in the above image the data can be extracted from one or many data sources
like CSV files, excel sheets, datasets, databases or cloud online sources. The user can
select one of the data source as per the need and availability.
Once the data source like Excel in this case is selected by the user the navigator helps
to transform the data sheet and load it for further process as shown below.

Understanding Relationships in Data Modeling

Relationships are the main feature of data modelling defining all data types.
Relationships helps connect with multiple data sources using Cardinality. The following
image shows that we have different panes for Data, Properties, Visualizations and
Reports.

Data visualizations on multiple data source by analyzing data and defining


relationships between them.
You can also define data type, explain the usage of data, manage data, define data
needs, remove redundant data ( data integrity ), retrieve data and perform data
analytics, track key performance indicators and provide useful solutions for business.

1. Data view in Power BI: Let us understand the data view of the Power BI desktop. The
excel sheet file selected by the user is opened in data view which looks like the following
image.
Relationship between data attributes with in your data model helps in creating insights
for story telling needs. Using relationships we can access data from other data sources.

2. Model View in Power BI: It is the Data pane and its attributes to be taken into
consideration for Report generation. The following image shows the Model view of any
dataset selected by the user.
3. Report View: Report view with filter options for detailed reporting with other features
like drill down, cross reports and others. The Report can have have many parameters
which the user can choose for the analysis. The choice is made out of the requirement
analysis or user's need.
• Filter options: Other options are provided for the detailed reporting. The following
image shows the "Filters" pane of the Power BI desktop.

• Sorting: The search and sort filters are set by the user as per the need of reporting in
a particular order.
• Power BI autodetect feature: After loading all the tables still no relationship can be
seen between tables then the Autodetect tool can be used to detect any
relationship or connection.

• Modelling Feature in Power BI: There is also an option of creating and modelling
relationships between table manually.

Create relationships in Power BI

Managing and editing table relationships refers to the process of defining and
maintaining the connections between tables in a relational database. Table
relationships are important for maintaining data integrity and ensuring efficient data
retrieval. The below image shows a example showing the editing of relationship between
tables and its support for cardinality.

1. Many to one: This means that the column in a given table can have more than one
instance of a value while the other table will only have one instance of the value.

2. One to one: This means that the column in one table has only one instance of value
and the other table also has one instance of value.
3. One to many: In one-to-many relationship, the column in the given table has one
instance of value while as the other related table can have more than one instance of
value.

4. Many to many: This type of relationship is when a column in both tables has
duplicate values.
Cross filter direction: Power BI also supports directionality. This option determines the
direction of cross-filtering to be utilized for a two-column relation. DAX is a formula and
query language that is designed to work with tabular data models and is primarily used
to simplify data analysis and calculation tasks in Power BI.

More Feature of Power BI

Query Editor in Power BI is used to edit or format the data files before they are
loaded into the Power BI Model. The Query Editor plays the role of an intermediate data
container where you can modify data type or the way the data is stored by selecting the
particular rows and columns.

You might also like