AI Maturity Framework for Enterprises
AI Maturity Framework for Enterprises
AI maturity
framework for
enterprise
applications
March 2021
Authors
Rishi Vaish is the Chief Technology Officer (CTO) for IBM AI
Applications, a market leading portfolio of enterprise applications
spanning Asset Management, Facilities Management, Supply
Chain Management, Engineering Lifecycle Management and Weather
Business solutions. He is responsible for driving innovation through
Machine Learning, Data Science, Site Reliability Engineering and Hybrid
Cloud architecture for the portfolio. Through his career Rishi has over
15 years of executive leadership experience in technology, product
strategy, product development, product operations and product
management from startups to large scale organizations across a variety
of industries and technologies. He has deep expertise in AI-based
applications, hybrid cloud technologies, cloud computing, software-as-
a-service and application middleware. He is passionate about driving
innovation, modernization and transforming product and technology
organizations to meet scale and growth demands.
vaish@[Link] | [Link]/in/rishivaish/
Ashish Agrawal leads the Strategy and Portfolio Planning team for
IBM AI Applications. He is responsible for collaborating with business
and technology leaders to establish long-term growth strategy for
the business unit as well as is greatly involved in the execution of the
strategy. He works closely with the CTO office as well as the product
teams in identifying new technologies like Artificial Intelligence (AI)
that should be leveraged to innovate the product portfolio. His 20+
years of Strategy and Operations experience at IBM, BCG, Deloitte,
and Shell spans across multiple industries like High Tech, Energy, Oil &
Gas, Retail, Healthcare and Life Sciences with a focus on helping clients
grow and overcome their complex business challenges.
aagrawa@[Link] | [Link]/in/ashish-agrawal-972283
Applications differ in their maturity curve and we observed that the – Silver
inclusion of the above parameters impacted the application’s success.
For example, an application that is low in improving the quality of the – Gold
input data and has no bias detection would give lower value to end
clients. Where you run your AI in enterprise applications adds another – Platinum
dimension to the maturity model. The next generation of enterprise
applications will be based on a hybrid cloud software model. This For a given AI capability, the ranking by the seven dimensions is also
means that the application containers could be running in a public cloud plotted on a six radii radar plot for easier visualization. For the radar
or in some private cloud behind a firewall. There might be a need to plot, the “impact on your business” dimension (dimension A) is pulled
monitor the performance of AI models and this feature would need to out of the radar since the other six dimensions are specific to the AI
be factored in the AI design itself. capability vs. the business area. As a sample, one of the AI capabilities
in our IBM AI Applications is shown in Figure 2 below. As you can see,
In this paper, we share the IBM AI Maturity Framework for this capability is on its way from gold to platinum.
Enterprise Applications.
C. Technology sophistication
D. Trustworthiness
E. Ease of use
F. AI operating model
G. Data
Dimensions Criteria
Impact on our Business Business Impact, Portfolio Impact
Value to Client Business Process Outcome, Differentiators
Technology Sophistication Appropriateness of the technology to the business problem, Learning Techniques, Reuse of Models,
Use of Inner Source or Open Source
Trustworthiness Integrity, Quality, Bias (Fairness), Explainability, Security
Ease of Use Intuitive for use by the intended user
AI Operating Model Deployment (Manual, Automated), Update Frequency, Infrastructure/Architecture Scale, Monitoring
Data Data Acquisition & Instrumentation, Data Management
3. Platinum
2. Gold
1. Silver
Phases
I Maturity
Offering A
Data 1 Technological
Sophistication
Ease of Use
Maturity framework:
What the phases mean
3. Platinum: AI capability that scores at this level is a sustainable
1. Silver: AI capability that scores at this level will include factors that
differentiator. It is part of a mission-critical workflow for the
have been more recently introduced into the product to become AI
enterprise users and they rely on it for automated decision making
ready. This is the first stage where you discover what AI is, how it
and only focus on exceptions. The AI capability is sophisticated and
impacts business, the tools and technologies required to implement
has mechanisms to adapt to incoming data and learn by feedback
AI, how to prepare data for usage in AI, etc. This is a capability level
provided by end-users. The decisions made by the AI are clearly
that enhances the experience for users but is not mission-critical to
explained and understood by business users. These users can adjust
the business outcome that enterprise users are seeking.
business level dials and levers to tune the outcome they desire.
There are very strong and automated data management and data
2. Gold: AI capability that scores at this level delivers a meaningful
governance measures in place.
business outcome to the users. It will deliver a competitive edge for
the AI offering or application in the market. The capability provides
recommendations based on optimization or offline training on data Not every AI capability needs to graduate from one phase to the next –
and provides basic explanations for why a particular decision or organizations will need to assess the cost and benefit of investing in the
recommendation was made. The AI features are usable by line- capability based on users’ business outcomes and the cost to graduate
of-business users without having to involve data scientists. In this them from silver to gold to platinum.
capability level, a good data hygiene and good automation of the
engineering processes producing the capability is demonstrated.
3. Platinum
2. Gold
1. Silver
work
ty frame
AI Maturi
Maturity framework:
Detailed criteria
A. Impact on business
– Increase in customer
satisfaction measured by
NPS scores
Defined and documented Business value or ROI for Business value/ROI is Business value/ROI is
business outcome the AI project is unclear documented but the clearly documented, and
delivered by the AI OR still being developed metrics do not always tie- metrics are tied to overall
capability. This is the back to overall BU KPIs BU KPIs. Both short term
measurable “business” and long-term metrics are
outcome that would not documented and tracked
B1. Business process have been delivered if AI
outcome was not infused. Every
AI project must have
clear metrics (short term,
long term) by which to
judge whether the project
was a success (Metrics
examples: NPS, Wins)
C. Technology sophistication
Are the tools and Simple analytics with Prescriptive analytics, Adaptive Learning
technologies used predictive capability making recommendations and decision-making:
appropriate based on the generated from ML based on optimization or Learning and/or decision-
C1. Appropriateness
business challenge to rules making processes that
of technology to the
be solved (e.g., ML, NLP, dynamically learn through
business problem
NLU, RPU, ChatBot)? feedback and adapt
their strategies to new
conditions
How easy is it to re-use Models are packaged into Models are deployed as Models deployed as
the DS model to allow containers that must be stand-alone, RESTful services, composed from
C3. Re-use of Data for scaling and to gain manually modified and services that applications re-usable building blocks
Science models operational efficiencies deployed for each new can easily call that can be re-arranged to
use case form different workflows
for different use cases
How effectively does Inner/Open Source Most AI capabilities All AI capabilities were
the technology make are used for some were built on Inner/Open built on and shipped
use of Inner Source components of the AI Source frameworks that with Inner/Open Source
and Open Source code capability. Common have been packaged frameworks. Inner/
to gain development examples would and shipped with the Open Source libraries
efficiencies, build be data validation capability. Common are regularly updated to
C4. Use of Inner Source
communities, mitigate and transformation, examples would the latest versions. Key
or Open Source
ethical/risk concerns and feature engineering include model training, functionalities and tests
encourage reproducible and visualization and serving, monitoring of the developed AI have
experiments? reporting and explainability. been contributed back
Experiments are internally to the Inner/Open
reproducible and open to Source project
communities within IBM
How do we ensure data Data dictionary and Data definitions and Data lineage is
integrity throughout its lineage are known at time lineage for multiple documented, tracked
lifecycle? of ingest projects are documented across transformations
Data Provenance and and used consistently. and published through
Data Lineage are known Developing a canonical the application
and documented. data model
D1. Integrity Understanding the
history and origins of a
data set as well as what
happened to data after it
was collected and prior to
its use
How do we ensure data Basic type validation and More advanced outlier Automated detection
quality throughout its density-based outlier detection, normalization, and correction for a
lifecycle? detection are reported interpolation and wide range of possible
Data quality is measured standardized quality issues; Cross table and
to understand common reporting multivariate validations.
D2. Quality
issues and information
content, what kinds of
corrections are made and
how improvements are
measured.
How do we reject bias Basic bias reporting Bias assessment and Bias assessment and
towards groups, sets relying on manual remediation are done for remediation are done as
of individuals, or data intervention for key features of data. a standard practice with
attributes? remediation proactive improvements
D3. Bias (Fairness)
Ability to prove that in the approach and
For complete list on measuring the outcomes are fair implementation of the
Bias in an application/enterprise, and not skewed either techniques.
please see the link given due to the model or the
in the conclusion section.
data (like incomplete,
limited/insufficient,
missing, corrupt, biased,
ambiguous)
How can we shield AI Risk and security Policies, processes, and Risk and security
and AI infused services management are minimal standards are defined management are
against cyber threats and reactive, mainly and institutionalized comprehensive across
or adversarial attacks? relying on key actors. for security and risk the enterprise and
D5. Security Applications and management at a among partners and
algorithms are resistant consistent level across customers, allowing for
to attacks from either the enterprise and continuous feedback and
data manipulation or partners. improvement.
direct security flaws
How well can the AI Basic AI features or More mature AI Advanced AI capabilities
capability be used by the capabilities that increase capabilities that provide that are tightly integrated
intended end-user? Does usability and deliver value significant value and into the experience
it require experts (like and insights to end users. insights to users and and leverages Watson
Data Scientists) to be AI provides little or no helps them accomplish Moments that not only
able to use and interpret Explainability, which can their goals more significantly improve the
the outcomes? lead to low confidence effectively. AI clearly ease of use and efficiency
E1. Intuitive for use by levels. Users may have conveys its value and of the product, but also
the intended user to rely on external reasoning, enabling users enable users to be more
assistance to interpret to build confidence in productive and make
outcomes, provide the insights delivered. smarter decisions faster.
additional insights and/or AI delivers insights Users have a high degree
create custom solutions. that increase user of confidence in the AI
effectiveness, efficiency, and the unique insights
and satisfaction. delivered through the
offering.
F. AI operating model
Uses tools that automate Manual build, integration, Use of continuous Deployment is
model building, data containerization and integration tools to automated using CI/
cleaning, and other key testing tools and build and test artifacts CD and operational
F1. Deployment (manual,
processes. Use of CI/CD processes are used for and containers for process is structured
automated)
pipeline deployment deployment with widely and enables dark launch
available status reporting in production/test
environments
How frequently can No established plan for AI model is trained at a AI model update is
models be updated or ongoing training of the AI regular basis based on automatic and is based
F2. Update frequency retrained? Is the process model human understanding of on the automated
automated? model quality assessment of model and
data quality
Analyzing and evaluating Basic logging of predictive Reporting of how the Alerting and automatic
the ongoing efficacy of accuracy and results model accuracy changes triggering remodel
F4. Monitoring
the model (e.g. model over time and the typical based upon continuous
drift analysis) lifecycle monitoring
How easy is it to acquire Data acquisition Data acquisition Data acquisition and data
data and get it into the and preparation are procedures and platforms preparation are provided
system for analysis? individuals' responsibility are in place; acquisition as a service that supports
G1. Data acquisition &
with no offering-wide and preparation are real-time provisioning
instrumentation
guidelines. documented for all data. of all needed resources
including data sets,
expertise, and tools.
How robust is Rigid undefined Agile processes are well Agile processes are
governance, provenance, processes with potentially defined, standardized and continuously improved
and standardization of unpredictable outcomes. accepted to support early by quantitative feedback
data throughout the and continuous delivery based on IT and business
G2. Data management lifecycle? of data to data scientists/ metrics to support early
engineers. and continuous delivery
of data to data scientists/
engineers.
Illustrative example
In thinking about how to apply a maturity model, examples are often would have been had no action been taken. The KPIs used are directly
very useful. AI Applications uses anomaly detection in a number of tied to the metrics that ecommerce companies use to evaluate their
different applications in its portfolio. Here we describe a fictitious business performance. In its first few implementations, the solution
ecommerce application called Anomaly Finder as an example. We has shown significant benefits to clients based on real-time tracking
assume that the application was recently launched and has had limited of these KPIs. Therefore, it is rated platinum for business process
marketing activity. At a high level, this application actively looks at outcome. There are several features that clients and analysts have
a stream of key performance indicators (KPIs) for an ecommerce told us are novel in the market, but we believe that those moats could
business over time, finds any anomalies relative to the history of the be challenged in the next few years. We rated Anomaly Finder gold in
KPI, alerts the user to the anomaly, and then suggests a potential terms of its differentiators.
cause of the issue and a potential action to take if one exists for this
type of issue. In this section, we will walk through how we might have Technological sophistication tries to understand the maturity of the
evaluated this application in terms of its AI maturity. AI in the application based on the relevance of the techniques used
relative to the business problem being solved, the sophistication of
The impact on the business criteria is meant to be an evaluation of the learning methods used, and the ability to easily re-use or extend
an application’s impact on both IBM revenue and on the brand of those methods used to enable continuous improvements in the AI in
the product portfolio. Anomaly Finder was scored as silver for both future releases. The appropriateness of technology criteria tries to
business impact and portfolio impact. Since it was just launched is understand the level of reasoning that was required for the application
does not yet contribute a significant amount of revenue to the portfolio to effectively help the human in solving the business problem. A
and while it has been mentioned by analysts in a couple of reports, silver level provides the human with insights that they would not
it was not listed as a significant factor in the analyst’s rating for the otherwise have had but relies on the person to make any decisions.
portfolio. It has also only had a limited impact in pulling through sales Gold describes methods that enable the machine to make decisions
or increasing interest in other parts of the product portfolio. Overall, based on learned insights and input from the human as to the goals
its impact on business area has not been significant enough thus far to and constraints of the business problem. Platinum applications apply
rate it above silver. learning to the decision process itself, modifying their reasoning in
solving the problem based on learnings from previous actions they
The value to client criteria tries to measure how the application has have taken. Anomaly Finder uses a proprietary set of algorithms to
created business value in terms of the client’s defined business understand the behavior of other metrics at the time of the anomaly
objectives for their AI journey for that specific business process. It also and uses its understanding of those behaviors to suggest potential
tries to estimate how distinct the value of the application for improving causes of the anomaly and to recommend potential actions based on
that business process is relative to competitive solutions from the previous scenarios. It was rated gold for appropriateness of technology.
client’s perspective. One of the key components of Anomaly Finder is
Risk Avoidance Reporting. This module directly estimates the benefit of Learning techniques is an evaluation of the sophistication of the
following the recommended actions relative to what the outcome likely machine learning capabilities embedded in the application. Silver
Conclusion
As you apply AI to enterprise applications, establishing goals around the maturity of the AI is critical to ensuring that you deliver value to your
enterprise clients, use the appropriate level of technological sophistication, ensure that the AI is trustworthy and easy to use by your targeted
line-of-business users, and that you have an AI operating model in place to manage the AI you’ve deployed in the field along with strong
data management and data governance practices. We hope that this maturity framework will help you measure and progress your enterprise
applications on their AI journey.
Below you will find links to enterprise-grade data and AI management stack:
[Link]
[Link]
[Link]
[Link]
[Link]
[Link]
[Link]
[Link]
[Link]
Below you can learn more about IBM’s Enterprise Applications infused with AI:
Supply Chain Management Sterling IBM Sterling Fulfillment Optimizer with Watson
Weather Solutions Weather Business Solutions Vegetation Management and Outage Prediction
All client examples cited or described are presented as illustration of the manner
in which some clients have used IBM products and the results they may have
achieved. Actual environmental costs and performance characteristics will vary
depending on individual client configurations and conditions. Contact IBM to see
what we can do for you.