Data Migration
Data Migration
Data Conversion
And Migration
Legacy System Data Conversion and Migration
Proprietary Notice
All trademarks, services marks, trade names, logos, icons and other intellectual property rights (the
―Intellectual Property‖) are proprietary to Key Management Group, Inc (KMG) or used by KMG as permitted
through contractual arrangements with respective proprietors. The content of this testing process shall at all
times continue to be the property of KMG/ proprietors and will be strictly protected by copyright and other
applicable laws. Any usage, reproduction, retransmission, distribution, dissemination, sale, publication or
circulation of the said content without the express prior written consent of KMG is strictly prohibited. Nothing
in this document shall be construed as granting by implication or otherwise a license or a right to use such
Intellectual Property, or indulge in reproduction or decompiling or sublicensing such Intellectual Property.
The users/clients expressly recognizes, admits and confirms that all the IPR rights in the development of this
testing process is the sole property of KMG and the user /client shall not claim or in any way assist, aid,
permit or abet others in claiming any right title or interest there in or thereto. If required by KMG the
user/client shall cooperate in having KMG register the IPR in this documentation, if deemed necessary.
© Unless otherwise acknowledged, the entire IPR (Intellectual Property Rights) in this testing process and
contents thereof absolutely vests in KMG. All rights reserved.
2
Legacy System Data Conversion and Migration
Table of Contents
1 Introduction .................................................................................................................. 4
2 About KMG .................................................................................................................... 5
3 Overview ...................................................................................................................... 6
3.1 Conversion and Migration Process ................................................................................... 6
3.2 Guiding Principles/Assumptions ..................................................................................... 7
4 Planning ....................................................................................................................... 8
4.1 Identify Migration Requirement ..................................................................................... 9
4.2 Identify Team and Stakeholders.................................................................................... 10
4.3 Identify System Environment ....................................................................................... 12
4.4 Create Project Schedule ............................................................................................. 13
4.5 Configuration Management Plan ................................................................................... 14
5 Analysis ...................................................................................................................... 15
5.1 Source and Target Data Profiling .................................................................................. 16
5.2 Data Mapping .......................................................................................................... 17
5.3 Data Cleansing ......................................................................................................... 18
5.4 Design Migration Architecture ...................................................................................... 19
5.5 Build Converted Data Test Plan .................................................................................... 20
5.5.1 Error Handling and Auditing Requirements................................................................ 21
6 Convert ...................................................................................................................... 22
6.1 Design & Develop Migration Tool ................................................................................... 22
6.2 Pre-test and Recalibration of Tool ................................................................................. 23
6.3 Configure Staging Area ............................................................................................... 24
6.4 Execute Data Conversion ............................................................................................ 25
6.5 Validate by Test Plans ............................................................................................... 26
7 Migrate ...................................................................................................................... 27
7.1 Develop Migration Statistics ........................................................................................ 27
7.2 DATA LOAD in Target System ....................................................................................... 27
Deployment phase ......................................................................................................... 28
7.3 Validation - Target Application ..................................................................................... 28
7.4 Target System Implementation ..................................................................................... 28
8 Risks and Mitigation ....................................................................................................... 29
3
Legacy System Data Conversion and Migration
1 Introduction
From time to time various Business organizations implement new Software Application System to replace the
functionality currently delivered by one or more legacy systems. Complications arise when there is an attempt
to take the information currently maintained by the legacy system and transform it to fit into the new system.
More often, the data structure of the legacy systems is different from the new application being
implemented, and that difference is not just limited to the table names, field names or attributes or sizes.
The types of databases are different and diverse, or the entity relationships definitions in the new system are
not compatible with the older legacy application. To the business organizations all the data being held in the
legacy system remains critical for their business functions and decision making.
To bring the legacy system data to the new application some Data Conversion must take place, where an
initiative, separate or concurrent with the implementation of the new application, is undertaken to convert
data from one structural form, used by the legacy application to the structural from required by the newer
application .
Often in a Data Conversion process, one would tend to think that any two similar systems that maintain the
same sort of data, as they are doing very similar functions should map from one to another without much
trouble. But that is not really the case as -
- In Legacy systems, historically, data integrity checks were not strictly enforced, leaving orphan data
- Theoretical design differences exist between hierarchical and relational systems.
- Legacy data may require some data cleansing.
Therefore it is important to have a sound, methodological approach by which organizations can undertake
Data Conversion projects, which will help to confront unpleasant surprises on later stages and resolve those
issues fast and effectively.
Key Management Group. Inc. (KMG) often helps various business organizations to implement the new
Software Application, especially in the area of Property and Casualty (P&C) Insurance and Enterprise
Resource Planning (ERP) package software. To help our business clients, KMG designed and developed a
methodology and adopted that for such Data Conversion projects.
This document attempts to document that methodology, which it does by stating various processes to
be performed at various stages of Conversion and Migration of Legacy System Data, and the Roles and
Responsibilities of various stakeholders and participants in the projects.
Though over the years, this document was reviewed and refined and republished by KMG, based on the
experience of many such projects, it is important to note that the methods and processes described
within this document is generic in nature. Each project presents its own challenges and opportunities and
KMG personnel engaged in the respective project design and develop, add or modify steps within the
broad framework of this methodology, for the benefit of that specific project.
4
Legacy System Data Conversion and Migration
Business
Analysis
2 About KMG
KMG was established in US in 1990 and is among Top 100 outsourcing companies in US. It is listed in the top
100 outsourcing companies in the world, top 10 fastest growing Indian-owned companies in the US &
among top 50 software companies in India. It has a Dun & Bradstreet rating of ―Good- 2A1‖.
Legacy
KMG provides software development and maintenance
Web-enabling solutions mainly for Insurance domain. The main emphasis
is on the P&C Insurance sector in the US. Almost 75% of its
revenue comes from maintenance of legacy applications.
New Application It is also quite capable for developing applications from
Development Support scratch using the latest state-of-art technologies &
architectures (Including SOA). Once again, most of the
development work is centered around the Insurance
Services vertical.
KMG has a large pool of Business Analysts with exposure to
BPO Testing all lines & facets of the Insurance industry. This group
helps us retain the knowledge & bridge the gaps between
the end-users & the development teams.
KMG has its headquarters in NY with 4 Offshore Development Centers in India (Bangalore, Delhi,
Chandigarh and Kolkata) and 3 sales-cum-development centers in US (New York, San Diego, & Chicago).
The offshore entity was established in 2000 and has grown at a very high rate over the last eight years. At
this time, KMG has around 100 professionals in the US who are supported by another 400 in India. KMG has
resource expertise that covers [Link] technologies, J2EE, Mainframe, IBM iSeries (AS/400) and
Software Testing.
KMG is building a 600-seat state-of-art development center near Chandigarh (Mohali). This center will also
house a large training center for providing training to in-house / external resources on legacy systems.
The center will be fully operational by end of 2010.
5
Legacy System Data Conversion and Migration
3 Overview
3.1 Conversion and Migration Process
Data Conversion project activities starts with Planning, leading to Analysis and Design, progressing to
Conversion of Data, finishing in Migration - where converted data loaded in the target system database.
Following figure provides an overview of all the processes involved in legacy data conversion process
Figure 1
All the tasks to be conducted at various stages of a Legacy Data Conversion projects and the guiding principles for
execution of those along with roles and responsibilities are described in the subsequent chapters.
6
Legacy System Data Conversion and Migration
The process of Automated Data Conversion could be through a pre-packaged Tool procured from one of the
software vendors in the market or could be developed by KMG, based on the specific requirement of the
Project. Benefits and costs of the both approach will be reviewed.
The Extraction and Conversion Process will be tested prior to the production conversion as per KMG
Methodology, described later in this document. This is to forecast precise conversion timeframes and to
validate/test programs/tools configuration.
This methodology is designed for Single tier and Single time Conversion and migration. Single-tier migration is
a migration in which all source data is migrated onto one or more devices within a single tier—that is, primary
storage.
This methodology does not define process for Continuous and policy-driven migration which, unlike all other
data migrations, is a continuous process (requiring the installation of an archival application) that uses a set
of user-defined policies to determine, in real time, when and where data needs to be moved.
7
Legacy System Data Conversion and Migration
4 Planning
To achieve the goal of a successful Legacy data migration through conversion, it is imperative that a lot
of upfront planning happens prior to that move of data, irrespective of the complexity of the task.
In this stage KMG, in conjunction with its client, prepares a plan, which is also intended to shorten
the duration of the Conversion/migration process and also reduces business impact and risk.
The Migration Plan, which is the end result of the planning process by KMG, defines
- Requirements i.e. what data is moved, where it is moved, how it is moved, when it is moved,
and approximately how long the move will take
- Team i.e. the Users, Business Analysts, System/Data Analysts, Testers, Target Application Users
- Environment under which the Legacy (Source) and the Target Application system operates, What will
be the Data Conversion Stage Area
- Schedules - and task distribution, staffing levels
- Configuration Management Plan – to maintain control over changes needed due to various reasons
The other objectives of this planning process by KMG is to design and document resolutions for events like
- Application downtime,
- Performance degradation,
- Technical incompatibilities, and
- Data corruption/loss.
KMG would initiate such project by deploying a Project Manager and an experienced Analyst to
gather information and develop the Migration Plan
8
Legacy System Data Conversion and Migration
KMG’s Project Manager and the Analysts, gather information about the proposed conversion/migration task
and define the Objectives and the Requirements for the project.
Following information is collected as part of Requirement definition -
a. Whether the project is part of the broader project of implementation of the target application or
independent project?
b. What data is to be converted and moved – like the nature and usage of the data – more likely by its
current Application / Function system and its environment?
c. How many source systems are involved?
d. What is the size of the legacy system data?
e. Where it is moved to – i.e. the target Application System?
f. Which are the more critical Data elements with respect to the Target Application System?
g. Whether any transformation or elimination will be part of this project?
h. How it is moved – if the client has already identified any tool and/or designed, developed any conversion
application system?
i. When it is to be moved and how long the conversion/move could take?
j. Whether the objective is to convert all the Legacy application data at once or the process will be
staggered phases?
k. Are there any production applications that may conflict with the migration?
l. Can the data in legacy system be migrated into the target system without data cleansing?
m. What is the level of data cleansing required to maximize the benefit of conversion and migration?
n. Whether Historical Data to converted or not – as it could be used by the Target system in future?
Historical data is any transactional data that has completed its business cycle and is generally needed for
inquiry purposes (statistical analysis, trending, comparison generation and legal implications). Historical
data is typically not converted from legacy systems.
o. What are the major complications anticipated in the conversion process?
There could be, and more likely would be, few more questions based on the response to some of the questions
listed above. It is also possible that all the questions listed above may not be answered in specific details (by
the Client representatives) and some response could be generic. However those answers could still be useful
to provide idea about the intended plan of the Client.
Responses to the above queries would be analyzed by KMG’s team and then the Team would define the
Requirements for the project and have that validated by the Client Representatives.
9
Legacy System Data Conversion and Migration
a. Client Project Owner of the proposed Conversion/Migration project, who would have the absolute power
to resolve all issues and take necessary decisions
b. Client Project Coordinator/ Manager – to assist KMG in all phases of the project and expected to be well
versed with either or both Application Systems - Legacy and the Target Application system
Then the KMG Project Manager, in consultation with Client Project Owner and Client Project Coordinator /
Manager, would identify various key personnel, who will be required to be part of the team for the project of
Conversion and Migration of the Legacy System Data.
The table below defines the roles and responsibilities of various individuals – participating in Legacy System
Data Conversion project from the Client side.
Roles Responsibilities
Business Analysts - Explains the Business Processes/Functions of the Legacy System and Target
System
- Identify Data elements / Groups related to various Business Process and
Functions
- Help to prepare / validate the Data Profile created by KMG’s Business /
Data Analysts
- Validate Data Mapping document – developed by KMG’s Business Analysts
and Technical Leads for the Conversion and related reporting.
- Analyze impacts of the Data Conversion processes and related actions
(including Freezing of operation) on the Business
10
Legacy System Data Conversion and Migration
Roles Responsibilities
Database - If RDBMS being used in either or both of the Legacy and Target system, DDL
Administrator could be generated to identify Table and filed properties
- Create and Maintain Test Database for the Target System and Staging
Environment for the actual conversion
-
System / Network - Provide help in understanding the Legacy System Environment and Target
Administrator System Environment
- Help to setup up Staging Area (Servers/ network)
- Connectivity between Legacy System and Staging Area
- Execute the process of DATA LOAD in target environment
Project Coordinator/ - Manages the Data Conversion Processes and repo/rting
Manager - Review and if required modify, develop the Migration Plan
- Coordinate and Provide necessary resources – as per the Migration Plan
- Assign the right Technical resources
In KMG - all projects are overseen by a Project Owner (Normally a AVP/VP of the company). Often in KMG
project the PM and the Technical Lead have interchangeable skills and act as backups for each other.
The backup developers continuously shadow the main team members. They are as good as any other
developer in the team and can take over from anyone at a day’s notice. These people are used in cases of
attrition as well handling spikes in the load (if any). These people also make sure that the work is not
disrupted if any member goes on vacation. A fresh person is added to the team as backup the day any
backup is absorbed into the team
11
Legacy System Data Conversion and Migration
KMG’s Technical Team Leads would gather information about the environment in which the Legacy
system operates
Same details would be gathered for the Target System. Such information helps to determine whether
the Legacy System data need to be converted to another format (like EBCIDIC to ASCII) which increases
the complexity.
Apart from the ones mentioned above technical information is gathered with respect to the Network in
which the Client organization hosts the Legacy System and Target System and operates those and other
Hardware, servers used by the organization.
a. How old is the operating system(s) under which data is to be migrated? Some migration tools do
not support legacy operating systems.
b. What staging area requirements are present, given current technologies and data migration requirements?
c. Whether Client would need or want the option to recover quickly from the source disk, or to fall back
to the original storage device as a fail-over? This is to determine/design both procedural and
technological ways to accomplish that.
d. Is a central console needed to manage data migrations across multiple servers?
e. Is there a need to control the data migration from a local server – or a remote server? If remote,
which protocols must be supported?
f. Is there a requirement to throttle or control data flows between servers?
g. Which storage tiers are involved?
h. Whether any change in the Target System model is likely or not? Should that is possibility – that should
be taken into account to analyze the consequences of the change in the eventual conversion process.
Based on the information gathered in response to the queries, listed above, KMG will enhance the
Requirement definition. All these information will greatly influence in determining the Conversion
Framework, method and ETL tools to be used or not.
12
Legacy System Data Conversion and Migration
An especially tricky part of doing any planning is that the target database is usually a ―moving target‖ – it is
being developed at the same time as the cleansing and conversion software, and is always subject to change.
If the new software is not a custom application, but a commercially available package, then it makes the
schedule creation process little easier even though that possibility still remains as most of the package
system often heavily customized.
KMG will create a schedule – using a project plan and pert chart, showing tasks involved, dependencies
between the tasks, and the staff required to perform each task. The schedule will include time to be spent
on task of familiarizing with the legacy systems’ operations and its data elements and MIS staff with the data
cleansing effort, and the tasks required from them.
KMG Project manager develops the schedules in consultation with the Client and the Project team
member, taking their experience and issues into consideration.
Project schedules would be revised based on the actual time taken to resolve complex issues and
procurement of necessary hardware, software resources and team resources.
13
Legacy System Data Conversion and Migration
managed. KMG would include a detailed procedure for controlling changes to the
A Change Control Board (CCB) could be constituted, consisting of KMG Project manager and Client
Project Owner, to review any proposed database changes and its impact. This would be critical to ensure
communication between the members of the project teams – whether software development team, Data
Analysts/ SMEs.
KMG will review and recommend, if required, Business freeze in multiple areas – which could be a critical
and required component of such Legacy Data conversion effort. Such requirement would be explained and
reviewed with business data owners, management and auditors.
Business freeze requirements should be addressed in detail through a separate document and circulated
well before the timelines - established for the cutover plan.
14
Legacy System Data Conversion and Migration
5 Analysis
In this methodology, the first stage of data conversion and migration is Data Classification, by creating
Data Profile for the data elements used in Legacy System(s) and Target Application.
Figure 2
During the Analysis phase of the Project, KMG produces various documents which serves as the
specification guideline for the Execution (Convert and Migrate) phases of the project.
15
Legacy System Data Conversion and Migration
The analysis is undertaken by the KMG Data Analysts/SMEs with the help of Client Business Analysts /
Technical Leads, and documented in Data Profile repository. This repository, which integrates source and
target data in mutual format, provides a profile of the various entities - visibility into that data’s usage,
capacity, and growth patterns. Various interfaces and reports utilizing the to-be-migrated data are also
considered.
KMG’s Data profiling process consist of three sequential steps with each step building on the information
produced in the previous steps. Data sources are profiled in three dimensions: down columns (column
profiling) ; across rows (dependency profiling); and across tables (redundancy profiling).
Column Profiling. Column profiling analyzes the values in each column or field of source data, inferring
detailed characteristics for each column, including data type and size, range of values, frequency and
distribution of values, cardinality and null and uniqueness characteristics. This step allows analysts to detect
and analyze data content quality problems and evaluate discrepancies between the inferred, true meta data
and the documented meta data.
Dependency Profiling. Dependency profiling analyzes data across rows comparing values in every column with
values in every other column and infers all dependency relationships that exist between attributes within
each table. Dependency profiling identifies primary keys and whether or not expected dependencies (e.g.,
those imposed by a new application) are supported by the data. It also identifies "gray-area dependencies"
those that are true most of the time, but not all of the time, and are usually an indication of a data quality
problem.
Redundancy Profiling. Redundancy profiling compares data between tables of the same or different data
sources, determining which columns contain overlapping or identical sets of values. It looks for repeating
patterns among an organization's "islands of information". Redundancy profiling identifies attributes containing
the same information but with different names (synonyms) and attributes that have the same name but
different business meaning (homonyms). It also helps determine which columns are redundant and can be
eliminated and which are necessary to connect information between tables. Redundancy profiling eliminates
processing overhead and reduces the probability of error in the target database.
KMG believes that developing an accurate profile of existing data sources is the essential first step in any
successful data migration project. The most significant problem associated with this phase could be if there
are frequent changes to the Target Application System Data model. Any change in the target system model
would have to be taken into account to analyze the consequences of the change in the eventual conversion
process. This renders the whole process to be iterative until a point wherein there is a freeze on the Target
System Data structure model or a complete understanding of the legacy system has been reached.
16
Legacy System Data Conversion and Migration
KMG’s objective of this process is to produce a comprehensive mapping between Legacy and Target System.
Every data field that is going to be migrated from the source system to the target system must be defined and
examined to ensure compliance with field lengths, data types, domain values permitted, system rules,
integrity checks and any other possible issues.
a. Un-mapped Target System Data Entity – this are the cases where data requirement is not satisfied by
Legacy/Source system. Specification could be developed for Deriving/defaulting data values for such
cases – specifically when that data is mandatory field in Target System.
b. Un Mapped Legacy System Data Entity - for which no Target Data element exists in the target system.
This will lead to data loss, which might NOT be desirable in all cases.
Data mapping process aims at identification and resolution of such gaps and such issues can be
addressed in different ways.
- Data Map (Transformation specification): At the end of the data mapping process, a detailed document
would be in place that would show the target field identified for each legacy field. Apart from identifying
the target fields to which the legacy fields are mapped, mapping specifications define the rules to be
applied in the conversion process. These rules are commonly known as Transformation rules.
- Referential Rules for Integrity check and domain values permitted,
- Validation rules – Application specific Business rules
Data mapping is an iterative process. For any change in the design of the target system or change in rule for
setting a value of a particular field there is a need to amend the mapping specification reflecting the changes
in Transformation rules.
The resulting Data Map (Transformation specification) document would be used later
a. In conjunction with third-party data migration tools to extract, scrub, transform and load the data from
the old system to the new system or
b. Develop a customized Application system to convert the Legacy / Source system data into Target System
data model. This will provide essential information to the programmers creating conversion routines to
move data from the source to the target database.
17
Legacy System Data Conversion and Migration
KMG believes that Data Cleansing is critical to the success of any Data Conversion and Migration project. If not
undertaken then business processes will not operate as designed. Data cleansing always takes more time and
more resources than anyone anticipates. It is for this reason that data cleansing efforts will be launched as
early as possible to make subsequent phase of the project easier and avoid delays.
a. Cleansing at the source: This involves the cleansing directly in the production of the existing legacy
system or systems. The main advantage of this approach is the exponential reduction in complications. It
makes conversion process very simple.
b. Cleansing through external means: This type of cleansing is generally accomplished by spreadsheets. A
report is sent to the persons responsible for data cleansing with data that is required to be cleansed. Care
should be taken that the spreadsheet contains enough data to make the businessperson understand what
they have to cleanse. It should also cater to the technical requirements so that it can be incorporated in
the data conversion process with ease. Spreadsheet design is of utmost importance as it can be
problematic, if at a later stage, a deficiency were to be identified with the basic design of the
spreadsheet which results in an inability to accommodate it in the conversion process.
KMG, suggests that data can be cleansed in the source system or in a staging area. Due to audit requirements
and for ease of cleansing, it is recommended that all data cleansing be performed in Legacy Systems unless
not viable.
KMG’s approach for data cleansing includes working with Client Integration/Functional teams to define /
execute automated data cleansing – based on the findings of the analysis phase. Multiple cycles of cleansing
could take place. Extracts are made periodically for validation of Cleansing activities and can be handed off
to the Technical Development team for sample loads into Staging area.
Deliverables/Outputs
a. Modified source data that increases the success of automated data conversion
b. Control metrics
c. Data Cleansing Requirement Specification (/ Recommendation)
18
Legacy System Data Conversion and Migration
Figure 3
The generic framework for a Data Conversion process consists of following steps
a. Data Extraction – Read and Gather data from source data store(s) into another storage, and if required
converted to the data format of the Target System (e.g. EBCIDIC to ASCII) and loaded in Staging Area
b. Validation and Cleansing – to confirm content and structure of extracted data in light of business rules
and fulfills integration rules based on the referential rules of Target System. Data Cleansing is performed
at this time based on requirements identified during Analysis phase.
c. Transformation - convert the extracted data from its previous form into the target form. Transformation
occurs by using Transformation Rules defined in Data Map (Transformation specification) and lookup
tables.
d. Validation -Target System - confirm content and structure of transformed data is valid for target.
e. DATA LOAD - Write the data into the target database, either through script or copying data using system
utilities.
Steps (a), (b), (c) and (d), shown in figure 3.0 above, are part of the ―Convert‖ phase of the Data Conversion
and Migration methodology. Step (d) i.e. DATA LOAD is part of the ―Migrate‖ phase of the project.
KMG makes necessary changes, depending on the specific requirement of each project, to the tasks as defined
in the various steps of the framework and create the executable process for Conversion of the Legacy Data
19
Legacy System Data Conversion and Migration
At this stage, KMG, in consultation with Client IT team, analyze and determines whether
- To use a data transformation tool to manage their data Conversion/migration effort? or
- Design and Develop an application system – specifically designed to fulfill that specific conversion
requirement based on the Data Map (Transformation Specification) by a group of technical experts?
KMG also determines, based on some of the information and Client requirement – whether to execute the
Conversion process in all at once or move data over through a controlled phase of multiple releases? KMG
analyze pros and cons to both options, considering which approach will best fit for the project based on
organization needs - to be evaluated on a variety of factors, like how much data involved and requirement of
the Target System.
Testing process in Data Conversion and Migration project could be categorized into two :
- Physical errors are the result of syntactical errors of the scripts / programs, which can be easily
identified and resolved.
- Logical errors – these are identified and resolved during Test phase. Such errors are result of the quality
of the mapping effort. During Implementation/Testing scripts/programs developed based on the Data Map
i.e. Transformation specification, are executed.
Based on the on the Data Map (Transformation Rules) KMG, creates Test Plans – in which KMG identifies Legacy
System data element and determines the Target system element and the expected results based on set of extract to
be used for testing. This test plan is to be prepared for each of the data element being converted.
Response to the following queries would be gathered and verified by Testing team
a. How many records were expected to be created by the scripts being tested?
b. Did the correct number of records get created? If not, why?
c. Has the data been loaded into the correct fields?
d. Is the data load complete – or are certain fields missing?
e. Has the data been formatted correctly?
f. Are any post-migration clean-up tasks in order?
The goal of a successful data migration is to keep the length of the deploy phase(s) to a minimum.
During Pilot /Testing, KMG would determine the quality of data mapping, by providing the populated target
data structures to the users that assisted in the analysis and design of the conversion scripts/application
system.
That would help Client Integration/Functional team to understand the data and would allow the user to
physically interact with the new, populated data structures of the Target System.
20
Legacy System Data Conversion and Migration
Data Extract process and Transformation (Conversion) process would produce reports from respective
processing and those could be used to log and validate the results:
- Record counts from the legacy input data, records with errors, records without errors, and total records
- Record counts for the converted output data
- Counts for the anticipated number of transactions
- Date and time of the start and end of the run
- Any codes that were not found in the crosswalk table, and the record in which they were encountered
- Any fields missing or in error and not found in the conversion table
- Summarized dollar values comparable with legacy system subtotals/totals.
This will help maintain the Conversion tables, enable Client functional teams to verify that the data loaded
match the data extracted. This will allow the KMG’s project Team to estimate the execution time required for
future conversion runs and provide an audit tracking mechanism.
21
Legacy System Data Conversion and Migration
6 Convert
6.1 Design & Develop Migration Tool
The data conversion process can be accomplished by the following methods:
Using a data conversion tool.
Scripts developed specifically for the purpose of conversion in the project.
Manual data conversion and migration.
The choice of a right conversion tool for a given project is always debatable but the following significant
factors form the basis of a rational decision
1. Cost involved in procuring a tool and having trained personnel to run the tool. Is this cost less than the
cost of employing developers to script the conversion process?
2. Does the tool require any customization? If yes, then the cost and time scales of such a customization
effort should be ascertained.
3. The volume of data to be converted and migrated. If the volume of data is much less, manual data
conversion and migration is the best option.
4. Whether the Legacy System data and Target environment could be hosted on same environment as
difference in operating system / data format would involve manual extraction and conversion to target
environment
A data transformation tool offers a central repository to support the storage and ongoing maintenance of data
mappings, lending itself to teams of all sizes. The most commonly used alternative tends to be spreadsheets,
which could fail when one needs to map one source data element to one or more target elements.
Spreadsheets typically prevent more than one person from making modifications at one time, resulting in a
great deal of unnecessary administrative overhead.
Flexible Reporting - Reporting is easily accomplished, and serves as a comprehensive communication device
between the systems and business analysts.
Code Generation from Mapping Rules – Technical Leads/Analysts can generate the migration scripts directly
from the mappings they have defined, reducing the overall programming requirements.
Script Scheduling Utilities - Some tools provide far more sophisticated options such as script compilation and
scheduling utilities. Scheduling utilities can cut the overall script execution time of a migration project by
more than one third in almost every case, by processing scripts in parallel as opposed to sequentially.
Automatic Integrity Violation Detection - Another important and useful feature of a data transformation tool
is the automatic detection of integrity violations.
Tailored Application (Manual Programming) - The manual approach to data migration is a method – very
useful when there are limited number entities involved and where Client wants to avoid significant
investment in a data migration tool or large amounts of effort by one or more programmers.
22
Legacy System Data Conversion and Migration
Pilot Migration. Complete end-to-end migration in the pilot environment. Coordinate with business users in
doing data validation, verify and evaluate the control mechanism and metrics.
Test Extract and Load. Once the Pilot Migration is completed, test the extract of the migrated data and test
the load into the mock target environment.
Key Activities
Deliverables/Outputs
23
Legacy System Data Conversion and Migration
Figure below illustrates the various operations that are carried out in the staging area.
24
Legacy System Data Conversion and Migration
Technical Mapping Specifications. Technical mapping specifications are translation of mapping specifications
from a technical standpoint prepared by technical team, and these serve as a primary input to code
development. Care should be taken to incorporate each and every mapping rule defined in the mapping
specification.
Overall Conversion Process Design. Though each data conversion process design varies depending on the
system to be migrated the philosophy remains the same, which is explained below.
Legacy System(s) Data Import. Data from the production database of the legacy system is imported to the
staging area.
Data Conversion Process. The data conversion process is executed on the imported legacy system data in the
staging area.
Converted Data. The data conversion process gives rise to the converted data that is in the required target
data structure model.
Final Extract and Load. Load the converted data into the target environment.
Key Activities
Deliverables/Outputs
25
Legacy System Data Conversion and Migration
a. Is the entire set of mapping rules defined in mapping specifications document incorporated?
b. Does the data converted follow the data structure model of the target system?
c. Are all the fields mapped from the legacy system converted in the appropriate way in the
target system?
d. Are all the constraints of the target system fulfilled or is there is any risk of data load rejections?
e. Did the correct number of records get created in the target system?
f. Was the cleansing data incorporated correctly in the conversion process?
g. Do the legacy and target system records reconcile?
26
Legacy System Data Conversion and Migration
7 Migrate
7.1 Develop Migration Statistics
There are specific goals associated with implementing an effective data migration strategy. Primarily, data
must be migrated from the source platform to the target platform completely and accurately, and according
to company and regulatory policies on information controls and security. This means no dropped or
incomplete records, and no data fields that fail validation or other quality controls in the target environment.
Another goal of data migration is that the process be done quickly, with as short a downtime window as
possible. Finally, the cost of data migration must be manageable, in terms of technology and staff
requirements.
There are many metrics that can measure the effectiveness and efficiency of data migrations:
The data load scripts that load data into target system are tested after the loading is done to the target
system.
Delta Data
Another important aspect of a data migration project is accounting for the data input to the production legacy
system in between the last export of legacy system and the data conversion process run. This is commonly
known as delta data. It has to be understood that the window of the conversion process is finite and the
legacy system still operates during this window of conversion process execution. There are various methods to
address this issue, notable of them are
1. If the volume of data input to production legacy system is very low, then a manual data conversion is
a wise choice.
2. If the data volume does not permit for a manual data conversion then the conversion process is re-run
on the delta data in a separate staging area and migrated to production.
27
Legacy System Data Conversion and Migration
It has to be appreciated that even though the delta data is migrated, there still exists another set of delta
data unless the use of the legacy system is frozen in the window wherein the data conversion process is
executed. A data freeze on production legacy system is not always a very good idea in all the areas of
business, as some of them cannot afford a freeze on their systems for a single day. So delta data conversion
can be eliminated but such a decision is dependant on the nature of the business in which it is carried out.
Deployment phase
The first step in the design stage is to put together a project plan and structure. As part of this process there
should be close analysis of any dependencies in data migrations; where possible, such dependencies and
complexities should be reduced to better manage deployment risk. During the deploy phase the following
occurs:
The above may be conducted in phases, or as part of a parallel run or pilot depending upon the
migration approach chosen.
28
Legacy System Data Conversion and Migration
Each and every phase explained in the data conversion and migration process is iterative. No conversion and
migration process is accomplished in a single go. It has to be refined as and when a change is triggered at the
design level, generally for the target systems and occasionally for the legacy system (when a certain aspect of
the legacy system is omitted in the preliminary analysis by oversight).
RISK MITIGATION
Rollback - When importing data into a target system, what happens if the data migration fails? Are we
prepared to either utilize existing transaction rollback functionality or do we have capacity to design and
build our own if none exists? How do we manage the client expectation in such cases? Do we have a mitigation
plan in place? Have we discussed these with the client IS team and business users?
Answers to these questions gives us an additional layer of security and contributes a lot in terms of executing
the project in time, within the budget as well as managing customer expectation all along.
Replication - The issue being: what happens in case of disaster or irrecoverable system failure? Migrating data
to a backup system at the same time as a new target system should be seriously considered to add one more
layer of security and ensure that the disaster recovery plan is in place.
Data Profiling - Gain a complete understanding of the content, structure, quality, and integrity of the data of
the source system.
Data Mapping - Develop an accurate set of data mapping specifications from the source system to the target
system.
29
Legacy System Data Conversion and Migration
Development - Selecting an ETL tool to automate the migration process and make it more scalable should be
a high-priority item.
Quality Assurance - Conduct mock migrations, pilot migrations before the final migration run; this will ensure
that the migration process is robust and trusted.
30
Client-side participants include Business Analysts who explain business processes of both systems, validate data profiles, map data elements, and analyze business impacts of data conversion. Technical Leads assist in exploring both system environments, identify system entities, resolve queries, and make technical decisions related to data exchanges. The Project Coordinator/Manager manages data conversion processes, coordinates resources, and modifies the migration plan if required .
KMG suggests utilizing a comprehensive data cleansing process which involves cleansing data directly within the legacy system or externally through spreadsheets, emphasizing the importance of addressing discrepancies early to prevent propagation of issues. They recommend using automated cleansing and iterative validation checks to manage data discrepancies .
Data cleansing is crucial for ensuring accurate and relevant data conversion. It involves removing errors, duplicates, and ensuring consistency which is critical for smooth operational processes post-migration. KMG highlights two approaches: cleansing at the source for simplicity and cleansing through external means with spreadsheets to handle complex cases. They recommend beginning the cleansing early to avoid project delays .
Technical decisions on data exchange configuration critically impact how data is transformed and loaded. They include resolving technical queries and decisions on data format changes which facilitate smooth data integration. Proper implementation avoids migration failures and ensures efficient data handling in the target system environment .
Technical incompatibilities such as format differences, e.g., EBCDIC to ASCII, can complicate migration. KMG plans to address these through thorough environmental assessment and data mapping exercises to identify and map required transformations. They also implement referential and validation rules during the planning phases to mitigate incompatibility issues .
KMG's framework involves several steps: Data Extraction to gather and move data to a staging area, Validation and Cleansing to verify extracted data against business rules, Transformation to convert data into the target format, and Data Load to transfer data into the target system. Each step is critical to ensure data accuracy and integrity throughout the conversion .
A staging area is vital to isolate the conversion process from production environments. According to KMG, it is set up on a separate server to host data conversions, thus preventing disturbances to the existing data. It supports operations such as validation, cleansing, and transformation before loading data into the target environment .
KMG employs rigorous validation through test plans to check if mapping rules are incorporated, and if data conforms to the target structure. They use automated processes for validation against business rules and perform internal checks to ensure referential integrity, thereby preventing data rejection issues .
KMG considers several criteria, including the nature and function of data, whether data transformation is required to fit the target system's format, and if any data is redundant and can be eliminated. They evaluate transformation needs based on data usage requirements in the target environment and potential conflicts with ongoing applications .
The Project Owner plays a crucial role by holding decision-making power, resolving conflicts, and ensuring project alignment with organizational goals. The Coordinator facilitates communication, assists in project phases, and ensures proper resource allocation, thus ensuring smooth execution of the migration process .









