Sanath
ETL Developer/Informatica
SUMMARY: -
Solutions-oriented Data Engineer with 8+ years of experience specializing in ETL design and
development using tools like Informatica (PowerCenter, IICS), SSIS, Talend, and Apache Airflow.
Hands-on experience with Informatica Data Quality (IDQ), Informatica Intelligent Cloud Services (IICS),
and Snowflake on AWS for implementing end-to-end cloud-based data quality and warehousing
solutions.
Proficient in SQL and NoSQL databases, including DB2, SQL Server, Snowflake, and MongoDB. Skilled in
SQL, PL/SQL, and basic Python scripting for data validation, automation, and troubleshooting.
Experienced in scripting languages (Python, Bash, PySpark), cloud data platforms (AWS Glue, S3), and
big data technologies (Spark, Hadoop, Kafka).
Skilled in designing and optimizing real-time and batch ETL pipelines, emphasizing data integrity,
quality, and governance.
Proficient in ETL Testing, Data Validation, API Testing (Postman, SOAP UI), and experienced with HP
ALM (QC) and other test management tools.
Strong skills in data profiling, data cleansing, data standardization, and exception handling using
Informatica IDQ and ETL Testing best practices.
Experienced in migrating data across AWS S3, Azure Blob Storage, Oracle, SQL Server, DB2, and
Teradata environments.
Knowledgeable in Dimensional Modeling (Star Schema, Snowflake Schema), Slowly Changing
Dimensions (SCD Types 1, 2, 3), and Change Data Capture (CDC) methodologies.
Skilled in ETL job orchestration and monitoring using Control-M, Crontab, and UNIX Shell Scripting for
automation.
Well-versed with Agile, Scrum, and Waterfall methodologies, delivering data integration projects on
time with high quality.
Skilled in UNIX Shell scripting for workflow automation and ETL process control.
Led offshore/onshore teams, delivering projects in Agile, Scrum, and Waterfall environments.
Proficient in ETL Testing: data validation, integrity checks, functional testing, regression testing, UAT
testing; strong experience with HP ALM (QC) and API testing (Postman, SOAP UI).
Excellent understanding of SDLC, STLC, and Bug Life Cycle; strong communication skills, problem-
solving abilities, and adaptability to dynamic environments.
Strong communication skills, capable of effectively presenting data-driven insights and project updates
to both technical and non-technical stakeholders.
TECHNICAL SKILLS:
ETL Tools Informatica PowerCenter 10.2/10.5/9.6/9.1, Talend, Informatica Data
Quality (IDQ), Informatica Intelligent Cloud Services (IICS/IDMC),
Informatica Power Exchange 10.2/9.6/9.1, IDQ 12.2, AWS Glue, Azure Data
Factory, IDMC/IICS, SSIS
Database Oracle 12c/11g/10g, MS SQL Server 2016/2014, PostgreSQL, Teradata
15/14/13, DB2-UDB, Netezza 7.1, MongoDB
Cloud AWS S3, Redshift, Teradata, Hadoop Hive, Azure Blob Storage,Snowflake (on
AWS),
BI/Reporting Tools Tableau 10.1, Power BI, OBIEE 11/10g,
Data Modeling Tools ER/Studio 15.0, ERWIN 9.2, Power Designer 15.0, Control-M
Languages SQL, PL/SQL, T-SQL, UNIX Shell Scripts, Java, Pyspark, Bash, Python
Operating Systems UNIX/LINUX, Windows, MS Windows Server, MS DOS
Other Applications Toad, SQL Developer, SSMS, VersionOne, TFS, Control M, Autosys, JIRA, QC,
Snap logic
Protocol ODBC, JDBC, OLE_DB, TCP/IP
Server LINUX, HP AIX
Methodologies and Agile – Scrum, Waterfall model, Power BI, Tableau
Data Visualization
Oracle BI Tools: OBIEE, OBIA, DAC, Oracle BI Server
WORK EXPERIENCE:
Fidelity, Dallas, TX, (Senior Informatica Developer) May 2023-
Present
Responsibilities:
Led the implementation of cloud-based data quality frameworks using Informatica IDQ on AWS,
ensuring robust validation, standardization, and cleansing processes for Snowflake data warehouses.
Ensured data governance and integrity by implementing robust validation, cleansing, and
transformation logic using Python and PySpark.
Developed real-time data integration solutions using Informatica Cloud and integrated Kafka-based
streaming pipelines with Spark Streaming.
Implemented and monitored real-time and batch data quality processes on Informatica Cloud (IICS),
enabling proactive issue detection.
Developed ETL pipelines using Informatica Cloud (IDMC/IICS), Azure Data Factory (ADF), and SSIS to
ingest, cleanse, and validate data from Oracle, SQL Server, Salesforce, and Teradata into Snowflake and
Azure Blob Storage.
Implemented parameterized Talend jobs for dynamic data loading based on environment
configurations (Dev, QA, Prod).
Collaborated with QA and DevOps teams to automate Talend job deployments using CI/CD tools,
reducing deployment time by 40%.
Conducted Data Validation and API testing using Postman and SOAP UI for end-to-end data quality
checks.
Validated and optimized Spark streaming pipelines for data loads between Azure Blob Storage and
Teradata environments.
Integrated on-premise databases with Salesforce Cloud using Informatica Cloud Services, managing
deployment and synchronization.
Led performance tuning initiatives for OBIEE dashboards, Oracle BI Server queries, and backend SQL
operations to enhance reporting KPIs.
Conducted ETL validation and functional testing using Informatica DVO, advanced SQL queries, and
API testing tools (Postman, SOAP UI).
Developed automation scripts for dynamic SQL generation, flat file creation, data extraction, and post-
load validation.
Environment: - Informatica PowerCenter 10.2.1, Power Exchange 10.2.1/[Link], Informatica Customer
360 (MDM), Informatica BES API Services, SSIS, Talend, IDMC/IICS, OBIEE, Informatica Data Quality 12.2
(IDQ), Oracle 12c, PostgreSQL, Tableau, SQL Developer, Azure, TOAD, CDQ, DB2 Mainframe, CDC, Autosys,
Snowflake, AWS, Visio, MS SQL Server 2012, Windows 7, JIRA, Harvest
Delta Air Lines, Atlanta, (Sr. Informatica Developer) Feb 2021-April 2023
Responsibilities:
Developed and managed ETL pipelines using Informatica PowerCenter, Informatica Cloud
(IICS/IDMC), and SSIS, integrating data from diverse sources into enterprise data warehouses.
Designed and built staging, intermediate, and fact models using DBT and BigQuery, applying schema
design, data cleansing, and transformation logic based on business requirements.
Analyzed tester notes, identified root causes of data issues, implemented fixes, and managed defect
backlogs.
Worked directly with Informatica Global Support to troubleshoot and resolve technical issues related
to MDM platform.
Applied knowledge of IBM DataStage job orchestration principles (server jobs, parallel jobs,
sequencers) while optimizing ETL frameworks across cloud and on-premises systems.
Automated batch workflows and ETL job orchestration using Control-M, optimizing scheduling,
dependency management, error handling, and notification strategies to meet SLAs.
Implemented metadata-driven ETL designs to enable dynamic job configurations and minimize manual
interventions across multiple pipelines.
Built incremental data loading frameworks using Change Data Capture (CDC) techniques, ensuring
efficient handling of large datasets.
Developed complex SQL queries, aggregation logic, and performance-optimized transformations for
data integration and reporting needs.
Used Apache Airflow to structure, schedule, and monitor ETL workflows, enhancing pipeline
automation and reliability.
Integrated Informatica ETL processes with Google BigQuery and Azure Blob Storage to support
scalable data lake and warehouse solutions.
Worked with Apache Spark and Scala to enhance data processing performance for large-scale
transformation requirements.
Used Kafka and Cloud Storage solutions for large-scale data movement between on-premise and cloud
environments.
Created dynamic ETL scripts using Python, UNIX Shell Scripting, and Maven for automating ETL builds
and deployments.
Participated in the operationalization of data pipelines in GCP, leveraging BigQuery Executor and
Dataflow jobs for real-time and batch data loads.
Worked closely with business and technical teams to translate requirements into scalable, efficient ETL
designs using Informatica, SSIS, and Cloud-based solutions.
Implemented ETL testing strategies for data validation, reconciliation, and performance testing across
Informatica and cloud pipelines.
Proficient in end-to-end ETL development, scheduling, data quality validation, and cloud integration
using Informatica, SSIS, Control-M, Airflow, and BigQuery in Agile/Scrum environments.
Environment: Informatica Intelligent Cloud Services (IICS), Informatica PowerCenter 10.2, SSIS,Informatica
Data Quality 10.1,Informatica Customer 360 (MDM), Oracle 12c, DB2, Teradata 15, AWS, RDBMS, Snap logic,
Snowflake 3.27, Erwin 9.2, PL/SQL, Putty, Shell Scripting, Control-M, Putty, WinSCP, Notepad++, JIRA, python
scripting, AutoSys,
Farmers New World Life Insurance, (Informatica Developer) June 2019-Jan 2021
Responsibilities:
Reviewing and analyzing business requirements and functional documentations for test scenario
development.
Developed ETL programs using Informatica to implement the business requirements.
Create a Parser to convert unstructured data into XML format and create a Mapper to map XML data
for unstructured transformation xml format in PWC mapping.
Developed the audit activity for all the Informatica IICS/IDMC (cloud) mappings.
Automated/Scheduled the Intelligent Data Management Cloud (IDMC/IICS) jobs to run daily with
email notifications for any failures
Strong technical skills in Oracle, SQL, PL/SQL, MySQL.
Experienced in transferring data from different source systems to Hadoop-Hive tables.
Power Exchange for Hadoop accesses Hadoop to extract data from HDFS or load data to HDFS/Hive.
Deploy DT services and create a deployment group to deploy code from Dev to Test environment.
Create complex mappings in PowerCenter Designer using Aggregate, Expression, Filter, and Sequence
Generator, Update Strategy, Union, Lookup, Joiner, Source Qualifier, Unstructured transformation,
and DX Transformations.
Used different Data Migration Services and Schema Conversion Tool along with Matillion ETL tool.
Created several Procedures, Functions, Triggers, and Packages to implement the functionality in
PL/SQL. Create Partner, Profiles, Accounts, and Endpoints for multiple clients.
Loaded diverse types (Structured, JSON, XML, flat files, etc.) into the Snowflake S data warehouse.
Extensively used ETL to load data from Flat Files, EDI files, staging, and a normalized table.
Performance tuning was done at the functional level and the map level. Used relational SQL wherever
possible to minimize the data transfer over the network.
Create logical and physical database models using Erwin Data Modeler based on different schemas
(Star and snowflake schema).
Improve the quality of data using the Informatica Data Quality tool.
Involved in creating new rules and a mapplet for IDQ mappings.
Extensive experience in the integration of Informatica Data Quality (IDQ) with Informatica
PowerCenter.
Worked on Enterprise Data warehouse with high volumes of data related to Auto Licensing and Tax
collection.
Extensive Functional and Regression testing of the frontend and middleware via SOAP, WSDL, and
Business process management through Java Messaging Services inside the web service client.
Testing of Data quality tickets for History and incremental fixes in EDW (Enterprise Data Warehouse).
Act as the technical contact for any query/database performance issues, work with developers and
business users in query requests and enhancements.
Validating the data using T-SQL on large databases (VLDB).
Followed Agile development methodology and adhered to strict quality standards in requirement
gathering
Analyzed the defects, and the root cause will be reported to the Technology/Developer team.
Retest and escalate the defects to stakeholders based on the defect classifications.
Tracked and reported testing results using HP Quality Center (ALM 12).
Environment: Informatica PowerCenter 9.6.1, Informatica Power Exchange 9.6, IDMC, IICS Oracle 11g,
SQL, PL/SQL, DB2 8.0, MS SQL Server 2008, T-SQL, Flat Files, Windows XP, Snowflake, UNIX, Linux, SAP
BODS, PL/SQL.
Quess Corp Limited, India June 2016-Aug 2018
([Link] Developer)
Responsibilities:
Designed, developed, and maintained ETL pipelines for efficient data extraction, transformation, and
loading from various structured and unstructured sources.
Worked extensively with Informatica PowerCenter, Apache Airflow, and Azure Data Factory (ADF) to
orchestrate and automate data workflows.
Designed data models and schemas for optimized data storage and retrieval in Snowflake, Azure SQL
Database, and HBase.
Developed and optimized complex SQL queries, stored procedures, and Snowflake UDFs for data
processing and reporting.
Implemented data integration solutions using Azure Data Lake Storage (ADLS), Amazon S3, and
Google Cloud Storage to support enterprise data warehouse initiatives.
Ensured data quality and governance by implementing validation, cleansing, and reconciliation
processes.
Developed ETL solutions in Python and Scala, leveraging Apache Spark and PySpark for large-scale
data transformations.
Optimized ETL processes, reducing data processing time by 20% through performance tuning and
parallel processing.
Collaborated with business stakeholders, data analysts, and data scientists to understand data
requirements and deliver accurate reports and dashboards.
Utilized Git, GitHub, and Bitbucket for version control, ensuring efficient code deployment and
collaboration.
Monitored and troubleshoot ETL jobs, reducing failure rates by 30% through proactive issue resolution
and automation.
Ensured compliance with data security and privacy regulations, implementing RBAC and encryption
mechanisms across ETL workflows.
Environment: Informatica PowerCenter, Apache Airflow, Azure Data Factory, Apache Spark, Hadoop,
PySpark, Hive, Sqoop, AWS (S3, RDS, Glue, Lambda, Redshift), Azure (ADLS, ADF, Synapse, Databricks), GCP
(Big Query, Cloud Storage), Snowflake, Azure SQL Database, MySQL, SQL Server, HBase, SQL, Python, Scala,
Shell Scripting, Git, GitHub, Bitbucket, Jenkins, CSV, JSON, Parquet, Avro, ORC, Linux, Windows.
EDUCATION:
Bachelor’s degree- Vardhaman College ofs Engineering (2016)