Jagan Mohan Kanimetta
sravanmaganti2@[Link]
+1 7327310281
[Link]
Summary
15+ years of Information Technology SDLC (Software Development Life Cycle) with a strong
background in Data Platform, Data Migration, ETL Processes and Data warehousing solutions with
4+ years of experience on data engineering cloud platforms.
Extensive experience with different phases of the project lifecycle, including project initiation,
requirement gathering, system design, coding, testing, and debugging client-server-based
applications.
Experience in Azure cloud data platforms to analyze and validation of the data between different
processing zones.
Proficient in designing, implementing, and managing large-scale data infrastructure and ETL
pipelines.
Expertise in SQL, Data Warehousing, Data Lake, and Azure Data Factory, along with a strong
foundation in optimizing and automating data workflows.
Experience in data processing using Azure Databricks using Lakehouse Architecture.
Experience in Azure Databricks, Apache Spark, Clusters, notebook, Workspace, Data loading,
Running Spark Jobs.
Experience in developing applications using PySpark and Spark-SQL for data extraction,
transformation and aggregation of data from multiple file formats.
Demonstrated experience in automated pipelines orchestration using Databricks workflows.
Demonstrated proficiency in Azure Cloud Services, managing critical components such as Azure
SQL, Azure Table Storage and Azure Lake ensuring robust and secure data storage, processing, and
retrieval.
Ability to architect solutions, create Data Platform roadmaps and enable scalability in Azure Data
Lake Azure Databricks and Azure Data Factory.
Experience with Developer tools like code Deploy, Code build, Code pipeline, design the overall
virtual private Cloud VPC environment including server instances, subnets, high availability zones
Proficient in writing complex SQL queries, including creating and managing Databricks Delta
tables, views, indexes, and stored procedures. Skilled in SQL-based analytics to build efficient
data models, processes, and transformation pipelines within Databricks environment.
Good knowledge Spark distributed frame work and performance concepts. Good experience on Code
optimization techniques in Spark, PySpark, Python.
Experience in working on complex data files structured/unstructured file formats such as XML, JSON,
and Text file.
Experience in Airflow for Data pipeline for job scheduling, orchestration & monitoring
Used Python (NumPy, SciPy, pandas, scikit-learn, seaborn) and Spark (PySpark, MLlib) to develop
variety of models and algorithms for analytic purposes.
Experience with different data formats like JSON, Avro, Parquet, ORC formats, and compressions like
Snappy & Bzip.
Experience in Data Integration, Migration and ETL process using Informatica Power Center
10.x/9.x/8.x.
Experienced in PL/SQL, UNIX shell scripting.
Worked with Stored Procedures, Triggers, Cursors, Indexes and Functions
TECHNICAL SKILLS:
Cloud Azure: Azure Data Lake, Data Factory, Databricks, Azure SQL
Platforms AWS: AWS Glue, DMS, IAM, S3, SQS, RDS, EC2 etc
Snowflake
ETL Tools Informatica 10.x/9x/8.x/7.x (Power Center/Power Mart) (Designer, Workflow Manager,
Workflow Monitor) , Informatica Power Exchange, Ab Initio 3.1
Data Erwin 4.0/3.5, Star Schema Modeling, Snowflake Modeling
Modeling
Tools Autosys, CA ESP, Control M, DB2A, DB2I, Endeavor, Perforce, Control-M and QMF.
Databases SQL Server, Azure SQL, Oracle 11g,Teradata 14.0/13/V2R5/V2R6, DB2
Programmin Python, PL/SQL,T-SQL, Unix Shell Scripting, MVS Cobol, JCL
g Languages
Data Azure Databricks, PySpark, Apache Airflow
Processing
Operating UNIX/LINUX, Windows
Systems
EDUCATION:
Master of Technology - 2005, JNTU, Hyderabad, India.
Bachelor of Technology - 2002, JNTU, Hyderabad, India.
TRAININGS & CERTIFICATIONS:
Databricks Certified Data Engineer - Associate
AWS Certified Solutions Architect - Associate
DB2 UDB Certified.
AINS 21 Certified.
PROFESSIONAL EXPERIENCE:
Bank of America, Charlotte/NC
Sep 23 – Till Date
Role : Lead Data Engineer
Responsibilities:
Designed and implemented data pipelines using Azure Data Factory to ingest, process, and
transform data from multiple sources into Azure Data Lake and SQL databases.
Built and maintained large-scale Azure Data Lake solutions to store unstructured and semi-
structured data, enabling high-performance analytics.
Leveraged Azure Databricks and Apache Spark to process large volumes of data efficiently,
reducing processing times by 30%.
Worked with ADF and its infrastructure, including Copy activity, Get metadata, Web activity,
execute pipeline, Azure data flows, IR’S, Dataset and linked service implementation, IAM,
triggers, synapse.
Executed complex data processing tasks using PySpark and Python, optimizing data workflows
for performance across distributed systems.
Created ETL workflows for data transformation and cleansing, improving the data quality and
reporting accuracy.
Implemented Azure Databricks notebooks to handle complex file transformations involving data
sourcing formats like csv/parquet/Json.
Enabled Unity Catalog for secure data governance within Databricks, managing access controls
and data cataloging.
Utilized PySpark RDDs (Resilient Distributed Datasets) and DataFrames for efficient data
manipulation and analysis in distributed computing environments
Implemented Slowly Changing Dimension (SCD), utilizing delta tables and change data feed.
Developed and maintained detailed documentation for all data engineering processes, including
data models, ETL workflows, and data transformation logic, ensuring transparency and ease of
knowledge transfer.
Created the DAGs in Airflow for orchestration of tasks through Python code and using the
operators.
Environment: Azure Databricks, Data Factory, Pyspark, Python, Spark SQL, Azure SQL, Informatica
10.x.
Mitsubishi Union Finance Group (MUFG), Charlotte, NC/Los Angeles CA
Projects: EDP Data Lake Pillar2, Application Production Support, OFSAA 6.1 upgrade etc.
Mar 15 – Aug23
Role : Lead Data Engineer
Responsibilities:
Engineered resilient data pipelines leveraging Azure Data Factory and Azure Blob Storage to
streamline data ingestion from on-premises and cloud sources.
Generated Spark jobs to handle data ingestion, transformation, and aggregation, significantly
reducing the time required for data preparation.
Formulated and optimized complex SQL queries to extract, transform, and load (ETL) sales data
from various sources into a centralized warehouse.
Testing & bug tracking and software maintenance in a CI/CD environment for Database and
Development Environment with GIT and Jenkin.
Development of Ingestion, Curation and Consumption process in Azure for new or existing sources.
Functional test case preparation, execution, logging and tracking defects in Jira
Report and discuss the status in scrum calls, attend all other meetings according to the Agile
practice.
Analyze business requirements and transformation rules for conversion into data validation test
scripts.
Responsible for BAU activities and production support of various applications and making sure no
impact on business.
Business development and delegating work to the teams by priority of the task and efficiency of the
team, as well as mentoring the team.
Design, Develop and Supported Extraction, Transformation and Load Process (ETL) for data
migration with Informatica 10.x/9.x with PL/SQL Packages.
Develop ETL mapping Documents like High Level Design (HLD) and Low Level design (LLD) for
every mapping and Mapping specification document for smooth transfer of project from
development to testing environment and then to production.
Performs the walkthrough on low-level design, Unit test plans and implementation plans at various
stages of the project prepared by the team; Ensures that all the team members are following the
PMP standards
Develop shell scripts and PL/SQL Procedures as part of Oracle data load.
Cloud Environment: Azure, PySpark , CI/CD ,Azure Data Factory, Azure SQL DB, Python
Environment: Informatica 10.x/9.5.1, Oracle 12c/11g, PL/SQL and UNIX Shell Scripting.
CIGNA- IM (CCW Accel Rx/Rebates), Bloomfield CT May 12 – Feb
15
Role : Informatica lead
Responsibilities:
Understand the business rules completely based on High Level document specifications and
implement the data transformation methodologies.
Business development and delegating work to the teams by priority of the task and efficiency of the
team, as well as mentoring the team.
Handles Offshore-Onsite-Client communication; prepares Functional Design documents and reviews
the deliverables and Quality Documentation.
Designed, Developed and Supported Extraction, Transformation and Load Process (ETL) for data
migration with Informatica 9.x with support of Teradata database.
Developed ETL mapping Documents like High Level Design (HLD) and Low Level design (LLD) for
every mapping and Data Migration document for smooth transfer of project from development to
testing environment and then to production.
Performs the walkthrough on low-level design, Unit test plans and implementation plans at various
stages of the project prepared by the team; Ensures that all the team members are following the
PMP standards; interacts with the client to get the approvals of the design, coding and
implementation
Environment: Informatica 9.1.1, Teradata 14, Oracle 11g and UNIX Shell Scripting.
Liberty Mutual Jan ‘ 10 –
Apr 12
Role: ETL Developer
Base Location : Hyderabad, India
Responsibilities:
Gathered the system requirements and created mapping document which gives detail
information about source to target mapping and business rules implementation.
Drafted Business Requirement Documents, System Requirement Specifications, Business Work
Flow Diagram, Use Case Diagram, Data Flow Diagram, Cross Functional Diagram to represent
Business and System requirements.
Designed, developed and debugged ETL mappings using Informatica designer tool.
Created complex mappings using Aggregator, Expression, Joiner, Filter, Sequence, Procedure,
Connected & Unconnected Lookup, Filter and Update Strategy transformations using
Informatica Power center designer.
Extensively used ETL to load data from different sources such as flat files, XML to Oracle.
Worked on mapping parameters and variables for the calculations done in aggregator
transformation.
Implemented slowly changing dimension for accessing the full history of accounts and
transaction information.
Tuned and monitored in Informatica workflows using Informatica workflow manager and
workflow monitor tools.
Environment: Informatica Power Center 8.6.1, Informatica power Exchange, Teradata, Unix and
Mainframe.
Marks and Spencers, UK Apr 08 - Dec
09
Role: Mainframe Developer
Base Location : Chennai, India
Responsibilities:
Attending client work group meetings and getting the requirements during the design phase.
Preparing Low Level Designs.
Coordinate and Communicate with the offshore by conducting weekly status calls.
Reviewing the offshore design docs & code deliverables and ensures that the coding is inline
with design specifications.
Ensuring quality process is followed at every stage of enhancement.
Training and Mentoring of the new joiners into the team and other teams by conducting KT
sessions.
Worked for CFTO and CSSM applications and implemented successfully in production
Writing the System Test Scripts and Test scenarios for the applications developed.
Environment: Cobol II, JCL, DB2