Data Engineering – Agenthum AI Solutions

Optimizing Data Pipelines and Engineering Solutions in GCP using Apache Spark and pyspark

November 12, 2025November 11, 2025 by ashish22june@gmail.com

Background A leading provider of advanced data solutions, was experiencing significant performance bottlenecks in its data processing pipelines. The company relied on legacy systems that could not keep pace with rapidly growing data volumes, leading to delays in analytics and reporting. These issues not only hampered operational efficiency but also increased operational costs and customer …

Implementing Informatica MDM for a Banking Institution (On-Prem)

November 12, 2025November 7, 2025 by ashish22june@gmail.com

Introduction In the modern banking industry, maintaining a single, trusted view of customer data is critical for regulatory compliance, risk management, and personalized customer experiences. This case study outlines the implementation of an Informatica Master Data Management (MDM) on-premises solution for a large banking institution with multiple legacy systems. Problem Statement The problem statement …