Overview

This project involved designing and implementing a robust data architecture to create a Single Customer View (SCV) by integrating data from multiple heterogeneous sources. The solution ensured high-quality, real-time, and batch data processing, enabling reliable insights across platforms.

Data is passed to Databricks for cleansing and transformation:

  • Null value elimination

  • Date format normalization

  • Duplicate removal

  • Standardization across datasets

Cleaned data from all sources is then integrated to build a Single Customer View (SCV), enabling a 360° profile of each customer.

The transformed and unified data is stored back into ADLS, with two key downstream flows:

  • Batch loading into Azure Synapse Analytics and SQL databases for further analysis and BI reporting.

  • Streaming data pipelines powered by Azure Stream Analytics (ASA) and Azure Data Explorer (ADX) for near-real-time insights.

Extraction

Transformation

Loading

Data is sourced from:

  • Salesforce (event-driven)

  • SAP (REST API)

  • Mobile/Web via Firebase

  • Finacle Core Banking (SOAP API)

Data from each source is first landed in Azure Data Lake Storage (ADLS). Azure Data Factory (ADF) is used, particularly for the Finacle integration, to orchestrate and temporarily store the extracted data securely.

End-to-End Data Integration Architecture for Unified Customer View Using Azure Ecosystem

Impact

This architecture enabled unified, real-time, and self-service analytics capabilities across business units, enhanced decision-making, and improved customer engagement through personalized insights.

Bright living room with modern inventory
Bright living room with modern inventory