Enterprise Data Warehouse (EDW) Cloud Migration

From Legacy On-Premises EDW to a Scalable Azure & Microsoft Fabric Platform

About Client

The client is a provider of event technology and production services. The company supports meetings and events through services that include audiovisual production, digital solutions, and on-site support. They operate across multiple regions and work with venues, organizers, and enterprises to deliver event experiences. They also manage data related to event operations, customer engagement, and service delivery across its business systems. 

Background 

Client built a mature on-premises Enterprise Data Warehouse (EDW) on SQL Server, with SSIS for ETL and reporting through Tableau, Power BI, and Excel. While reliable for traditional analytics, it faced limitations in meeting growing demands for real-time insights, scalability, and advanced analytics. These evolving requirements underscore the necessity for a modern, cloud-native platform to enable real-time intelligence and supporting analytics at scale.

Business ChallengesTechnical Challenges
Data is 1 day old — no real-time availabilitySlow EDW queries and poor ETL performance
EDW access limited to a small analyst communityScaling DB, SSIS, and Power BI gateways is costly and complex
Poor data quality in some cases due to source-level issuesOn-premises stack is outdated with limited vendor support
No viable path for AI/ML workloads or data mining

To address these challenges, client initiated a phased migration to a cloud-based EDW leveraging Azure and Microsoft Fabric, adopting a Lambda Architecture to support both batch and real-time processing.

Objective 

The core objective was to retire the on-premises EDW and replace it with a modern, cloud-native data platform capable of supporting batch and real-time workloads, enabling broader self-service analytics, and laying the foundation for AI/ML capabilities. Specific expectations included:

  • Azure-hosted, horizontally scalable infrastructure with high availability and reliability
  • Support for both batch and real-time data processing, differentiated by use case
  • Embedded data quality framework for source-level validation
  • Role-based access control (RBAC) and broader user enablement 
  • Support for structured, semi-structured and unstructured data
  • Unified, certified Power BI reporting environment with self-service analytics
  • Platform readiness for future AI/ML workflows and experimentation

Our Solution: Microsoft Fabric

The transformation followed a structured, phased approach that adopted Microsoft Fabric as the cloud data platform, implementing a Lambda Architecture that supports both batch and real-time processing workloads. The architecture comprises three layers:

Lambda Architecture Layers

Batch LayerELT pipelines via Azure Data Factory and Fabric Notebooks, processing data into a medallion architecture (Bronze → Silver → Gold)
Speed LayerMicrosoft Fabric Event Stream for near-real-time ingestion via Azure Event Hub into KQL Database and Lakehouse Delta Tables
Serving LayerT-SQL and Spark endpoints exposing data to Power BI, Excel, and other consumers across all data stores

All data is standardized in Delta Lake format within OneLake, eliminating redundant copies across compute engines. Microsoft Purview integration was incorporated for governance and data lineage. The platform was sized at F128 Microsoft Fabric capacity — exceeding the baseline F64 estimate — to accommodate additional workloads and future growth. DevOps/CI-CD pipelines and a data quality framework were established from Phase 1 as foundational standards applied across all subsequent phases.

Oracle Data Migration (Phase I)

Migrate the Oracle EDW on-premises ETL and all associated Power BI reporting to cloud. Oracle was selected as the Phase 1 workload because it is the most loosely coupled dataset in the EDW – minimizing dependencies – and requires a source-layer change from Oracle ATP to Oracle Object Storage, which is best handled as a standalone effort. Two distinct Oracle data streams are in scope: GL (General Ledger) and Discover. Both run as separate batches in the on-premises EDW and will be independently managed in the cloud pipeline.

Data Pipeline Architecture

Oracle Object StorageAzure Blob Storage (ADF)Fabric Bronze LayerFabric Silver LayerFabric Gold Layer

Each medallion layer maps to a specific transformation stage: Bronze ingests raw files as-is; Silver applies transformations to match the existing ATP table structure; Gold processes data into the final structure mirroring the on-premises EDW.

Migrated Oracle ERP data (GL and Discover) along with common reference datasets (LocationList, WLCEmployee, RLS) to the cloud. This phase established the foundational architecture using a medallion model (Bronze, Silver, Gold), implemented end-to-end pipelines, migrated historical data, and transitioned Power BI dashboards. Success was defined by full data and dashboard parity with improved ETL performance, enabling decommissioning of on-prem Oracle ETL.

Success Criteria

On-premises EDW data matches the Cloud platform data – row counts, aggregates and key business metrics validated
Power BI dashboards fed from on-premises data match those fed from the cloud platform
Cloud ETL runtime is equal to or better than the on-premises runtime (currently 8 hours avg.)

 

Phase 1 Exit Milestone: Upon successful completion and validation, on-premises ETL for Oracle sources will be permanently decommissioned.

Expansion & Platform Scaling (Phase II)

Extend cloud migration to operational and workforce data sources, including Compass (CRM), NWF (workforce data), and Medallia (survey/API data) while building the batch processing layer of the data platform. This phase involved building 47 tables each in bronze and silver layers, and 30 tables in the gold layer, over a 19-week timeline. The objective was to migrate all ETL and reporting pipelines for these sources from on-premises systems to the cloud, covering the full data lifecycle from ingestion to summary layers. The scope included setting up source connectivity, migrating historical data, developing pipelines across bronze, silver, and gold layers, transitioning Power BI dashboards, extending CI/CD frameworks from Phase 1, and executing proofs-of-concept for SharePoint Shortcuts and custom Airflow-style orchestration. 

Success Criteria

Data and dashboard parity, along with improved ETL performance
Key workstreams progressed from initial POCs to source integration, pipeline development, and reporting migration
Seamless transition of reporting layers to cloud, UAT, production deployment, and documentation.

Future Phases (Planned)

Phase 3: Data quality, reliability, and model refinement

Phase 4: Migration of remaining data sources and reporting layers 

Overall Migration Roadmap

Phase 1 is the first of four phases in the full EDW cloud migration. Each phase builds on the foundation established by the previous one.

 

PhaseFocusObjectiveExit Condition
Phase 1 Oracle MigrationMigrate Oracle ETL (GL & Discover) and Oracle Power BI dashboards to the cloudOracle on-prem ETL decommissioned
Phase 2 Bronze & Silver (All Sources)Ingest and transform all remaining source data into bronze and silver layersCloud Bronze/Silver validated; on-prem still running
Phase 3Gold Layer (All Sources)Build Dimensions, Facts, and Summaries for Phase 2 sources in Gold layerFull cloud EDW parity achieved
Phase 4Full Reporting CutoverMigrate all remaining Power BI dashboards/datasets to cloud and decommission on-premOn-prem EDW fully retired

Business Value

Operational Benefits

  • On-time ETL completion for field reporting
  • Horizontally scalable — no infrastructure ceiling
  • Different SLAs per source based on business need
  • Larger user base supported on EDW
Data Quality & Governance

  • Embedded Data Quality Framework
  • Better integration with Microsoft Purview governance
  • Role-based access controls (RBAC) at platform level
  • Historical data retention and consistency
Self-Service Analytics

  • Excel: business users access data for daily analytics
  • Power BI: certified and ad-hoc reporting at global scale
  • Platform supports exploration and experimentation
Future Readiness

  • AI/ML workflow support built into platform design
  • Unstructured and semi-structured data support
  • Proven blueprint for Phases 2–4

Ongoing Support Services

Post-migration, the team will provide continuous support across the cloud platform and reporting layer. Core responsibilities include:

Platform Maintenance

  • Load monitoring
  • ETL & data issue resolution
  • Performance optimizations
Data Management

  • Incorporating new data points
  • Modifying existing data per business requirements
Reporting

  • New report development
  • Updates to existing reports and datasets

The team can be augmented as needed when project volume exceeds steady-state capacity.

A phased, cloud-native EDW transformation enabled client to modernize its data platform, improve performance and scalability and unlock enterprise-wide analytics capabilities.

Leave a Reply

Your email address will not be published. Required fields are marked *

BizAcuity
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.