Enterprise Data Warehouse (EDW) Cloud Migration

From Legacy On-Premises EDW to a Scalable Azure & Microsoft Fabric Platform

About Client

The client is a provider of event technology and production services. The company supports meetings and events through services that include audiovisual production, digital solutions, and on-site support. They operate across multiple regions and work with venues, organizers, and enterprises to deliver event experiences. They also manage data related to event operations, customer engagement, and service delivery across its business systems.

Background

Client built a mature on-premises Enterprise Data Warehouse (EDW) on SQL Server, with SSIS for ETL and reporting through Tableau, Power BI, and Excel. While reliable for traditional analytics, it faced limitations in meeting growing demands for real-time insights, scalability, and advanced analytics. These evolving requirements underscore the necessity for a modern, cloud-native platform to enable real-time intelligence and supporting analytics at scale.

Business Challenges	Technical Challenges
Data is 1 day old — no real-time availability	Slow EDW queries and poor ETL performance
EDW access limited to a small analyst community	Scaling DB, SSIS, and Power BI gateways is costly and complex
Poor data quality in some cases due to source-level issues	On-premises stack is outdated with limited vendor support
	No viable path for AI/ML workloads or data mining

To address these challenges, client initiated a phased migration to a cloud-based EDW leveraging Azure and Microsoft Fabric, adopting a Lambda Architecture to support both batch and real-time processing.

Objective

The core objective was to retire the on-premises EDW and replace it with a modern, cloud-native data platform capable of supporting batch and real-time workloads, enabling broader self-service analytics, and laying the foundation for AI/ML capabilities. Specific expectations included:

Azure-hosted, horizontally scalable infrastructure with high availability and reliability
Support for both batch and real-time data processing, differentiated by use case
Embedded data quality framework for source-level validation
Role-based access control (RBAC) and broader user enablement
Support for structured, semi-structured and unstructured data
Unified, certified Power BI reporting environment with self-service analytics
Platform readiness for future AI/ML workflows and experimentation

Our Solution: Microsoft Fabric

The transformation followed a structured, phased approach that adopted Microsoft Fabric as the cloud data platform, implementing a Lambda Architecture that supports both batch and real-time processing workloads. The architecture comprises three layers:

Lambda Architecture Layers

Batch Layer	ELT pipelines via Azure Data Factory and Fabric Notebooks, processing data into a medallion architecture (Bronze → Silver → Gold)
Speed Layer	Microsoft Fabric Event Stream for near-real-time ingestion via Azure Event Hub into KQL Database and Lakehouse Delta Tables
Serving Layer	T-SQL and Spark endpoints exposing data to Power BI, Excel, and other consumers across all data stores

All data is standardized in Delta Lake format within OneLake, eliminating redundant copies across compute engines. Microsoft Purview integration was incorporated for governance and data lineage. The platform was sized at F128 Microsoft Fabric capacity — exceeding the baseline F64 estimate — to accommodate additional workloads and future growth. DevOps/CI-CD pipelines and a data quality framework were established from Phase 1 as foundational standards applied across all subsequent phases.

Oracle Data Migration (Phase I)

Migrate the Oracle EDW on-premises ETL and all associated Power BI reporting to cloud. Oracle was selected as the Phase 1 workload because it is the most loosely coupled dataset in the EDW – minimizing dependencies – and requires a source-layer change from Oracle ATP to Oracle Object Storage, which is best handled as a standalone effort. Two distinct Oracle data streams are in scope: GL (General Ledger) and Discover. Both run as separate batches in the on-premises EDW and will be independently managed in the cloud pipeline.

Data Pipeline Architecture

Oracle Object Storage

→

Azure Blob Storage (ADF)

→

Fabric Bronze Layer

→

Fabric Silver Layer

→

Fabric Gold Layer

Each medallion layer maps to a specific transformation stage: Bronze ingests raw files as-is; Silver applies transformations to match the existing ATP table structure; Gold processes data into the final structure mirroring the on-premises EDW.

Migrated Oracle ERP data (GL and Discover) along with common reference datasets (LocationList, WLCEmployee, RLS) to the cloud. This phase established the foundational architecture using a medallion model (Bronze, Silver, Gold), implemented end-to-end pipelines, migrated historical data, and transitioned Power BI dashboards. Success was defined by full data and dashboard parity with improved ETL performance, enabling decommissioning of on-prem Oracle ETL.

Success Criteria

✓	On-premises EDW data matches the Cloud platform data – row counts, aggregates and key business metrics validated
✓	Power BI dashboards fed from on-premises data match those fed from the cloud platform
✓	Cloud ETL runtime is equal to or better than the on-premises runtime (currently 8 hours avg.)

Phase 1 Exit Milestone: Upon successful completion and validation, on-premises ETL for Oracle sources will be permanently decommissioned.

Expansion & Platform Scaling (Phase II)

Extend cloud migration to operational and workforce data sources, including Compass (CRM), NWF (workforce data), and Medallia (survey/API data) while building the batch processing layer of the data platform. This phase involved building 47 tables each in bronze and silver layers, and 30 tables in the gold layer, over a 19-week timeline. The objective was to migrate all ETL and reporting pipelines for these sources from on-premises systems to the cloud, covering the full data lifecycle from ingestion to summary layers. The scope included setting up source connectivity, migrating historical data, developing pipelines across bronze, silver, and gold layers, transitioning Power BI dashboards, extending CI/CD frameworks from Phase 1, and executing proofs-of-concept for SharePoint Shortcuts and custom Airflow-style orchestration.

Success Criteria

✓	Data and dashboard parity, along with improved ETL performance
✓	Key workstreams progressed from initial POCs to source integration, pipeline development, and reporting migration
✓	Seamless transition of reporting layers to cloud, UAT, production deployment, and documentation.

Future Phases (Planned)

Phase 3: Data quality, reliability, and model refinement

Phase 4: Migration of remaining data sources and reporting layers

Overall Migration Roadmap

Phase 1 is the first of four phases in the full EDW cloud migration. Each phase builds on the foundation established by the previous one.

Phase	Focus	Objective	Exit Condition
Phase 1	Oracle Migration	Migrate Oracle ETL (GL & Discover) and Oracle Power BI dashboards to the cloud	Oracle on-prem ETL decommissioned
Phase 2	Bronze & Silver (All Sources)	Ingest and transform all remaining source data into bronze and silver layers	Cloud Bronze/Silver validated; on-prem still running
Phase 3	Gold Layer (All Sources)	Build Dimensions, Facts, and Summaries for Phase 2 sources in Gold layer	Full cloud EDW parity achieved
Phase 4	Full Reporting Cutover	Migrate all remaining Power BI dashboards/datasets to cloud and decommission on-prem	On-prem EDW fully retired

Business Value

Operational Benefits

On-time ETL completion for field reporting
Horizontally scalable — no infrastructure ceiling
Different SLAs per source based on business need
Larger user base supported on EDW

Data Quality & Governance

Embedded Data Quality Framework
Better integration with Microsoft Purview governance
Role-based access controls (RBAC) at platform level
Historical data retention and consistency

Self-Service Analytics

Excel: business users access data for daily analytics
Power BI: certified and ad-hoc reporting at global scale
Platform supports exploration and experimentation

Future Readiness

AI/ML workflow support built into platform design
Unstructured and semi-structured data support
Proven blueprint for Phases 2–4

Ongoing Support Services

Post-migration, the team will provide continuous support across the cloud platform and reporting layer. Core responsibilities include:

Platform Maintenance

Load monitoring
ETL & data issue resolution
Performance optimizations

Data Management

Incorporating new data points
Modifying existing data per business requirements

Reporting

New report development
Updates to existing reports and datasets

The team can be augmented as needed when project volume exceeds steady-state capacity.

A phased, cloud-native EDW transformation enabled client to modernize its data platform, improve performance and scalability and unlock enterprise-wide analytics capabilities.

Talk to Our Experts

Share this case study

Enterprise Data Warehouse (EDW) Cloud Migration

About Client

Background

Objective