Zelarsoft

CASE STUDY: Building a Data Warehouse solution on AWS Redshift

Abbott Laboratories is an American multinational medical devices and health care company with headquarters in Abbott Park,Illinois, United State

ZELAR ROLE IN PROJECT

The project involved building a data warehouse solution on Cloud (Amazon Redshift) and building various ETL layers. Zelarsoft was involved in the development of the data warehouse solution, provisioning, and managing the infrastructure on AWS Cloud.

A Team of 10 associates with different technical skills was assigned to work for this project

THE CHALLENGE

The client, a medical device and healthcare company had two major challenges. They were using multiple systems for their operations and these systems generated data that was in silos and in various formats.


The client did not own the data and was charged for its usage. The client had a vendor for doing the ETL work and owning the data(storing and managing). This vendor charged the client for any data access requests.

THE SOLUTION

  • To solve the above challenges, a solution was envisioned to build a data lake. As part of this, an ELT pipeline was built for extracting data from multiple sources, cleansing it, and loading it into a staging layer. The developers applied business rules to do the transformations and store this transformed data in a DataWarehouse built on Amazon Redshift.
  • The Data Warehouse provided a single source of truth for the organization-wide data. Since Amazon Redshift is used as a Data Warehouse solution, it provided the client with a solution that is scalable, cost-effective, secure, and highly available.
  • Since the solution was owned by the client, they solved the problem of recurring costs they were paying to the ETL vendor.
  • The data warehouse solution integrated well with the BI tool(Tableau) for report generation and eased the BI developer’s job.

KEY RESULTS

  • With this new solution in place, the client got a single source of truth for their data and was able to own their data.
  • With this, they saved significant costs (40%) as earlier they had to pay the ETL vendor for data access.
  • The new solution is cloud hosted and hence was more scalable, fast (columnar database), secure, and highly available.