Modern Data Platform

Using Bigquery as a Data Lake for the Organization

Company Profile

Dr. Reddy’s Laboratories Ltd. is a multinational pharmaceutical company based in Hyderabad, Telangana, India. The company manufactures and sells a wide range of pharmaceutical products in India and over 25 countries through its three businesses – Pharmaceutical Services & Active Ingredients, Global Generics, and Proprietary Products. Its major markets include India, USA, Russia and CIS, Germany, UK, Venezuela, S. Africa, Romania, and New Zealand.

Founded by Dr. K Anji Reddy on February 24, 1984, the company’s portfolio of products and services includes APIs, custom pharmaceutical services, generics, biosimilars, and differentiated formulations. The company’s major therapeutic areas of focus are gastrointestinal, cardiovascular, diabetology, oncology, pain management, and anti-infectives.

The company’s offerings cover active pharmaceutical ingredients, branded formulations, generic drugs, biologics, specialty products and new chemical entities (NCE) that are sold in North America, Europe and the emerging markets of Asia, Africa, and South America.

Business Situation

The customer has multiple in house projects ongoing where their R&D data is being collected in multiple databases and warehouses. This data is valuable and needs to be consolidated for better analytics. The customer is in the process of pushing all their data into a Data lake. Google Bigquery along with Google Cloud Storage will be used as the Data lake to which multiple sources will be feeding the data. Using Bigquery as their Data lake the customer plans to build applications that will be used by their R&D team, data analysts, data scientists, and others. This Data lake should eventually be used for analytics and driving efficiencies in various streams. 

Google Cloud Implementation

SpringML helped migrate data from their onprem sources/databases into Bigquery. Multiple data sources were analyzed to create a consistent way in which data could be migrated. As an extension to the Datalake and to leverage the valuable data that is now available in BigQuery, we are in the process of helping the customer build a platform using AppEngine. This platform will be used to deliver business centric apps to various internal organizations. The same data is being used to build analytics dashboards and apply BQML where applicable to get insights into the data.

Thought Leadership

Check out our recent blogs and videos on best practices and implementation approaches.