Building A World Class Genetics Center Based On Data Scalability

Building A World Class Genetics Center Based On Data Scalability

The ability to accelerate drug discovery is necessary. I recently spoke with Jeffrey Reid, Head of genomics and Data Engineering for Regeneron. Reid works in the Regeneron genetics Center (RGC), a research initiative that seeks to improve patient care by using genomic approaches to speed drug discovery and development. The genetics center is a unit of Regeneron (NASDAQ: REGN), a leading biotechnology company that has been at the forefront of drug discovery for 3 decades. The firm’s focus on translating science into medicine has led to seven FDA-approved treatments. The Regeneron Genetics Center is engaged in one of the largest genetics sequencing efforts in the world.

Reid describes his role as existing at the intersection of science and data, noting that he is responsible for “taking raw data and turning it into usable facts about genomes”. His role in data engineering entails the deployment of algorithms that enable drug development. As part of building a large genetic sequencing center, Reid works with more than 80 industry and academic research partners to combine genetics data with electronic health record (EHR) data to understand how genetics impact health. 

To enable these drug discovery efforts, Reid and his team have deployed the Databricks technology platform to help mine genomic data at scale. Reid remarks, “We bring to bear a lot of robotics in the lab and analysis automation”. He emphasizes the urgency of operating at scale, given the billions of combinations of genotypes and phenotypes that can be mined for drug development insights.  “We need to identify every possible association between each genotype and phenotype. This requires us to analyze billions of cells of information”, says Reid. 

Databricks provides Regeneron with a scalable solution for mining these vast amounts of data. Reid notes that in the past, there was no scalable approach to managing volumes of data this large, and research companies were dependent upon home-built solutions based on antiquated approaches and technologies. According to Reid, Databricks delivers an enterprise platform that operates on the FAIR data principles of making data “findable, accessible, interoperable, and reusable” and helps drive scientific insights. Reid describes a technology environment at Regeneron characterized by what he describes as “tune up, deploy, tear down clusters” that support collaborative research initiatives such as Project Glow, an open-source toolkit for large-scale genomic analysis that was jointly created by the Regeneron Genetics Center and Databricks.

Share it:
Share it:

[Social9_Share class=”s9-widget-wrapper”]

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

You Might Be Interested In

What is Data Storytelling? Plus 5 Great Examples

17 Aug, 2018

Storytelling is the watchword in content marketing. To communicate with humans on an emotional level, you should tell stories using …

Read more

Your Data Is Biased, Here’s Why

14 Oct, 2017

Biased data can lead to bad decisions. Most business leaders aren’t aware of the problem just yet, but they need …

Read more

Why Modern Marketers Must be Data-Driven

4 Jul, 2021

Spanish Retailer Zara has mastered its ability to bring new designs from the drawing board to the store in a …

Read more

Recent Jobs

Senior Cloud Engineer (AWS, Snowflake)

Remote (United States (Nationwide))

9 May, 2024

Read More

IT Engineer

Washington D.C., DC, USA

1 May, 2024

Read More

Data Engineer

Washington D.C., DC, USA

1 May, 2024

Read More

Applications Developer

Washington D.C., DC, USA

1 May, 2024

Read More

Do You Want to Share Your Story?

Bring your insights on Data, Visualization, Innovation or Business Agility to our community. Let them learn from your experience.

Get the 3 STEPS

To Drive Analytics Adoption
And manage change

3-steps-to-drive-analytics-adoption

Get Access to Event Discounts

Switch your 7wData account from Subscriber to Event Discount Member by clicking the button below and get access to event discounts. Learn & Grow together with us in a more profitable way!

Get Access to Event Discounts

Create a 7wData account and get access to event discounts. Learn & Grow together with us in a more profitable way!

Don't miss Out!

Stay in touch and receive in depth articles, guides, news & commentary of all things data.