Big Data News Hubb
Advertisement
  • Home
  • Big Data
  • News
  • Contact us
No Result
View All Result
  • Home
  • Big Data
  • News
  • Contact us
No Result
View All Result
Big Data News Hubb
No Result
View All Result
Home Big Data

Enabling the Customer Data Platform with Databricks ETL Support

admin by admin
April 11, 2023
in Big Data


Customer Data Platforms (CDPs) play an increasingly important role in the enterprise marketing landscape. By bringing together data from a wide variety of internal and external sources to construct a 360-degree view of the customer around a shared understanding of customer identity, the CDP enables marketers to develop rich insights to drive targeted engagement.

More narrowly focused than general-purpose data platforms, the CDP provides native support for the ingestion of frequently employed data sources and common transformations intended to turn raw data into informational assets ready for consumption by marketing teams. This built-in functionality helps accelerate time to value but may feel a bit constrained when teams are challenged to tackle more complex data transformation challenges. This is when marketing teams may turn to their data engineers, and those data engineers turn to their preferred data processing platform, Databricks.

The Right Tool for the Right Job

Databricks has long been recognized for its ability to tackle large, complex data processing challenges. With its support for both structured and unstructured data sources, high degree of extensibility and ease of integration with open source technologies, blurring of boundaries between real-time and periodic, batch processing and careful attention to workload management, Databricks has transformed how organizations think about analytic data processing.

This might appear to position Databricks as a rival to the CDP. Both systems support the processing of data, the generation of insights through analytics and the delivery of data and insights to downstream systems. But in our vision of a modern marketing ecosystem, we see the CDP and Databricks as complementary systems best fit for specific tasks that when properly integrated can help organizations maximize the potential of their customer information assets and minimize costs.

Complex Data Abound

Returning to the idea of complex data processing challenges in the CDP landscape, consider the processing of product reviews, social media content, clickstream data, or airline bookings with nested arrays of values. All of these information sources originate from customers and can provide valuable insights the marketing team can leverage to drive better engagement. But the data volumes involved (sometimes billions or trillions) and the complexity of the data e.g. XML, JSON or semi-structured text, are such that they must be carefully digested before they become useful to marketers.

By flowing these data through Databricks, Data Engineers can bring the full power of the lakehouse platform to bear. Product feedback can be tagged for sentiment and tone and topics can be extracted. Images can be interpreted and products in view can be identified. Individual clicks can be condensed to summary information that captures the flow of a customers’ recent visit to a website. And Airline booking data in XML can be unpacked to neatly tie revenue to multiple individuals on the reservation. This information can then flow from Databricks into the CDP where marketers use these details to determine who to engage and in what manner without having to wade through an ocean of raw data. Those are still preserved in the Databricks environments for analysts and data scientists who will have use for the data in its original, unaltered form.

The Lakehouse Unlocks Insights for CDPs

To demonstrate how the Databricks lakehouse might assist a CDP with this kind of ETL-offload, we partnered with our friends at Amperity around a scenario where customer data in the Amperity CDP is used to drive a targeted email campaign. The campaign is executed via the Salesforce Marketing Cloud (SFMC) where customer segments and individual consumer email addresses are pushed from Amperity to the SFMC platform. Scheduled jobs send messages to targeted individuals and SFMC captures details about which emails were delivered, opened and clicked-through or otherwise bounced or triggered an unsubscribe request.

Details of these email message events, which can run in the billions of records in just a few weeks, are captured by SFMC and are made accessible to the marketer by a daily extract. Instead of feeding this high-volume data directly into Amperity, it’s processed via Databricks, allowing for the capture of detailed information from ongoing email marketing campaigns while limiting the details flowing back. The customer 360-view housed in Amperity now has just those bits of information needed to understand the customer journey and define the next round of engagement.

Want to see this process in action, please check out the accompanying notebooks where we capture the Databricks process along with the Salesforce and Amperity integrations that surround it. We hope this demonstration helps our customers envision their own ETL offload scenarios within which Databricks can assist them in best achieving their customer engagement scenarios.



Source link

Previous Post

Reference guide to build inventory management and forecasting solutions on AWS

Next Post

Book Review: Math for Deep Learning

Next Post

Book Review: Math for Deep Learning

Recommended

PyTorch on Databricks – Introducing the Spark PyTorch Distributor

April 20, 2023

Build a real-time GDPR-aligned Apache Iceberg data lake

February 24, 2023

No Average Patient – Leveraging Data for Precision Healthcare

March 28, 2023

Don't miss it

News

Why Roblox Picked VictoriaMetrics for Observability Data Overhaul

June 6, 2023
News

Fivetran vs Matillion: Unveiling the Ultimate Battle of ETL Tools

June 5, 2023
Big Data

3 Key AI Predictions for The Near Future + How to Use AI to Transform Your Business

June 5, 2023
Big Data

Better LLMs with Better Data using Cleanlab Studio

June 5, 2023
News

Trakstar unlocks new analytical opportunities for its HR customers with Amazon QuickSight

June 5, 2023
News

Saving Sea Turtles with SAS’s ConserVision App

June 5, 2023
big-data-footer-white

© Big Data News Hubb All rights reserved.

Use of these names, logos, and brands does not imply endorsement unless specified. By using this site, you agree to the Privacy Policy and Terms & Conditions.

Navigate Site

  • Home
  • Big Data
  • News
  • Contact us

Newsletter Sign Up

No Result
View All Result
  • Home
  • Big Data
  • News
  • Contact us

© 2022 Big Data News Hubb All rights reserved.