Building an Open, Petabyte-Scale Data Platform With Databricks



Come hear how digital native companies are leveraging a new data architecture — the data lakehouse — which delivers data warehouse performance at data lake economics, all powered by open source technologies. The data lakehouse architecture combines the best of data warehouses and data lakes into a single, unified architecture that can serve all data use cases, including BI, streaming analytics, data science and machine learning.

 

At this event, we’ll also explore Delta Lake — an open source storage layer that brings reliability to data lakes. Delta Lake provides ACID transactions, scalable metadata handling, and unifies streaming and batch data processing. We’ll talk about how Delta Lake makes the lakehouse vision possible.

 

We’ll also cover best practices to help organizations use powerful open source technologies so you can build and extend your data platform investments. You’ll learn about the advantages of cloud-based data lakes in terms of security and cost. And finally, you’ll learn how data teams at digital native companies are having a huge impact — lowering costs, speeding up time to market — and powering new innovations to disrupt industries.

 

Lastly, you’ll be able to interact with data engineers, data scientists and ML engineers and learn from each other. Databricks engineers and open source committers for Apache Spark™, Delta Lake and MLflow will be present todiscuss emerging trends and ways for you to get involved in the open source community.

 

Register today so you can:

  • Hear about the open lakehouse architecture and the advantages it offers over data warehouses and lakes
  • Find out how to extend and simplify your data platform by adopting lakehouse architecture concepts
  • See how you can add reliability, performance and governance to your open data lake
  • Hear how digital natives build highly scalable and reliable data pipelines for analytics and machine learning
  • Network with and hear from your data engineering and machine learning peers at other digital native companies

 

Agenda (PT):

  • 10:00–10:20 AM Enabling an Open, Petabyte-Scale Data Architecture With Databricks
  • 10:20–10:40 AM Lakehouse Architecture in Practice at Scribd
  • 10:40–11:20 AM Customer Panel — Lessons Learned in Building Data Platforms
  • 11:20–12:00 PM AMA With the Databricks Technical Team

Speaker and Presenter Information


Tyler Croy
Director of Platform Engineering
Scribd
 

Hien Luu
Sr. Engineering Manager
DoorDash
 

Chris Locklin
Engineering Manager
Grammarly
 

Sherwin Wu
Engineering Manager

Relevant Government Agencies

Other Federal Agencies, Federal Government, State & Local Government


Event Type
Virtual


This event has no exhibitor/sponsor opportunities


When
Wed, Sep 29, 2021, 1:00pm ET


Cost
Complimentary:    $ 0.00


Website
Click here to visit event website


Organizer
Databricks


Contact Event Organizer



Return to search results