Artwork

Content provided by Brian Olsen and Trino Community. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Brian Olsen and Trino Community or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

45: Trino swimming with the DolphinScheduler

1:54:46
 
Share
 

Manage episode 358485307 series 2796878
Content provided by Brian Olsen and Trino Community. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Brian Olsen and Trino Community or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

DolphinScheduler is a popular Apache data workflow orchestrator that enables running complex data pipelines. They recently added a Trino integration and will be demonstrating how to use DolphinScheduler to enable a series of transformations on the data lakehouse with Trino.

- Intro Music: 0:00

- Intro: 0:31

- Trino release 407: 13:22

- What is workflow orchestration?: 21:12

- Why do we need a workflow orchestration tool for building a data lake?: 31:07

- What is Apache DolphinScheduler?: 37:35

- Does DolphinScheduler have any computing engine or storage layer?: 53:11

- What are the differences with other workflow orchestration, such as Apache Airflow?: 58:46

- Demo: Creating a simple Trino workflow in DolphinScheduler: 1:26:44

- PR: Improve performance of Parquet files: 1:47:04

Show Notes: https://trino.io/episodes/45

Show Page: https://trino.io/broadcast/

  continue reading

57 episodes

Artwork
iconShare
 
Manage episode 358485307 series 2796878
Content provided by Brian Olsen and Trino Community. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Brian Olsen and Trino Community or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

DolphinScheduler is a popular Apache data workflow orchestrator that enables running complex data pipelines. They recently added a Trino integration and will be demonstrating how to use DolphinScheduler to enable a series of transformations on the data lakehouse with Trino.

- Intro Music: 0:00

- Intro: 0:31

- Trino release 407: 13:22

- What is workflow orchestration?: 21:12

- Why do we need a workflow orchestration tool for building a data lake?: 31:07

- What is Apache DolphinScheduler?: 37:35

- Does DolphinScheduler have any computing engine or storage layer?: 53:11

- What are the differences with other workflow orchestration, such as Apache Airflow?: 58:46

- Demo: Creating a simple Trino workflow in DolphinScheduler: 1:26:44

- PR: Improve performance of Parquet files: 1:47:04

Show Notes: https://trino.io/episodes/45

Show Page: https://trino.io/broadcast/

  continue reading

57 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Quick Reference Guide