Go offline with the Player FM app!
Drill to Detail Ep.44 'Pandas, Apache Arrow and In-Memory Analytics' With Special Guest Wes McKinney
Archived series ("Inactive feed" status)
When? This feed was archived on June 20, 2020 04:08 (). Last successful fetch was on May 14, 2020 16:18 ()
Why? Inactive feed status. Our servers were unable to retrieve a valid podcast feed for a sustained period.
What now? You might be able to find a more up-to-date version using the search function. This series will no longer be checked for updates. If you believe this to be in error, please check if the publisher's feed link below is valid and contact support to request the feed be restored or if you have any other concerns about this.
Manage episode 196856342 series 1980903
Mark is joined in this episode of Drill to Detail by Wes McKinney, to talk about the origins of the Python Pandas open-source package for data analysis and his subsequent work as a contributor to the Kudu (incubating) and Parquet projects within the Apache Software Foundation and Arrow, an in-memory data structure specification for use by engineers building data systems and the de-facto standard for columnar in-memory processing and interchange.
- Python Data Analysis Library
- "Ibis on Impala: Python at Scale for Data Science"
- Drill To Detail Ep.3 'Apache Kudu And Cloudera's Analytic Platform' With Special Guest Mike Percy
- Apache Arrow homepage
- "Apache Arrow and the "10 Things I Hate About pandas"
- "Apache Arrow vs. Parquet and ORC: Do we really need a third Apache project for columnar data representation?"
- "Some comments to Daniel Abadi's blog about Apache Arrow"
- Wes McKinney homepage
81 episodes
Drill to Detail Ep.44 'Pandas, Apache Arrow and In-Memory Analytics' With Special Guest Wes McKinney
Archived series ("Inactive feed" status)
When? This feed was archived on June 20, 2020 04:08 (). Last successful fetch was on May 14, 2020 16:18 ()
Why? Inactive feed status. Our servers were unable to retrieve a valid podcast feed for a sustained period.
What now? You might be able to find a more up-to-date version using the search function. This series will no longer be checked for updates. If you believe this to be in error, please check if the publisher's feed link below is valid and contact support to request the feed be restored or if you have any other concerns about this.
Manage episode 196856342 series 1980903
Mark is joined in this episode of Drill to Detail by Wes McKinney, to talk about the origins of the Python Pandas open-source package for data analysis and his subsequent work as a contributor to the Kudu (incubating) and Parquet projects within the Apache Software Foundation and Arrow, an in-memory data structure specification for use by engineers building data systems and the de-facto standard for columnar in-memory processing and interchange.
- Python Data Analysis Library
- "Ibis on Impala: Python at Scale for Data Science"
- Drill To Detail Ep.3 'Apache Kudu And Cloudera's Analytic Platform' With Special Guest Mike Percy
- Apache Arrow homepage
- "Apache Arrow and the "10 Things I Hate About pandas"
- "Apache Arrow vs. Parquet and ORC: Do we really need a third Apache project for columnar data representation?"
- "Some comments to Daniel Abadi's blog about Apache Arrow"
- Wes McKinney homepage
81 episodes
All episodes
×Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.