Artwork

Content provided by Tomasz Nurkiewicz. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Tomasz Nurkiewicz or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

#93: K-means clustering: machine learning algorithm to easily split observations into multiple buckets

4:16
 
Share
 

Manage episode 352266111 series 2680464
Content provided by Tomasz Nurkiewicz. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Tomasz Nurkiewicz or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

K-means clustering is an algorithm for partitioning data into multiple, non-overlapping buckets. For example, if you have a bunch of points in two-dimensional space, this algorithm can easily find concentrated clusters of points. To be honest, that’s quite a simple task for humans. Just plot all the points on a piece of paper and find areas with higher density. For example, most of the points are located on the top-left of the plane, some at the bottom and a few at the centre-right. However, this is not that straightforward once you can no longer rely on graphical representation. For instance, when your data points live 3-, 4- or 100-dimensional space. Turns out, this is not that uncommon. Let me clarify.

Read more: https://nurkiewicz.com/93

Get the new episode straight to your mailbox: https://nurkiewicz.com/newsletter

  continue reading

98 episodes

Artwork
iconShare
 
Manage episode 352266111 series 2680464
Content provided by Tomasz Nurkiewicz. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Tomasz Nurkiewicz or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

K-means clustering is an algorithm for partitioning data into multiple, non-overlapping buckets. For example, if you have a bunch of points in two-dimensional space, this algorithm can easily find concentrated clusters of points. To be honest, that’s quite a simple task for humans. Just plot all the points on a piece of paper and find areas with higher density. For example, most of the points are located on the top-left of the plane, some at the bottom and a few at the centre-right. However, this is not that straightforward once you can no longer rely on graphical representation. For instance, when your data points live 3-, 4- or 100-dimensional space. Turns out, this is not that uncommon. Let me clarify.

Read more: https://nurkiewicz.com/93

Get the new episode straight to your mailbox: https://nurkiewicz.com/newsletter

  continue reading

98 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Quick Reference Guide