Artwork

Content provided by Talk Python To Me Podcast and Michael Kennedy. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Talk Python To Me Podcast and Michael Kennedy or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

#283: Web scraping, the 2020 edition

48:34
 
Share
 

Manage episode 272758408 series 2497444
Content provided by Talk Python To Me Podcast and Michael Kennedy. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Talk Python To Me Podcast and Michael Kennedy or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Web scraping is pulling the HTML of a website down and parsing useful data out of it. The use-cases for this type of functionality are endless. Have a bunch of data on governmental sites that are only listed online in HTML without a download? There's an API for that! Do you want to keep abreast of what your competitors are featuring on their site? There's an API for that. Need alerts for changes on a website, for example enrollment is now open at your college and you want to be first to get in and avoid the 8am Monday morning course slot? There's an API for that. That API is screen scraping and Attila Tóth from ScrapingHub is here to tell us all about it. Full show notes at https://talkpython.fm/episodes/show/283/web-scraping-the-2020-edition
  continue reading

636 episodes

Artwork
iconShare
 
Manage episode 272758408 series 2497444
Content provided by Talk Python To Me Podcast and Michael Kennedy. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Talk Python To Me Podcast and Michael Kennedy or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Web scraping is pulling the HTML of a website down and parsing useful data out of it. The use-cases for this type of functionality are endless. Have a bunch of data on governmental sites that are only listed online in HTML without a download? There's an API for that! Do you want to keep abreast of what your competitors are featuring on their site? There's an API for that. Need alerts for changes on a website, for example enrollment is now open at your college and you want to be first to get in and avoid the 8am Monday morning course slot? There's an API for that. That API is screen scraping and Attila Tóth from ScrapingHub is here to tell us all about it. Full show notes at https://talkpython.fm/episodes/show/283/web-scraping-the-2020-edition
  continue reading

636 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Quick Reference Guide