The Undocumented Web: scraping, private APIs, proxies and “alternative solutions”


Manage episode 213574851 series 1469447
By Wes Bos and Scott Tolinski - Full Stack JavaScript Web Developers. Discovered by Player FM and our community — copyright is owned by the publisher, not Player FM, and audio is streamed directly from their servers. Hit the Subscribe button to track updates in Player FM, or paste the feed URL into other podcast apps.

What is the undocumented web? Scott and Wes dive into it, discussing APIs, faking, scraping, automation, proxies as well as tips and tricks for best practices.

Kyle Prinsloo’s Freelancing & Beyond — Sponsor

Kyle Prinsloo teaches you everything you need to know about freelancing, including how to quit your job, earn a side-income and start taking control of your life. Check it out at and use the coupon code “syntax” at checkout to get 25% off.

Freshbooks - Sponsor

Get a 30 day free trial of Freshbooks at and put SYNTAX in the “How did you hear about us?” section.

Show Notes

3:05 - What Is the undocumented web?

  • When traditional methods don’t give you what you want
  • Wild West

6:10 - Undocumented APIS

17:34 - Faking

  • Faking User Agents
    • Sending headers from requests
    • Free Blackberry wifi on plane

23:52 - Scraping

  • What is craping?
  • Cheerio - jQuery like node package

27:30 - Automation

33:30 - Proxies

  • Need a fresh IP address
  • Need a different country
  • Side note: Canadians buying stuff in USA M4S1P1 === 41100
  • API throttling

39:00 - Tips and Tricks

  • Multiple API keys
  • diskDB - Text files as databases
  • Search Github for API keys
  • Inspect Element and Delete Overlays
  • Delete Cookiesokay

44:45 - Stories

××× SIIIIICK ××× PIIIICKS ××× Shameless Plugs Tweet us your tasty treats!

477 episodes