The Complex World of Data Scientists and Black-Box Algorithms

25:17
 
Share
 

Manage episode 218658719 series 1951941
By Data Crunch Corporation. Discovered by Player FM and our community — copyright is owned by the publisher, not Player FM, and audio is streamed directly from their servers. Hit the Subscribe button to track updates in Player FM, or paste the feed URL into other podcast apps.
Hilary Mason is a huge name in the data science space, and she has an extensive understanding of what's happening in this space. Today, she answers these questions for us: What are the backgrounds of your typical data scientists? What are key differences between software engineering and data science that most companies get wrong? How should you measure the effectiveness of your work or your team's work as a data scientist for the best results? What is a good approach for creating a successful data product? How can we peak behind the curtain of black-box deep learning algorithms? Below is a partial transcript. For the full interview, listen to the podcast episode by selecting the Play button above or by selecting this link, or you can also listen to the podcast through Apple Podcasts, Google Play, Stitcher, and Overcast. Curtis: Today we hear from one of the biggest thinkers in the data science space, someone who DJ Patil endorses on LinkedIn for data science skills. She worked at bit.ly, the url shortener, and is a data scientist in residence at venture capital firm Accel Partners, a firm that helped fund some companies you may know, like Facebook, Slack, Etsy, Venmo, Vox Media, Lynda.com, Cloudera, Trifacta—and you get the picture. Ginette: The partner of this VC firm said that Accel wouldn’t have brought on just any data scientist. This position was specifically created because this particular data scientist might be able to join their team. Curtis: But beyond her position as data in residence with Accel, she founded a company that’s doing very interesting research, and today, she shares with us some of her experiences and perspective on where AI is headed. Ginette: I’m Ginette. Curtis: And I’m Curtis. Ginette: And you are listening to Data Crunch. Curtis: A podcast about how data and prediction shape our world. Ginette: A Vault Analytics production. Hilary: I'm Hilary Mason, and I'm the founder and CEO of Fast Forward Labs (Please note that Hilary is now the VP of Research at Cloudera). In addition to that, I'm a data science in residence for Accel Partners. And I've been working in what we now call data science, or even now call AI, for about twenty years at this point. Started my career in academic machine learning and decided startups were more fun and have been doing that for about 10, 12 years depending on how you count now, and it's a lot of fun! Ginette: Something I’d like to note here is there’s been a very recent change: Hilary’s company, Fast Forward Labs, and Cloudera recently joined forces, and Hilary’s new position is Vice President of Research at Cloudera. Now, one thing that Hilary talks to is where the data scientists she works with come from, which is a great example of the different paths people take to get into this field. Hilary I am a computer scientist, and I have studied computer science. It's funny because now at Fast Forward, our team only has only two computer scientists on it, and one of them is our general counsel, and one is me, and I'm running the business, so most of the people doing data science here come from very different backgrounds. We have a bunch of physicists, mathematicians, a neuroscientist, a person who does brilliant machine learning design who was an English major, and so data science is one of those fields where one of the things I really love about it is that people come to it from so many different backgrounds, but mine happens to be computer science. The people on our team at Fast Forward typically have a PhD in a quantitative field, such as physics, neuroscience, electrical engineering, and then have, through that, learned sufficient programming skill. One of the jokes I make about my team is that we're essentially a halfway house for wayward academics in the sense that we can absorb people and teach them to be good software engineers, help them understand the difference between theoretical machine learning an...

69 episodes