Scraping Websites with Python

Sometimes scraping is the only way to extract meaningful data when there are no options like an accessible API. Parsing raw HTML can be intimidating and full of failures if you aren't used to existing tooling that can help you parse faster and more efficiently. In this video, learn all the basi...

Full description

Bibliographic Details
Main Authors: Deza, Alfredo, Gift, Noah (Author)
Format: eBook
Language:English
Published: Pragmatic AI Solutions 2021
Edition:1st edition
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
Description
Summary:Sometimes scraping is the only way to extract meaningful data when there are no options like an accessible API. Parsing raw HTML can be intimidating and full of failures if you aren't used to existing tooling that can help you parse faster and more efficiently. In this video, learn all the basics including some advanced techniques to parse HTML and extract data with the Scrapy library in Python. k Topics include: * Install, configure, and create a new project with Scrapy, a powerful scraping library written in Python * See what is required to start parsing a website, including looking at raw HTML, tags, and CSS. * Identify data to create a dataset or datasets to perform data science analysis later * Capture parsed data and save it in different formats locally * Ultra fast scraping techniques by using the filesystem directly A few resources that are helpful if you are trying to do scraping, some of them covered in the course: * Scrapy Library * Scrapy Getting started tutorial
Item Description:Mode of access: World Wide Web
Made available through: Safari, an O'Reilly Media Company
Physical Description:1 video file, approximately 60 min.