Scraping Websites with Python
Sometimes scraping is the only way to extract meaningful data when there are no options like an accessible API. Parsing raw HTML can be intimidating and full of failures if you aren't used to existing tooling that can help you parse faster and more efficiently. In this video, learn all the basi...
Main Authors: | , |
---|---|
Format: | eBook |
Language: | English |
Published: |
Pragmatic AI Solutions
2021
|
Edition: | 1st edition |
Subjects: | |
Online Access: | |
Collection: | O'Reilly - Collection details see MPG.ReNa |
Summary: | Sometimes scraping is the only way to extract meaningful data when there are no options like an accessible API. Parsing raw HTML can be intimidating and full of failures if you aren't used to existing tooling that can help you parse faster and more efficiently. In this video, learn all the basics including some advanced techniques to parse HTML and extract data with the Scrapy library in Python. k Topics include: * Install, configure, and create a new project with Scrapy, a powerful scraping library written in Python * See what is required to start parsing a website, including looking at raw HTML, tags, and CSS. * Identify data to create a dataset or datasets to perform data science analysis later * Capture parsed data and save it in different formats locally * Ultra fast scraping techniques by using the filesystem directly A few resources that are helpful if you are trying to do scraping, some of them covered in the course: * Scrapy Library * Scrapy Getting started tutorial |
---|---|
Item Description: | Mode of access: World Wide Web Made available through: Safari, an O'Reilly Media Company |
Physical Description: | 1 video file, approximately 60 min. |