BeautifulSoup
HTML and XML parsing with Python's BeautifulSoup library
#1
Getting Started with Web Scraping in Python
Learn the basics of web scraping with Python using the Requests library and BeautifulSoup. Your first scraper in 10 minutes.
#2
CSS Selectors for Web Scraping
Master CSS selectors to extract exactly the data you need. Classes, IDs, attributes, and advanced selector patterns.
#3
Handling Pagination in Web Scraping
Learn how to scrape paginated websites by following next-page links, handling page numbers, and collecting data across multiple pages.
#18
Extracting Data from HTML Tables
Scrape HTML tables from websites using BeautifulSoup and pandas. Handle complex tables with rowspan, colspan, and nested elements.
#19
Scraping Images and Files
Download images, PDFs, and other files while web scraping. Learn URL resolution, streaming downloads, and file organization best practices.
#20
Building a Price Monitoring Scraper
Build a complete price monitoring scraper that tracks product prices over time, detects price drops, and sends alerts. A real-world scraping project.
#25
Building a News Aggregator Scraper
Build a complete news aggregator that collects articles from multiple sources using RSS feeds and web scraping. Deduplicate, categorize, and store results.
#1
HTML Parsing with BeautifulSoup - Complete Guide
Master HTML parsing with BeautifulSoup4 in Python. Learn to navigate the DOM, find elements, extract text, and handle attributes.
#2
CSS Selectors vs XPath - When to Use Which
Compare CSS selectors and XPath for web scraping. Learn the syntax, strengths, and best use cases for each approach.
#7
Extracting Structured Data from Unstructured HTML
Techniques for pulling structured records from messy, inconsistent HTML pages. Handle missing elements, variable layouts, and embedded metadata.
#9
Handling Malformed HTML
Learn techniques for parsing broken, incomplete, and malformed HTML that you commonly encounter when web scraping.