DEV Community

Scraping

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
I Built an API That Turns Any Website Into JSON Using Just CSS Selectors

I Built an API That Turns Any Website Into JSON Using Just CSS Selectors

Comments
2 min read
Why I ditched regex scrapers for an LLM parser (and when you shouldn't)

Why I ditched regex scrapers for an LLM parser (and when you shouldn't)

1
Comments
4 min read
How to Scrape Google Search Results with Python Without Getting Blocked (2026)

How to Scrape Google Search Results with Python Without Getting Blocked (2026)

Comments
10 min read
Proxy Rotation & Session Management for AI Web Agents

Proxy Rotation & Session Management for AI Web Agents

Comments
6 min read
I Built an API That Turns Any Website Into JSON Using Just CSS Selectors

I Built an API That Turns Any Website Into JSON Using Just CSS Selectors

Comments
2 min read
Rate Limits & Anti-Bots in Agentic Scraping

Rate Limits & Anti-Bots in Agentic Scraping

1
Comments
5 min read
The Anti-Bot Detection Checklist I Use Before Every Scraping Project

The Anti-Bot Detection Checklist I Use Before Every Scraping Project

Comments
4 min read
My Apify Promotion Filter: Scale Clean APIs, Hold Back Noisy Demand

My Apify Promotion Filter: Scale Clean APIs, Hold Back Noisy Demand

Comments
3 min read
Regex broke my scraper: Using LLMs for robust data extraction

Regex broke my scraper: Using LLMs for robust data extraction

2
Comments
5 min read
I Thought I Knew Web Scraping — Until I Hit JavaScript

I Thought I Knew Web Scraping — Until I Hit JavaScript

Comments
4 min read
My web scraping nightmare ended when I let an LLM read the HTML

My web scraping nightmare ended when I let an LLM read the HTML

Comments
5 min read
Why I Gave Up on Regex and Started Using AI for Web Scraping

Why I Gave Up on Regex and Started Using AI for Web Scraping

Comments
5 min read
I Spent a Weekend Fighting Flaky Scrapers — Here’s What Finally Worked

I Spent a Weekend Fighting Flaky Scrapers — Here’s What Finally Worked

Comments
5 min read
Advanced Headless Browser Anti-Bot Techniques: TLS & Canvas

Advanced Headless Browser Anti-Bot Techniques: TLS & Canvas

Comments
6 min read
Optimizing Chunking and Data Extraction for Zero-Hallucination RAG

Optimizing Chunking and Data Extraction for Zero-Hallucination RAG

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.