Menu

Close
  • Home
  • About Me
  • Contact Me
Subscribe
Menu

crawler

A 3-post collection

Page 1 of 1

The Tale of Creating a Distributed Web Crawler

Around 6 million records with about 15 fields each. This was the dataset that I wanted to analyze for a data analysis project of mine. But »

Benoit Bernard Benoit Bernard on web, crawler, scraper, distributed, scaling, python, politeness 12 September 2017

The Case of the Mysterious Python Crash

It was almost 11PM. My distributed web crawler had been running for a few hours when I discovered a very weird thing. One of its log »

Benoit Bernard Benoit Bernard on python, crawler, logs, linux, crash, requests, eventlet, signals, timeout 14 March 2017

Using Uber's Pyflame and Logs to Tackle Scaling Issues

Here I was again, looking at my screen in complete disbelief. This time, it was different though. My distributed web crawler seemed to be slowing down »

Benoit Bernard Benoit Bernard on python, crawler, scaling, performance, profiler, uber, pyflame, logs, mongodb, zeromq, linux 14 February 2017
Page 1 of 1
Benoit Bernard © 2021
Proudly published with Ghost