Posts by

Lorenzo Fontana

Distributed Data Analysis with Plain UNIX Commands and Docker Swarm


Reading Time: 11 minutesThe purpose of this post is to show how powerful and flexible Docker Swarm can be when combined with standard UNIX tools to analyze data in a distributed fashion. To do this, let’s write a simple MapReduce implementation in bash/sh that uses Docker Swarm to schedule Map jobs on nodes across the cluster. “Let’s see […]

Continue Reading