Recent Posts (all)

My @NoSQLmatters Barcelona talk

It seems like, right now, I only write in this blog when a get a new child or when I speak at some public event. Not breaking the tradition is this post, with the link to the slides accompanying the talk.

As John Gruber aptly put it (although for another conference):

Great crowd, great venue, and an amazing array of fellow speakers. It was a real honor and a thrill to speak [there].

I completely share John’s take if I had to describe my experience at NoSQL matters Barcelona.

Without further ado, here’s the link to the slides.


It took more than 9 months (9 months and four days to be precise), but Elia Lanzani is now here. He’s a healthy boy. He weights 3.8 kg and he seems like he could be a sleeper (I certainly hope so!).

The real hero is of course my wife that almost didn’t blink while delivering. If before this I was writing little here, rest assured I’ll write even less now :)

My NoSQL matters Dublin talk

I was honored to have been invited at NoSQL matters Dublin 2014 as a speaker. The title of the talk was Real time data driven applications (and SQL vs NoSQL databases) and being there was a great pleasure. Dublin is lovely, they have really good beer, the typical Irish breakfast charmed me and beef had me yearning so good was it.

As a cherry on top of that, my wife flew with me, as we got married six years ago.

You can find the slides for the talk here in pdf and on Speakerdeck. I’ve even uploaded some pictures to Flickr.

My PyData Berlin talk, now with video.

As announced (you heard it here first!) I talked at PyData Berlin 2014. You can find the slides at Speakerdeck and a video of the presentation on Youtube. The video seems to be rather popular on the PyData Youtube channel, but we’ll see if it’ll stand the test of time.

Kudos to Numfocus and Pydata for promoting the conference and kudos to the organisation for putting the video’s online in record time (faster than WWDC’s video’s).

Web Excursions for July 26, 2014

Lot of bookmarks this time:

A command-line utility written in Go for batch-sending email.
Ma (negative space)
I’m always fascinated by the Japanese culture.
Find out what’s keeping your Mac awake
Sometimes selecting Sleep from the Apple menu doesn’t do anything. In that case, there’s a Terminal command that’ll tell you which processes are keeping your Mac awake. This is SO useful!
What I Wish I Knew When Learning Haskell ( Stephen Diehl )
A skimmable reference for intermediate level Haskell topics and an aggregate of the best external resources for diving into those subjects with more depth.
A command line tool for showing the progress of long-running coreutil functions like mv and cp.
You’re probably using the wrong dictionary « the blog
Add the best English dictionary to your Mac.
Online syntax highlighting for “MySQL”
Online syntax highlighting for more than 100 languages, including C#, Java, PHP, Basic, Perl, Python, Pascal, SQL, JavaScript and others.
Vim Awesome
Awesome Vim plugins from across the universe.
Cloudmarks - Canisbos
Cloudmarks (formerly Moofmarks) is a Safari extension that works with cloud bookmarking services Pinboard, Delicious, Kippt, and Google Bookmarks, letting you access your cloud bookmarks in a convenient popover.
A web-based launchd.plist generator.
100+ Interesting Data Sets for Statistics
Looking for interesting data sets? Here’s a list of more than 100 of the best stuff, from dolphin relationships to political campaign donations to death row prisoners.
The utility contacts gives you access from the terminal to view and search all your records in the Address Book database.
Shaping up with Angular.js
Learn to build an application using Angular.js
Syncthing replaces Dropbox and BitTorrent Sync by being open and decentralised. Runs on OS X, Windows, Linux, FreeBSD and Solaris.
Vincent takes Python data structures and translates them into Vega visualization grammar. It allows for quick iteration of visualization designs via getters and setters on grammar elements, and outputs the final visualization to JSON. Perhaps most importantly, Vincent groks Pandas DataFrames and Series in an intuitive way.

Maciej at Beyond Tellerand

I need to be on vacation in a house without internet connectivity1 to catch up with what is happening on the web. So two months after Maciej Cegłowski gave a talk at Beyond Tellerand in Düsseldorf, here am I, linking to it.

The talk is about the consequences of the internet on our lives. Maciej worked at Yahoo and runs a profitable business selling something (almost) all his competitors are giving away for free so he’s no stupid and mostly knows what he’s saying. As a bonus, his sense of humour is almost unmatched on the web (follow him if you don’t believe me).

If you’re short on time (like me), at least read it up to the second animal picture. But you’ll miss gems like:

You can dress up a bug and call it a feature. You can also put dog crap in the freezer and call it ice cream. But people can taste the difference.

  1. You may see this post appear on my site days (if not weeks) after I wrote it. [return]

Pydata Berlin 2014

I am thrilled to announce that I will speak this next July (25th and 26th, to be precise) at Pydata Berlin 2014, about Python and pandas as back end to real-time data driven applications. From the abstract of the talk:

For data, and data science, to be the fuel of the 21th century, data driven applications should not be confined to dashboards and static analyses. Instead they should be the driver of the organizations that own or generates the data. Most of these applications are web-based and require real-time access to the data. However, many Big Data analyses and tools are inherently batch-driven and not well suited for real-time and performance-critical connections with applications. Trade-offs become often inevitable, especially when mixing multiple tools and data sources. In this talk we will describe our journey to build a data driven application at a large Dutch financial institution. We will dive into the issues we faced, why we chose Python and pandas and what that meant for real-time data analysis (and agile development). Important points in the talk will be, among others, the handling of geographical data, the access to hundreds of millions of records as well as the real time analysis of millions of data points.

The full schedule is available and if you’re into Python and data I warmly suggest you go. Registrations are still open.

Xebicon 2014

In case you’re in the Netherlands (or nearby! Hello Belgians), and are interested in big data and data science, software development, continuous integration or architecture, then you should come to Xebicon 2014. At €95 it is dirty cheap (you get good food too!) but if you’re a swell chap I have a promo code for you (just drop me a line and you’ll get the early bird price).

Some highlights from the sessions:

The basis of all Haskell love-hate relationships

Depends on what you think about this snippet of code:1

let square y = y * y; limit = 100 in [(x, y, z) | y <- [1..limit], x <- [1..y], z <- [1..limit],  square x + square y == square z]

  1. It finds all the right triangles with integer sides smaller than limit without duplicates. If you want the version with duplicates, just use x <- [1..limit] [return]

Web Excursions for April 18, 2014

Learn Web Development From Scratch - SlideRule
Udacity’s awesome course, CS 253 - Web Development, by Steve Huffman, forms the backbone of this course.
Spiped is a utility for creating symmetrically encrypted and authenticated pipes between socket addresses, so that one may connect to one address and transparently have a connection established to another address. This is similar to ‘ssh -L’ functionality, but does not use SSH and requires a pre-shared symmetric key.
Skala Color, a Mac color picker by Bjango
Skala Color is a compact and feature-rich OS X color picker that works with a huge variety of formats, covering everything you’re likely to need for web, iOS, Android, and OS X development — Hex, CSS RGBA, CSS HSLA, UIColor, NSColor and more.
tiimgreen/github-cheat-sheet · GitHub
An extremely nice collection of tips for git and GitHub. I already had a bunch of convenience aliases set up, but I was missing gits status -sb.
Alexandru Cobuz wrote on April 10th, 2014 at 05:54:
PredictionIO is an open source machine learning server for software developers to create predictive features, such as personalization, recommendation and content discovery. I will definitely check it out at the next Google Friday.
A fast MySQL driver written in pure C/C++ for Python. Compatible with gevent through monkey patching.
Bypassing “clang: error: unknown argument”
This page describe how to bypass that nasty Clang 5.1 problem when compiling stuff in Mavericks. Note that, for fish users, the fix is using
set -x ARCHFLAGS -Wno-error=unused-command-line-argument-hard-error-in-future
pip install whatever
A curated list of free programming books hosted at GitHub.
Data scientists need their own GitHub. Here are four of the best options
Devs have GitHub. Now data scientists have more tools to help them work together.
Quick tip: Best practices for rechargeable batteries - The Sweet Setup
Some of the best (good?) practices for rechargeable batteries.