The Simpson's Paradox in the offline evaluation of recommendation systems