Monthly Archives: March 2017

Defeat the Titans with salt !!!

Last time, we discussed how you could use Brickhouse’s sketch set implementation to scalably handle counting uniques. Even with sketch sets, however, there are times when skew or unbalanced datasets can reak havoc with your jobs. Even when Hive uses … Continue reading

Posted in Uncategorized | Leave a comment