Monthly Archives: March 2013

Exploding multiple arrays at the same time with numeric_range

Hive allows you to emit all the elements of an array into multiple rows using the explode UDTF, but there is no easy way to explode multiple arrays at the same time. Say you have a table my_table which contains … Continue reading

Posted in Uncategorized | 7 Comments

Use collect to avoid the self-join

Hive is the 5GL for MapReduce One of the confusions in describing what Brickhouse is about, is that Hive has multiple purposes, and different uses for different people. It is analogous to SQL, but that is the trick. There are … Continue reading

Posted in Uncategorized | Tagged , , | Leave a comment