Hive Partitioning with Spark
I experimented with Hive partitioning and some of the things I discussed in this blog post are:
- Query response time decreased by 75% when I use partitioning.
- Partitioning can help us to organize our data in ways that work better for our use cases (for example, to partition each experiment output in a partitioned table)
Full article available here