Tag Archives: performance issues

Distribution, Not Just for Joins

So I think in previous posts the need for distribution and therefore the benefit for colocation of data to handle joins has been pretty well covered.  But there are several other scenarios where data needs to be distributed to process. … Continue reading

Posted in Performance Tuning for Netezza | Tagged , , , , | 4 Comments

Distributed Joins, Process Skew

This post will build on concepts introduced in the Distributed Joins, The Basics and Distributed Joins, Modeling for Colocation posts.  I’m assuming some familiarity with table skew, where a distribution key is chosen where some key values has significantly more rows than … Continue reading

Posted in Best Practices, Performance Tuning for Netezza | Tagged , , , , | 8 Comments