Quicksort
Usage
Often the best practical choice for sorting because it is remarkably efficient on the average, expected running time is $O(n\ lg\ n)$, and the hidden constant factors are quite small.
Moreover, Quicksort has a smaller constant than merge sort. So if they’re both $O(n log n)$ time, quicksort is faster, and in practice it is because it hits the average case way more often than the worst case.
Complexity
Worst | Best | Average | |
---|---|---|---|
Time | $O(n^2)$ | $O(n log n)$ | $O(n log n)$ |
Space | $O(1)$ | $O(1)$ | $O(1)$ |
Tweaks in implementation might optimize the algorithm for specific use cases.
Explanation
Quicksort applies the Divide & Conquer technique.
-
Divide: Partition (rearrange) the array $A[p..r]$ into two (possible empty) subarrays $A[p..q-1]$ and $A[q+1..r]$ such that each element of $A[p..q-1]$ is less than or equal to $A[q]$, which is, in turn, less than or equal to each element of $A[q+1..r]$. Compute the index $q$ as part of this partitioning procedure.
-
Conquer: Sort the two subarrays $A[p..q-1]$ and $A[q+1..r]$ by recursive calls to quicksort.
-
Combine: Because the subarrays are already sorted, no work is needed to combine them, entire array $A[p..q]$ is sorted.
Performance
Running time depends on whether the partitioning is balanced or not, which in turn depends on which elements are used for patitioning.
If the partitioning is balanced, the algorithm runs asymptotically as fast as marge sort. If the partiotining is unbalanced, however, it can run asymptotically as aslowly as insertion sort.
Worst-case partitioning
If the element used for partitioning is either the smallest or the biggest one then partitioning produces one subproblem with n-1 elements and one with 0 elements.
Best-case partitioning
In the most even possible split, partitioning produces two subproblems, each of size no more than n/2: one is of size | n/2 | and one of size | n/2 | -1. |
Balanced partitioning
The average-case running time is much closer to the best case.
When we run quicksort on a random input array, the partitioning is highly unlikely to happen in the same way at every level. We expect that some of the splits will be reasonably well balanced an some will be fairly unbalanced.
In the average case, partitioning produces a mix of “good” and “bad” splits, which are distributed randomly throughout the recursion tree.
Implementation details
Pseudocode
QUICKSORT(A, 0, A.length-1)
QUICKSORT(A, p, r)
if p < r:
q = PARTITION(A, p, r)
QUICKSORT(A, p, q-1)
QUICKSORT(A, q+1, r)
PARTITION(A, p, r)
x = A[r]
i = p-1
for j = p to r-1
if A[j] <= x
i = i + 1
exchange A[i] with A[j]
exchange A[i+1] with A[r]
return I + 1
Randomized version
We cannot always expect that permutations of input numbers are equally likely.
We could use random sampling, instead of always using $A[r]$ as the pivot, we will select a randomly chosen element for the subarray $A[p..r]$. We do so by first exchanging the element $A[r]$ with the randomly chosen element.
By randomly sampling the range $p,…,r$ we ensure that the pivot element $x = A[r]$ is equally likely to be any of the $r-p+1$ elements in the subarray, and we can expect the partitioing to be reasonably well balanced on average.
RANDOMIZED-PARTITION(A,p,r):
i = RANDOM(p, r)
exchange A[r] with A[i]
return PATITION(A, p, r)