Non-Recursive Merge Sort | Baeldung on Computer Science

1. Overview

In this tutorial, we’ll discuss how to implement the merge sort algorithm using an iterative algorithm. First of all, we’ll explain the merge sort algorithm and the recursive version of it.

After that, we’ll discuss the iterative approach of this algorithm. Also, we’ll present a simple example to clarify the idea.

Finally, we’ll present a comparison between the iterative and the recursive algorithms and when to use each one.

2. Merge Sort Algorithm

As we know, the merge sort algorithm is an efficient sorting algorithm that enables us to sort an array within $\boldsymbol{O(n \times log(n))}$ time complexity, where $n$ is the number of values.

Usually, we find that the recursive approach more widespread. Thus, let’s quickly remember the steps of the recursive algorithm so that it’s easier for us to understand the iterative one later. The recursive version is based on the divide and conquers strategy:

Divide: In this step, we divide the input into two halves, the pivot being the midpoint of the array. This step is carried out recursively for all the half arrays until there are no more halves to divide.
Conquer: In this step, we sort and merge the divided parts from bottom to top and get the complete sorted result.

3. The Iterative Approach

3.1. General Idea

As we showed in the recursive version, we divided the input into two halves. This process continued until we got each element of the array alone. Then, we merged the sorted parts from bottom to top until we get the complete result containing all the values sorted.

As usual, when we’re trying to think about moving from a recursive version to an iterative one, we have to think in the opposite way of the recursion. Let’s list a few thoughts that will help us implement the iterative approach:

Consider each element of the array as a sorted part. As a start, this part contains a single value.
In the second step, merge every two adjacent elements, such that we get sorted parts. In the beginning, each part has two values. However, the last part may contain less than two values if the number of parts was odd.
Keep performing steps 1 and 2 until the size of the part reaches the entire array. By then, we can say that the result is sorted.

Now, we can jump into the implementation. However, to simplify the algorithm, we’ll first implement a function responsible for merging two adjacent parts. After that, we’ll see how to implement the complete algorithm based on the merge function.

3.2. Merge Function

Let’s implement a simple function that merges two sorted parts and returns the merged sorted array, which contains all elements in the first and second parts.

Take a look at the implementation:

algorithm Merge(A, L1, R1, L2, R2):
    // INPUT
    //     A = the array to be sorted
    //     L1 = the start of the first part
    //     R1 = the end of the first part
    //     L2 = the start of the second part
    //     R2 = the end of the second part
    // OUTPUT
    //     Returns the merged sorted array

    temp <- []
    index <- 0
    while L1 <= R1 and L2 <= R2:
        if A[L1] <= A[L2]:
            temp[index] <- A[L1]
            index <- index + 1
            L1 <- L1 + 1
        else:
            temp[index] <- A[L2]
            index <- index + 1
            L2 <- L2 + 1

    while L1 <= R1:
        temp[index] <- A[L1]
        index <- index + 1
        L1 <- L1 + 1

    while L2 <= R2:
        temp[index] <- A[L2]
        index <- index + 1
        L2 <- L2 + 1

    return temp

In the merging function, we use three $while$ loops. The first one is to iterate over the two parts together. In each step, we take the smaller value from both parts and store it inside the $temp$ array that will hold the final answer.

Once we add the value to the resulting $temp$ , we move $index$ one step forward. The variable $index$ points to the index that should hold the next value to be added to $temp$ .

In the second $while$ loop, we iterate over the remaining elements from the first part. We store each value inside $temp$ . In the third $while$ loop, we perform a similar operation to the second $while$ loop. However, here we iterate over the remaining elements from the second part.

The second and third $while$ loops are because after the first $while$ loop ends, we might have remaining elements in one of the parts. Since all of these values are larger than the added ones, we should add them to the resulting answer.

The complexity of the merge function is $\boldsymbol{O(len1 + len2)}$ , where $len1$ is the length of the first part, and $len2$ is the length of the second one.

Note that the complexity of this function is linear in terms of the length of the passed parts. However, it’s not linear compared to the full array $A$ because we might call the function to handle a small part of it.

3.3. Merge Sort

Now let’s use the merge function to implement the merge sort iterative approach.

Take a look at the implementation:

algorithm IterativeMergeSort(A, n):
    // INPUT
    //     A = the array
    //     n = the size of the array
    // OUTPUT
    //     sorted array

    len <- 1
    while len < n:
        i <- 0
        while i < n:
            L1 <- i
            R1 <- i + len - 1
            L2 <- i + len
            R2 <- i + 2 * len - 1
            
            if L2 >= n:
                break

            if R2 >= n:
                R2 <- n - 1

            temp <- Merge(A, L1, R1, L2, R2)
            for j <- 0 to R2 - L1 + 1:
                A[i + j] <- temp[j]

            i <- i + 2 * len

        len <- 2 * len

    return A

Firstly, we start $len$ from $1$ that indicates the size of each part the algorithm handles at this step.

In each step, we iterate over all parts of size $len$ and calculated the beginning and end of each two adjacent parts. Once we determined both parts, we merged them using the $merge$ function defined in algorithm 1.

Note that we handled two special cases. The first one is if $L2$ reaches the outside of the array, while the second one is when $R2$ reaches the outside. The reason for these cases is that the last part may contain less than $len$ elements. Therefore, we adjust its size so that it doesn’t exceed $n$ .

After the merging ends, we copy the elements from $temp$ into their respective places in $A$ .

Note that in each step, we doubled the length of a single part $len$ . The reason is that we merged two parts of length $len$ . So, for the next step, we know that all parts of the size $2 \times len$ are now sorted.

Finally, we return the sorted $A$ .

The complexity of the iterative approach is $\boldsymbol{O(n \times log(n))}$ , where $n$ is the length of the array. The reason is that, in the first $while$ loop, we double the value $len$ in each step. So, this is $O(log(n))$ . Also, in each step, we iterate over each element inside the array twice and call the $merge$ function for the complete array in total. Thus, this is $O(n)$ .

4. Example

Let’s take an example to understand the iterative version more clearly. Suppose we have an array $A$ as follows:

Let’s apply the iterative algorithm to $A$ .

In the first step, we’ll be merging the $1^{st}$ element with the $2^{nd}$ one. As a result, we’ll get a new sorted part which contains the first two values. Since the $2^{nd}$ is smaller than the $1^{st}$ , we’ll swap both of them.

Similarly, we perform this operation for the $3^{rd}$ and $4^{th}$ elements. However, in this case, they’re already sorted, and we keep their order. We continue this operation for the $5^{th}$ and $6^{th}$ values as well.

After these steps, $A$ will have its parts of size 2 sorted as follows:

In the second step, the $len$ becomes 2. Hence, we have three parts. Each of these parts contains two elements. Similarly to the previous steps, we have to merge the first and the second parts. Thus, we get a new part that contains the first four values. For the third part, we just skip it because it was sorted in the previous step, and we don’t have a fourth part of merging it with.

Take a look at the result after these steps:

In the last step, the $len$ becomes 4, and we have two parts. The first one contains four elements, whereas the second one contains two. We call the merging function for these parts.

After the merging ends, we get the final sorted answer:

5. Comparison

As usual, recursion reduces the size of the code, and it’s easier to think about and implement. On the other hand, it takes more memory because it uses the stack memory which is slow in execution.

For that, we prefer to use the iterative algorithm because of its speed and memory saving. However, if we don’t care about execution time and memory as if we have a small array, for example, we can use a recursive version.

6. Conclusion

In this tutorial, we explained how to implement the merge sort algorithm using an iterative approach. Firstly, we discussed the merge sort algorithm using the recursive version of the algorithm.

Then, we discussed the iterative version of this algorithm. Also, we explained the reason behind its complexity.

After that, we provided a simple example to clarify the idea well.

Finally, we presented a comparison between the iterative and the recursive algorithms and showed the best practice of using each of them.

Full Archive

About Baeldung

Core Concepts

Operating Systems

Artificial Intelligence

Graph Theory

Latex