Python中的归并排序

2025年4月17日 | 阅读9分钟

归并排序与快速排序算法类似，都基于分治（divide and conquer）的思想。它是最流行和最高效的排序算法之一。它是分治类算法的绝佳范例。

它将给定的列表分成两个子列表，分别对这两个子列表进行递归调用，然后合并两个已排序的子列表。我们定义了用于合并两个子列表的 merge() 函数。

子列表会不断地被分成两半，直到每个子列表只包含一个元素。然后，我们将一对单元素列表合并成双元素列表，并在此过程中进行排序。排序后的双元素对会被合并成四元素列表，以此类推，直到得到排序后的列表。

归并排序概念

让我们看下面的归并排序图。

我们将给定的列表分成了两个部分。如果列表不能被平均分割，这完全没关系。

归并排序可以通过两种方式实现：自顶向下（top-down）方法和自底向上（bottom-up）方法。在上面的例子中，我们使用了自顶向下方法，这通常是归并排序最常用的方法。

自底向上方法提供了更多的优化，我们稍后将进行定义。

算法的核心部分是如何合并两个已排序的子列表。让我们来合并两个已排序的归并列表。

A : [2, 4, 7, 8]
B : [1, 3, 11]
sorted : empty

首先，我们观察两个列表的第一个元素。我们发现 B 的第一个元素更小，所以我们将其添加到我们的排序列表中，并向前移动 B 列表的指针。

A : [2, 4, 7, 8]
B : [1, 3, 11]
Sorted : 1

现在我们来看下一对元素 2 和 3。2 更小，所以我们将其添加到我们的排序列表中，并向前移动 A 列表的指针。

A : [2, 4, 7, 8]
B : [1, 3, 11]
Sorted : 1

继续这个过程，我们最终得到一个排序列表 {1, 2, 3, 4, 7, 8, 11}。可能会出现两种特殊情况。

如果两个子列表有相同的元素 - 在这种情况下，我们可以选择其中一个子列表，并将元素添加到排序列表中。技术上讲，我们可以同时向前移动两个子列表的指针，并将元素添加到排序列表中。

当一个子列表中的元素用完时，只需要将另一个子列表中的剩余元素按顺序添加到排序列表中。

我们应该记住，我们可以按任何顺序对元素进行排序。我们这里按升序排序，但也可以轻松地按降序排序。

实施

归并排序算法是使用自顶向下方法实现的。这可能看起来有点难，所以我们将详细阐述每一步。在这里，我们将对两种类型的集合实现此算法：整数元素列表（通常用于介绍排序）和自定义对象（更实用和现实的场景）。

排序数组

算法的主要思想是将（子）列表分成两半并递归地对它们进行排序。我们继续这个过程，直到我们得到只包含一个元素的列表。让我们理解以下用于分割的函数：

def merge_sort(array, left_index, right_index):   
       if left_index >= right_index:   
                 return middle = (left_index + right_index)//2   
       merge_sort(array, left_index, middle)   
       merge_sort(array, middle + 1, right_index)   
       merge(array, left_index, right_index, middle)    

我们的主要关注点是在排序发生之前将列表分成子部分。我们需要得到整数值，所以我们使用 // 运算符来获取索引。

让我们通过以下步骤来理解上述过程。

第一步是创建列表的副本。第一个列表包含从 [left_index,...,middle] 到 [middle+1,?,right_index] 的列表。
我们使用指针遍历两个列表的副本，选择两个值中较小的一个，并将它们添加到排序列表中。一旦我们将一个元素添加到列表中，我们就相应地向前移动排序列表的指针。
将另一个副本中剩余的元素添加到排序数组。

让我们用 Python 程序来实现归并排序。

Python 程序

# Here, we are declaring the function to divide the lists in to the two sub lists 
# Here, we are passing the list1, left index, right index as the parameters  
def merge_sort(list1, left_index, right_index):  
    if left_index >= right_index:    # here, we are checking the if condition 
        return  
    middle = (left_index + right_index)//2   
# Here, we are finding the middle of the given two numbers
    merge_sort(list1, left_index, middle)      
# Here, we are calling the merge sort function till the middle number we got 
    merge_sort(list1, middle + 1, right_index)  
# Here, we are calling the merge sort function till the end of the list i.e., right index
    merge(list1, left_index, right_index, middle)  
# Here, we are calling the merge function to merge the divided list using the merge   # sort function above 
# Here, we are defining a function for merge the list after dividing  
def merge(list1, left_index, right_index, middle):  
   # Here, we are creating subparts of a lists  
    left_sublist = list1[left_index:middle + 1]  
    right_sublist = list1[middle+1:right_index+1] 
    # Here, we are initializing the values for variables that we use to keep  
    # track of where we are in each list1  
    left_sublist_index = 0  
    right_sublist_index = 0  
    sorted_index = left_index 
    # Here, we are traversing the both copies until we get run out one element  
    while left_sublist_index < len(left_sublist) and right_sublist_index < len(right_sublist):        # here, we are declaring a while loop
        # If our left_sublist has the smaller element, put it in the sorted  
        # part and then move forward in left_sublist (by increasing the pointer)  
        if left_sublist[left_sublist_index] <= right_sublist[right_sublist_index]: 
        # Here, we are checking the if condition, if it is true then we will enter the block 
            list1[sorted_index] = left_sublist[left_sublist_index]  
            left_sublist_index = left_sublist_index + 1  
        # Otherwise add it into the right sublist  
        else:  
            list1[sorted_index] = right_sublist[right_sublist_index]  
            right_sublist_index = right_sublist_index + 1  
            # Here, we are moving forward in the sorted part  
        sorted_index = sorted_index + 1  
     # Here, we will go through the remaining elements and add them  
    while left_sublist_index < len(left_sublist):  # here, we are declaring a while loop
        list1[sorted_index] = left_sublist[left_sublist_index]  
        left_sublist_index = left_sublist_index + 1  
        sorted_index = sorted_index + 1  
      while right_sublist_index < len(right_sublist):# here, we are declaring a while loop
        list1[sorted_index] = right_sublist[right_sublist_index]  
        right_sublist_index = right_sublist_index + 1  
        sorted_index = sorted_index + 1  
  list1 = [44, 65, 2, 3, 58, 14, 57, 23, 10, 1, 7, 74, 48] 
print("The given list before performing the merge sort is: ", list1) 
# Here, this is the input unsorted array given by the user
merge_sort(list1, 0, len(list1) -1)  
print("The given list after performing the merge sort is:", list1)     
# here, we are printing the list1 after performing the merge sort amd the merge      
# functions

输出

The given list before performing the merge sort is:
[44, 65, 2, 3, 58, 14, 57, 23, 10, 1, 7, 74, 48]
The given list after performing the merge sort is:
[1, 2, 3, 7, 10, 14, 23, 44, 48, 57, 58, 65, 74]

排序自定义对象

我们也可以使用 Python 类来排序自定义对象。此算法与上述算法几乎相同，但我们需要使其更通用，并传递比较函数。

我们将创建一个自定义类 Car，并为其添加一些字段。我们将对以下算法进行一些修改，使其更通用。我们可以通过使用 lambda 函数来实现这一点。

让我们理解下面的例子。

Python 程序

class Car:    # here, we are declaring a class named car
    def __init__(self, make, model, year):  
        self.make = make      
# Here, we are using the self to declare the make variables locally
        self.model = model  
# Here, we are using the self to declare the model variables locally
        self.year = year    
# Here, we are using the self to declare the year variables locally
    def __str__(self):  
        return str.format("Make: {}, Model: {}, Year: {}", self.make, self.model, self.year)  
   # Here, we are returning the format of the strings given
def merge(list1, l, r, m, comp_fun):  
# Here, we are defining a function for merge the list using the compound function 
    left_copy = list1[l:m + 1]      # here, we are coping the left part of the list
    r_sublist = list1[m+1:r+1]   # here, we are coping the right part of the list
    left_copy_index = 0     # here, we are coping the left part indexes of the list
    r_sublist_index = 0     # here, we are coping the right part indexes of the list
    sorted_index = l  
    while left_copy_index < len(left_copy) and r_sublist_index < len(r_sublist):  
# Here, we are declaring a while loop
        # Here, we are using the comp_fun instead of a simple comparison operator  
        if comp_fun(left_copy[left_copy_index], r_sublist[r_sublist_index]):  
# Here, we are checking the if condition, if it is true then we will enter the block
            list1[sorted_index] = left_copy[left_copy_index]  
            left_copy_index = left_copy_index + 1  
        else:    # if the condition is false then we will enter the else block
            list1[sorted_index] = r_sublist[r_sublist_index]  
            r_sublist_index = r_sublist_index + 1  
        sorted_index = sorted_index + 1  
    while left_copy_index < len(left_copy):     # Here, we are declaring a while loop
        list1[sorted_index] = left_copy[left_copy_index]  
        left_copy_index = left_copy_index + 1  
        sorted_index = sorted_index + 1 
    while r_sublist_index < len(r_sublist):      # Here, we are declaring a while loop
        list1[sorted_index] = r_sublist[r_sublist_index]  
        r_sublist_index = r_sublist_index + 1  
        sorted_index = sorted_index + 1  
def merge_sort(list1, l, r, comp_fun):  
# Here, we are declaring the merge sort function to sort the given list
    if l >= r: 
# Here, we are checking the if condition, if it is true then we will enter the block
        return  
    m = (l + r)//2     # here, we are finding the middle element of the list
    merge_sort(list1, l, m, comp_fun)   
# Here, we are calling the merge sort function till the middle number we got   
    merge_sort(list1, m + 1, r, comp_fun)  
# Here, we are calling the merge sort function from the middle number we got
    merge(list1, l, r, m, comp_fun)  
# Here, we are calling the merge function to merge the divided list using the merge   # sort function above
car1 = Car("Renault", "33 Duster", 2001)  
car2 = Car("Maruti", "Maruti Suzuki Dzire", 2015)  
car3 = Car("Tata motor", "Jaguar", 2004)  
car4 = Car("Cadillac", "Seville Sedan", 1995)  
list1 = [car1, car2, car3, car4]  
merge_sort(list1, 0, len(list1) -1, lambda carA, carB: carA.year < carB.year)  
print("Cars sorted by year:")  
for car in list1:     # here, we are declaring the for loop to iterate through list1
    print(car)     # here, we are printing all the data of the car and the list
print()  
merge_sort(list1, 0, len(list1) -1, lambda carA, carB: carA.make < carB.make)  
print("Cars sorted by make:")  
for car in list1:  # here, we are declaring the for loop to iterate through list1
    print(car)     # here, we are printing all the data of the car and the list  

输出

Cars sorted by year:
Make: Cadillac, Model: Seville Sedan, Year: 1995
Make: Renault, Model: 33 Duster, Year: 2001
Make: Tata motor, Model: Jaguar, Year: 2004
Make: Maruti, Model: Maruti Suzuki Dzire, Year: 2015

Cars sorted by make:
Make: Cadillac, Model: Seville Sedan, Year: 1995
Make: Maruti, Model: Maruti Suzuki Dzire, Year: 2015
Make: Renualt, Model: 33 Duster, Year: 2001
Make: Tata motor, Model: Jaguar, Year: 2004

优化

我们可以改进归并排序算法的性能。首先，让我们理解自顶向下和自底向上归并排序之间的区别。自底向上方法通过迭代地对相邻列表的元素进行排序，而自顶向下方法则将列表分解为两个部分。

给定的列表是 [10, 4, 2, 12, 1, 3]，而不是将其分解为 [10], [4], [2], [12], [1], [3] - 我们将其分成可能已排序的子列表：[10, 4], [2], [1, 12], [3]，现在准备对它们进行排序。

对于较小的子列表，归并排序在时间和空间上都是效率较低的算法。因此，对于较小的子列表，插入排序比归并排序更有效。

结论

归并排序是一种流行且高效的算法。对于大型列表，它是一种更有效的算法。它不依赖于任何可能导致糟糕运行时间的糟糕决策。

归并排序有一个主要缺点。它使用额外的内存来存储合并前的列表的临时副本。然而，归并排序在软件中被广泛使用。它的性能很快，并且能产生出色的结果。

我们简要讨论了归并排序的概念，并通过 lambda 函数（用于比较）在简单整数列表和自定义对象上进行了实现。

下一个主题Python 快速排序

Python中的归并排序

归并排序概念

实施

排序数组

Python 程序

排序自定义对象

Python 程序

优化

结论

联系信息

关注我们

教程

面试题

在线编译器

Python

Java

.Net Framework

AI, ML and Data Science

Cloud Technology

B.Tech and MCA

Web Technology

PHP

Software Testing

Technical Interview

Java Interview

Python

Web Interview

Database Interview

B.Tech / MCA

Important Interview

Software Testing Interview

Company Interviews

Online Compilers

Multiple Choice Questions

Python教程

Python变量和数据类型

Python控制语句

Python数据结构

Python函数

Python模块

Python OOP

Python异常处理

Python文件处理

Python搜索和排序

Python高级主题

Python MySQL

Python MongoDB

Python SQLite

Python MCQ

Python Tkinter (GUI)

Python Web Blocker

Python内置函数

Python字符串函数

Python列表

Python字典

Plotly

相关教程

Python中的归并排序

归并排序概念

实施

排序数组

Python 程序

排序自定义对象

Python 程序

优化

结论

相关帖子

Python中的搜索算法

Python中的线性搜索

Python中的排序算法

Python中的快速排序

Python中的二分搜索

Python中的Tim排序

Python中的冒泡排序

Python中的插入排序

Python中的选择排序

Python中的堆排序

订阅 Tpoint Tech

联系信息

关注我们

教程

面试题

在线编译器