Python中二叉树的序列化和反序列化

2025年1月5日 | 阅读6分钟

序列化是指当我们必须将树数据结构存储在文件中时所使用的过程。之后，我们可以根据需要恢复此树。唯一的条件是必须保持树的结构。反序列化是序列化的完全相反的过程。当我们必须从文件中恢复树时使用它。

根据树的类型，可以更改序列化过程以简化该过程。

例如，如果二叉树是二叉搜索树，那么我们可以使用[前序遍历和后序遍历来存储二叉搜索树。我们可以使用前序或后序遍历来检索二叉搜索树的完整结构。

然而，如果给定的二叉树是完全二叉树，我们可以使用层序遍历来获取树的结构。在完全二叉树中，所有层都已完全填充，确保层序遍历是完整的，并且不会有任何空节点。我们知道第一个节点将是根节点，然后接下来的节点将是下一个级别的节点。由于条件在于完全二叉树中的级别，因此层序遍历是最合适的。

如果给定的二叉树是满二叉树，则只需前序遍历。满二叉树是指每个节点恰好有 2 个子节点的树。我们可以跟踪节点是内部节点还是叶节点。

我们通常需要前序和中序遍历来构造完整的通用二叉树。但是，我们可以节省空间，只存储前序遍历。为了进行遍历，我们需要知道哪里有空节点。为了达到此目的，我们可以使用#。

因此，我们将首先存储所有带有所有子节点的节点。对于空节点，我们将使用#。最后，我们将此遍历保存在文件中。

代码

# Python program to serialize a binary tree

# Creating a constructor class to create and store the binary tree nodes
class TreeNode:

  # Defining the function to initialize the pointers of the tree node
  def __init__(self, val):
    self.val = val
    self.left = None
    self.right = None

# Creating a class to create the complete binary tree and perform operations on the tree
class BinaryTree:

  # Storing the root node of the binary tree
  def __init__(self):
    self.root = None

  # Defining the function to serialize the given binary tree
  def serialize(self, root):

    # If the tree is empty, then return None
    if not root:
      return None

    # We will use the stack data structure to store the nodes of the tree
    stack = [root]

    # Creating a list to store the serialized list of nodes
    l = []

    # Using a while loop which will work until the stack is empty
    while stack:
      n = stack.pop()

      # If the current node of the tree is Null, then we will store it as a "#"
      if not n:
        l.append("#")
      else:
        # If the node is not null, then we will store the current node, add the child nodes to the stack and perform a recurrence process on the child nodes of the node
        l.append(str(n.val))
        stack.append(n.right)
        stack.append(n.left)

    # Returning the serialized tree
    return ",".join(l)

  # Defining a function to deserialize the given serialized form of the binary tree and return a binary tree
  def deserialize(self, s):
    if not s:
      return None

    global n
    n = 0
    array = s.split(",")
    return self.utility(array)

  def utility(self, array):
    global n

    # If we will find a # then we will return None
    if array[n] == "#":
      return None

    # We will create a tree node for the current value and then create nodes recursively for the children nodes of the current node
    root = TreeNode(int(array[n]))
    n += 1

    # Recurring for the children nodes
    root.left = self.utility(array)
    n += 1
    root.right = self.utility(array)
    return root

  # Defining the function to create the inorder traversal for the constructed tree
  def inordertraversal(self, root):
    if root:
      self.inordertraversal(root.left)
      print(root.val, end = " ")
      self.inordertraversal(root.right)


# Creating a binary tree
t = BinaryTree()
t.root = TreeNode(2)
t.root.left = TreeNode(18)
t.root.right = TreeNode(20)
t.root.left.left = TreeNode(14)
t.root.left.right = TreeNode(1)
t.root.left.right.left = TreeNode(12)
t.root.left.right.right = TreeNode(4)

# Serializing the tree
serialized_tree = t.serialize(t.root)
print("Serialized tree:")
print(serialized_tree)
print()

# Deserializing the tree
deserialized_tree = t.deserialize(serialized_tree)

print("Inorder traversal of the deserialized tree:")
t.inordertraversal(deserialized_tree)

输出

Serialized tree:
2,18,14,#,#,1,12,#,#,4,#,#,20,#,#

Inorder traversal of the deserialized tree:
14 18 12 1 4 2 20

上述解决方案还需要多少额外空间？

在键很大或与之对应的数据项很大的情况下，以下技术（需要 n+1 个标记，其中 n 是键的数量）可能比存储键两次的简单选项更可取。

我们还能做些什么来优化它吗？

有许多方法可以优化上述答案。更详细地检查上面的序列化树会发现每个叶节点都需要两个标记。添加一个不同的位到每个节点以指示它是内部节点还是外部节点是一种简单的优化方法。

这样，由于可以通过附加位区分叶节点，因此我们可以避免为每个叶节点存储两个标记。

对于只有一个子节点的内部节点，仍然需要一个标记。

例如，在下图中，符号'表示一个内部节点设置位，而字母 '/' 表示一个 NULL 标记。

代码

# Python program to show how to serialize and deserialize the given N-optimize
# Importing the required modules
imporoptimizedCreating a constructor class to create and store the serialized nodes
class Node:

  # Defining the function to initialize the pointers optimization de
  def __init__(self, key):
    self.key = key
    self.children = []

# Defining a function to create a node for the N-ary tree
def newNode(key):
  temp = Node(key)
  return temp

# Defining the function to serialize the N-ary tree and then store this serialized tree in a file
def serialize(root, file):

  # Specifying the base case of the function
  if not root:
    return

  # Creating a list to store the serialized list of nodes
serialize
  # deserialize case is not satisfied, then save the current node and recursively perform the same function for its children nodes
  l.append(root.key)
  file.write(root.key)

  for c in root.childreinitializenot c: l.append("#")
    serialize(c, file)

  # Placing an identifier or marker at the end of the children's list
  file.write(")")

  return l

# Defining a function to deserialize the tree saved in the given file.
def deserialize (file) serialize. We will read the next item from the file. If there are no more items in the file, then return None.

  # Readiserialized nodes from the file. If there are no more nodes in the file, then we will return None
  v = file.read(1)
  if not v or v == ")":
    return None
# We will create a new treenode for this item and recursively perform this function for the children nodes of the current node
  root = newNode(v)

  # Calling the function for the children nodes
  while True:
    c = deserialize(file)
    if not c:
      break
    root.children.append(c)

  # At the end, we will return the root node
  return root

# Defining a function to create a tree
def creatdeserializeroot = newNode('1')
  root.children = [newNode('2'), newNode('3'), newNode('4')]
  root.children[0].children = [newNode('5'), newNode('6')]
  root.children[2].children = [newNode('7'), newNode('8'), newNode('9'), newNode('10')]
  root.children[0].children[1].children = [newNode('11')]
  return root

# Defining a function to perform traversal on the given N-ary tree
def traverse(root):
  if root:
    print(root.key, end = " ")
    for c in root.children:
      traverse(c)

# Creating a tree and performing serializing and deserializing on this tree
root = createTree()

# We will open a file and write the serialized tree in the file
file = open("N_ary tree.txt", "w")
serialize(root, file)
file.close()

# We will open the same file and deserialize the tree stored in the file
file = open("N_ary tree.txt", "r")
root1 = deserialize(file)
file.close()

print("The deserialized N-Ary tree from file is ")
traverse(root1)

输出

The deserialized N-Ary tree from file is
1 2 5 6 1 1 3 4 7 8 9 1 0

时间复杂度：此程序的 time complexity 为 O(N)，其中 n 是节点数。

辅助空间：此程序的 space complexity 为 O(H + N)。

下一主题Abc-algorithm-in-python

Python中二叉树的序列化和反序列化

联系信息

关注我们

教程

面试题

在线编译器

Python

Java

.Net Framework

AI, ML and Data Science

Cloud Technology

B.Tech and MCA

Web Technology

PHP

Software Testing

Technical Interview

Java Interview

Python

Web Interview

Database Interview

B.Tech / MCA

Important Interview

Software Testing Interview

Company Interviews

Online Compilers

Multiple Choice Questions

其他

Python中二叉树的序列化和反序列化

相关帖子

使用Matplotlib在Python中绘制垂直线

Python中的有序集

Python中的日期时间格式化

Python中的ZeroDivisionError:浮点数除零错误

Python中的Math.tan()函数

Python Simple-salesforce 包

Python中的ADAM算法

Python中归一化数组

理解Python JSON模块中的自定义编码器和解码器

Python中的NumPy Vectorize

订阅 Tpoint Tech

联系信息

关注我们

教程

面试题

在线编译器