Node.js 中的批处理

2025 年 3 月 3 日 | 阅读 4 分钟

在 Node.js 中，批处理 是一种通过分批或分组处理来有效处理海量数据的方法。它涉及到处理异步和并发任务。这种方法与单独处理每个项目相反。批处理可用于处理文件、数据库更新、数据转换等。它可以减少内存耗尽的可能性，并大大提高系统或程序的性能。

一般工作流程

让我们概述一下 Node.js 批处理的典型工作流程

首先，我们以批次收集需要处理的数据。这可以是一个文件、一个数组、数据库数据等。
我们设计可以处理单个项目和一组项目的函数。
我们根据系统的能力和数据的类型来计算理想的批次大小。我们弄清楚需要多少批次。
我们迭代地处理每个批次，提取数据，并应用指定的函数来处理它。
我们加入错误处理程序和日志记录，以确保操作的顺利进行并便于调试。
批处理完成后，我们就会得到一团糟。

Node.js 批处理方法

有几种方法可以处理 Node.js 文件批次。其中包括以下几种

同步方法： 我们按顺序处理每个批次，不使用并行化或异步操作。
Promises： 当活动需要异步方面，如网络请求时，我们可以使用 promises 来处理异步过程，从而实现更有序、更易读的代码。
Streams： Node.js Streams 可用于更快地分析大型数据集，并减少内存使用。
并行批处理： 我们还可以使用并行批处理来进一步提高性能，利用系统的全部容量。

让我们以一个电子商务系统的订单处理为例。假设我们有一个客户订单数据集，并且我们必须根据新的定价计划调整订单价格。我们将使用 Node.js 中的批处理来更新价格。这里，批处理是通过使用简单的同步方法实现的，该方法利用函数调用和简单的循环。

 
const ordersData = [
  { orderId: 1, product: 'Mobile', quantity: 2, price: 18 },
  { orderId: 2, product: 'Microphone', quantity: 1, price: 25 },
  { orderId: 3, product: 'Pants', quantity: 3, price: 30 },
  { orderId: 4, product: 'Laptop', quantity: 1, price: 20 },
  { orderId: 5, product: 'Table', quantity: 4, price: 27 },
  { orderId: 6, product: 'Charger', quantity: 2, price: 35 },
  { orderId: 7, product: 'Jacket', quantity: 1, price: 22 },
  { orderId: 8, product: 'Shirt', quantity: 2, price: 28 },
  { orderId: 9, product: 'Jeans', quantity: 3, price: 12 },
  { orderId: 10, product: 'Macbook', quantity: 2, price: 38 },
  { orderId: 11, product: 'Jacket', quantity: 1, price: 25 },
  { orderId: 12, product: 'Cars', quantity: 3, price: 26 },
  { orderId: 13, product: 'Bike', quantity: 1, price: 32 },
  { orderId: 14, product: 'Television', quantity: 3, price: 18 },
  { orderId: 15, product: 'TV', quantity: 4, price: 40 },
];
const batchSize = 5;

function updateOrderPrice(order) {
  const newPrice = parseFloat((order.price * 1.1).toFixed(1));
  return { ...order, price: newPrice };
}

function processABatch(batch, processingFunction) {
  for (const order of batch) {
    const updatedOrder = processingFunction(order);
    console.log(`Order ${updatedOrder.orderId} - Updated Price: $${updatedOrder.price}`);
  }
}

const numOfBatches = Math.ceil(ordersData.length / batchSize);

for (let batchIndex = 0; batchIndex < numOfBatches; batchIndex++) {
  const start = batchIndex * batchSize;
  const end = Math.min(start + batchSize, ordersData.length);

  const batch = ordersData.slice(start, end);

  processABatch(batch, updateOrderPrice);

  console.log(`Batch ${batchIndex + 1} processed.`);
  console.log("");
}

console.log('Batch processing of order prices is complete. ');   

输出

说明

首先，我们定义一个数组，其中包含有关需要处理的批次大小和客户订单的信息。
定义了 updateOrderPrice 方法来处理单个订单，方法是将订单价格四舍五入到最近的十分之一（10%）。
然后，我们定义 processABatch 函数来处理每个订单批次。在这种情况下，它将在遍历批次时为每个订单调用 processingFunction 函数和 updateOrderPrice 函数。
使用数据和预先确定的批次大小，我们计算批次数。
我们迭代地循环遍历每个批次，使用 slice 方法为每个批次提取数据批次。
我们调用 processABatch 函数来处理当前批次的数据。
在实际应用中，批处理可用于执行更复杂的操作，例如处理来自 CSV 文件的海量数据、数据库更新或数据转换。通过针对特定用例定制代码，我们可以优化 Node.js 的批处理。处理大量数据可以更有效，节省时间，并使用更少的内存。

下一主题Convert-ipv6-to-ipv4-in-nodejs

Node.js 中的批处理

一般工作流程

Node.js 批处理方法

说明

联系信息

关注我们

教程

面试题

在线编译器

Python

Java

.Net Framework

AI, ML and Data Science

Cloud Technology

B.Tech and MCA

Web Technology

PHP

Software Testing

Technical Interview

Java Interview

Python

Web Interview

Database Interview

B.Tech / MCA

Important Interview

Software Testing Interview

Company Interviews

Online Compilers

Multiple Choice Questions

Node.js 教程

Node.js MySQL

Node.js MongoDB

区别

其他

Node.js 选择题

Node.js Express

面试题

Node.js 中的批处理

一般工作流程

Node.js 批处理方法

说明

相关帖子

Node-Canvas 是什么？

Node.js 中的 dns.resolveSrv(hostname, callback) 函数

Node.js 中的 buf.includes(value[,byteOffset][,encoding]) 函数

Node.js date-and-time Date.isLeapYear() 方法

2024 年 8 大 Node.js 设计模式

Node.js Stream readable.pipe() 方法

Node.js stats.dev 属性

Node.js 中的 tracingChannel.traceCallback(fn[, position[, context[, thisArg[, ...args]]]]) 函数

Node.js zlib.bytesWritten 属性

如何使用 Node.js 执行 SOAP 请求？

订阅 Tpoint Tech

联系信息

关注我们

教程

面试题

在线编译器