机器学习中的图像伪造检测

2025 年 6 月 25 日 | 9 分钟阅读

Image Forgery Detection Using Machine Learning

在数字时代，图像伪造越来越普遍，无论是个人还是组织都出于各种目的制作虚假照片。这些伪造品可能被用于欺骗、宣传或其他不良意图。因此，用于检测和阻止图像伪造的工具和方法正变得越来越必要。利用机器学习是其中一种最有前景的策略。

机器学习是人工智能的一个分支，它使计算机能够在没有明确编程的情况下从数据中学习。它在图像处理中有许多应用，例如识别假图像。目标是训练一个机器学习模型来识别真实图像中的模式，以便它能够利用这些模式来识别假图像。

拼接、修饰和复制-移动伪造只是各种图像伪造形式中的几个例子。复制-移动伪造涉及切割、粘贴和重新组合图像的片段以创建新图像。拼接是将不同图像融合在一起以创建全新图像的过程。修饰是指更改图像外观的过程。机器学习可以用于检测这些伪造类型留下的各种证据。

使用机器学习进行图像伪造检测的优势

与更传统的伪造检测技术相比，机器学习在图像伪造检测方面具有显著优势。机器学习在检测图像伪造方面提供了许多优于传统方法的优势，包括速度、自动化、准确性、灵活性、可扩展性和一致性。这些优势使其成为各种应用中检测假图像的理想方法。许多主要优势包括：

通过在大量的图像数据集上应用机器学习算法，可以大规模地检测图像伪造。这对于社交媒体监控和法证调查等应用至关重要。
机器学习算法可以适应不同类型的图像伪造和不同形式的图像处理方法。这意味着基于机器学习的图像欺诈检测可以比早期技术更可靠。
机器学习算法可以学习区分真实和虚假图像，即使是细微的细节。这意味着基于机器学习的图像欺诈检测可以比早期技术更精确。
机器学习算法可以提供一致的结果，确保图像伪造检测以系统和可靠的方式进行。
这些算法可以训练成自动检测图像伪造，无需人工干预。这节省了时间和资源，并降低了人为错误的风险。

我们需要考虑特征和分类算法的选择，因为不同的算法可能在不同类型的操纵或图像特征上表现更好。

Python 实现

我们将构建一个模型，该模型将能够预测任何图像的伪造情况。

导入库

import os
import cv2
import random
import itertools
from tqdm import tqdm
from pathlib import Path
from natsort import natsorted
from os import makedirs, listdir
from os.path import join, exists, isdir
from PIL import Image, ImageChops, ImageEnhance

import sklearn
import numpy as np
import seaborn as sns
import matplotlib.pyplot as plt
import matplotlib as mpl
from sklearn.metrics import confusion_matrix

import tensorflow as tf
from tensorflow.keras import backend as K
from tensorflow.keras import Sequential,Model
from tensorflow.keras.preprocessing import image
from tensorflow.keras.optimizers import SGD, Adam
from tensorflow.keras.utils import to_categorical
from tensorflow.keras.layers import Dense, GlobalAveragePooling2D, Flatten
from tensorflow.keras.preprocessing.image import load_img, img_to_array
from tensorflow.keras.callbacks import EarlyStopping, ModelCheckpoint, LearningRateScheduler, TensorBoard
from tensorflow.keras.applications import (MobileNetV2, Xception, InceptionV3, EfficientNetB7, ResNet101, NASNetLarge,
                                           VGG19, VGG16, DenseNet201)
from tensorflow.keras.applications import (mobilenet_v2, xception, inception_v3, efficientnet, resnet, nasnet, vgg19,
                                           vgg16, densenet)

np.random.seed(2)
plt.rcParams['figure.dpi'] = 100
mpl.rcParams['figure.figsize'] = (8, 6)
colors = plt.rcParams['axes.prop_cycle'].by_key()['color']

创建一个类，该类将包含以下变量，这些变量将用作需求。

class Config:
    CASIA1 = "CASIA1"
    CASIA2 = "CASIA2"
    autotune = tf.data.experimental.AUTOTUNE
    epochs = 30
    batch_size = 32
    lr = 1e-3
    name = 'xception'
    n_labels = 2
    image_size = (224, 224)
    decay = 1e-6
    momentum = 0.95
    nesterov = False

# Dictionary having models and their preprocess

models = {
    'densenet': DenseNet201,
    'xception': Xception,
    'inceptionv3': InceptionV3,
    'effecientnetb7': EfficientNetB7,
    'vgg19': VGG19,
    'vgg16': VGG16,
    'nasnetlarge': NASNetLarge,
    'mobilenetv2': MobileNetV2,
    'resnet': ResNet101
}
# To use => myNet = models['densenet']()

preprocess = {
    'densenet': densenet.preprocess_input,
    'xception': xception.preprocess_input,
    'inceptionv3': inception_v3.preprocess_input,
    'effecientnetb7': efficientnet.preprocess_input,
    'vgg19': vgg19.preprocess_input,
    'vgg16': vgg16.preprocess_input,
    'nasnetlarge': nasnet.preprocess_input,
    'mobilenetv2': mobilenet_v2.preprocess_input,
    'resnet': resnet.preprocess_input
}

计算错误率分析

错误率分析是通过衡量系统或过程中产生的错误的频率和类型来评估其准确性和可靠性的过程。

在这里，我们对其进行图像处理，这通常涉及通过评估图像时产生的错误的频率和类型来衡量图像处理系统（例如计算机视觉算法或机器学习模型）的准确性和可靠性。

def compute_ela_cv(path, quality):
    temp_filename = 'temp_file_name.jpg'
    SCALE = 15
    orig_img = cv2.imread(path)
    orig_img = cv2.cvtColor(orig_img, cv2.COLOR_BGR2RGB)
   
    cv2.imwrite(temp_filename, orig_img, [cv2.IMWRITE_JPEG_QUALITY, quality])

    # reading the  compressed image
    compressed_img = cv2.imread(temp_filename)

    # getting absolute difference between img1 and img2 and multiplying it by scale
    diff = SCALE * cv2.absdiff(orig_img, compressed_img)
    return diff


def convert_to_ela_image(path, quality):
    temp_filename = 'temp_file_name.jpg'
    ela_filename = 'temp_ela.png'
    image = Image.open(path).convert('RGB')
    image.save(temp_filename, 'JPEG', quality = quality)
    temp_image = Image.open(temp_filename)

    ela_image = ImageChops.difference(image, temp_image)

    extrema = ela_image.getextrema()
    max_diff = max([ex[1] for ex in extrema])
    if max_diff == 0:
        max_diff = 1

    scale = 255.0 / max_diff
    ela_image = ImageEnhance.Brightness(ela_image).enhance(scale)
   
    return ela_image


def random_sample(path, extension=None):
    if extension:
        items = Path(path).glob(f'*.{extension}')
    else:
        items = Path(path).glob(f'*')
       
    items = list(items)
       
    p = random.choice(items)
    return p.as_posix()

真实图像测试

现在我们将进行 ELA 分析，其中原始图像的副本被压缩然后保存为新文件。然后将此压缩图像与原始图像进行比较，以识别可能被数字操纵或编辑的区域。ELA 图像使用颜色比例来突出图像不同部分的压缩级别的差异。

p = join(Config.CASIA2, 'Au/')
p = random_sample(p)
orig = cv2.imread(p)
orig = cv2.cvtColor(orig, cv2.COLOR_BGR2RGB) / 255.0
init_val = 100
columns = 3
rows = 3

fig=plt.figure(figsize=(15, 10))
for i in range(1, columns*rows +1):
    quality=init_val - (i-1) * 3
    img = compute_ela_cv(path=p, quality=quality)
    if i == 1:
        img = orig.copy()
    ax = fig.add_subplot(rows, columns, i)
    ax.title.set_text(f'q: {quality}')
    plt.imshow(img)
plt.show()

输出

经过篡改的假图像测试

现在我们将尝试对假图像进行 ELA 分析。

p = join(Config.CASIA2, 'Tp/')
p = random_sample(p)
orig = cv2.imread(p)
orig = cv2.cvtColor(orig, cv2.COLOR_BGR2RGB) / 255.0
init_val = 100
columns = 3
rows = 3

fig=plt.figure(figsize=(15, 10))
for i in range(1, columns*rows +1):
    quality=init_val - (i-1) * 3
    img = compute_ela_cv(path=p, quality=quality)
    if i == 1:
        img = orig.copy()
    ax = fig.add_subplot(rows, columns, i)
    ax.title.set_text(f'q: {quality}')
    plt.imshow(img)
plt.show()## Test on a spliced fake image

输出

拼接的假图像测试

我们将对拼接的假图像进行 ELA 分析。然后我们将尝试将其转换为数值。

import albumentations as A
from albumentations import OneOf, Compose

@tf.function
def tensor_aug(img):
    img = tf.image.random_flip_left_right(img, 5)
    img = tf.image.random_flip_up_down(img, 5)
    return img

@tf.function
def batch_aug(images, labels):
    images = tf.map_fn(lambda img: tensor_aug(img), images)
    return images, labels



def ela_process(file_path):
    QUALITY = 95
    SCALE = 15
    LABELS = np.array(['Au', 'Tp'])
   
    parts = tf.strings.split(file_path, os.path.sep)
    one_hot = parts[-2] == LABELS
    # Integer encode the label
    label = tf.argmax(one_hot)
    label = tf.cast(label, tf.float32)
   
    # Generate the image
    orig = cv2.imread(file_path.numpy().decode('utf-8'))
    orig = cv2.resize(orig, (224, 224), interpolation = cv2.INTER_AREA)
    orig = cv2.cvtColor(orig, cv2.COLOR_BGR2RGB)
    # Augmentation
    #    
    buffer = cv2.imencode(".jpg", orig, [cv2.IMWRITE_JPEG_QUALITY, QUALITY])
    # get it from the buffer and decode it to a numpy array
    compressed_img = cv2.imdecode(np.frombuffer(buffer, np.uint8), cv2.IMREAD_COLOR)

    # Compute the absolute difference
    diff = SCALE * (cv2.absdiff(orig, compressed_img))
    img = preprocess[Config.name](diff)
   
    return img, label


jpg_pattern = '../input/casia-dataset/CASIA2/*/*jp*g'
tif_pattern = '../input/casia-dataset/CASIA2/*/*tif'

jpg_files = tf.data.Dataset.list_files(tif_pattern)
tif_files = tf.data.Dataset.list_files(jpg_pattern)

data_ds = jpg_files.concatenate(tif_files)

tensor_preprocess = lambda x: tf.py_function(ela_process, [x], [tf.float32, tf.float32])

n_data = data_ds.cardinality().numpy()
n_val = int(.2 * n_data)
data_ds = data_ds.shuffle(n_data)

train_ds = data_ds.skip(n_val).map(
    tensor_preprocess, num_parallel_calls=Config.autotune).batch(Config.batch_size).map(
    batch_aug, num_parallel_calls=Config.autotune)

val_ds = data_ds.take(n_val).map(
    tensor_preprocess, num_parallel_calls=Config.autotune).batch(Config.batch_size)

for img, label in train_ds:
    print(label)
    break

输出

建模

在机器学习中，建模涉及构建一个问题的数学模型，该模型可用于从输入数据生成预测或决策。

METRICS = [
    tf.keras.metrics.CategoricalAccuracy(name='accuracy'),
    tf.keras.metrics.Precision(name='precision'),
    tf.keras.metrics.Recall(name='recall'),
    tf.keras.metrics.AUC(name='auc'),
    tf.keras.metrics.AUC(name='prc', curve='PR'), # precision-recall curve
]

def create_model(optimizer, name='mobilenet', loss='categorical_crossentropy'):
    """
    Creates a model based on the input name and freezes `blocks_to_train` blocks.
    Args:
        optimizer(tf.keras.optimizers): initialized tensorflow optimizers.
        name(str): one of the keys in the `models` list.
        blocks_to_train: name of the blocks to freeze. If not given, all the
        layers will be trainable.
        Loss: sets loss
       
    """
   
    base_model = models[name](include_top=False, weights='imagenet', input_shape=(224, 224, 3))
    # model = Model(base_model.inputs, base_model.layers[-1].output)

    x = GlobalAveragePooling2D()(base_model.output)
    x = Dense(1024, activation='relu')(x)
    output = Dense(1, activation='sigmoid')(x)
   
    model = Model(base_model.inputs, output)
   
    model.compile(loss=loss,
                  optimizer=optimizer,
                  metrics=METRICS)
    return model

def scheduler(epoch):
    if epoch % 25 == 0 and epoch != 0:
        lr = K.get_value(model.optimizer.lr)
        K.set_value(model.optimizer.lr, lr * 0.9)
       
    return K.get_value(model.optimizer.lr)

def generate_path(path_to_output, last_run=False):
    """
    Creates a new path and returns the address.
    Notes:
        Sometimes accidently, it happens that you overwrite your previous models. so
        this function is designed to create a new path for each run.
    """
    if not isdir(path_to_output):
        makedirs(path_to_output)
   
    runs = natsorted([path for path in listdir(path_to_output) if path.startswith("run_tf_data")])
    if last_run:
        if not bool(runs):
            path = join(path_to_output, "run_tf_data_1")
        else:
            path = join(path_to_output, runs[-1])

        return path
    if not bool(runs):
        path = join(path_to_output, 'run_tf_data_1')
    else:
        f = runs[-1].rsplit("data_")[1]
        path = join(path_to_output, 'run_tf_data_' + str(int(f) + 1))
   
    return path

初始化模型

初始化模型涉及定义模型的架构和参数。初始化过程通常在训练阶段之前完成，并且是机器学习管道中的关键步骤。

loss=tf.keras.losses.BinaryCrossentropy()
optimizer = SGD(lr=Config.lr,
#                 decay=Config.decay,
                momentum=Config.momentum,
                nesterov=Config.nesterov)
"""
Model             Params
mobilenet           3M
effecientnetb7      66M
nasnetlarge         89M
inceptionv3         23M
xception            22M
resnet              44M
densenet            20M

"""


model = create_model(optimizer, name=Config.name, loss=loss)

# model.summary()

输出

path = generate_path('checkpoints')
weight_path = join(path, 'weights')
tensorboard_path = join(path, 'logs')

makedirs(weight_path)
makedirs(tensorboard_path)

ckpt = ModelCheckpoint(
    filepath=weight_path,
    monitor='val_loss',
    save_best_only=True,
    save_weights_only=True
)

tensorboard = TensorBoard(
    log_dir=tensorboard_path,
    write_graph=True
)

reduce_lr = LearningRateScheduler(scheduler)


callbacks = [ckpt,
#              reduce_lr,
             tensorboard]

history = model.fit(
    train_ds,
    epochs=6,
    batch_size=Config.batch_size,
    callbacks=callbacks,
    validation_data=val_ds
)

输出

求值

它涉及使用各种指标来衡量模型在特定任务（例如分类或回归）上的性能。这里的目标是确定模型在任务上的表现如何，并识别需要改进的领域。

def plot_loss(history, label, n):
    # Using a log scale on the y-axis to show the wide range of values.
    plt.semilogy(history.epoch, history.history['loss'],
               color=colors[n], label='Train ' + label)
    plt.semilogy(history.epoch, history.history['val_loss'],
               color=colors[n], label='Val ' + label,
               linestyle="--")
    plt.xlabel('Epoch')
    plt.ylabel('Loss')
    plt.legend()

plot_loss(history, Config.name, 0)

输出

def plot_metrics(history):
    metrics = ['loss', 'prc', 'precision', 'recall']
    for n, metric in enumerate(metrics):
        name = metric.replace("_"," ").capitalize()
        plt.subplot(2,2,n+1)
        plt.plot(history.epoch, history.history[metric], color=colors[0], label='Train')
        plt.plot(history.epoch, history.history['val_'+metric],color=colors[0],
                 linestyle="--", label='Val')
        plt.xlabel('Epoch')
        plt.ylabel(name)
        if metric == 'loss':
            plt.ylim([0, plt.ylim()[1]])
        elif metric == 'auc':
            plt.ylim([0.8,1])
        else:
            plt.ylim([0,1])

        plt.legend()

plot_metrics(history)

输出

val_ds_x = []
val_ds_y = []

for _, (val_x_batch, val_y_batch) in enumerate(val_ds):
    for val_x, val_y in zip(val_x_batch, val_y_batch):
        val_ds_x.append(val_x)
        val_ds_y.append(val_y)

val_data = (tf.convert_to_tensor(val_ds_x, dtype=tf.float32),
            tf.convert_to_tensor(val_ds_y, dtype=tf.float32))
val_data[0].numpy().shape

输出

class_names = ['fake', 'authentic'] # make sure its correct

test_predictions_baseline = model.predict(val_data[0].numpy(), batch_size=Config.batch_size)

## CHECK
def plot_cm(label_matrix, predictions):
   
    preds = np.around(np.squeeze(predictions))
    gt = np.around(np.squeeze(predictions))
   
    cm = confusion_matrix(gt,
                          preds,
                          labels=np.array([0, 1]))
    plt.figure(figsize=(8,8))
    sns.heatmap(cm, annot=True, fmt="d", cmap="icefire_r")
    indices = np.arange(len(class_names))
    plt.xticks(indices, class_names, rotation=45)
    plt.yticks(indices, class_names)
    plt.title('Confusion matrix')
    plt.ylabel('Actual label')
    plt.xlabel('Predicted label')

baseline_results = model.evaluate(val_data[0].numpy(),
                                  val_data[1].numpy(),
                                  batch_size=Config.batch_size,
                                  verbose=0)

for name, value in zip(model.metrics_names, baseline_results):
    print(name, ': ', value)
print()

plot_cm(val_data[1].numpy(), np.squeeze(test_predictions_baseline))

输出

模型的准确性很好，混淆矩阵代表了模型已相应地对图像进行了分类。

结论

总的来说，利用机器学习检测图像伪造是一种有前景的策略，可以帮助解决日益严重的图像伪造问题。对真实和虚假图像的大量数据集的需求、可靠的特征提取方法以及能够检测复杂伪造的算法，只是仍需解决的众多问题中的几个。通过更多的研究和开发，机器学习在验证数字照片的真实性和完整性方面具有巨大的潜力。

下一个主题堆叠生成对抗网络

机器学习中的图像伪造检测

使用机器学习进行图像伪造检测的优势

Python 实现

结论

联系信息

关注我们

教程

面试题

在线编译器

Python

Java

.Net Framework

AI, ML and Data Science

Cloud Technology

B.Tech and MCA

Web Technology

PHP

Software Testing

Technical Interview

Java Interview

Python

Web Interview

Database Interview

B.Tech / MCA

Important Interview

Software Testing Interview

Company Interviews

Online Compilers

Multiple Choice Questions

机器学习

监督式学习

分类

杂项

相关教程

面试题

机器学习中的图像伪造检测

使用机器学习进行图像伪造检测的优势

Python 实现

结论

相关帖子

贝叶斯回归

MSE 和偏差-方差分解

如何从零开始学习机器学习

机器学习中的图像处理

机器学习中的解析解与数值解

机器学习和数据科学认证

共形预测

神经网络中的学习率 (eta)

使用 Python 和 Pandas 访问 SQLite 数据库

时间序列预测的自回归 (AR) 模型

订阅 Tpoint Tech

联系信息

关注我们

教程

面试题

在线编译器