EigenFaces

2025年03月17日 | 阅读 9 分钟

Eigenfaces 是计算机视觉中的一个关键概念，它提供了一种强大的面部分析和识别技术。Eigenfaces 通过利用主成分分析的数学概念，以简洁明了的方式描绘人脸照片。尽管 Eigenfaces 存在一些挑战，但它仍然持续推动面部生物识别领域的创新和研究，促进了安全、监控和人机交互等领域的发展。

Eigenfaces 由一组特征向量组成，这些特征向量是从一组人脸图像协方差矩阵中获得的。每个 Eigenface 代表了能够最好地反映数据集中面部外观最显著变化的基函数的主要成分。这些 Eigenfaces 为高维人脸图像空间提供了一个低维表示。

代码

现在我们已经使用 Olivetti 数据集完成了人脸识别，之后我们还对 Eigenfaces 进行了 PCA。目前，我们将专注于人脸识别。

导入库

import numpy as np # linear algebra
import pandas as pd # data processing, CSV file I/O (e.g. pd.read_csv)

#Visualization
import matplotlib.pyplot as plt

#Machine Learning
from sklearn.model_selection import train_test_split
from sklearn.decomposition import PCA
from sklearn.svm import SVC
from sklearn.naive_bayes import GaussianNB
from sklearn.neighbors import KNeighborsClassifier
from sklearn.tree import DecisionTreeClassifier
from sklearn.linear_model import LogisticRegression
from sklearn.discriminant_analysis import LinearDiscriminantAnalysis
from sklearn import metrics
import warnings
warnings.filterwarnings('ignore')

Olivetti 数据集

Olivetti 数据集简介

data=np.load("../input/olivetti_faces.npy")
target=np.load("../input/olivetti_faces_target.n

让我们确认以上信息。

print("There are {} images in the dataset".format(len(data)))
print("There are {} unique targets in the dataset".format(len(np.unique(target))))
print("Size of each image is {}x{}".format(data.shape[1],data.shape[2]))
print("Pixel values were scaled to [0,1] interval. e.g:{}".format(data[0][0,:4]))

输出

现在我们将展示 Olivetti 数据集中 48 位不同的个体。

def show_40_distinct_people(images, unique_ids):
    #Creating 4X10 subplots in  18x9 figure size
    fig, axarr=plt.subplots(nrows=4, ncols=10, figsize=(18, 9))
    #For easy iteration flattened 4X10 subplots matrix to 40 array
    axarr=axarr.flatten()
    	
    #iterating over user IDs
    for unique_id in unique_ids:
        image_index=unique_id*10
        axarr[unique_id].imshow(images[image_index], cmap='gray')
        axarr[unique_id].set_xticks([])
        axarr[unique_id].set_yticks([])
        axarr[unique_id].set_title("face id:{}".format(unique_id))
    plt.suptitle("There are 40 distinct people in the dataset")
show_40_distinct_people(data, np.unique(target))

输出

如上面的图库所示，数据集中包含四十位不同个体的面部照片。

现在我们将展示选定目标的 10 张人脸图像。

def show_10_faces_of_n_subject(images, subject_ids):
    cols=10# Each subject has 10 distinct face images
    rows=(len(subject_ids)*10)/cols #
    rows=int(rows)
    
    fig, axarr=plt.subplots(nrows=rows, ncols=cols, figsize=(18,9))
    #axarr=axarr.flatten()
    
    for i, subject_id in enumerate(subject_ids):
        for j in range(cols):
            image_index=subject_id*10 + j
            axarr[i,j].imshow(images[image_index], cmap="gray")
            axarr[i,j].set_xticks([])
            axarr[i,j].set_yticks([])
            axarr[i,j].set_title("face id:{}".format(subject_id))
   
#You can play around subject_ids to see other people's faces
show_10_faces_of_n_subject(images=data, subject_ids=[0,5, 21, 24, 36])

输出

在不同的光照、面部表情和面部细节（眼镜、胡须）的背景下，每个个体的面部都有不同的特征。

#We reshape images for the learning  model
X=data.reshape((data.shape[0],data.shape[1]*data.shape[2]))
print("X shape:",X.shape)

输出

分割数据集

数据集中每个个体有十张面部照片。百分之三十的面部照片将用于测试，百分之七十用于训练。为了确保每个个体拥有相同数量的训练和测试照片，使用了分层抽样（stratify）功能。每个个体将有七张训练照片和三张测试图像。测试和训练比例可以调整。

X_train, X_test, y_train, y_test=train_test_split(X, target, test_size=0.3, stratify=target, random_state=0)
print("X_train shape:",X_train.shape)
print("y_train shape:{}".format(y_train.shape))

输出

y_frame=pd.DataFrame()
y_frame['subject ids']=y_train
y_frame.groupby(['subject ids']).size().plot.bar(figsize=(15,8),title="Number of Samples for Each Classes")

输出

PCA

import mglearn
mglearn.plots.plot_pca_illustration()

输出

上面的图形展示了一个虚构的二维数据集示例。在第一个图中，实际数据点以颜色区分，以便更容易辨认。程序首先寻找最大的方差方向，即“组件 1”。这是数据最相关的方向，或者说属性之间相关性最强的方向。

当算法找到一个与第一个方向正交（成直角）且包含最大信息量（方差）的方向时，它会选择该方向。在二维空间中，一个直角只有一个可能的方向；然而，在高维空间中，有多个（无限个）正交方向。

from sklearn.decomposition import PCA
pca=PCA(n_components=2)
pca.fit(X)
X_pca=pca.transform(X)

number_of_people=10
index_range=number_of_people*10
fig=plt.figure(figsize=(10,8))
ax=fig.add_subplot(1,1,1)
scatter=ax.scatter(X_pca[:index_range,0],
            X_pca[:index_range,1], 
            c=target[:index_range],
            s=10,
           cmap=plt.get_cmap('jet', number_of_people)
          )

ax.set_xlabel("First Principle Component")
ax.set_ylabel("Second Principle Component")
ax.set_title("PCA projection of {} people".format(number_of_people))

fig.colorbar(scatter)

输出

pca=PCA()
pca.fit(X)

plt.figure(1, figsize=(12,8))

plt.plot(pca.explained_variance_, linewidth=2)
 
plt.xlabel('Components')
plt.ylabel('Explained Variaces')
plt.show()

输出

从下面的图形可以看出，90 个或更多的 PCA 组件对应于同一组数据。现在，让我们使用九十九个 PCA 组件来创建分类过程。

n_components=90

pca=PCA(n_components=n_components, whiten=True)
pca.fit(X_train)

输出

我们将看一看平均脸。

fig,ax=plt.subplots(1,1,figsize=(8,8))
ax.imshow(pca.mean_.reshape((64,64)), cmap="gray")
ax.set_xticks([])
ax.set_yticks([])
ax.set_title('Average Face')

输出

现在，来看 Eigenfaces。

number_of_eigenfaces=len(pca.components_)
eigen_faces=pca.components_.reshape((number_of_eigenfaces, data.shape[1], data.shape[2]))

cols=10
rows=int(number_of_eigenfaces/cols)
fig, axarr=plt.subplots(nrows=rows, ncols=cols, figsize=(15,15))
axarr=axarr.flatten()
for i in range(number_of_eigenfaces):
    axarr[i].imshow(eigen_faces[i],cmap="gray")
    axarr[i].set_xticks([])
    axarr[i].set_yticks([])
    axarr[i].set_title("eigen id:{}".format(i))
plt.suptitle("All Eigen Faces".format(10*"=", 10*"="))

输出

X_train_pca=pca.transform(X_train)
X_test_pca=pca.transform(X_test)

print(X_train_pca.shape)
print(y_train.shape)
print(X_test_pca.shape)
print(y_test.shape)
print(y_train)

from keras.utils import to_categorical

y_train=to_categorical(y_train,40)
y_test=to_categorical(y_test,40)
print(y_train.shape)

输出

模型

现在，我们将构建一个模型，该模型最终能够识别面孔。

from keras.models import Sequential
from keras.layers import Dense, Activation

from keras.optimizers import Adam
from keras.callbacks import ReduceLROnPlateau
from keras.layers import Dropout

from keras import regularizers
model=Sequential()
model.add(Dense(256, activation='relu', input_dim=90))
model.add(Dropout(0.2))
model.add(Dense(128, activation='relu'))
model.add(Dropout(0.1))
model.add(Dense(128, activation='relu'))
model.add(Dropout(0.05))
model.add(Dense(40, activation='softmax'))

epochs=100
batch_size=128
red_lr=ReduceLROnPlateau(monitor='val_acc', factor=0.1, min_delta=0.0001, patience=2, verbose=1)
model.compile(optimizer=Adam(lr=1e-3),loss='categorical_crossentropy',metrics=['accuracy'])

model.summary()

输出

现在，我们需要训练模型。

History = model.fit(X_train_pca,y_train, epochs = epochs, validation_data = (X_test_pca,y_test),batch_size=128, verbose = 1)

输出

plt.plot(History.history['acc'])
plt.plot(History.history['val_acc'])
plt.title('Model Accuracy')
plt.ylabel('Accuracy')
plt.xlabel('Epochs')
plt.legend(['train', 'test'])
plt.show()

输出

我们的模型的准确率相当不错。

plt.plot(History.history['loss'])
plt.plot(History.history['val_loss'])
plt.title('Model Loss')
plt.ylabel('Loss')
plt.xlabel('Epochs')
plt.legend(['train', 'test'])
plt.show()

输出

看起来足够好了。

正如我们之前讨论过的，现在将向您展示 PCA 在实际应用中的用法。所以，我们将继续进行代码的后续部分。现在，我们将处理一个不同的数据集。

# Importing Libraries
import numpy as np # linear algebra
import pandas as pd # data processing, CSV file I/O (e.g. pd.read_csv)
import matplotlib.pyplot as plt
import skimage.io as skio



import os
print(os.listdir("../input"))

设置测试集和训练集

数据集由训练集和测试集组成。原始数据集总共有 400 张照片（40 位个体，每人 10 张照片）。下面的函数用于准备测试集，该函数保留了每位个体的一张照片作为测试集（总共 40 张），其余 360 张照片保留在训练集中。

def load_train_test_set():
    faces  = np.load('../input/olivetti_faces.npy')
    target = np.load('../input/olivetti_faces_target.npy')
    print("Original faces.shape:"+ str(faces.shape))
    print("Original target.shape:"+ str(target.shape))
    	
    # create a test set that takes one face for each person
    test_index = list(range(9,400,10))
    faces_test = faces[test_index]
    target_test = target[test_index]
    
    # Preprare training set by removing items in the test set
    faces_train = faces.copy()
    target_train =  target.copy()
    for i in test_index[::-1]:
        faces_train  = np.delete(faces_train,i,axis =0 )
        target_train = np.delete(target_train,i,axis =0 )
    
    print("faces_train.shape:" +str(faces_train.shape)) 
    print("target_train.shape:" +str(target_train.shape))
    print("faces_test.shape:" +str(faces_test.shape)) 
    print("target_test.shape:" +str(target_test.shape))
    return faces_train, target_train, faces_test, target_test

faces_train, target_train, faces_test, target_test = load_train_test_set()

现在，我们将可视化一些训练图像。

def show_sample_training_and_test_images(faces_train, faces_test):
    fig = plt.figure()
    fig.add_subplot(1, 4, 1)
    plt.imshow(faces_train[0], cmap='gray')
    fig.add_subplot(1, 4, 2)
    plt.imshow(faces_train[1], cmap='gray')
    fig.add_subplot(1, 4, 3)
    plt.imshow(faces_train[9], cmap='gray')
    fig.add_subplot(1, 4, 4)
    plt.imshow(faces_train[10], cmap='gray')
    fig = plt.figure()
    fig.add_subplot(1, 2, 1)
    plt.imshow(faces_test[0], cmap='gray')
    fig.add_subplot(1, 2, 2)
    plt.imshow(faces_test[1], cmap='gray')
show_sample_training_and_test_images(faces_train, faces_test)

输出

预处理数据

在数据准备过程中，从每个特征的训练样本均值中减去均值，然后将结果除以标准差，以对图像进行中心化处理。这有两个目的：

数据被标准化。这样做可以避免任意大的数值。
它消除了所有特征的单位。这确保了单位（例如，一个以厘米为单位，另一个以米为单位）不会导致一个特征的值范围与其他特征的值范围不同。

def preprocess_data( faces_train, faces_test ):
    # flatten the images from 
    X_train =  np.reshape(faces_train,(faces_train.shape[0], -1 ))
    X_test =   np.reshape(faces_test, (faces_test.shape[0], -1 ))
    mu = np.mean(X_train, axis = 0 )
    std_dev = np.mean(X_train, axis = 0)
    std_dev_mod = np.copy(std_dev)
    std_dev_mod[std_dev == 0 ] = 1
    X_train = (X_train - mu)/std_dev_mod
    # normalize the test set with the same mu and std values as a training set 
    X_test = (X_test - mu)/std_dev_mod
    return X_train, X_test   

X_train, X_test = preprocess_data(faces_train, faces_test )
print("X_train.shape=" + str(X_train.shape))
print("X_test.shape="  + str(X_test.shape)) 

输出

PCA

主成分分析 (PCA) 是一种统计技术，它通过正交变换，从一组可能相关的变量的观测值中创建主成分——一组线性不相关的变量值。这些主成分可以被推断为仅仅是根据其特征值排列的特征向量。代表图像的特征值最大的特征向量提供了关于该图像最多的信息。通过选择足以从图像中提取大部分显著特征的特征向量，并在该特征基上对图像向量进行正交投影，我们可以降低图像的维度。

class PCA:
    def __init__( self, optimize = False ):
        self.__optimize = optimize
        
    def __calculate_covariance_matrix( self, X ):
        if self.__optimize:
            return X @ X.T
        else:
            return X.T @ X

    def __get_eigenvecs_sorted_by_eigenvals( self, S ):
        w, v = np.linalg.eig(S)
        sorted_index = np.argsort(w)[::-1]
        eigenvals = w[sorted_index]
        eigenvecs = v[:,sorted_index]
        return eigenvals, eigenvecs

    def __get_eigenvec_for_original_matrix(  self, X, eigenvecs ):
        U = X.T @ eigenvecs
        return U
    
    def fit( self, X ):
        S = self.__calculate_covariance_matrix( X )
        eigenvals,eigenvecs =   self.__get_eigenvecs_sorted_by_eigenvals( S )
        if self.__optimize :
            B = self.__get_eigenvec_for_original_matrix(X, eigenvecs )
        else:
            B = eigenvecs
            
        self.__B = B
        self.__w = eigenvals
        return B
        
    def plot_eigenvals ( self ):
        c = np.cumsum ( self.__w )
        plt.plot( c )
        
    def get_num_components ( self, variance_threshold):
        variance_ratio = self.__w/np.sum(self.__w)
        s = 0
        i = -1
        while s < variance_threshold and s < 1.0:
            i += 1
            s +=  variance_ratio[i]
            
        return i

def show_images( images, num_images_to_show ):
    fig = plt.figure()
    for i in range(1,num_images_to_show+1):
        fig.add_subplot(1,num_images_to_show,i)
        img = np.reshape( images[:,i-1], (64,64) )
        plt.imshow(img, cmap='gray'

寻找 K 的最优值

现在，我们来解决确定 K 的最优值的问题。

绘制特征值累积和，以确定代表训练数据中最大方差的特征向量数量。下面的图形说明了前 50 个特征占了大部分方差。
我们提取对应于 90% 方差的特征向量（您可以调整此数字）。<

# Now since the number of dimensions (D=4096 ) >> the number of training samples (N = 360), we calculate the NXN covariance matrix instead of D X D
pca = PCA( optimize = True )
B = pca.fit(X_train)
pca.plot_eigenvals()
num_dim = pca.get_num_components(0.9)
print( num_dim )
B = B[:,:num_dim]
# show top 4 eigenfaces
show_images( B, 4 )

输出

class Projection:
    def __init__( self, B ):
        self.B = B
        
    def reduce_dim( self, X ):
        return  X @ B @ np.linalg.inv(B.T @ B)  

    def reconstruct( self, X_reduced ):
        return X_reduced @ B.T
    
    def get_projection_matrix( self ):
        P = B @ np.linalg.inv(B.T @ B) @ B.T
        return P
    
    def project( self , X ):
        P = self.get_projection_matrix()
        return X @ P


proj = Projection( B )
X_train_reduced = proj.reduce_dim(X_train )
print("X_train_reduced.shape="+str(X_train_reduced.shape) )
show_images(X_train.T, 1)
r_img = proj.reconstruct( X_train_reduced[0,:])
r_img = np.reshape(r_img,(4096,1))
show_images(r_img, 1)

输出

X_test_reduced =proj.reduce_dim(X_test)
print("X_test_reduced.shape="+str(X_test_reduced.shape) )

输出

图像识别

PCA 是人脸识别的一个有用工具。其思想是：

缩小人脸图像的比例，以便能够进行识别。
将其与人脸类别的平均图像进行比较，在这种情况下，单个个体的平均图像对应一个类别。
该个体属于与平均图像的欧氏距离最短的类别，即类别由该人的面部图像表示。

class ImageClassifier:
    def __init__( self, class_count ):
        self.class_count = class_count
        
    def __get_class_mean( self, X, target ):
        class_count = self.class_count
        N,D = X.shape
        mu = np.zeros((class_count,D))
        for i in range(class_count):
            mu[i,:] = (1/N) * np.sum(X[target == i,: ], axis = 0 )
        return mu 

    def __dist(self, v1, v2 ):
        diff = v1-v2
        d = np.sqrt(np.dot(diff,diff))
        return d
                  
    def fit( self,  X, target  ):
        class_count = self.class_count
        mu = self.__get_class_mean( X, target )
        self.mu = mu
        
    def predict( self, test_img ):
        min_dist = np.float('inf')
        min_class = -1
        for i in range( self.class_count ):
            d = self.__dist(test_img, self.mu[i,:])
            if d < min_dist:
                min_dist = d
                min_class = i
            
        return min_class, self.mu[min_class,:]

print("X_test_reduced.shape=" + str(X_test_reduced.shape))
print("X_train_reduced.shape=" + str(X_train_reduced.shape))
img_classifer = ImageClassifier(40)   
img_classifer.fit(X_train_reduced,target_train )
recognized_class, mu_rec = img_classifer.predict( X_test_reduced[5,: ] )
print("recognized_class="+ str(recognized_class))

输出

重构不完整图像

PCA 的另一个用途是从部分数据中重建图像。计划是将不完整图像投影到特征基的 K = 64 特征向量投影矩阵上。

def get_half_image( test_image_index ):
    orig_image = np.copy(X_test[test_image_index,:])
    D, = orig_image.shape
    orig_image = np.reshape(orig_image, (1,D) )
    # Blacken the lower half of the face
    
    half_image = np.copy(orig_image)
    half_image[0, 2048: 4096] = 0
    return half_image, orig_image

def reconstruct_half_images( test_indexes ):
    for i in test_indexes:
        half_image, orig_image = get_half_image(i)
        N,D = half_image.shape
        new_image = proj.project( half_image  )
        #target, mu_rec = recognize_image( reduced_half_image[0,: ], X_train_reduced, target_train  )
        #print(target)
        #new_image = reconstruct_image( reduced_half_image, B )
        new_image[0,0:2048,] = orig_image[0, 0:2048]
        images_for_display = np.concatenate((orig_image.T, half_image.T, new_image.T), axis=1 )
        show_images(images_for_display, 3)

reconstruct_half_images([0,10,30])

输出

下一个主题使用机器学习进行图像字幕生成

EigenFaces

导入库

Olivetti 数据集

分割数据集

PCA

模型

设置测试集和训练集

预处理数据

PCA

寻找 K 的最优值

图像识别

重构不完整图像

联系信息

关注我们

教程

面试题

在线编译器

Python

Java

.Net Framework

AI, ML and Data Science

Cloud Technology

B.Tech and MCA

Web Technology

PHP

Software Testing

Technical Interview

Java Interview

Python

Web Interview

Database Interview

B.Tech / MCA

Important Interview

Software Testing Interview

Company Interviews

Online Compilers

Multiple Choice Questions

机器学习

监督式学习

分类

杂项

相关教程

面试题

EigenFaces

导入库

Olivetti 数据集

分割数据集

PCA

模型

设置测试集和训练集

预处理数据

PCA

寻找 K 的最优值

图像识别

重构不完整图像

相关帖子

使用 PyCaret 构建机器学习分类模型

使用梯度下降进行线性回归

如何在 PyTorch 中获取模型摘要

什么是 MLOps

微分和积分微积分

机器学习中的数据分析

置信区间

C GAN

机器学习中的数据可视化

理解用于机器学习回归的 3 种最常见的损失函数

订阅 Tpoint Tech

联系信息

关注我们

教程

面试题

在线编译器