Python 中的 15 个统计假设检验

2024 年 8 月 29 日 | 阅读 15 分钟

有数百种用于检验假设的统计检验。然而，机器学习项目只需要其中少数几种。在本教程中，我们将介绍一些最重要的假设检验，如果您想从事统计建模相关领域的工作，就必须了解它们。我们将使用 Python 编程语言来实现这些检验。

下面提到的每个假设检验都包含与该检验相关的以下信息

这个检验叫什么？
我们在检验中检查什么？
实施检验的关键假设是什么？
如何解释检验结果？
如何在 Python 中实施检验？

请注意，这些假设非常重要。如果违反了数据样本的预期分布或所需样本大小等假设，检验结果将不准确。基于这些结果的解释将高度不可靠。因此，在应用这些检验之前，检查这些假设非常重要。

数据样本通常需要足够大，才能揭示其分布情况，以便进行分析和说明领域。

在某些情况下，可以调整数据使其符合假设。仅举两个例子，这可以通过从接近正态分布的数据中去除异常值使其更接近正态分布，或者当给定数据样本的方差不同时调整检验的自由度来完成。

最后，对于某个特定问题（例如正态性），可能会有多种检验可用。通过统计学，我们无法获得问题的精确解决方案；相反，我们获得的是概率性解决方案。因此，通过以多种方式思考同一主题，我们可能会得出不同的答案。因此，可能需要多种检验来解决我们可能遇到的某些数据相关问题。

正态性检验

在本节中，我们将看到用于检验给定数据样本是否具有高斯分布的检验。数据遵循高斯分布的假设是许多统计建模技术的基本要求。因此，这些检验非常重要。

Shapiro-Wilk 检验

因此，此检验用于检验给定数据样本是否具有高斯分布或正态分布。

假设

每个样本的观测值本质上是独立的，并且它们是同分布的。此假设的缩写是 IID。

解释

H0：样本遵循高斯分布

H1：给定样本不遵循高斯分布。

代码

# Python program to perform the Shapiro-Wilk Normality Test

# Importing the required modules
from scipy.stats import shapiro

# Creating a dataset
data = [0.863, 2.717, 0.221, -0.965, -0.255, -1.476, 0.560, -1.578, -2.637, -1.969]

# Performing the test
stat, p = shapiro(data)
print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('The data follows a Gaussian distribution')
else:
 print('The data does not follows a Gaussian distribution')

输出

The statistic value is: 0.9621855020523071, and the p-value is 0.8104783892631531
The data does not follow a Gaussian distribution.

D'Agostino's K^2 检验

此检验用于检验给定数据样本是否为高斯分布。

假设

每个样本的观测值本质上是独立的，并且它们是同分布的。

解释

H0：样本遵循高斯分布

H1：给定样本不遵循高斯分布。

代码

# Python program to perform the D'Agostino's K^2 Normality Test

# Importing the required modules
from scipy.stats import normaltest

# Creating a dataset
data = [0.863, 2.717, 0.221, -0.965, -0.255, -1.476, 0.560, -1.578, -2.637, -1.969]

# Performing the test
stat, p = normaltest(data)
print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('The data follows a Gaussian distribution')
else:
 print('The data does not follow a Gaussian distribution')

输出

The statistic value is: 1.0653637027947445, and the p-value is 0.5870285334466323
The data does not follow a Gaussian distribution

Anderson-Darling 检验

此检验用于检验给定数据样本是否为高斯分布。

假设

每个样本的观测值本质上是独立的，并且它们是同分布的。

解释

H0：样本遵循高斯分布

H1：给定样本不遵循高斯分布。

代码

# Python program to perform the Anderson-Darling Normality Test

# Importing the required modules
from scipy.stats import anderson

# Creating a dataset
data = [0.863, 2.717, 0.221, -0.965, -0.255, -1.476, 0.560, -1.578, -2.637, -1.969]

# Performing the test
res = anderson(data)
print(f'The statistic value is: {res.statistic}')
print()
# For each significance level, checking if the hypothesis holds or not
for i in range(len(res.critical_values)):
  sig_level, critical_value = res.significance_level[i], res.critical_values[i]
  
  # Comparing each statistic with the critical value corresponding to the ith significance level
  print(f"The critical value at {sig_level}% is {critical_value}")
  if res.statistic > critical_value:
    print(f'The data follow a Gaussian distribution at {sig_level}%')
  else:
    print(f'The data does not follow a Gaussian distribution at {sig_level}%')
  print()

输出

The statistic value is: 0.20692157645671116

The critical value at 15.0% is 0.501
The data does not follow a Gaussian distribution at 15.0%

The critical value at 10.0% is 0.57
The data does not follow a Gaussian distribution at 10.0%

The critical value at 5.0% is 0.684
The data does not follow a Gaussian distribution at 5.0%

The critical value at 2.5% is 0.798
The data does not follow a Gaussian distribution at 2.5%

The critical value at 1.0% is 0.95
The data does not follow a Gaussian distribution at 1.0%

相关性检验

现在我们将看到比较两个样本并判断它们是否相关的检验。

Pearson 相关系数

此检验用于检验给定两个数据样本之间是否存在线性关系。

假设

每个样本的观测值本质上是独立的，并且它们是同分布的。
每个样本的观测值遵循正态分布。
每个样本的观测值具有相同的方差。

解释

H0：给定两个样本不相关，即它们是独立的。

H1：给定样本之间存在某种依赖关系。

代码

# Python program to implement Pearson's Correlation test

# Importing the required classes
from scipy.stats import pearsonr

# Creating two random data samples
sample1 = [0.843, 3.817, 0.221, -0.445, -0.455, -1.236, 0.660, -1.428, -1.337, -1.769]
sample2 = [0.363, 3.317, 0.165, -7.525, -0.565, -1.546, 3.450, -1.558, -3.577, -1.279]

# Implementing the Pearson Test
stat, p = pearsonr(sample1, sample2)
print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('Both data samples are dependent on each other')
else:
 print('The data samples are independent of each other')

输出

The statistic value is: 0.6135196215696078, and the p-value is 0.05922727627191346
The data samples are independent of each other

Spearman 秩相关

这是 Pearson 检验的进一步发展。它检验给定样本是否具有单调关系。这种关系可以是线性的也可以是非线性的。

假设

每个样本的观测值本质上是独立的，并且它们是同分布的。
两个样本的观测值都经过排序。

解释

H0：给定两个样本不相关，即它们是独立的。

H1：给定样本之间存在某种依赖关系。

代码

# Python program to implement Spearman's Rank Correlation test

# Importing the required classes
from scipy.stats import spearmanr

# Creating two random data samples
sample1 = [0.843, 3.817, 0.221, -0.445, -0.455, -1.236, 0.660, -1.428, -1.337, -1.769]
sample2 = [0.363, 3.317, 0.165, -7.525, -0.565, -1.546, 3.450, -1.558, -3.577, -1.279]

# Implementing the Spearman's Test
stat, p = spearmanr(sample1, sample2)
print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('Both data samples are dependent on each other')
else:
 print('The data samples are independent of each other')

输出

The statistic value is: 0.6969696969696969, and the p-value is 0.02509667588225183
Both data samples are dependent on each other

Kendall 秩相关

这是 Pearson 检验的进一步发展。它检验给定样本是否具有单调关系。

假设

每个样本的观测值本质上是独立的，并且它们是同分布的。
两个样本的观测值都经过排序。

解释

H0：给定两个样本不相关。

H1：给定样本之间存在某种依赖关系。

代码

# Python program to implement the Kendall's Rank Correlation test

# Importing the required classes
from scipy.stats import kendalltau

# Creating two random data samples
sample1 = [0.843, 3.817, 0.221, -0.445, -0.455, -1.236, 0.660, -1.428, -1.337, -1.769]
sample2 = [0.363, 3.317, 0.165, -7.525, -0.565, -1.546, 3.450, -1.558, -3.577, -1.279]

# Implementing the Kendall rank Test
stat, p = kendalltau(sample1, sample2)
print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('Both data samples are dependent on each other')
else:
 print('The data samples are independent of each other')

输出

The statistic value is: 0.5111111111111111, and the p-value is 0.04662257495590829
Both data samples are dependent on each other

卡方检验

Pearson 检验只能用于数值。Spearman 和 Kendall 秩相关检验可用于序数数据。序数数据是具有一定顺序的分类数据。但对于名义数据（无序的分类数据），这些检验不能使用。要检验名义数据之间的依赖性或关系，我们使用卡方检验。

假设

用于计算列联表的观测值应是独立的。
列联表的每个单元格应包含超过 25 个观测值。

解释

H0：给定两个样本不相关。

H1：给定样本之间存在某种依赖关系。

代码

# Python program to implement the Chi-Squared test

# Importing the required classes
from scipy.stats import chi2_contingency

# Creating a sample observations table
table = [[30, 25, 34, 26, 31],[25, 29, 31, 34, 32]]

# Implementing the Chi-squared Test
stat, p, dof, expected_freq= chi2_contingency(table)

print("Expected Frequencies are", expected_freq)
print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('The data samples are dependent on each other')
else:
 print('The data samples are independent of each other')

输出

Expected Frequencies are [[27.03703704 26.54545455 31.95286195 29.49494949 30.96969697]
 [27.96296296 27.45454545 33.04713805 30.50505051 32.03030303]]
The statistic value is: 1.8882030380034551, and the p-value is 0.7563117707680647
The data samples are independent of each other

平稳性检验

时间序列是一个非常重要的话题。在时间序列上执行的模型要求时间序列数据是平稳的。因此，要应用任何模型，我们首先需要检查时间序列数据是否平稳。现在我们将看到用于检查数据平稳性的检验。

增广迪基-富勒单位根检验

通过此检验，我们检查给定时间序列数据是否具有单位模根。或者，更专业地说，数据是否是自回归的？自回归时间序列是平稳的。如果时间序列具有单位模根，则它不平稳。

假设

观测值应按时间顺序排列。

解释

H0：时间序列具有单位根（序列不平稳）。

H1：不存在单位模根（序列是平稳的）。

代码

# Python program to implement the Augmented Dickey-Fuller unit root test

# Importing the required classes
from statsmodels.tsa.stattools import adfuller

# Creating a time series data
time_series = [2, 4, -1, -2, 5, 8, -4, -9, 9, 10]

# Implementing the Augmented Dickey-Fuller unit root test
stat, p, lag, o, c, t = adfuller(time_series)

print("The order of the autoregressive model is", lag)
print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('The given time series is stationary')
else:
 print('The given time series is not stationary')

输出

The order of the autoregressive model is 1
The statistic value is: -10.232070586545865, and the p-value is 4.998574442108246e-18
The given time series is stationary

Kwiatkowski-Phillips-Schmidt-Shin

此检验用于检验给定时间序列是否具有平稳趋势。如果序列是趋势平稳的，则意味着该序列是确定性的。

假设

观测值应按时间顺序排列。

解释

H0：给定时间序列具有平稳趋势。

H1：给定时间序列不具有平稳趋势。

代码

# Python program to implement the Kwiatkowski Phillips Schmidt Shin test

# Importing the required classes
from statsmodels.tsa.stattools import kpss

# Creating a time series data
time_series = [2, 4, -1, -2, 5, 8, -4, -9, 9, 10]

# Implementing the Kwiatkowski Phillips Schmidt Shin Test
stat, p, lag, c= kpss(time_series)

print("The order of the autoregressive model is", lag)
print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('The given time series is stationary')
else:
 print('The given time series is not stationary')

输出

The order of the autoregressive model is 0
The statistic value is: 0.09930151338766009, and the p-value is 0.1
The given time series is not stationary

参数统计假设检验

现在我们将看到参数检验。在这些检验中，我们检验一个或多个样本的某个参数是否等于或不同于某个值或彼此。

Student's t 检验

在此检验中，参数是给定样本的均值。我们检查两个样本的均值是否独立，换句话说，是否彼此显着不同。

假设

每个样本的观测值本质上是独立的，并且它们是同分布的。
两个样本的观测值都遵循正态分布。
两个样本的观测值具有相同的方差。

解释

H0：给定样本的均值相等。

H1：给定样本的均值不相等。

代码

# Python program to implement the Student's t-test

# Importing the required classes
from scipy.stats import ttest_ind

# Creating two random data samples
sample1 = [0.843, 3.817, 0.221, -0.445, -0.455, -1.236, 0.660, -1.428, -1.337, -1.769]
sample2 = [0.363, 3.317, 0.165, -7.525, -0.565, -1.546, 3.450, -1.558, -3.577, -1.279]

# Implementing the Kwiatkowski Phillips Schmidt Shin Test
stat, p = ttest_ind(sample1, sample2)

print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('The given samples have unequal mean values')
else:
 print('The given samples have equal mean values')

输出

The statistic value is: 0.6713796580759667, and the p-value is 0.5105037120903526
The given samples have equal mean values

配对 Student's t 检验

在此检验中，参数也是均值。但是，此检验用于两个样本配对的情况。如果两个值是在相同样本经过某个处理前后观测到的，则称这两个样本是配对的。

假设

每个样本的观测值本质上是独立的，并且它们是同分布的。
两个样本的观测值都遵循正态分布。
两个样本的观测值具有相同的方差。
每个样本的观测值都是配对的。

解释

H0：配对样本的均值相等。

H1：配对样本的均值不相等。

代码

# Python program to implement the Paired Student's t-test

# Importing the required classes
from scipy.stats import ttest_rel

# Creating two random data samples
sample1 = [0.843, 3.817, 0.221, -0.445, -0.455, -1.236, 0.660, -1.428, -1.337, -1.769]
sample2 = [0.363, 3.317, 0.165, -7.525, -0.565, -1.546, 3.450, -1.558, -3.577, -1.279]

# Implementing the Paired Student's t-test
stat, p = ttest_rel(sample1, sample2)

print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('The paired samples have unequal mean values')
else:
 print('The paired samples have equal mean values')

输出

The statistic value is: 0.9502747511161275, and the p-value is 0.36679175997294733
The paired samples have equal mean values

方差分析检验 (ANOVA)

在此检验中，我们使用方差来确定两个或多个样本是否彼此不同或相同。

假设

每个样本的观测值本质上是独立的，并且它们是同分布的。
两个样本的观测值都遵循正态分布。
两个样本的观测值具有相同的方差。

解释

H0：给定样本的均值相等。

H1：给定多个样本中的一个或多个均值不相等。

代码

# Python program to implement the Analysis of Variance Test

# Importing the required classes
from scipy.stats import f_oneway

# Creating two random data samples
sample1 = [0.843, 3.817, 0.221, -0.445, -0.455, -1.236, 0.660, -1.428, -1.337, -1.769]
sample2 = [0.363, 3.317, 0.165, -7.525, -0.565, -1.546, 3.450, -1.558, -3.577, -1.279]
sample3 = [-0.308, 0.656, 0.918, -2.148, -0.413, 0.329, 0.157, 0.369, -0.850, -1.304]

# Implementing the Analysis of the Variance Test
stat, p = f_oneway(sample1, sample2, sample3)

print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('The samples have unequal mean values')
else:
 print('The samples have equal mean values')

输出

The statistic value is: 0.3557581063875854, and the p-value is 0.7038772383760818
The samples have equal mean values

非参数统计假设检验

Mann-Whitney U 检验

此检验将检验从两个独立总体数据中抽取的样本是否相等。

假设

每个样本的观测值本质上是独立的，并且它们是同分布的。
两个样本的观测值都经过排序。

解释

H0：独立样本的基础分布是相同的。

H1：独立样本的基础分布不同。

代码

# Python program to implement the Mann-Whitney U Test

# Importing the required classes
from scipy.stats import mannwhitneyu

# Creating two random data samples
sample1 = [0.843, 3.817, 0.221, -0.445, -0.455, -1.236, 0.660, -1.428, -1.337, -1.769]
sample2 = [0.363, 3.317, 0.165, -7.525, -0.565, -1.546, 3.450, -1.558, -3.577, -1.279]

# Implementing the Mann-Whitney U Test
stat, p = mannwhitneyu(sample1, sample2)

print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('The samples have different distributions')
else:
 print('The samples have same distributions')

输出

The statistic value is: 60.0, and the p-value is 0.47267559351158717
The samples have the same distributions

Wilcoxon 符号秩检验

此检验用于检验给定两个或多个配对观测样本的分布是否相等。

假设

每个样本的观测值本质上是独立的，并且它们是同分布的。
两个样本的观测值都经过排序。
每个样本的观测值都是配对的。

解释

H0：独立样本的基础分布是相同的。

H1：独立样本的基础分布不同。

代码

# Python program to implement the Wilcoxon Signed-Rank Test

# Importing the required classes
from scipy.stats import wilcoxon

# Creating two random data samples
sample1 = [0.843, 3.817, 0.221, -0.445, -0.455, -1.236, 0.660, -1.428, -1.337, -1.769]
sample2 = [0.363, 3.317, 0.165, -7.525, -0.565, -1.546, 3.450, -1.558, -3.577, -1.279]

# Implementing the Wilcoxon Signed-Rank Test
stat, p = wilcoxon(sample1, sample2)

print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('The samples have different distributions')
else:
 print('The samples have the same distributions')

输出

The statistic value is: 15.0, and the p-value is 0.232421875
The samples have the same distributions

Kruskal-Wallis H 检验

此检验用于检验给定两个或多个观测样本的分布是否相等。

假设

每个样本的观测值本质上是独立的，并且它们是同分布的。
两个样本的观测值都经过排序。

解释

H0：独立样本的基础分布是相同的。

H1：独立样本的基础分布不同。

代码

# Python program to implement the Kruskal-Wallis H Test

# Importing the required classes
from scipy.stats import kruskal

# Creating two random data samples
sample1 = [0.843, 3.817, 0.221, -0.445, -0.455, -1.236, 0.660, -1.428, -1.337, -1.769]
sample2 = [0.363, 3.317, 0.165, -7.525, -0.565, -1.546, 3.450, -1.558, -3.577, -1.279]

# Implementing the Kruskal-Wallis H Test
stat, p = kruskal(sample1, sample2)

print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('The samples have different distributions')
else:
 print('The samples have the same distributions')

输出

The statistic value is: 0.5714285714285694, and the p-value is 0.4496917979688917
The samples have the same distributions

Friedman 检验

此检验用于检验给定两个或多个配对观测样本的分布是否相等。

假设

每个样本的观测值本质上是独立的，并且它们是同分布的。
两个样本的观测值都经过排序。
每个样本的观测值都是配对的。

解释

H0：独立样本的基础分布是相同的。

H1：独立样本的基础分布不同。

代码

# Python program to implement the Friedman Test

# Importing the required classes
from scipy.stats import friedmanchisquare

# Creating two random data samples
sample1 = [0.843, 3.817, 0.221, -0.445, -0.455, -1.236, 0.660, -1.428, -1.337, -1.769]
sample2 = [0.363, 3.317, 0.165, -7.525, -0.565, -1.546, 3.450, -1.558, -3.577, -1.279]
sample3 = [-0.308, 0.656, 0.918, -2.148, -0.413, 0.329, 0.157, 0.369, -0.850, -1.304]

# Implementing the Friedman Test
stat, p = friedmanchisquare(sample1, sample2, sample3)

print(f'The statistic value is: {stat}, and the p-value is {p}')

# Checking if the p-value is less than the level of significance 0.05
if p < 0.05:
 print('The samples have different distributions')
else:
 print('The samples have the same distributions')

输出

The statistic value is: 2.4000000000000057, and the p-value is 0.3011942119122012
The samples have the same distributions

总结

在本教程中，您了解了可以在机器学习项目中应用的主要假设检验。

特别是，您发现了

根据具体情况使用的多种检验类型，包括正态性检查、变量之间的相关性以及样本的配对性质。
每个检验的主要假设以及如何评估结果。
如何使用 Python API 执行检验？

下一主题在 Python 中克隆带有随机和下一个指针的链表

Python 中的 15 个统计假设检验

正态性检验

Shapiro-Wilk 检验

D'Agostino's K^2 检验

Anderson-Darling 检验

相关性检验

Pearson 相关系数

Spearman 秩相关

Kendall 秩相关

卡方检验

平稳性检验

增广迪基-富勒单位根检验

Kwiatkowski-Phillips-Schmidt-Shin

参数统计假设检验

Student's t 检验

配对 Student's t 检验

方差分析检验 (ANOVA)

非参数统计假设检验

Mann-Whitney U 检验

Wilcoxon 符号秩检验

Kruskal-Wallis H 检验

Friedman 检验

总结

联系信息

关注我们

教程

面试题

在线编译器

Python

Java

.Net Framework

AI, ML and Data Science

Cloud Technology

B.Tech and MCA

Web Technology

PHP

Software Testing

Technical Interview

Java Interview

Python

Web Interview

Database Interview

B.Tech / MCA

Important Interview

Software Testing Interview

Company Interviews

Online Compilers

Multiple Choice Questions

Python 问题

Python 中的 15 个统计假设检验

正态性检验

Shapiro-Wilk 检验

D'Agostino's K^2 检验

Anderson-Darling 检验

相关性检验

Pearson 相关系数

Spearman 秩相关

Kendall 秩相关

卡方检验

平稳性检验

增广迪基-富勒单位根检验

Kwiatkowski-Phillips-Schmidt-Shin

参数统计假设检验

Student's t 检验

配对 Student's t 检验

方差分析检验 (ANOVA)

非参数统计假设检验

Mann-Whitney U 检验

Wilcoxon 符号秩检验

Kruskal-Wallis H 检验

Friedman 检验

总结

相关帖子

Python 异步编程 - asyncio 和 await

Python Gzip 模块

有多少 Python 包

如何在 Python 中写平方根

Python 集合转列表

正则表达式

Python 中的 argparse

Python 的 Configparser 模块

Twitter API Python

Python 自动化模块

订阅 Tpoint Tech

联系信息

关注我们

教程

面试题

在线编译器