开发也能看懂的大模型：CNN_老汪软件技巧-棋牌游戏开发

作者：老汪软件技巧
发表时间：2024-11-25 07:03
浏览量：

前情提要：后端开发自学AI知识，内容比较浅显，偏实战；仅适用于入门了解，解决日常沟通障碍。

卷积神经网络（CNN）是一种专门用于处理数据具有格点结构（如图像）的深度学习模型。CNN尤其擅长处理图像数据，广泛应用于图像分类、目标检测等领域。

它的基本原理和结构包括以下几个关键部分：

卷积层（Convolutional Layer）：

激活函数（Activation Function）：

池化层（Pooling Layer）：

全连接层（Fully Connected Layer）：

Dropout层：

Softmax层：

工作原理：应用场景

卷积神经网络广泛应用于图像处理、自然语言处理以及其他需要处理结构化数据的任务中。

1. 图像相关任务1.1 图像分类1.2 目标检测1.3 图像分割1.4 图像生成与修复2. 视频分析2.1 视频分类2.2 动作识别2.3 视频分割案例：使用CNN进行CIFAR-10图像分类

这个案例将使用CIFAR-10数据集，该数据集是机器学习领域常用的一个图像识别任务。

环境准备

确保安装了以下Python库：

步骤 1：加载与预处理数据

首先，我们需要加载CIFAR-10数据集，并将其分成训练集和测试集。此外，还需对数据进行归一化处理，以提高模型的收敛速度。

CIFAR-10数据集可以通过其官方网站下载，以下是相关的下载地址：

from tensorflow.keras.datasets import cifar10
from tensorflow.keras.utils import to_categorical
# 加载数据集
(x_train, y_train), (x_test, y_test) = cifar10.load_data()
# 将图像像素值归一化到0到1之间
x_train, x_test = x_train / 255.0, x_test / 255.0
# 将标签转化为one-hot编码
y_train = to_categorical(y_train, 10)
y_test = to_categorical(y_test, 10)

步骤 2：构建CNN模型

定义一个简单的卷积神经网络模型。这个模型包含多个卷积层、池化层和全连接层。

from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Conv2D, MaxPooling2D, Flatten, Dense, Dropout
model = Sequential([
    Conv2D(32, (3, 3), activation='relu', input_shape=(32, 32, 3)),
    MaxPooling2D((2, 2)),
    
    Conv2D(64, (3, 3), activation='relu'),
    MaxPooling2D((2, 2)),
    
    Conv2D(64, (3, 3), activation='relu'),
    MaxPooling2D((2, 2)),
    
    Flatten(),
    Dense(64, activation='relu'),
    Dropout(0.5),
    Dense(10, activation='softmax')
])

步骤 3：编译模型

选择合适的损失函数，优化器和评估指标来编译模型。

model.compile(optimizer='adam',
              loss='categorical_crossentropy',
              metrics=['accuracy'])

步骤 4：训练模型

使用训练数据拟合模型，通过验证集监控性能。

history = model.fit(x_train, y_train, epochs=10, 
                    validation_data=(x_test, y_test), 
                    batch_size=64)

步骤 5：评估模型

在测试数据集上评估模型性能。

test_loss, test_acc = model.evaluate(x_test, y_test, verbose=2)
print(f"Test accuracy: {test_acc}")

步骤 6：结果可视化

绘制训练过程中的损失和准确率曲线。

import matplotlib.pyplot as plt
plt.plot(history.history['accuracy'], label='accuracy')
plt.plot(history.history['val_accuracy'], label='val_accuracy')
plt.xlabel('Epoch')
plt.ylabel('Accuracy')
plt.ylim([0, 1])
plt.legend(loc='lower right')
plt.show()

比较尴尬，服务器性能太差，任务直接被kill掉，有人跑出来可评论区留言。

总结

通过上述步骤，我们成功地训练了一个卷积神经网络来对CIFAR-10的数据进行分类。我们可以进一步对模型进行调整，比如增加更多的卷积层、更复杂的架构、数据增强等，以改善模型性能。希望这对你理解CNN在图像分类中的应用有所帮助！

解释步骤 2：构建CNN模型

在步骤 2 中，我们构建一个卷积神经网络（CNN）模型，以便对图像数据进行分类。

_基于模型开发_cnn模型搭建

下面逐层解释这个模型的构建过程：

导入所需模块：

模型结构：

model = Sequential([
    Conv2D(32, (3, 3), activation='relu', input_shape=(28, 28, 1)),
    MaxPooling2D((2, 2)),
    
    Conv2D(64, (3, 3), activation='relu'),
    MaxPooling2D((2, 2)),
    Flatten(),
    Dense(64, activation='relu'),
    Dropout(0.5),
    Dense(10, activation='softmax')
])

MaxPooling2D((2, 2)) :

Conv2D(64, (3, 3), activation='relu') :

MaxPooling2D((2, 2)) :

Flatten() :

Dense(64, activation='relu') :

Dropout(0.5) :

Dense(10, activation='softmax') :

通过这个结构，模型能够提取图像中的空间层次特征，并最终根据特征判断图像属于哪个类别。不同的层负责不同的功能，如卷积层提取特征，池化层降低计算量，全连接层整合信息，而Dropout层则有助于提高模型的泛化能力。

完整代码


from tensorflow.keras.datasets import cifar10
from tensorflow.keras.utils import to_categorical
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Conv2D, MaxPooling2D, Flatten, Dense, Dropout
import matplotlib.pyplot as plt
# 加载数据集
(x_train, y_train), (x_test, y_test) = cifar10.load_data()
# 将图像像素值归一化到0到1之间
x_train, x_test = x_train / 255.0, x_test / 255.0
# 将标签转化为one-hot编码
y_train = to_categorical(y_train, 10)
y_test = to_categorical(y_test, 10)
# 定义模型
model = Sequential([
    Conv2D(32, (3, 3), activation='relu', input_shape=(32, 32, 3)),
    MaxPooling2D((2, 2)),
    
    Conv2D(64, (3, 3), activation='relu'),
    MaxPooling2D((2, 2)),
    
    Conv2D(64, (3, 3), activation='relu'),
    MaxPooling2D((2, 2)),
    
    Flatten(),
    Dense(64, activation='relu'),
    Dropout(0.5),
    Dense(10, activation='softmax')
])
# 编译模型
model.compile(optimizer='adam',
              loss='categorical_crossentropy',
              metrics=['accuracy'])
# 训练模型
history = model.fit(x_train, y_train, epochs=10, 
                    validation_data=(x_test, y_test), 
                    batch_size=64)
# 评估模型
test_loss, test_acc = model.evaluate(x_test, y_test, verbose=2)
print(f"Test accuracy: {test_acc}")
# 可视化
plt.plot(history.history['accuracy'], label='accuracy')
plt.plot(history.history['val_accuracy'], label='val_accuracy')
plt.xlabel('Epoch')
plt.ylabel('Accuracy')
plt.ylim([0, 1])
plt.legend(loc='lower right')
# plt.show()
plt.savefig('output.png')  # 保存为文件

上一条查看详情 +批量迁移 GitLab 仓库到 Forgejo

下一条 查看详情 +没有了