使用tensorflow的slim框架训练自己的图像识别模型

投稿
APP
微信扫一扫获取更多

使用tensorflow的slim框架训练自己的图像识别模型

兰舒凡

2018-08-23 21:30:29

图灵汇官网

图像识别模型实战指南：使用TensorFlow Slim框架

在进行图像识别模型开发的过程中，TensorFlow Slim框架提供了很好的API封装功能，使得构建和训练图像识别模型变得更加简单。本文将详细介绍如何使用Slim框架创建和训练自己的图像识别模型，并提供了一些实用技巧，以帮助读者快速上手。

InceptionV3模型简介

InceptionV3模型是Google LeNet模型的一个改进版本，其结构复杂且高效。更多关于InceptionV3的信息可以参考论文：https://arxiv.org/abs/1512.00567。

全文源码下载地址

感兴趣的读者可以在这里下载本文使用的源码：http://www.broadview.com.cn/book/5490。

训练思路

训练图像识别模型主要有三种思路：

只训练最终的全连接层：这种方法速度较快，但效果一般。
训练所有参数：这种方法速度较慢，但效果更好。
训练部分参数：例如训练深层参数，这是一种折中的方法。

本文将实现前两种方法，并使用TensorBoard对模型效果进行比较。

使用InceptionV3模型

本文采用的是InceptionV3模型，其框架结构如下：

文件结构

项目文件结构如下： - Useslim: 包含dataprepare和slim两个文件夹。 - data_prepare: 主要包含训练和验证数据集及生成TfRecord数据的函数。 - slim: 包含Slim框架的具体内容。

数据准备

分割数据集 将所有图片按目录结构存放，并将它们分为训练集和验证集。
生成TfRecord文件 生成TfRecord文件时，需要注意一些细节：
- 将xrange替换为range。
- 将shuffled_index = range(len(filenames))替换为shuffled_index = list(range(len(filenames)))。
- _process_image()函数中，打开文件时使用'rb'参数，_convert_to_example()函数中将相关参数转换为字节形式。

训练模型

下载Slim框架 Slim框架的下载地址为：https://github.com/tensorflow/models.git。
自定义数据集
- 在datasets文件夹中创建自定义的.py文件，例如从flowers.py拷贝到satellite.py，并进行相应修改。
- 注册自定义数据集：在dataset_factory.py中添加相应的注册语句。
- 创建satellite目录，包含data、train_dir和pretrained目录。
  - 将生成的TfRecord文件复制到data目录。
  - 下载InceptionV3模型，并解压后将.ckpt文件放入pretrained目录。
- 如果没有GPU，需要修改train_image_classifier.py文件中的参数，将clone_on_cpu设置为True。

训练命令示例

bash python train_image_classifier.py --train_dir=satellite/train_dir --dataset_name=satellite --dataset_split_name=train --dataset_dir=satellite/data --model_name=inception_v3 --checkpoint_path=satellite/pretrained/inception_v3.ckpt --checkpoint_exclude_scopes=InceptionV3/Logits,InceptionV3/AuxLogits --trainable_scopes=InceptionV3/Logits,InceptionV3/AuxLogits --max_number_of_steps=100000 --batch_size=32 --learning_rate=0.001 --learning_rate_decay_type=fixed --save_interval_secs=300 --save_summaries_secs=2 --log_every_n_steps=10 --optimizer=rmsprop --weight_decay=0.00004

验证模型性能

使用eval_image_classifier.py函数验证模型在验证集上的表现。由于数据集只有6类，可以将准确率改为top2准确率。

使用TensorBoard监控训练过程

使用以下命令启动TensorBoard： bash tensorboard --logdir satellite/train_dir 如果遇到问题，可以在浏览器中手动输入URL查看结果。

导出和使用模型

导出模型 使用export_inference_graph.py和freeze_graph.py导出和固化模型参数。 ```bash python exportinferencegraph.py --alsologtostderr --modelname=inceptionv3 --outputfile=satellite/inceptionv3infgraph.pb --dataset_name satellite

python freezegraph.py --inputgraph slim/satellite/inceptionv3infgraph.pb --inputcheckpoint slim/satellite/traindir/model.ckpt-202 --inputbinary true --outputnodenames InceptionV3/Predictions/Reshape1 --outputgraph slim/satellite/frozen_graph.pb ```
使用模型识别图片 使用classify_image_inception_v3.py识别图片。 bash python classify_image_inception_v3.py --model_path slim/satellite/frozen_graph.pb --label_path data_prepare/pic/label.txt --image_file test_image.jpg