目标检测 | MobileNetV3_mobilenetv3目标检测-CSDN博客

本文链接：https://blog.csdn.net/xiao_lxl/article/details/95479517

文章目录

轻量级网络
高效的网络构建模块
互补搜索
网络改进
非线性
MobileNetV3-Large网络结构
MobileNetV3-Small网络结构
实验结果

会议：CVPR 2019
标题：《Searching for MobileNetV3》

论文链接： https://arxiv.org/abs/1905.02244?context=cs

代码：感谢github上大佬们开源，开源代码整理如下：

（1）PyTorch实现1：https://github.com/xiaolai-sqlai/mobilenetv3
（2）PyTorch实现2：https://github.com/kuan-wang/pytorch-mobilenet-v3
（3）PyTorch实现3：https://github.com/leaderj1001/MobileNetV3-Pytorch
（4）Caffe实现：https://github.com/jixing0415/caffe-mobilenet-v3
（5）TensorFLow实现：https://github.com/Bisonai/mobilenetv3-tensorflow

本文仅作为个人学习笔记分享，图片来自于论文，如有侵权，请联系删除。

轻量级网络

从SqueezeNet开始模型的参数量就不断下降，为了进一步减少模型的实际操作数（MAdds），MobileNetV1利用了深度可分离卷积提高了计算效率，而MobileNetV2则加入了线性bottlenecks和反转残差模块构成了高效的基本模块。随后的ShuffleNet充分利用了组卷积和通道shuffle进一步提高模型效率。CondenseNet则学习保留有效的dense连接在保持精度的同时降低，ShiftNet则利用shift操作和逐点卷积代替了昂贵的空间卷积。

在这里插入图片描述
图1分别是MobileNetV3两个版本与其他轻量级网络在Pixel 1 手机上的计算延迟与ImageNet分类精度的比较。可见MobileNetV3 取得了显著的比较优势

在这里插入图片描述

图2是ImageNet分类精度、MAdd计算量、模型大小的比较，MobileNetV3依然是最优秀的。

高效的网络构建模块

MobileNetV3 是神经架构搜索得到的模型，其内部使用的模块继承自：

1. MobileNetV1 模型引入的深度可分离卷积（depthwise separable convolutions）；

2. MobileNetV2 模型引入的具有线性瓶颈的倒残差结构(the inverted residual with linear bottleneck)；

3. MnasNet 模型引入的基于squeeze and excitation结构的轻量级注意力模型。

这些被证明行之有效的用于移动端网络设计的模块是搭建MobileNetV3的积木。
在这里插入图片描述

互补搜索

在网络结构搜索中，作者结合两种技术：资源受限的NAS（platform-aware NAS）与NetAdapt，前者用于在计算和参数量受限的前提下搜索网络的各个模块，所以称之为模块级的搜索（Block-wise Search），后者用于对各个模块确定之后网络层的微调。

这两项技术分别来自论文：

M. Tan, B. Chen, R. Pang, V. Vasudevan, and Q. V. Le. Mnasnet: Platform-aware neural architecture search for mobile. CoRR, abs/1807.11626, 2018.

T. Yang, A. G. Howard, B. Chen, X. Zhang, A. Go, M. Sandler, V. Sze, and H. Adam. Netadapt: Platform-aware neural network adaptation for mobile applications. In ECCV, 2018

前者相当于整体结构搜索，后者相当于局部搜索，两者互为补充。