助力智能人群检测计数，基于YOLOv6开发构建通用场景下人群检测计数系统-编程知识

助力智能人群检测计数，基于YOLOv6开发构建通用场景下人群检测计数系统

在一些人流量比较大的场合，或者是一些特殊时刻、时段、节假日等特殊时期下，密切关注当前系统所承载的人流量是十分必要的，对于超出系统负荷容量的情况做到及时预警对于管理团队来说是保障人员安全的重要手段，本文的主要目的是想要基于通用的数据开发构建用于通用场景下的人群检测计数系统。

前文我们基于比较经典的YOLOv3开发实现了检测计数系统，感兴趣的话可以自行移步阅读即可：
《助力智能人群检测计数，基于YOLOv3开发构建通用场景下人群检测计数识别系统》

《助力智能人群检测计数，基于YOLOv5全系列模型【n/s/m/l/x】开发构建通用场景下人群检测计数识别系统》

首先看下实例效果：

Yolov6是美团开发的轻量级检测算法，截至目前为止该算法已经迭代到了4.0版本，每一个版本都包含了当时最优秀的检测技巧和最最先进的技术，YOLOv6的Backbone不再使用Cspdarknet，而是转为比Rep更高效的EfficientRep；它的Neck也是基于Rep和PAN搭建了Rep-PAN；而Head则和YOLOX一样，进行了解耦，并且加入了更为高效的结构。YOLOv6也是沿用anchor-free的方式，抛弃了以前基于anchor的方法。除了模型的结构之外，它的数据增强和YOLOv5的保持一致；而标签分配上则是和YOLOX一样，采用了simOTA；并且引入了新的边框回归损失：SIOU。
YOLOv5和YOLOX都是采用多分支的残差结构CSPNet，但是这种结构对于硬件来说并不是很友好。所以为了更加适应GPU设备，在backbone上就引入了ReVGG的结构，并且基于硬件又进行了改良，提出了效率更高的EfficientRep。RepVGG为每一个3×3的卷积添加平行了一个1x1的卷积分支和恒等映射的分支。这种结构就构成了构成一个RepVGG Block。和ResNet不同的是，RepVGG是每一层都添加这种结构，而ResNet是每隔两层或者三层才添加。RepVGG介绍称，通过融合而成的3x3卷积结构，对计算密集型的硬件设备很友好。

简单看下实例数据情况：

训练数据配置文件如下所示：

# Please insure that your custom_dataset are put in same parent dir with YOLOv6_DIR
train: ./dataset/images/train # train images
val: ./dataset/images/test # val images
test: ./dataset/images/test # test images (optional)# whether it is coco dataset, only coco dataset should be set to True.
is_coco: False# Classes
nc: 1  # number of classes# class names
names: ['person']

默认我先选择的是yolov6m系列的模型，基于finetune来进行模型的开发：

# YOLOv6m model
model = dict(type='YOLOv6m',pretrained='weights/yolov6m.pt',depth_multiple=0.60,width_multiple=0.75,backbone=dict(type='CSPBepBackbone',num_repeats=[1, 6, 12, 18, 6],out_channels=[64, 128, 256, 512, 1024],csp_e=float(2)/3,fuse_P2=True,),neck=dict(type='CSPRepBiFPANNeck',num_repeats=[12, 12, 12, 12],out_channels=[256, 128, 128, 256, 256, 512],csp_e=float(2)/3,),head=dict(type='EffiDeHead',in_channels=[128, 256, 512],num_layers=3,begin_indices=24,anchors=3,anchors_init=[[10,13, 19,19, 33,23],[30,61, 59,59, 59,119],[116,90, 185,185, 373,326]],out_indices=[17, 20, 23],strides=[8, 16, 32],atss_warmup_epoch=0,iou_type='giou',use_dfl=True,reg_max=16, #if use_dfl is False, please set reg_max to 0distill_weight={'class': 0.8,'dfl': 1.0,},)
)solver = dict(optim='SGD',lr_scheduler='Cosine',lr0=0.0032,lrf=0.12,momentum=0.843,weight_decay=0.00036,warmup_epochs=2.0,warmup_momentum=0.5,warmup_bias_lr=0.05
)data_aug = dict(hsv_h=0.0138,hsv_s=0.664,hsv_v=0.464,degrees=0.373,translate=0.245,scale=0.898,shear=0.602,flipud=0.00856,fliplr=0.5,mosaic=1.0,mixup=0.243,
)

终端执行：

python tools/train.py --batch-size 16 --conf configs/yolov6m_finetune.py --data data/self.yaml --fuse_ab --device 0 --name yolov6m --epochs 100 --workers 2

即可启动训练。

日志输出如下所示：

训练完成如下所示：

Training completed in 10.652 hours.
loading annotations into memory...
Done (t=0.27s)
creating index...
index created!
Loading and preparing results...
DONE (t=0.77s)
creating index...
index created!
Running per image evaluation...
Evaluate annotation type *bbox*
DONE (t=61.14s).
Accumulating evaluation results...
DONE (t=0.65s).Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.499Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=100 ] = 0.946Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=100 ] = 0.459Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.441Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.614Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.671Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=  1 ] = 0.030Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 10 ] = 0.220Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.565Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.523Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.673Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.722

推理实例如下所示：