Scene-adaptive crowd counting method based on meta learning with dual-input network DMNet

doi:10.1007/s11704-021-1207-x

Frontiers of Computer Science

2023, Vol. 17

Issue (1): 171304 https://doi.org/10.1007/s11704-021-1207-x

本期目录

Scene-adaptive crowd counting method based on meta learning with dual-input network DMNet

Haoyu ZHAO¹, Weidong MIN^2,³(

), Jianqiang XU¹, Qi WANG¹, Yi ZOU¹, Qiyan FU¹

¹. School of Information Engineering, Nanchang University, Nanchang 330031, China
². School of Software, Nanchang University, Nanchang 330047, China
³. Jiangxi Key Laboratory of Smart City, Nanchang 330047, China

全文: PDF(6952 KB) HTML

Abstract：

Crowd counting is recently becoming a hot research topic, which aims to count the number of the people in different crowded scenes. Existing methods are mainly based on training-testing pattern and rely on large data training, which fails to accurately count the crowd in real-world scenes because of the limitation of model’s generalization capability. To alleviate this issue, a scene-adaptive crowd counting method based on meta-learning with Dual-illumination Merging Network (DMNet) is proposed in this paper. The proposed method based on learning-to-learn and few-shot learning is able to adapt different scenes which only contain a few labeled images. To generate high quality density map and count the crowd in low-lighting scene, the DMNet is proposed, which contains Multi-scale Feature Extraction module and Element-wise Fusion Module. The Multi-scale Feature Extraction module is used to extract the image feature by multi-scale convolutions, which helps to improve network accuracy. The Element-wise Fusion module fuses the low-lighting feature and illumination-enhanced feature, which supplements the missing illumination in low-lighting environments. Experimental results on benchmarks, WorldExpo’10, DISCO, USCD, and Mall, show that the proposed method outperforms the existing state-of-the-art methods in accuracy and gets satisfied results.

Key words： crowd counting meta-learning scene-adaptive Dual-illumination Merging Network

收稿日期: 2021-04-28 出版日期: 2022-03-01

Corresponding Author(s): Weidong MIN

引用本文:

. [J]. Frontiers of Computer Science, 2023, 17(1): 171304.
Haoyu ZHAO, Weidong MIN, Jianqiang XU, Qi WANG, Yi ZOU, Qiyan FU. Scene-adaptive crowd counting method based on meta learning with dual-input network DMNet. Front. Comput. Sci., 2023, 17(1): 171304.

链接本文:

https://academic.hep.com.cn/fcs/CN/10.1007/s11704-021-1207-x
https://academic.hep.com.cn/fcs/CN/Y2023/V17/I1/171304

Fig.1

Fig.2

Fig.3

Fig.4

Fig.5

Fig.6

Fig.7

Fig.8

Tab.1

Tab.2

Tab.3

Tab.4

Tab.5

Tab.6

Tab.7

1	Q Wang , J Gao , W Lin , X Li . NWPU-crowd: a large-scale benchmark for crowd counting and localization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 43( 6): 2141– 2149
2	Y Liu , Q Wen , H Chen , W Liu , J Qin , G Han , S He . Crowd counting via cross-stage refinement networks. IEEE Transactions on Image Processing, 2020, 29 : 6800– 6812
3	J Gao , Q Wang , X Li . PCC Net: perspective crowd counting via spatial convolutional network. IEEE Transactions on Circuits and Systems for Video Technology, 2020, 30( 10): 3486– 3498
4	M K K Reddy, M A Hossain, M Rochan, Y Wang. Few-shot scene adaptive crowd counting using meta-learning. In: Proceedings of the 2020 IEEE Winter Conference on Applications of Computer Vision (WACV). 2020, 2803−2812
5	X Liu, J Van De Weijer, A D Bagdanov. Leveraging unlabeled data for crowd counting by learning to rank. In: Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2018, 7661−7669
6	C Zhang, H Li, X Wang, X Yang. Cross-scene crowd counting via deep convolutional neural networks. In: Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2015, 833−841
7	C C Loy, S Gong, T Xiang. From semi-supervised to transfer counting of crowds. In: Proceedings of the 2013 IEEE International Conference on Computer Vision. 2013, 2256−2263
8	C Finn, P Abbeel, S Levine. Model-agnostic meta-learning for fast adaptation of deep networks. In: Proceedings of the 34th International Conference on Machine Learning. 2017, 1126−1135
9	M Zhao , C Zhang , J Zhang , F Porikli , B Ni , W Zhang . Scale-aware crowd counting via depth-embedded convolutional neural networks. IEEE Transactions on Circuits and Systems for Video Technology, 2020, 30( 10): 3651– 3662
10	Y Fang , S Gao , J Li , W Luo , L He , B Hu . Multi-level feature fusion based Locality-Constrained Spatial Transformer network for video crowd counting. Neurocomputing, 2020, 392 : 98– 107
11	D B Sam , S V Peri , M N Sundararaman , A Kamath , R V Babu . Locate, size, and count: accurately resolving people in dense crowds via detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43( 8): 2739– 2751
12	L Liu , H Lu , H Xiong , K Xian , Z Cao , C Shen . Counting objects by blockwise classification. IEEE Transactions on Circuits and Systems for Video Technology, 2020, 30( 10): 3513– 3527
13	X Wu , Y Zheng , H Ye , W Hu , T Ma , J Yang , L He . Counting crowds with varying densities via adaptive scenario discovery framework. Neurocomputing, 2020, 397 : 127– 138
14	D Hu, L Mou, Q Wang, J Gao, Y Hua, D Dou, X X Zhu. Ambient sound helps: audiovisual crowd counting in extreme conditions. 2020, arXiv preprint arXiv: 2005.07097
15	H Zhao , W Min , X Wei , Q Wang , Q Fu , Z Wei . MSR-FAN: multi-scale residual feature-aware network for crowd counting. IET Image Processing, 2021, 15( 14): 3512– 3521 https://doi.org/10.1049/ipr2.12175
16	H Zheng , Z Lin , J Cen , Z Wu , Y Zhao . Cross-line pedestrian counting based on spatially-consistent two-stage local crowd density estimation and accumulation. IEEE Transactions on Circuits and Systems for Video Technology, 2019, 29( 3): 787– 799
17	Z Shen, Y Xu, B Ni, M Wang, J Hu, X Yang. Crowd counting via adversarial cross-scale consistency pursuit. In: Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2018, 5245−5254
18	B Yang , W Zhan , N Wang , X Liu , J Lv . Counting crowds using a scale-distribution-aware network and adaptive human-shaped kernel. Neurocomputing, 2020, 390 : 207– 216
19	Z Zou , Y Cheng , X Qu , S Ji , X Guo , P Zhou . Attend to count: crowd counting with adaptive capacity multi-scale CNNs. Neurocomputing, 2019, 367 : 75– 83
20	L Wang , B Yin , X Tang , Y Li . Removing background interference for crowd counting via de-background detail convolutional network. Neurocomputing, 2019, 322 : 360– 371
21	J Chen , Z Wang . Crowd counting with segmentation attention convolutional neural network. IET Image Processing, 2021, 15( 6): 1221– 1231 https://doi.org/10.1049/ipr2.12099
22	S Jiang , X Lu , Y Lei , L Liu . Mask-aware networks for crowd counting. IEEE Transactions on Circuits and Systems for Video Technology, 2020, 30( 9): 3119– 3129
23	W Min , M Fan , X Guo , Q Han . A new approach to track multiple vehicles with the combination of robust detection and two classifiers. IEEE Transactions on Intelligent Transportation Systems, 2018, 19( 1): 174– 186
24	H Yang , L Liu , W Min , X Yang , X Xiong . Driver yawning detection based on subtle facial action recognition. IEEE Transactions on Multimedia, 2020, 23 : 572– 583 https://doi.org/10.1109/TMM.2020.2985536
25	Q Wang , W Min , D He , S Zou , T Huang , Y Zhang , R Liu . Discriminative fine-grained network for vehicle re-identification using two-stage re-ranking. Science China Information Sciences, 2020, 63( 11): 212102– https://doi.org/10.1007/s11432-019-2811-8
26	Y Ma , G Zhong , W Liu , Y Wang , P Jiang , R Zhang . ML-CGAN: conditional generative adversarial network with a meta-learner structure for high-quality image generation with few training data. Cognitive Computation, 2021, 13( 2): 418– 430
27	I Jung, K You, H Noh, M Cho, B Han. Real-time object tracking via meta-learning: efficient model adaptation and one-shot channel pruning. In: Proceedings of the 34th AAAI Conference on Artificial Intelligence. 2020, 11205−11212, doi:
28	T Elsken, B Staffler, J H Metzen, F Hutter. Meta-learning of neural architectures for few-shot learning. In: Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2020, 12362−12372
29	C Xu , J Shen , X Du . A method of few-shot network intrusion detection based on meta-learning framework. IEEE Transactions on Information Forensics and Security, 2020, 15 : 3540– 3552
30	H J Ye , X R Sheng , D C Zhan . Few-shot learning with adaptively initialized task optimizer: a practical meta-learning approach. Machine Learning, 2020, 109( 3): 643– 664
31	A Nichol, J Achiam, J Schulman. On first-order meta-learning algorithms. 2018, arXiv preprint arXiv: 1803.02999v3
32	D Wang , Y Cheng , M Yu , X Guo , T Zhang . A hybrid approach with optimization-based and metric-based meta-learner for few-shot learning. Neurocomputing, 2019, 349 : 202– 211
33	N Lai , M Kan , C Han , X Song , S Shan . Learning to learn adaptive classifier–predictor for few-shot learning. IEEE Transactions on Neural Networks and Learning Systems, 2021, 32( 8): 3458– 3470 https://doi.org/10.1109/TNNLS.2020.3011526
34	A B Chan, Z S J Liang, N Vasconcelos. Privacy preserving crowd monitoring: counting people without people models or tracking. In: Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition. 2008, 1−7
35	Q Zhang , Y Nie , W S Zheng . Dual illumination estimation for robust exposure correction. Computer Graphics Forum, 2019, 38( 7): 243– 252
36	Y Zhang, J Zhang, X Guo. Kindling the darkness: a practical low-light image enhancer. In: Proceedings of the 27th ACM International Conference on Multimedia. 2019, 1632−1640
37	C Wei, W Wang, W Yang, J Liu. Deep Retinex decomposition for low-light enhancement. 2018, arXiv preprint arXiv: 1808.04560
38	X Guo , Y Li , H Ling . LIME: low-light image enhancement via illumination map estimation. IEEE Transactions on Image Processing, 2017, 26( 2): 982– 993
39	Y Li, X Zhang, D Chen. CSRNet: dilated convolutional neural networks for understanding the highly congested scenes. In: Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2018, 1091−1100
40	W Liu, M Salzmann, P Fua. Context-aware crowd counting. In: Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2019, 5094-5103
41	J Chu , Z Guo , L Leng . Object detection based on multi-layer convolution feature fusion and online hard example mining. IEEE Access, 2018, 6 : 19959– 19967
42	Y Zhang , J Chu , L Leng , J Miao . Mask-Refined R-CNN: a network for refining object details in instance segmentation. Sensors, 2020, 20( 4): 1010–
43	Y Zhang, D Zhou, S Chen, S Gao, Y Ma. Single-image crowd counting via multi-column convolutional neural network. In: Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016, 589−597

[1]

Highlights

Download

Viewed

Full text

Abstract

Cited

Shared

Discussed