|
|
Scene-adaptive crowd counting method based on meta learning with dual-input network DMNet |
Haoyu ZHAO1, Weidong MIN2,3( ), Jianqiang XU1, Qi WANG1, Yi ZOU1, Qiyan FU1 |
1. School of Information Engineering, Nanchang University, Nanchang 330031, China 2. School of Software, Nanchang University, Nanchang 330047, China 3. Jiangxi Key Laboratory of Smart City, Nanchang 330047, China |
|
|
Abstract Crowd counting is recently becoming a hot research topic, which aims to count the number of the people in different crowded scenes. Existing methods are mainly based on training-testing pattern and rely on large data training, which fails to accurately count the crowd in real-world scenes because of the limitation of model’s generalization capability. To alleviate this issue, a scene-adaptive crowd counting method based on meta-learning with Dual-illumination Merging Network (DMNet) is proposed in this paper. The proposed method based on learning-to-learn and few-shot learning is able to adapt different scenes which only contain a few labeled images. To generate high quality density map and count the crowd in low-lighting scene, the DMNet is proposed, which contains Multi-scale Feature Extraction module and Element-wise Fusion Module. The Multi-scale Feature Extraction module is used to extract the image feature by multi-scale convolutions, which helps to improve network accuracy. The Element-wise Fusion module fuses the low-lighting feature and illumination-enhanced feature, which supplements the missing illumination in low-lighting environments. Experimental results on benchmarks, WorldExpo’10, DISCO, USCD, and Mall, show that the proposed method outperforms the existing state-of-the-art methods in accuracy and gets satisfied results.
|
Keywords
crowd counting
meta-learning
scene-adaptive
Dual-illumination Merging Network
|
Corresponding Author(s):
Weidong MIN
|
Just Accepted Date: 06 August 2021
Issue Date: 01 March 2022
|
|
1 |
Q Wang , J Gao , W Lin , X Li . NWPU-crowd: a large-scale benchmark for crowd counting and localization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 43( 6): 2141– 2149
|
2 |
Y Liu , Q Wen , H Chen , W Liu , J Qin , G Han , S He . Crowd counting via cross-stage refinement networks. IEEE Transactions on Image Processing, 2020, 29 : 6800– 6812
|
3 |
J Gao , Q Wang , X Li . PCC Net: perspective crowd counting via spatial convolutional network. IEEE Transactions on Circuits and Systems for Video Technology, 2020, 30( 10): 3486– 3498
|
4 |
M K K Reddy, M A Hossain, M Rochan, Y Wang. Few-shot scene adaptive crowd counting using meta-learning. In: Proceedings of the 2020 IEEE Winter Conference on Applications of Computer Vision (WACV). 2020, 2803−2812
|
5 |
X Liu, J Van De Weijer, A D Bagdanov. Leveraging unlabeled data for crowd counting by learning to rank. In: Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2018, 7661−7669
|
6 |
C Zhang, H Li, X Wang, X Yang. Cross-scene crowd counting via deep convolutional neural networks. In: Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2015, 833−841
|
7 |
C C Loy, S Gong, T Xiang. From semi-supervised to transfer counting of crowds. In: Proceedings of the 2013 IEEE International Conference on Computer Vision. 2013, 2256−2263
|
8 |
C Finn, P Abbeel, S Levine. Model-agnostic meta-learning for fast adaptation of deep networks. In: Proceedings of the 34th International Conference on Machine Learning. 2017, 1126−1135
|
9 |
M Zhao , C Zhang , J Zhang , F Porikli , B Ni , W Zhang . Scale-aware crowd counting via depth-embedded convolutional neural networks. IEEE Transactions on Circuits and Systems for Video Technology, 2020, 30( 10): 3651– 3662
|
10 |
Y Fang , S Gao , J Li , W Luo , L He , B Hu . Multi-level feature fusion based Locality-Constrained Spatial Transformer network for video crowd counting. Neurocomputing, 2020, 392 : 98– 107
|
11 |
D B Sam , S V Peri , M N Sundararaman , A Kamath , R V Babu . Locate, size, and count: accurately resolving people in dense crowds via detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43( 8): 2739– 2751
|
12 |
L Liu , H Lu , H Xiong , K Xian , Z Cao , C Shen . Counting objects by blockwise classification. IEEE Transactions on Circuits and Systems for Video Technology, 2020, 30( 10): 3513– 3527
|
13 |
X Wu , Y Zheng , H Ye , W Hu , T Ma , J Yang , L He . Counting crowds with varying densities via adaptive scenario discovery framework. Neurocomputing, 2020, 397 : 127– 138
|
14 |
D Hu, L Mou, Q Wang, J Gao, Y Hua, D Dou, X X Zhu. Ambient sound helps: audiovisual crowd counting in extreme conditions. 2020, arXiv preprint arXiv: 2005.07097
|
15 |
H Zhao , W Min , X Wei , Q Wang , Q Fu , Z Wei . MSR-FAN: multi-scale residual feature-aware network for crowd counting. IET Image Processing, 2021, 15( 14): 3512– 3521
https://doi.org/10.1049/ipr2.12175
|
16 |
H Zheng , Z Lin , J Cen , Z Wu , Y Zhao . Cross-line pedestrian counting based on spatially-consistent two-stage local crowd density estimation and accumulation. IEEE Transactions on Circuits and Systems for Video Technology, 2019, 29( 3): 787– 799
|
17 |
Z Shen, Y Xu, B Ni, M Wang, J Hu, X Yang. Crowd counting via adversarial cross-scale consistency pursuit. In: Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2018, 5245−5254
|
18 |
B Yang , W Zhan , N Wang , X Liu , J Lv . Counting crowds using a scale-distribution-aware network and adaptive human-shaped kernel. Neurocomputing, 2020, 390 : 207– 216
|
19 |
Z Zou , Y Cheng , X Qu , S Ji , X Guo , P Zhou . Attend to count: crowd counting with adaptive capacity multi-scale CNNs. Neurocomputing, 2019, 367 : 75– 83
|
20 |
L Wang , B Yin , X Tang , Y Li . Removing background interference for crowd counting via de-background detail convolutional network. Neurocomputing, 2019, 322 : 360– 371
|
21 |
J Chen , Z Wang . Crowd counting with segmentation attention convolutional neural network. IET Image Processing, 2021, 15( 6): 1221– 1231
https://doi.org/10.1049/ipr2.12099
|
22 |
S Jiang , X Lu , Y Lei , L Liu . Mask-aware networks for crowd counting. IEEE Transactions on Circuits and Systems for Video Technology, 2020, 30( 9): 3119– 3129
|
23 |
W Min , M Fan , X Guo , Q Han . A new approach to track multiple vehicles with the combination of robust detection and two classifiers. IEEE Transactions on Intelligent Transportation Systems, 2018, 19( 1): 174– 186
|
24 |
H Yang , L Liu , W Min , X Yang , X Xiong . Driver yawning detection based on subtle facial action recognition. IEEE Transactions on Multimedia, 2020, 23 : 572– 583
https://doi.org/10.1109/TMM.2020.2985536
|
25 |
Q Wang , W Min , D He , S Zou , T Huang , Y Zhang , R Liu . Discriminative fine-grained network for vehicle re-identification using two-stage re-ranking. Science China Information Sciences, 2020, 63( 11): 212102–
https://doi.org/10.1007/s11432-019-2811-8
|
26 |
Y Ma , G Zhong , W Liu , Y Wang , P Jiang , R Zhang . ML-CGAN: conditional generative adversarial network with a meta-learner structure for high-quality image generation with few training data. Cognitive Computation, 2021, 13( 2): 418– 430
|
27 |
I Jung, K You, H Noh, M Cho, B Han. Real-time object tracking via meta-learning: efficient model adaptation and one-shot channel pruning. In: Proceedings of the 34th AAAI Conference on Artificial Intelligence. 2020, 11205−11212, doi:
|
28 |
T Elsken, B Staffler, J H Metzen, F Hutter. Meta-learning of neural architectures for few-shot learning. In: Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2020, 12362−12372
|
29 |
C Xu , J Shen , X Du . A method of few-shot network intrusion detection based on meta-learning framework. IEEE Transactions on Information Forensics and Security, 2020, 15 : 3540– 3552
|
30 |
H J Ye , X R Sheng , D C Zhan . Few-shot learning with adaptively initialized task optimizer: a practical meta-learning approach. Machine Learning, 2020, 109( 3): 643– 664
|
31 |
A Nichol, J Achiam, J Schulman. On first-order meta-learning algorithms. 2018, arXiv preprint arXiv: 1803.02999v3
|
32 |
D Wang , Y Cheng , M Yu , X Guo , T Zhang . A hybrid approach with optimization-based and metric-based meta-learner for few-shot learning. Neurocomputing, 2019, 349 : 202– 211
|
33 |
N Lai , M Kan , C Han , X Song , S Shan . Learning to learn adaptive classifier–predictor for few-shot learning. IEEE Transactions on Neural Networks and Learning Systems, 2021, 32( 8): 3458– 3470
https://doi.org/10.1109/TNNLS.2020.3011526
|
34 |
A B Chan, Z S J Liang, N Vasconcelos. Privacy preserving crowd monitoring: counting people without people models or tracking. In: Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition. 2008, 1−7
|
35 |
Q Zhang , Y Nie , W S Zheng . Dual illumination estimation for robust exposure correction. Computer Graphics Forum, 2019, 38( 7): 243– 252
|
36 |
Y Zhang, J Zhang, X Guo. Kindling the darkness: a practical low-light image enhancer. In: Proceedings of the 27th ACM International Conference on Multimedia. 2019, 1632−1640
|
37 |
C Wei, W Wang, W Yang, J Liu. Deep Retinex decomposition for low-light enhancement. 2018, arXiv preprint arXiv: 1808.04560
|
38 |
X Guo , Y Li , H Ling . LIME: low-light image enhancement via illumination map estimation. IEEE Transactions on Image Processing, 2017, 26( 2): 982– 993
|
39 |
Y Li, X Zhang, D Chen. CSRNet: dilated convolutional neural networks for understanding the highly congested scenes. In: Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2018, 1091−1100
|
40 |
W Liu, M Salzmann, P Fua. Context-aware crowd counting. In: Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2019, 5094-5103
|
41 |
J Chu , Z Guo , L Leng . Object detection based on multi-layer convolution feature fusion and online hard example mining. IEEE Access, 2018, 6 : 19959– 19967
|
42 |
Y Zhang , J Chu , L Leng , J Miao . Mask-Refined R-CNN: a network for refining object details in instance segmentation. Sensors, 2020, 20( 4): 1010–
|
43 |
Y Zhang, D Zhou, S Chen, S Gao, Y Ma. Single-image crowd counting via multi-column convolutional neural network. In: Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016, 589−597
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|