1. Department of Computer Science, COMSATS University Islamabad, Attock Campus, Attock 43600, Pakistan 2. Department of Computer Science, Bahria University, Lahore 54600, Pakistan 3. Department of Industrial Security, Chung-Ang University, Seoul 06974, Republic of Korea 4. Department of Computer Engineering, Mokwon University, Daejeon 35349, Republic of Korea
Innovations on the Internet of Everything (IoE) enabled systems are driving a change in the settings where we interact in smart units, recognized globally as smart city environments. However, intelligent video-surveillance systems are critical to increasing the security of these smart cities. More precisely, in today’s world of smart video surveillance, person re-identification (Re-ID) has gained increased consideration by researchers. Various researchers have designed deep learning-based algorithms for person Re-ID because they have achieved substantial breakthroughs in computer vision problems. In this line of research, we designed an adaptive feature refinement-based deep learning architecture to conduct person Re-ID. In the proposed architecture, the inter-channel and inter-spatial relationship of features between the images of the same individual taken from nonidentical camera viewpoints are focused on learning spatial and channel attention. In addition, the spatial pyramid pooling layer is inserted to extract the multiscale and fixed-dimension feature vectors irrespective of the size of the feature maps. Furthermore, the model’s effectiveness is validated on the CUHK01 and CUHK02 datasets. When compared with existing approaches, the approach presented in this paper achieves encouraging Rank 1 and 5 scores of 24.6% and 54.8%, respectively.
Sequential arrangement of adaptive feature refinement block
1
P2
27
55
80
86
91
93
2
P3
19
57
?
?
?
?
3
P4
10
40
56
?
?
?
4
P5
13
28
46
57
?
?
Parallel arrangement of adaptive feature refinement block
6
P2
22
59
75
85
88
95
7
P3
23.1
52
?
?
?
?
8
P4
18
43
64
?
?
?
9
P5
13
28
51
62
?
?
Tab.2
No#
Camera view
Division
Testing subjects
1
P1
Training
971
2
P2
Testing
306
3
P3
Testing
109
4
P4
Testing
193
5
P5
Testing
239
Tab.3
Fig.10
Sr. No.
Authors
Method
Rank 1 score
Datasets
1
Li et al. [16]
Deep learning
24.26
CUHK01 and CUHK03
2
Koestinger et al. [48]
Distance metric learning
20.8
VIPeR and ToyCars
3
Zheng et al. [49]
Bag of words model
24.33
CUHK03
4
Fan et al. [50]
Clustering and fine-tuning
24.8
CUHK03
5
Feng et al. [51]
Distance metric learning
21.96
CUHK03
6
Proposed method
Deep learning
24.6
CUHK01 and CUHK02
Tab.4
1
P, Neirotti Marco A, De A C, Cagliano G, Mangano F Scorrano . Current trends in smart city initiatives: some stylised facts. Cities, 2014, 38: 25–36
2
P, Vlacheas R, Giaffreda V, Stavroulaki D, Kelaidonis V, Foteinos G, Poulios P, Demestichas A, Somov A R, Biswas K Moessner . Enabling smart cities through a cognitive management framework for the internet of things. IEEE Communications Magazine, 2013, 51( 6): 102–111
3
P, Singh A, Nayyar A, Kaur U Ghosh . Blockchain and fog based architecture for internet of everything in smart cities. Future Internet, 2020, 12( 4): 61
4
L, Zheng Y, Yang A G Hauptmann . Person re-identification: past, present and future. 2016, arXiv preprint arXiv: 1610.02984
5
D, Wu S J, Zheng X P, Zhang C A, Yuan F, Cheng Y, Zhao Y J, Lin Z-Q, Zhao Y L, Jiang D S Huang . Deep learning-based methods for person re-identification: a comprehensive review. Neurocomputing, 2019, 337: 354–371
6
A, Zahra N, Perwaiz M, Shahzad M M Fraz . Person re-identification: a retrospective on domain specific open challenges and future trends. 2022, arXiv preprint arXiv: 2202.13121
7
M, Ye J, Shen G, Lin T, Xiang L, Shao S C H Hoi . Deep learning for person re-identification: a survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44( 6): 2872–2893
8
W, Wu D, Tao H, Li Z, Yang J Cheng . Deep features for person re-identification on metric learning. Pattern Recognition, 2021, 110: 107424
9
X, Chen H, Xu Y, Li M Bian . Person re-identification by low-dimensional features and metric learning. Future Internet, 2021, 13( 11): 289
10
R, Li B, Zhang Z, Teng J Fan . A divide-and-unite deep network for person re-identification. Applied Intelligence, 2021, 51( 3): 1479–1491
11
Z, Ming M, Zhu X, Wang J, Zhu J, Cheng C, Gao Y, Yang X Wei . Deep learning-based person re-identification methods: a survey and outlook of recent works. Image and Vision Computing, 2022, 119: 104394
12
S, Lin C T Li . Person re-identification with soft biometrics through deep learning. In: Jiang R, Li C T, Crookes D, Meng W, Rosenberger C, eds. Deep Biometrics. Cham: Springer, 2020, 21–36
13
N, Shoukry El Ghany MA, Abd M A M Salem . Multi-modal long-term person re-identification using physical soft bio-metrics and body figure. Applied Sciences, 2022, 12( 6): 2835
14
A, Nambiar A, Bernardino J C Nascimento . Gait-based person re-identification: a survey. ACM Computing Surveys, 2020, 52( 2): 33
15
S, Woo J, Park J-Y, Lee I S Kweon . CBAM: convolutional block attention module. In: Proceedings of the 15th European Conference on Computer Vision. 2018, 3–19
16
W, Li R, Zhao T, Xiao X DeepReID: deep filter pairing neural network for person re-identification Wang . In: Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition. 2014, 152–159
17
E, Ahmed M, Jones T K Marks . An improved deep learning architecture for person re-identification. In: Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition. 2015, 3908–3916
18
S, Chen J, Qin X, Ji B, Lei T, Wang D, Ni J-Z Cheng . Automatic scoring of multiple semantic attributes with multi-task feature leverage: a study on pulmonary nodules in CT images. IEEE Transactions on Medical Imaging, 2017, 36( 3): 802–814
19
Y, Huang H, Sheng Y, Zheng Z Xiong . DeepDiff: learning deep difference features on human body parts for person re-identification. Neurocomputing, 2017, 241: 191–203
20
H, Zhao M, Tian S, Sun J, Shao J, Yan S, Yi X, Wang X Tang . Spindle net: person re-identification with human body region guided feature decomposition and fusion. In: Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition. 2017, 907–915
21
A, Hermans L, Beyer B Leibe . In defense of the triplet loss for person re-identification. 2017, arXiv preprint arXiv: 1703.07737
22
Z, He C, Jung Q, Fu Z Zhang . Deep feature embedding learning for person re-identification based on lifted structured loss. Multimedia Tools and Applications, 2019, 78( 5): 5863–5880
23
L, Wu Y, Wang X, Li J Gao . What-and-where to match: deep spatially multiplicative integration networks for person re-identification. Pattern Recognition, 2018, 76: 727–738
24
K, Chatfield K, Simonyan A, Vedaldi A Zisserman . Return of the devil in the details: delving deep into convolutional nets. In: Proceedings of British Machine Vision Conference. 2014
25
K, Simonyan A Zisserman . Very deep convolutional networks for large-scale image recognition. In: Proceedings of the 3rd International Conference on Learning Representations. 2015
26
L, Wu R, Hong Y, Wang M Wang . Cross-entropy adversarial view adaptation for person re-identification. IEEE Transactions on Circuits and Systems for Video Technology, 2020, 30( 7): 2081–2092
27
X, Zhu J, Liu H, Xie Z-J Zha . Adaptive alignment network for person re-identification. In: Proceedings of the 25th International Conference on Multimedia Modeling. 2019, 16–27
28
A, Wu W-S, Zheng J-H Lai . Robust depth-based person re-identification. IEEE Transactions on Image Processing, 2017, 26( 6): 2588–2603
29
Z, Imani H Soltanizadeh . Histogram of the node strength and histogram of the edge weight: two new features for RGB-D person re-identification. Science China Information Sciences, 2018, 61( 9): 092108
30
L, Ren J, Lu J, Feng J Zhou . Multi-modal uniform deep learning for RGB-D person re-identification. Pattern Recognition, 2017, 72: 446–457
31
A, Wu W-S, Zheng H-X, Yu S, Gong J Lai . RGB-infrared cross-modality person re-identification. In: Proceedings of 2017 IEEE International Conference on Computer Vision. 2017, 5390–5399
32
A, Møgelmose C, Bahnsen T B, Moeslund A, Clapes S Escalera . Tri-modal person re-identification with RGB, depth and thermal features. In: Proceedings of 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops. 2013, 301–307
33
B N, Silva M, Khan K Han . Towards sustainable smart cities: a review of trends, architectures, components, and open challenges in smart cities. Sustainable Cities and Society, 2018, 38: 697–713
34
U, Majeed L U, Khan I, Yaqoob S M A, Kazmi K, Salah C S Hong . Blockchain for IoT-based smart cities: recent advances, requirements, and future challenges. Journal of Network and Computer Applications, 2021, 181: 103007
35
F, Ullah F, Al-Turjman A Nayyar . IoT-based green city architecture using secured and sustainable android services. Environmental Technology & Innovation, 2020, 20: 101091
36
J, Li J, Wang F Ullah . An end-to-end task-simplified and anchor-guided deep learning framework for image-based head pose estimation. IEEE Access, 2020, 8: 42458–42468
37
D H, Hubel T N Wiesel . Receptive fields and functional architecture of monkey striate cortex. The Journal of Physiology, 1968, 195( 1): 215–243
38
M, Bukhari K B, Bajwa S, Gillani M, Maqsood M Y, Durrani I, Mehmood H, Ugail S Rho . An efficient gait recognition method for known and unknown covariate conditions. IEEE Access, 2021, 9: 6465–6477
39
R, Ashraf S, Afzal A U, Rehman S, Gul J, Baber M, Bakhtyar I, Mehmood O Y, Song M Maqsood . Region-of-interest based transfer learning assisted framework for skin cancer detection. IEEE Access, 2020, 8: 147858–147871
40
M, Maqsood M, Bukhari Z, Ali S, Gillani I, Mehmood S, Rho Y-A Jung . A residual-learning-based multi-scale parallel-convolutions- assisted efficient CAD system for liver tumor detection. Mathematics, 2021, 9( 10): 1133
41
M, Maqsood S, Yasmin I, Mehmood M, Bukhari M Kim . An efficient DA-net architecture for lung nodule segmentation. Mathematics, 2021, 9( 13): 1457
42
Z, Niu G, Zhong H Yu . A review on the attention mechanism of deep learning. Neurocomputing, 2021, 452: 48–62
43
M H, Guo T X, Xu J J, Liu Z N, Liu P T, Jiang T J, Mu S H, Zhang R R, Martin M M, Cheng S M Hu . Attention mechanisms in computer vision: a survey. Computational Visual Media, 2022, 8( 3): 331–368
44
K, He X, Zhang S, Ren J Sun . Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37( 9): 1904–1916
45
S, Lazebnik C, Schmid J Ponce . Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2006, 2169–2178
46
W, Li X Wang . Locally aligned feature transforms across views. In: Proceedings of 2013 IEEE Conference on Computer Vision and Pattern Recognition. 2013, 3594–3601
47
W, Li R, Zhao X Wang . Human reidentification with transferred metric learning. In: Proceedings of the 11th Asian Conference on Computer Vision. 2012, 31–44
48
M, Köstinger M, Hirzer P, Wohlhart P M, Roth H Bischof . Large scale metric learning from equivalence constraints. In: Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition. 2012, 2288–2295
49
L, Zheng L, Shen L, Tian S, Wang J, Wang Q Tian . Scalable person re-identification: a benchmark. In: Proceedings of 2015 IEEE International Conference on Computer Vision. 2015, 1116–1124
50
H, Fan L, Zheng C, Yan Y Yang . Unsupervised person re-identification: clustering and fine-tuning. ACM Transactions on Multimedia Computing, Communications, and Applications, 2018, 14( 4): 83
51
G, Feng W, Liu D, Tao Y Zhou . Hessian regularized distance metric learning for people re-identification. Neural Processing Letters, 2019, 50( 3): 2087–2100