Please wait a minute...
Frontiers of Computer Science

ISSN 2095-2228

ISSN 2095-2236(Online)

CN 10-1014/TP

Postal Subscription Code 80-970

2018 Impact Factor: 1.129

Front. Comput. Sci.    2021, Vol. 15 Issue (2) : 152317    https://doi.org/10.1007/s11704-020-9194-x
RESEARCH ARTICLE
Attention based simplified deep residual network for citywide crowd flows prediction
Genan DAI1, Xiaoyang HU1, Youming GE1, Zhiqing NING1, Yubao LIU1,2()
1. School of Data and Computer Science, Sun Yat-Sen University, Guangzhou 510006, China
2. Guangdong Key Laboratory of Big Data Analysis and Processing, Guangzhou 510006, China
 Download: PDF(969 KB)  
 Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks
Abstract

Crowd flows prediction is an important problem of urban computing whose goal is to predict the number of incoming and outgoing people of regions in the future. In practice, emergency applications often require less training time. However, there is a little work on how to obtain good prediction performance with less training time. In this paper, we propose a simplified deep residual network for our problem. By using the simplified deep residual network, we can obtain not only less training time but also competitive prediction performance compared with the existing similar method. Moreover, we adopt the spatio-temporal attention mechanism to further improve the simplified deep residual network with reasonable additional time cost. Based on the real datasets, we construct a series of experiments compared with the existing methods. The experimental results confirm the efficiency of our proposed methods.

Keywords crowd flows prediction      spatio-temporal data mining      attention     
Corresponding Author(s): Yubao LIU   
Just Accepted Date: 17 January 2020   Issue Date: 24 December 2020
 Cite this article:   
Genan DAI,Xiaoyang HU,Youming GE, et al. Attention based simplified deep residual network for citywide crowd flows prediction[J]. Front. Comput. Sci., 2021, 15(2): 152317.
 URL:  
https://academic.hep.com.cn/fcs/EN/10.1007/s11704-020-9194-x
https://academic.hep.com.cn/fcs/EN/Y2021/V15/I2/152317
1 Y Zheng, L Capra, O Wolfson, H Yang. Urban computing: concepts, methodologies, and applications. ACM Transactions on Intelligent Systems and Technology, 2014, 5(3): 38
https://doi.org/10.1145/2629592
2 J Zhang, Y Zheng, D Qi. Deep spatio-temporal residual networks for citywide crowd flows prediction. In: Proceedings of AAAI Conference on Artificial Intelligence. 2017, 1655–1661
3 L Wang, X Geng, X Ma, F Liu, Q Yang. Crowd flow prediction by deep spatio-temporal transfer learning. 2018, arXiv preprint arXiv:1802.00386
4 C Wu, T Yin, S Ge, K Yu. Ensemble learning for crowd flows prediction on campus. In: Proceedings of International Conference on Smart Computing and Communication. 2017, 103–113
https://doi.org/10.1007/978-3-319-73830-7_11
5 J Zhang, Y Zheng, D Qi, R Li, X Yi. DNN-based prediction model for spatio-temporal data. In: Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. 2016
https://doi.org/10.1145/2996913.2997016
6 X Song, Q Zhang, Y Sekimoto, R Shibasaki. Prediction of human emergency behavior and their mobility following large-scale disaster. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2014, 5–14
https://doi.org/10.1145/2623330.2623628
7 Z Fan, X Song, R Shibasaki, R Adachi. Citymomentum: an online approach for crowd behavior prediction at a citywide level. In: Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing. 2015, 559–569
https://doi.org/10.1145/2750858.2804277
8 R Silva, S M Kang, E M Airoldi. Predicting traffic volumes and estimating the effects of shocks in massive transportation systems. Proceedings of the National Academy of Sciences, 2015, 112(18): 5643–5648
https://doi.org/10.1073/pnas.1412908112
9 Y Xu, Q J Kong, R Klette, Y Liu. Accurate and interpretable bayesian mars for traffic flow prediction. IEEE Transactions on Intelligent Transportation Systems, 2014, 15(6): 2457–2469
https://doi.org/10.1109/TITS.2014.2315794
10 J Bao, T He, S Ruan, Y Li, Y Zheng. Planning bike lanes based on sharingbikes’ trajectories. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2017, 1377–1386
https://doi.org/10.1145/3097983.3098056
11 Y Li, Y Zheng, H Zhang, L Chen. Traffic prediction in a bike-sharing system. In: Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems. 2015
https://doi.org/10.1145/2820783.2820837
12 X Kong, Z Xu, G Shen, J Wang, Q Yang, B Zhang. Urban traffic congestion estimation and prediction based on floating car trajectory data. Future Generation Computer Systems, 2016, 61: 97–107
https://doi.org/10.1016/j.future.2015.11.013
13 Y Zheng, X Yi, M Li, R Li, Z Shan, E Chang, T Li. Forecasting finegrained air quality based on big data. In: Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2015, 2267–2276
https://doi.org/10.1145/2783258.2788573
14 M M Hamed, H R Al-Masaeid, Z M B Said. Short-term prediction of traffic volume in urban arterials. Journal of Transportation Engineering, 1995, 121(3): 249–254
https://doi.org/10.1061/(ASCE)0733-947X(1995)121:3(249)
15 Q Y Ding, X F Wang, X Y Zhang, Z Q Sun. Forecasting traffic volume with space-time arima model. Advanced Materials Research, 2011, 156: 979–983
https://doi.org/10.4028/www.scientific.net/AMR.156-157.979
16 Y Bengio, P Simard, P Frasconi. Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks, 1994, 5(2): 157–166
https://doi.org/10.1109/72.279181
17 S Hochreiter, J Schmidhuber. Long short-term memory. Neural Computation, 1997, 9(8): 1735–1780
https://doi.org/10.1162/neco.1997.9.8.1735
18 B Yu, H Yin, Z Zhu. Spatio-temporal graph convolutional networks: a deep learning framework for traffic forecasting. In: Proceedings of International Joint Conferences on Artificial Intelligence. 2018, 3634–3640
https://doi.org/10.24963/ijcai.2018/505
19 Y LeCun, L Bottou, Y Bengio, P Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 1998, 86(11): 2278–2324
https://doi.org/10.1109/5.726791
20 X Shi, Z Chen, H Wang, D Y Yeung, W K Wong, W C Woo. Convolutional LSTM network: a machine learning approach for precipitation nowcasting. In: Proceedings of Advances in Neural Information Processing Systems. 2015, 802–810
21 F Xiong, X Shi, D Y Yeung. Spatiotemporal modeling for crowd counting in videos. In: Proceedings of the IEEE International Conference on Computer Vision. 2017, 5151–5159
https://doi.org/10.1109/ICCV.2017.551
22 K He, X Zhang, S Ren, J Sun. Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016, 770–778
https://doi.org/10.1109/CVPR.2016.90
23 L Chen, H Zhang, J Xiao, L Nie, J Shao, W Liu, T S Chua. SCA-CNN: spatial and channel-wise attention in convolutional networks for image captioning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017, 5659–5667
https://doi.org/10.1109/CVPR.2017.667
24 J Lu, C Xiong, D Parikh, R Socher. Knowing when to look: adaptive attention via a visual sentinel for image captioning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017, 375–383
https://doi.org/10.1109/CVPR.2017.345
25 A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, A N Gomez, Ł Kaiser, I Polosukhin. Attention is all you need. In: Proceedings of Advances in Neural Information Processing Systems. 2017, 5998–6008
26 D Bahdanau, J Chorowski, D Serdyuk, P Brakel, Y Bengio. End-to-end attention-based large vocabulary speech recognition. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. 2016, 4945–4949
https://doi.org/10.1109/ICASSP.2016.7472618
27 J K Chorowski, D Bahdanau, D Serdyuk, K Cho, Y Bengio. Attentionbased models for speech recognition. In: Proceedings of Advances in Neural Information Processing Systems. 2015, 577–585
28 X Zhou, Y Shen, Y Zhu, L Huang. Predicting multi-step citywide passenger demands using attention-based neural networks. In: Proceedings of the 11th ACM International Conference onWeb Search and Data Mining. 2018, 736–744
https://doi.org/10.1145/3159652.3159682
29 X Geng, Y Li, L Wang, L Zhang, Q Yang, J Ye, Y Liu. Spatiotemporal multi-graph convolution network for ride-hailing demand forecasting. In: Proceedings of AAAI Conference on Artificial Intelligence. 2019
https://doi.org/10.1609/aaai.v33i01.33013656
30 P Veliˇckovíc, G Cucurull, A Casanova, A Romero, P Lio, Y Bengio. Graph attention networks. In: Proceedings of International Conference on Learning Representations. 2018
31 Y Liang, S Ke, J Zhang, X Yi, Y Zheng. Geoman: multi-level attention networks for geo-sensory time series prediction. In: Proceedings of International Joint Conferences on Artificial Intelligence. 2018, 3428–3434
https://doi.org/10.24963/ijcai.2018/476
32 L Liu, R Zhang, J Peng, G Li, B Du, L Lin. Attentive crowd flow machines. In: Proceedings of the 26th ACM International Conference on Multimedia. 2018, 1553–1561
https://doi.org/10.1145/3240508.3240681
33 A Krizhevsky, I Sutskever, G E Hinton. Imagenet classification with deep convolutional neural networks. In: Proceedings of Advances in Neural Information Processing Systems. 2012, 1097–1105
34 X Hu, G Dai, Y Ge, Z Ning, Y Liu. A simplified deep residual network for citywide crowd flows prediction. In: Proceedings of the International Conference on Semantics, Knowledge and Grids. 2019, 60–67
https://doi.org/10.1109/SKG.2018.00016
35 D P Kingma, J Ba. Adam: a method for stochastic optimization. 2014, arXiv preprint arXiv:1412.6980
36 S Wu, Y Tang, Y Zhu, L Wang, X Xie, T Tan. Session-based recommendation with graph neural networks. In: Proceedings of the AAAI Conference on Artificial Intelligence. 2019, 346–353
https://doi.org/10.1609/aaai.v33i01.3301346
37 G E P Box, G M Jenkins, G C Reinsel, G M Ljung. Time series analysis: forecasting and control. Journal of the Operational Research Society, 2015, 22(2): 199–201
38 B M Williams, P K Durvasula, D E Brown. Urban freeway traffic flow prediction: application of seasonal autoregressive integrated moving average and exponential smoothing models. Transportation Research Record, 1998, 1644(1): 132–141
https://doi.org/10.3141/1644-14
39 H Lütkepohl. Vector Autoregressive Models. Cheltenham: Edward Elgar Publishing, 2013
40 M X Hoang, Y Zheng, A K Singh. FCCF: forecasting citywide crowd flows based on big data. In: Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. 2016
https://doi.org/10.1145/2996913.2996934
[1] Guoguang HUA, Lihong LI, Shiguang LIU. Multipath affinage stacked—hourglass networks for human pose estimation[J]. Front. Comput. Sci., 2020, 14(4): 144701-.
[2] Wenhao ZHENG, Hongyu ZHOU, Ming LI, Jianxin WU. CodeAttention: translating source code to comments by exploiting the code constructs[J]. Front. Comput. Sci., 2019, 13(3): 565-578.
[3] Wayne Xin ZHAO,Ji-Rong WEN,Xiaoming LI. Generating timeline summaries with social media attention[J]. Front. Comput. Sci., 2016, 10(4): 702-716.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed