Please wait a minute...
Frontiers of Computer Science

ISSN 2095-2228

ISSN 2095-2236(Online)

CN 10-1014/TP

Postal Subscription Code 80-970

2018 Impact Factor: 1.129

Front. Comput. Sci.    2024, Vol. 18 Issue (4) : 184340    https://doi.org/10.1007/s11704-023-3058-2
Artificial Intelligence
Discriminative explicit instance selection for implicit discourse relation classification
Wei SONG1(), Hongfei HAN1, Xu HAN1, Miaomiao CHENG1, Jiefu GONG2, Shijin WANG2, Ting LIU3
1. Information Engineering College, Capital Normal University, Beijing 100048, China
2. iFLYTEK AI Research Institute, Hefei 230088, China
3. Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, Harbin 150006, China
 Download: PDF(12174 KB)   HTML
 Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks
Abstract

Discourse relation classification is a fundamental task for discourse analysis, which is essential for understanding the structure and connection of texts. Implicit discourse relation classification aims to determine the relationship between adjacent sentences and is very challenging because it lacks explicit discourse connectives as linguistic cues and sufficient annotated training data. In this paper, we propose a discriminative instance selection method to construct synthetic implicit discourse relation data from easy-to-collect explicit discourse relations. An expanded instance consists of an argument pair and its sense label. We introduce the argument pair type classification task, which aims to distinguish between implicit and explicit argument pairs and select the explicit argument pairs that are most similar to natural implicit argument pairs for data expansion. We also propose a simple label-smoothing technique to assign robust sense labels for the selected argument pairs. We evaluate our method on PDTB 2.0 and PDTB 3.0. The results show that our method can consistently improve the performance of the baseline model, and achieve competitive results with the state-of-the-art models.

Keywords discourse analysis      PDTB      discourse relation      implicit discourse relation classification      data expansion     
Corresponding Author(s): Wei SONG   
Issue Date: 15 December 2023
 Cite this article:   
Wei SONG,Hongfei HAN,Xu HAN, et al. Discriminative explicit instance selection for implicit discourse relation classification[J]. Front. Comput. Sci., 2024, 18(4): 184340.
 URL:  
https://academic.hep.com.cn/fcs/EN/10.1007/s11704-023-3058-2
https://academic.hep.com.cn/fcs/EN/Y2024/V18/I4/184340
Fig.1  An overview of the proposed method. We train an argument pair type classifier, which gives the probability of being naturally implicit for a given argument pair. Together with a label-smoothing strategy for sense assignment, synthetic data for implicit discourse relation classification is created
ModelF1ACC
BERT85.0985.10
RoBERTa85.9886.00
RoBERTa-Ind83.3983.40
Tab.1  Argument pair type classification results
SensePDTB 2.0PDTB 3.0
Comparison89,62192,794
Contingency25,41325,204
Expansion62,75672,517
Temporal8,9278,595
Tab.2  The number of expanded argument pairs for top-level class senses
ParametersValue
Padding size128
OptimizerAdam W
Batch size64
Dropout0.1
Learning rate3e?5
Warmup200
Tab.3  Hyper-parameter settings for training
MethodTop-level Second-level
Macro-F1ACCACC
PDTB 2.0
Domain-Adaptation [10]39.46? ?
Freely omissible [9]40.5051.70?
Multi-task Learning [11]44.9857.27?
Active Learning [35]44.4860.63?
Knowledge [36]52.8959.6648.23
Knowledge [26]51.2459.94?
Multi-task Learning [12]58.4865.2654.32
XLNet-large [37]59.1068.7061.29
BMGF-RoBERTa [27]63.3969.0658.13
Our Method64.2969.75 61.69
PDTB 3.0
LSTM [38]?? 43.41
XLNet-large [37]68.3073.8064.83
MTL [39]??63.30
BMGF-RoBERTa66.5771.0061.02
Our Method69.9474.98 64.20
Tab.4  Overall comparison results on PDTB 2.0 and PDTB 3.0
ModelTop-level Second-level
Macro-F1ACCACC
PDTB 2.0
Baseline62.8369.46 58.96
Low ambiguity62.2368.7958.83
Freely omissible61.5567.90?
Our method64.2969.7561.69
PDTB 3.0
Baseline68.8674.41 62.93
Low ambiguity68.9373.8363.16
Freely omissible66.7371.95?
Our method69.9474.9864.20
Tab.5  Overall results of explicit data expansion methods
ModelTop-level
ComConExpTem
PDTB 2.0
Baseline61.0661.9177.0051.32
Low ambiguity58.7962.8576.2051.09
Freely omissible60.1560.4975.6449.93
Our method62.2363.4976.7354.71
PDTB 3.0
Baseline62.8377.3078.5556.76
Low ambiguity63.1975.4277.9859.13
Freely omissible59.1074.2776.4357.14
Our method64.8077.7778.4858.69
Tab.6  Comparison results on four top-level class senses
ModelMacro-F14-way ACC14-way ACC
Intra-sentential relations
Baseline76.5884.5676.21
Our method77.5785.13 77.16
Inter-sentential relations
Baseline62.76 69.9857.05
Our method64.02 70.5758.43
Tab.7  Results on intra/inter-sentential discourse relations on PDTB 3.0
Top-level Second-level
Macro-F1ACCACC
PDTB 2.0
Baseline62.8369.46 58.96
LA62.2368.7958.83
LA + LS62.1269.4659.09
APTC + LS64.2969.7561.69
APTC + LSstd62.6068.7960.56
PDTB 3.0
Baseline68.8674.41 62.93
LA68.9373.8363.16
LA + LS67.15 73.01 63.00
APTC + LS69.9474.9864.20
APTC + LSstd68.7273.8963.27
Tab.8  Ablation study of the proposed method
ModelTop-level Second-level
Macro-F1ACCACC
PDTB 2.0
DeBERTa64.22 70.79 60.36
+DataExp64.79 71.12 61.17
PDTB 3.0
DeBERTa69.7674.57 65.02
+DataExp70.21 74.89 66.20
Tab.9  The performance of DeBERTa-large on IDRC
  
  
  
  
  
  
  
1 B, Webber M, Egg V Kordoni . Discourse structure and language technology. Natural Language Engineering, 2012, 18( 4): 437–490
2 J R Hobbs . Coherence and coreference. Cognitive Science, 1979, 3( 1): 67–90
3 W C, Mann S A Thompson . Rhetorical structure theory: toward a functional theory of text organization. Text-Interdisciplinary Journal for the Study of Discourse, 1988, 8( 3): 243–281
4 Prasad R, Dinesh N, Lee A, Miltsakaki E, Robaldo L, Joshi A, Webber B. The Penn discourse TreeBank 2.0. In: Proceedings of the 6th International Conference on Language Resources and Evaluation. 2008
5 E, Pitler M, Raghupathy H, Mehta A, Nenkova A, Lee A Joshi . Easily identifiable discourse relations. In: Proceedings of COLING 2008: Companion Volume: Posters. 2008, 87–90
6 M, Kurfalı R Östling . Let’s be explicit about that: distant supervision for implicit discourse relation classification via connective prediction. In: Proceedings of the 1st Workshop on Understanding Implicit and Underspecified Language. 2021, 1–10
7 D, Marcu A Echihabi . An unsupervised approach to recognizing discourse relations. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. 2002, 368–375
8 C, Sporleder A Lascarides . Using automatically labelled examples to classify rhetorical relations: an assessment. Natural Language Engineering, 2008, 14( 3): 369–416
9 A, Rutherford N Xue . Improving the inference of implicit discourse relations via classifying explicit discourse connectives. In: Proceedings of 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2015, 799–808
10 Y, Ji G, Zhang J Eisenstein . Closing the gap: domain adaptation from explicit to implicit discourse relations. In: Proceedings of 2015 Conference on Empirical Methods in Natural Language Processing. 2015, 2219–2224
11 Liu Y, Li S, Zhang X, Sui Z. Implicit discourse relation classification via multi-task neural networks. In: Proceedings of the 30th AAAI Conference on Artificial Intelligence. 2016, 2750–2756
12 Y, Kishimoto Y, Murawaki S Kurohashi . Adapting BERT to implicit discourse relation classification with a focus on discourse connectives. In: Proceedings of the 12th Language Resources and Evaluation Conference. 2020, 1152–1158
13 W, Song L Liu . Representation learning in discourse parsing: a survey. Science China Technological Sciences, 2020, 63( 10): 1921–1946
14 J, Li M, Liu B, Qin T Liu . A survey of discourse parsing. Frontiers of Computer Science, 2022, 16( 5): 165329
15 W, Xiang B Wang . A survey of implicit discourse relation recognition. ACM Computing Surveys, 2023, 55( 12): 258
16 B, Webber R, Prasad A, Lee A Joshi . The Penn discourse Treebank 3.0 annotation manual. Philadelphia: University of Pennsylvania, 2019
17 J, Devlin M W, Chang K, Lee K Toutanova . BERT: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2019, 4171–4186
18 Y, Liu M, Ott N, Goyal J, Du M, Joshi D, Chen O, Levy M, Lewis L, Zettlemoyer V Stoyanov . RoBERTa: a robustly optimized BERT pretraining approach. 2019, arXiv preprint arXiv: 1907.11692
19 Z, Lin M Y, Kan H T Ng . Recognizing implicit discourse relations in the Penn discourse Treebank. In: Proceedings of 2009 Conference on Empirical Methods in Natural Language Processing. 2009, 343–351
20 Y, Ji J Eisenstein . Representation learning for text-level discourse parsing. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2014, 13–24
21 L, Qin Z, Zhang H Zhao . Implicit discourse relation recognition with context-aware character-enhanced embeddings. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers. 2016, 1914–1924
22 H, Li J, Zhang C Zong . Implicit discourse relation recognition for English and Chinese with multiview modeling and effective representation learning. ACM Transactions on Asian and Low-Resource Language Information Processing, 2017, 16( 3): 19
23 H, Bai H Zhao . Deep enhanced representation for implicit discourse relation recognition. In: Proceedings of the 27th International Conference on Computational Linguistics. 2018, 571–583
24 Lei W, Xiang Y, Wang Y, Zhong Q, Liu M, Kan M Y. Linguistic properties matter for implicit discourse relation recognition: combining semantic interaction, topic continuity and attribution. In: Proceedings of the 32nd AAAI Conference on Artificial Intelligence and 30th Innovative Applications of Artificial Intelligence Conference and 8th AAAI Symposium on Educational Advances in Artificial Intelligence. 2018, 594
25 W, Shi V Demberg . Next sentence prediction helps implicit discourse relation classification within and across domains. In: Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 2019, 5790–5796
26 R, He J, Wang F, Guo Y Han . TransS-driven joint learning architecture for implicit discourse relation recognition. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 139–148
27 Liu X, Ou J, Song Y, Jiang X. On the importance of word and sentence representation learning in implicit discourse relation classification. In: Proceedings of the 29th International Joint Conference on Artificial Intelligence. 2021, 3830–3836
28 Z, Dou Y, Hong Y, Sun G Zhou . CVAE-based Re-anchoring for implicit discourse relation classification. In: Proceedings of Findings of the Association for Computational Linguistics: EMNLP 2021. 2021, 1275–1283
29 X, Wang S, Li J, Li W Li . Implicit discourse relation recognition by selecting typical training examples. In: Proceedings of COLING 2012. 2012, 2757–2772
30 M, Lan J, Wang Y, Wu Z Y, Niu H Wang . Multi-task attention-based neural networks for implicit discourse relationship representation and identification. In: Proceedings of 2017 Conference on Empirical Methods in Natural Language Processing. 2017, 1299–1308
31 C, Wu X, Shi Y, Chen Y, Huang J Su . Bilingually-constrained synthetic data for implicit discourse relation recognition. In: Proceedings of 2016 Conference on Empirical Methods in Natural Language Processing. 2016, 2306–2312
32 Malmi E, Pighin D, Krause S, Kozhevnikov M. Automatic prediction of discourse connectives. In: Proceedings of the 11th International Conference on Language Resources and Evaluation. 2018
33 C, Szegedy V, Vanhoucke S, Ioffe J, Shlens Z Wojna . Rethinking the inception architecture for computer vision. In: Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition. 2016, 2818–2826
34 Y, Yang K, Zha Y, Chen H, Wang D Katabi . Delving into deep imbalanced regression. In: Proceedings of the 38th International Conference on Machine Learning. 2021, 11842–11851
35 Y, Xu Y, Hong H, Ruan J, Yao M, Zhang G Zhou . Using active learning to expand training data for implicit discourse relation recognition. In: Proceedings of 2018 Conference on Empirical Methods in Natural Language Processing. 2018, 725–731
36 Z, Dai R Huang . A regularization approach for incorporating event knowledge and coreference relations into neural discourse parsing. In: Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 2019, 2976–2987
37 N, Kim S, Feng C, Gunasekara L Lastras . Implicit discourse relation classification: we need to talk about evaluation. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 5404–5414
38 Liang L, Zhao Z, Webber B. Extending implicit discourse relation recognition to the PDTB-3. In: Proceedings of the 1st Workshop on Computational Approaches to Discourse. 2020, 135–147
39 H, Kiyomaru S Kurohashi . Contextualized and generalized sentence representations by contrastive self-supervised learning: a case study on discourse relation analysis. In: Proceedings of 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2021, 5578–5584
40 P, He X, Liu J, Gao W Chen . DeBERTa: decoding-enhanced BERT with disentangled attention. In: Proceedings of the 9th International Conference on Learning Representations. 2021
[1] FCS-23058-OF-WS_suppl_1 Download
[1] Sheng XU, Peifeng LI, Qiaoming ZHU. Incorporating contextual evidence to improve implicit discourse relation recognition in Chinese[J]. Front. Comput. Sci., 2024, 18(3): 183312-.
[2] Jiaqi LI, Ming LIU, Bing QIN, Ting LIU. A survey of discourse parsing[J]. Front. Comput. Sci., 2022, 16(5): 165329-.
[3] Yu HONG, Siyuan DING, Yang XU, Xiaoxia JIANG, Yu WANG, Jianmin YAO, Qiaoming ZHU, Guodong ZHOU. Focus-sensitive relation disambiguation for implicit discourse relation detection[J]. Front. Comput. Sci., 2019, 13(6): 1266-1281.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed