Multi-Stage Prompting for Knowledgeable Dialogue Generation

Resource type
Conference Paper
Authors/contributors
Title
Multi-Stage Prompting for Knowledgeable Dialogue Generation
Abstract
Existing knowledge-grounded dialogue systems typically use finetuned versions of a pretrained language model (LM) and large-scale knowledge bases. These models typically fail to generalize on topics outside of the knowledge base, and require maintaining separate potentially large checkpoints each time finetuning is needed. In this paper, we aim to address these limitations by leveraging the inherent knowledge stored in the pretrained LM as well as its powerful generation ability. We propose a multi-stage prompting approach to generate knowledgeable responses from a single pretrained LM. We first prompt the LM to generate knowledge based on the dialogue context. Then, we further prompt it to generate responses based on the dialogue context and the previously generated knowledge. Results show that our knowledge generator outperforms the state-of-the-art retrieval-based model by 5.8% when combining knowledge relevance and correctness. In addition, our multi-stage prompting outperforms the finetuning-based dialogue model in terms of response knowledgeability and engagement by up to 10% and 5%, respectively. Furthermore, we scale our model up to 530 billion parameters and demonstrate that larger LMs improve the generation correctness score by up to 10%, and response relevance, knowledgeability and engagement by up to 10%. Our code is available at: https://github.com/NVIDIA/Megatron-LM.
Date
2022-05
Proceedings Title
Findings of the Association for Computational Linguistics: ACL 2022
Conference Name
Findings 2022
Place
Dublin, Ireland
Publisher
Association for Computational Linguistics
Pages
1317–1337
Accessed
24/02/2024, 17:42
Library Catalogue
ACLWeb
Citation
Liu, Z., Patwary, M., Prenger, R., Prabhumoye, S., Ping, W., Shoeybi, M., & Catanzaro, B. (2022). Multi-Stage Prompting for Knowledgeable Dialogue Generation. In S. Muresan, P. Nakov, & A. Villavicencio (Eds.), Findings of the Association for Computational Linguistics: ACL 2022 (pp. 1317–1337). Association for Computational Linguistics. https://doi.org/10.18653/v1/2022.findings-acl.104