We propose knowledge internalization (KI), which aims to complement the lexical knowledge into neural dialog models. Instead of further conditioning the knowledge-grounded dialog (KGD) models on externally retrieved knowledge, we seek to integrate knowledge about each input token internally into the model's parameters. To tackle the challenge due to the large scale of lexical knowledge, we adopt the contrastive learning approach and create an effective token-level lexical knowledge retriever that requires only weak supervision mined from Wikipedia. We demonstrate the effectiveness and general applicability of our approach on various datasets and diversified model structures.
Lexical Knowledge Internalization for Neural Dialog Generation
Knowledge internalization integrates lexical knowledge into neural dialog models' parameters through contrastive learning, enhancing effectiveness across datasets and model structures.
- Year
- 2022
- Venue
- ACL 2022 5
- Authors
- 5
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2205.01941ARXIV-DEFAULT
- TL;DR
- Semantic Scholar