面向一致性对话生成的对抗匹配网络与目标侧注意力机制研究

蔡恒毅* **; 王成瑞* **; 宋永浩*; 袁旭* **; 张程*; 赵晓芳*

文章摘要

蔡恒毅* **,王成瑞* **,宋永浩*,袁旭* **,张程*,赵晓芳*.面向一致性对话生成的对抗匹配网络与目标侧注意力机制研究[J].高技术通讯(中文),2022,32(2):131~142

面向一致性对话生成的对抗匹配网络与目标侧注意力机制研究

Research on towards coherent dialogue generation via adversarial matching network and target-side attention

DOI：10.3772/j.issn.1002-0470.2022.02.003

中文关键词: 生成式对话模型；神经张量网络；对抗学习；目标侧注意力机制

英文关键词: neural dialogue generation, neural tensor layer, adversarial learning, target-side attention mechanism

基金项目:

作者	单位
蔡恒毅* **
王成瑞* **
宋永浩*
袁旭* **
张程*
赵晓芳*

摘要点击次数: 1161

全文下载次数: 867

中文摘要:

序列到序列（seq2seq）方法在开放域对话生成领域中备受研究学者的关注。然而，标准的序列到序列模型容易产生语义冲突和不连贯的对话回复，这种不一致性是现有系统生成的回复显著有别于人类真实对话的重要原因之一。对话生成中的一致性既包括回复内部的语义一致性，也包括上文与其回复之间的外部关联性。本文提出了一个新的对话生成框架，称为基于张量匹配的生成式对抗网络（MatchGAN），以提高对话回复与其上文之间的外部关联性。与传统的基于最大似然估计的方法不同，该框架通过基于序列到序列模型的生成器和基于张量匹配网络的判别器之间的对抗学习来生成与上文相关的回复。通过使用匹配网络对上文与回复之间的多维关系进行建模，该模型所产生的回复更加符合人类对话的特点。此外，本研究进一步引入了目标侧注意力机制来增强所产生回复的内部语义一致性。实验结果表明，本文提出的框架能够产生高质量的对话回复，在量化指标评价和人工评测方面均优于其他基线方法。

英文摘要:

The sequence-to-sequence (seq2seq) approach has received great attention in the field of open-domain dialogue generation. However, the standard seq2seq model is prone to generate meaningless and incoherent responses, making it distinguish clearly from the human-human conversations. This coherence includes both the internal consistency of the long response and the external relevance between the post and its response. This work proposes a novel dialogue generation framework called matching-based generative adversarial network (MatchGAN) to improve external relevance. Instead of imitating the ground truth with supervised learning, this model can generate post-relevant responses through the generative-adversarial learning with a seq2seq-based generator and a matching-based discriminator, allowing the generated human-machine conversation more like a human-human conversation by discriminating whether a response is matching with the post. Furthermore the target-side attention mechanism is introduced to maintain the internal consistency of the generated responses. This new framework is able to generate coherent responses with high quality. Experimental results show that the proposed model can achieve substantial improvements in both metric-based and human evaluations among various baselines.

查看全文查看/发表评论下载PDF阅读器

关闭