简体繁体 English

使用深度学习技术的文本摘要

[英]Text summarization using deep learning techniques

原文 2014-05-19 04:53:35 2 2 python/ theano/ summarization/ deep-learning

I am trying to summarize text documents that belong to legal domain.我正在尝试总结属于法律领域的文本文档。

I am referring to the site deeplearning.net on how to implement the deep learning architectures.我指的是关于如何实现深度学习架构的网站 deeplearning.net。 I have read quite a few research papers on document summarization (both single document and multidocument) but I am unable to figure to how exactly the summary is generated for each document.我已经阅读了很多关于文档摘要（单文档和多文档）的研究论文，但我无法弄清楚每个文档的摘要是如何生成的。

Once the training is done, the network stabilizes during testing phase.训练完成后，网络会在测试阶段稳定下来。 So even if I know the set of features (which I have figured out) that are learnt during the training phase, it would be difficult to find out the importance of each feature (because the weight vector of the network is stabilized) during the testing phase where I will be trying to generate summary for each document.因此，即使我知道在训练阶段学习的一组特征（我已经弄清楚了），也很难在测试期间找出每个特征的重要性（因为网络的权重向量是稳定的）我将尝试为每个文档生成摘要的阶段。

I tried to figure this out for a long time but it's in vain.我试图弄清楚这一点很长一段时间，但它是徒劳的。

If anybody has worked on it or have any idea regarding the same, please give me some pointers.如果有人研究过它或对此有任何想法，请给我一些指示。 I really appreciate your help.我真的很感谢你的帮助。 Thank you.谢谢你。

2 个解决方案

I think you need to be a little more specific.我认为你需要更具体一点。 When you say "I am unable to figure to how exactly the summary is generated for each document", do you mean that you don't know how to interpret the learned features, or don't you understand the algorithm?当你说“我无法弄清楚每个文档的摘要是如何生成的”时，你的意思是你不知道如何解释学习到的特征，或者你不了解算法？ Also, "deep learning techniques" covers a very broad range of models - which one are you actually trying to use?此外，“深度学习技术”涵盖了非常广泛的模型——您实际尝试使用的是哪一个？

In the general case, deep learning models do not learn features that are humanly intepretable (albeit, you can of course try to look for correlations between the given inputs and the corresponding activations in the model).在一般情况下，深度学习模型不会学习人类无法理解的特征（尽管您当然可以尝试寻找给定输入与模型中相应激活之间的相关性）。 So, if that's what you're asking, there really is no good answer.所以，如果这就是你要问的，那么真的没有好的答案。 If you're having difficulties understanding the model you're using, I can probably help you :-) Let me know.如果您在理解您使用的模型时遇到困难，我可能可以帮助您 :-) 让我知道。

this is a blog series that talks in much detail from the very beginning of how text summarization works, recent research uses seq2seq deep learning based models, this blog series begins by explaining this architecture till reaching the newest research approaches这是一个博客系列，从一开始就详细讨论了文本摘要的工作原理，最近的研究使用了基于 seq2seq 深度学习的模型，这个博客系列从解释这种架构开始，直到达到最新的研究方法

Also this repo collects multiple implementations on building a text summarization model, it runs these models on google colab, and hosts the data on google drive, so no matter how powerful your computer is, you can use google colab which is a free system to train your deep models on此外，这个repo收集了构建文本摘要模型的多个实现，它在 google colab 上运行这些模型，并将数据托管在 google drive 上，因此无论您的计算机有多强大，您都可以使用 google colab 这是一个免费系统进行训练你的深层模型

If you like to see the text summarization in action, you can use this free api .如果您想查看实际的文本摘要，可以使用这个免费的 api 。

I truly hope this helps我真的希望这会有所帮助