disconnected recurrent neural networks for text …disconnected recurrent neural networks for text...

RESEARCH POSTER PRESENTATION DESIGN © 2015 www.PosterPresentations.com RNN can model the entire sequence and capture long-term dependencies, but it does not do well in extracting key patterns. In contrast, convolutional neural network (CNN) is good at Extracting local and position-invariant features. We present a novel model named disconnected recurrent neural network (DRNN), which incorporates position-invariance into RNN by constraining the distance of information flow in RNN. DRNN achieves the best performance on several benchmark datasets for text categorization. INTRODUCTION MOTIVATION EXPERIMENTS ACKNOWLEDGMENTS & CONTACT This work was supported by the National Key Research and Development Program of China (No.2016YFC0800806). To get in touch with our lab (HFL), please scan the QR-code on the right. E-mail: [email protected] The key phrase that determines the category is unsolved mysteries of mathematics, which can be well extracted by CNN due to position-invariance. RNN can model the whole sequence and capture long-term dependencies, yet it doesn't address the above issue well because the representation of the key phrase relies on all the previous terms and it changes as the key phrase moves. DRNN is proposed to deal with such issues while eliminating the burden of modeling the entire sentence. Joint Lab of HIT and iFLYTEK, iFLYTEK Research, Beijing, China Baoxin Wang Disconnected Recurrent Neural Networks for Text Categorization Case1: One of the seven great unsolved mysteries of mathematics may have been cracked by a reclusive Russian. Case2: A reclusive Russian may have cracked one of the seven great unsolved mysteries of mathematics. THE MODEL DRNN limits the distance of information flow in RNN. An important difference from RNN is that the state of our model at each step is only related to the previous k-1 words rather than all the previous words. DRNN can be considered as a special 1D CNN which replaces the convolution filters with recurrent units. Figure： Dropout applied on hidden states as well. Figure：Model Architecture We utilize DRNN in text categorization and conduct experiments on several datasets, the table lists the error rates (%) on seven datasets: The comparison among models: Effects of recurrent units and pooling methods: Results of different window sizes: The type of task has a great impact on the optimal window size, while the effect of different training data sizes on the optimal window size seems little. Table ： Error rates on three datasets compared to RNN and CNN Figure： The influence of window size on DRNN compared to CNN

Upload: others

Post on 15-Jul-2020

24 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Disconnected Recurrent Neural Networks for Text …Disconnected Recurrent Neural Networks for Text Categorization Case1: One of the seven great unsolved mysteries of mathematics may

www.PosterPresentations.com

RNN can model the entire sequence and capture long-term

dependencies, but it does not do well in extracting key patterns.

In contrast, convolutional neural network (CNN) is good at

Extracting local and position-invariant features. We present a

novel model named disconnected recurrent neural network

(DRNN), which incorporates position-invariance into RNN by

constraining the distance of information flow in RNN. DRNN

achieves the best performance on several benchmark datasets for

text categorization.

INTRODUCTION

MOTIVATION

EXPERIMENTS

ACKNOWLEDGMENTS & CONTACTThis work was supported by the National Key

Research and Development Program of China

(No.2016YFC0800806).

To get in touch with our lab (HFL), please

scan the QR-code on the right.

E-mail: [email protected]

The key phrase that determines the category is unsolved

mysteries of mathematics, which can be well extracted by CNN

due to position-invariance. RNN can model the whole sequence

and capture long-term dependencies, yet it doesn't address the

above issue well because the representation of the key phrase

relies on all the previous terms and it changes as the key phrase

moves.

DRNN is proposed to deal with such issues while eliminating the

burden of modeling the entire sentence.

Joint Lab of HIT and iFLYTEK, iFLYTEK Research, Beijing, China Baoxin Wang

Disconnected Recurrent Neural Networksfor Text Categorization

Case1: One of the seven great unsolved mysteries of mathematics

may have been cracked by a reclusive Russian.

Case2: A reclusive Russian may have cracked one of the seven

great unsolved mysteries of mathematics.

THE MODELDRNN limits the distance of information flow in RNN. An

important difference from RNN is that the state of our model at

each step is only related to the previous k-1 words rather than all

the previous words. DRNN can be considered as a special 1D

CNN which replaces the convolution filters with recurrent units.

Figure： Dropout applied on

hidden states as well.

Figure：Model Architecture

We utilize DRNN in text categorization and conduct experiments

on several datasets, the table lists the error rates (%) on seven

datasets:

The comparison among models:

Effects of recurrent units and pooling methods:

Results of different window sizes:

The type of task has a great impact

on the optimal window size, while

the effect of different training data

sizes on the optimal window size

seems little.

Table： Error rates on three

datasets compared to RNN and

CNN Figure： The influence of window size

on DRNN compared to CNN

Beyond Recurrent Neural Networks - GitHub Pagesiamthevastidledhitchhiker.github.io/figs/SOM_11MAR2016/TensorFlow... · The Hitchhiker’s Guide to TensorFlow: Beyond Recurrent Neural

Deep Multi-State Dynamic Recurrent Neural Networks ...papers.nips.cc/paper/9594-deep-multi-state-dynamic-recurrent-neural... · Deep Multi-State Dynamic Recurrent Neural Networks

Recurrent Neural Network Grammars - nlp.cs.hku.hk

DRACON: Disconnected Graph Neural Network for Atom Mapping

Pixel Recurrent Neural Networks

Neural and Evolutionary Computing - Lecture 5 1 Recurrent neural networks Architectures –Fully recurrent networks –Partially recurrent networks Dynamics

Tutorial Training Recurrent Neural Networks

Discovering Gated Recurrent Neural Network Architecturesnn.cs.utexas.edu/downloads/papers/AdityaRawalThesis.pdfDiscovering Gated Recurrent Neural Network Architectures by Aditya Rawal,

Lecture 6: Recurrent & Graph Neural Networks · 2019. 12. 14. · UVA DEEP LEARNING COURSE –EFSTRATIOS GAVVES RECURRENT NEURAL NETWORKS - 1 Lecture 6: Recurrent & Graph Neural Networks

Neural networks for structured data 1. Table of contents Recurrent models Partially recurrent neural networks Elman networks Jordan networks Recurrent

Dilated Recurrent Neural Networks - Neural …papers.nips.cc/paper/6613-dilated-recurrent-neural...Dilated Recurrent Neural Networks Shiyu Chang 1 , Yang Zhang , Wei Han 2 , Mo Yu

Recurrent neural networks for statistical machine translation (Neural Machine Translation) · 2016-11-14 · Recurrent neural networks for statistical machine translation (Neural

Recurrent Neural Network tutorial (2nd)

Recurrent Neural Networksjcheung/teaching/fall-2017/...Recurrent Neural Networks COMP-550 Oct 5, 2017 Outline Introduction to neural networks and deep learning Feedforward neural networks

Quasi-Recurrent Neural Networks - NVIDIAon-demand.gputechconf.com/...bradbury-quasi-recurrent-neural-netw… · Quasi-Recurrent Neural Networks James Bradbury and Stephen Merity Salesforce

Convolutional Neural Networks (CNNs) and Recurrent Neural

Translation Modeling with Bidirectional Recurrent Neural ... · PDF fileTranslation Modeling with Bidirectional Recurrent Neural Networks ... g. language modeling, this type of neural

Online Learning with Stochastic Recurrent Neural …...recurrent neural network, hence, we refer to it as a stochastic recurrent neural network. The basis model consists of two different

SUPERRESOLUTION RECURRENT CONVOLUTIONAL NEURAL NETWORKS

Recurrent Neural Network for Language Modeling Taskfraser/smt_nmt_2016_seminar/08_rnns.pdf · Recurrent Neural Network for Language Modeling Task ... Recurrent Neural Network Language

Gated Recurrent Convolution Neural Network for OCR · which leads to the Gated Recurrent Convolution Neural Network ... Many new deep neural networks for general image ... Gated Recurrent

Particle Filter Recurrent Neural Networks

Understanding Recurrent Neural Networks Using

Recurrent Neural Network - whdeng Neural Network.pdfRecurrent neural networks Long Short-Term Memory recurrent networks Applications Recurrent Neural Network Xiaogang Wang [email protected]

Recurrent Neural Radio Anomaly Detection

Hierarchical Recurrent Neural Networks for Long …papers.nips.cc/paper/1102-hierarchical-recurrent-neural...Hierarchical Recurrent Neural Networks for Long-term Dependencies 495 The

Lecture 6: Recurrent & Graph Neural NetworksLecture 6: Recurrent & Graph Neural Networks Efstratios Gavves. UVA DEEP LEARNING COURSE –EFSTRATIOS GAVVES RECURRENT NEURAL NETWORKS

ELEC 677: Recurrent Neural Network Applications & Recurrent Neural Network … · 2016-11-08 · Recurrent Neural Network Applications & Recurrent Neural Network Language Models Lecture

Deep Q-Learning with Recurrent Neural Networkscs229.stanford.edu/...DeepQLearningWithRecurrentNeuralNetwords... · Deep Q-Learning with Recurrent Neural Networks ... rent neural network

RECURRENT NEURAL NETWORKS

Neural Network Part 4: Recurrent Neural Networkspages.cs.wisc.edu/~jerryzhu/cs760/10_neural-networks-4.pdf · •recurrent neural networks (RNN) and the advantage •training recurrent

CS224d Deep NLP Lecture 6: Neural Tips and Tricks ... · Neural Tips and Tricks + Recurrent Neural Networks ... • First intro to Recurrent Neural Networks ... through the parameters

Lecture 6: Recurrent Neural Networks - GitHub Pages · Lecture 6: Recurrent Neural Networks Efstratios Gavves. UVA DEEP LEARNING COURSE –EFSTRATIOS GAVVES RECURRENT NEURAL NETWORKS

Recurrent neural network - SINTEF · 2017-01-18 · THE RECURRENT NEURAL NETWORK A recurrent neural network (RNN) is a universal approximator of dynamical systems. It can be trained

Recurrent Neural Network - University of Torontotingwuwang/rnn_tutorial.pdf · Recurrent Neural Network ... RNN could do a lot more than modeling language 1. ... (Gated Feedback Recurrent