a radical-aware attention-based model for chinese text ...home.ustc.edu.cn/~hqtao/index_files/aaai...

1
How to build a text classification model adapted to Chinese language? 4. Conclusion Introduction Motivation: Chinese is a language derived from Oracle Bone Inscriptions (pictographs), which is essentially different from English or other phonetic languages. Radical is a semantic unit of Chinese with some graphic characteristics, which might help us to recognize semantics. Fact : Most existing studies on text classification are professionally conducted for English, which may lose effectiveness on Chinese materials due to the huge difference between Chinese and English. Definition: To select the most appropriate assignment to an untagged text from a predefined set of tags. Radical-aware Attention-based Four-Granularity model 1. Dataset: We conduct experiments on two real-world datasets. Thirty - Third AAAI Conference on Artificial Intelligence (AAAI - 19), Honolulu, Hawaii, January 27 - February 1, 2019 University of Science and Technology of China (USTC) Challenges: Radical is useful, but how to introduce it to existing works is difficult. The properties of Chinese is hard to model. How to combine radicals with other features is also challenging. Speaker: Hanqing Tao Email: [email protected] Lab Homepage: http://bigdata.ustc.edu.cn/ A Radical-aware Attention-based Model for Chinese Text Classification Hanqing Tao 1 , Shiwei Tong 1 , Hongke Zhao 1 , Tong Xu 1, 2 *, Binbin Jin 1 , Qi Liu 1, 2 1 Anhui Province Key Lab. of Big Data Analysis and Application, University of Science and Technology of China, 2 School of Data Science, University of Science and Technology of China The peaks and valleys exactly reflect the classification effect of radicals. Radicals can help recognize semantics and classify Chinese texts. For example, the original meaning of radical “clothing” is closed to the concept of class “dress”, where the high tf-idf value is a convincing indication. Given: A predefined set of tags . Input space: An untagged text . Output space: The most appropriate assignment . Task: To learn a classification function : () . Implementation Attention Mechanism: 1) To capture the interrelations between radicals and their corresponding characters or words; 2) The radical information can be further modified by the attention weight sum of character context and word context. Experiments 2. Experimental Results of Different Methods 3. Discussion Our work presents a novel insight on how to leverage radicals. Simply introducing radicals to Chinese text classification cannot improve the performance well. Making rational use of radicals is necessary. Attention mechanism in RAFG can enhance the effect of radicals. Extensive experiments demonstrate the superiority of our model and the effectiveness of radicals in the task of Chinese text classification. Different from previous work, our goal is to take advantage of radicals and leverage four different granularities of features to comprehensively model Chinese texts. Furtherly, we systematically integrate these features into the task of Chinese text classification, so that to deal with the huge difference between Chinese and English. The Radical-aware Attention-based Four-Granularity (RAFG) Model Feature Acquisition Hidden Representation Calculation: Prediction: Objective Function: Word-level radicals are worthy of attention. Our model (RAFG) gains a higher performance than any other comparison methods Tf-idf Distributions of Some Radicals in 32 Classes A radical is often related to certain concepts Given a specific feature embedding sequence of a sentence = { 1 , 2 ,… , }, the hidden vector of a BLSTM is calculated as follows:

Upload: others

Post on 26-Mar-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A Radical-aware Attention-based Model for Chinese Text ...home.ustc.edu.cn/~hqtao/index_files/aaai 19-poster-Tao.pdfSimply introducing radicals to Chinese text classification cannot

How to build a text

classification model

adapted to Chinese

language?

4. Conclusion

Introduction

Motivation: Chinese is a language derived from Oracle Bone Inscriptions (pictographs), which is essentially different from English or other phonetic languages. Radical is a semantic unit of Chinese with some graphic characteristics, which might help us to recognize semantics.

Fact : Most existing studies on text classification are professionally conducted for English, which may lose effectiveness on Chinese materials due to the huge difference between Chinese and English.

Definition: To select the most appropriate assignment to an untagged text from a predefined set of tags.

Radical-aware Attention-based Four-Granularity model

1. Dataset: We conduct experimentson two real-world datasets.

Thirty-Third AAAI Conference on

Artificial Intelligence (AAAI-19),

Honolulu, Hawaii,

January 27 - February 1, 2019

University of Science and

Technology of China

(USTC)

Challenges:Radical is useful, but how to introduce it to existing works is difficult.

The properties of Chinese is hard to model.

How to combine radicals with other features is also challenging.

Speaker: Hanqing Tao

Email: [email protected]

Lab Homepage: http://bigdata.ustc.edu.cn/

A Radical-aware Attention-based Model for Chinese Text ClassificationHanqing Tao1, Shiwei Tong1, Hongke Zhao1, Tong Xu1, 2*, Binbin Jin1, Qi Liu1, 2

1Anhui Province Key Lab. of Big Data Analysis and Application, University of Science and Technology of China,2School of Data Science, University of Science and Technology of China

The peaks and valleys exactly reflect the classification effect of radicals.

Radicals can help recognize semantics and classify Chinese texts.

For example, the original meaning of radical “clothing” is closed to the concept of class “dress”, where the high tf-idf value is a convincing indication.

Given: A predefined set of tags 𝑼.

Input space: An untagged text 𝑻.

Output space: The most appropriate assignment 𝑷∈𝑼.

Task: To learn a classification function 𝑭: 𝑭 (𝑻) → 𝑷.

Implementation

Attention Mechanism:

1) To capture the interrelationsbetween radicals and their corresponding characters or words;

2) The radical information can be further modified by the attention weight sum of character context and word context.

Experiments

2. Experimental Results of Different Methods

3. Discussion

Our work presents a novel insight on how to leverage radicals.

Simply introducing radicals to Chinese text classification cannot improve the performance well.

Making rational use of radicals is necessary.

Attention mechanism in RAFG can enhance the effect of radicals.

Extensive experiments demonstrate the superiority of our model and the effectiveness of radicals in the task of Chinese text classification.

Different from previous work, our goal is to take advantage of radicals and leverage four different granularities of features to comprehensively model Chinese texts. Furtherly, we systematically integrate these features into the task of Chinese text classification, so that to deal with the huge difference between Chinese and English.

The Radical-aware

Attention-based

Four-Granularity

(RAFG) Model

Feature

Acquisition

Hidden Representation Calculation:

Prediction:

Objective Function:

Word-level radicals are worthy of attention.

Our model (RAFG) gains a higher performance than any other comparison methods

Tf-idf Distributions of Some Radicals in 32 Classes

A radical is often related to certain concepts

Given a specific feature embedding sequence of a sentence 𝑠 = {𝑥1, 𝑥2, … , 𝑥𝑁}, the hidden vector of a BLSTM is calculated as follows: