Bi-Weighting Domain Adaptation for Cross-Language Text Classification

资源分类

2019-11-12 |

49 |

41 |

Abstract Text classi?cation is widely used in many realworld applications. To obtain satis?ed classi?cation performance, most traditional data mining methods require lots of labeled data, which can be costly in terms of both time and human efforts. In reality, there are plenty of such resources in English since it has the largest population in the Internet world, which is not true in many other languages. In this paper, we present a novel transfer learning approach to tackle the cross-language text classi?cation problems. We ?rst align the feature spaces in both domains utilizing some on-line translation service, which makes the two feature spaces under the same coordinate. Although the feature sets in both domains are the same, the distributions of the instances in both domains are different, which violates the i.i.d. assumption in most traditional machine learning methods. For this issue, we propose an iterative feature and instance weighting (Bi-Weighting) method for domain adaptation. We empirically evaluate the effectiveness and ef?ciency of our approach. The experimental results show that our approach outperforms some baselines including four transfer learning algorithms.

上一篇：Constituent Grammatical Evolution Loukas Georgiou and William J. Teahan

下一篇：Pattern Field Classification with Style Normalized Transformation

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Learning to Predi...

Much of model-based reinforcement learning invo...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
The Variational S...

Unlike traditional images which do not offer in...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Rating-Boosted La...

The performance of a recommendation system reli...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com