计算机科学
嵌入
可扩展性
图形
理论计算机科学
人工智能
图嵌入
集合(抽象数据类型)
机器学习
数据库
程序设计语言
作者
Rahul Ragesh,Sundararajan Sellamanickam,Arun Iyer,Ram Bairi,Vijay Lingam
出处
期刊:Cornell University - arXiv
日期:2020-01-01
被引量:3
标识
DOI:10.48550/arxiv.2008.12842
摘要
We consider the problem of learning efficient and inductive graph convolutional networks for text classification with a large number of examples and features. Existing state-of-the-art graph embedding based methods such as predictive text embedding (PTE) and TextGCN have shortcomings in terms of predictive performance, scalability and inductive capability. To address these limitations, we propose a heterogeneous graph convolutional network (HeteGCN) modeling approach that unites the best aspects of PTE and TextGCN together. The main idea is to learn feature embeddings and derive document embeddings using a HeteGCN architecture with different graphs used across layers. We simplify TextGCN by dissecting into several HeteGCN models which (a) helps to study the usefulness of individual models and (b) offers flexibility in fusing learned embeddings from different models. In effect, the number of model parameters is reduced significantly, enabling faster training and improving performance in small labeled training set scenario. Our detailed experimental studies demonstrate the efficacy of the proposed approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI