Adaptive Framework for Deep Learning Based Dynamic and Temporal Topic Modeling from Big Data

计算机科学 自动汇总 主题模型 大数据 可扩展性 深度学习 机器学习 正规化(语言学) 人工智能 数据建模 情绪分析 流式数据 社会化媒体 数据科学 数据挖掘 万维网 数据库
作者
Ajeet Ram Pathak,Manjusha Pandey,Siddharth Swarup Rautaray
出处
期刊:Recent Patents on Engineering [Bentham Science]
卷期号:14 (3): 394-402 被引量:9
标识
DOI:10.2174/1872212113666190329234812
摘要

Background: The large amount of data emanated from social media platforms need scalable topic modeling in order to get current trends and themes of events discussed on such platforms. Topic modeling play crucial role in many natural language processing applications like sentiment analysis, recommendation systems, event tracking, summarization, etc. Objective: The aim of the proposed work is to adaptively extract the dynamically evolving topics over streaming data, and infer the current trends and get the notion of trend of topics over time. Because of various world level events, many uncorrelated streaming channels tend to start discussion on similar topics. We aim to find the effect of uncorrelated streaming channels on topic modeling when they tend to start discussion on similar topics. Methods: An adaptive framework for dynamic and temporal topic modeling using deep learning has been put forth in this paper. The framework approximates online latent semantic indexing constrained by regularization on streaming data using adaptive learning method. The framework is designed using deep layers of feedforward neural network. Results: This framework supports dynamic and temporal topic modeling. The proposed approach is scalable to large collection of data. We have performed exploratory data analysis and correspondence analysis on real world Twitter dataset. Results state that our approach works well to extract topic topics associated with a given hashtag. Given the query, the approach is able to extract both implicit and explicit topics associated with the terms mentioned in the query. Conclusion: The proposed approach is a suitable solution for performing topic modeling over Big Data. We are approximating the Latent Semantic Indexing model with regularization using deep learning with differentiable ℓ1 regularization, which makes the model work on streaming data adaptively at real-time. The model also supports the extraction of aspects from sentences based on interrelation of topics and thus, supports aspect modeling in aspect-based sentiment analysis.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
Doris完成签到 ,获得积分10
5秒前
benben应助FF采纳,获得10
7秒前
8秒前
11秒前
54zxy完成签到,获得积分10
13秒前
15秒前
小谷发布了新的文献求助10
17秒前
20秒前
SOLOMON应助何桉采纳,获得10
27秒前
北陌发布了新的文献求助10
27秒前
zzpj应助小谷采纳,获得10
29秒前
田様应助酷酷的小鸽子采纳,获得10
29秒前
29秒前
李冰发布了新的文献求助10
32秒前
32秒前
123发布了新的文献求助10
33秒前
科研通AI2S应助科研通管家采纳,获得10
33秒前
今后应助科研通管家采纳,获得10
33秒前
33秒前
36秒前
小徐发布了新的文献求助10
36秒前
38秒前
40秒前
41秒前
All发布了新的文献求助10
43秒前
开朗的骁应助翟函采纳,获得10
43秒前
蜗牛完成签到,获得积分10
46秒前
wjx发布了新的文献求助10
47秒前
个性的紫菜给勤恳飞风的求助进行了留言
48秒前
在水一方应助Sense采纳,获得10
54秒前
上官若男应助grumpysquirel采纳,获得10
57秒前
susui完成签到 ,获得积分10
59秒前
cctv18应助Asunnyya采纳,获得10
59秒前
1分钟前
wanci应助123采纳,获得10
1分钟前
yz完成签到 ,获得积分10
1分钟前
CodeCraft应助折镜采纳,获得10
1分钟前
星辰大海应助友好小笼包采纳,获得10
1分钟前
恰饭完成签到,获得积分10
1分钟前
高分求助中
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000
Yuwu Song, Biographical Dictionary of the People's Republic of China 700
[Lambert-Eaton syndrome without calcium channel autoantibodies] 520
Sphäroguß als Werkstoff für Behälter zur Beförderung, Zwischen- und Endlagerung radioaktiver Stoffe - Untersuchung zu alternativen Eignungsnachweisen: Zusammenfassender Abschlußbericht 500
少脉山油柑叶的化学成分研究 430
Revolutions 400
MUL.APIN: An Astronomical Compendium in Cuneiform 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2454623
求助须知:如何正确求助?哪些是违规求助? 2126300
关于积分的说明 5415390
捐赠科研通 1854881
什么是DOI,文献DOI怎么找? 922509
版权声明 562340
科研通“疑难数据库(出版商)”最低求助积分说明 493579