Adaptive Framework for Deep Learning Based Dynamic and Temporal Topic Modeling from Big Data

计算机科学自动汇总主题模型大数据可扩展性深度学习机器学习正规化（语言学）人工智能数据建模情绪分析流式数据社会化媒体数据科学数据挖掘万维网数据库

作者

Ajeet Ram Pathak,Manjusha Pandey,Siddharth Swarup Rautaray

出处

期刊：Recent Patents on Engineering [Bentham Science]
日期：2021-01-19 卷期号：14 (3): 394-402 被引量：9

标识

DOI：10.2174/1872212113666190329234812

摘要

Background: The large amount of data emanated from social media platforms need scalable topic modeling in order to get current trends and themes of events discussed on such platforms. Topic modeling play crucial role in many natural language processing applications like sentiment analysis, recommendation systems, event tracking, summarization, etc. Objective: The aim of the proposed work is to adaptively extract the dynamically evolving topics over streaming data, and infer the current trends and get the notion of trend of topics over time. Because of various world level events, many uncorrelated streaming channels tend to start discussion on similar topics. We aim to find the effect of uncorrelated streaming channels on topic modeling when they tend to start discussion on similar topics. Methods: An adaptive framework for dynamic and temporal topic modeling using deep learning has been put forth in this paper. The framework approximates online latent semantic indexing constrained by regularization on streaming data using adaptive learning method. The framework is designed using deep layers of feedforward neural network. Results: This framework supports dynamic and temporal topic modeling. The proposed approach is scalable to large collection of data. We have performed exploratory data analysis and correspondence analysis on real world Twitter dataset. Results state that our approach works well to extract topic topics associated with a given hashtag. Given the query, the approach is able to extract both implicit and explicit topics associated with the terms mentioned in the query. Conclusion: The proposed approach is a suitable solution for performing topic modeling over Big Data. We are approximating the Latent Semantic Indexing model with regularization using deep learning with differentiable ℓ1 regularization, which makes the model work on streaming data adaptively at real-time. The model also supports the extraction of aspects from sentences based on interrelation of topics and thus, supports aspect modeling in aspect-based sentiment analysis.

求助该文献

最长约 10秒，即可获得该文献文件

Adaptive Framework for Deep Learning Based Dynamic and Temporal Topic Modeling from Big Data

今日热心研友