计算机科学
自编码
代表(政治)
特征学习
人工智能
任务(项目管理)
模式
模态(人机交互)
多模式学习
多任务学习
自然语言处理
机器学习
深度学习
模式识别(心理学)
社会学
经济
管理
法学
政治
社会科学
政治学
作者
Paul Mohan,Batley C Brilley,Nippun Kumar A. A.
标识
DOI:10.1109/i4tech55392.2022.9952528
摘要
Various kinds of Information retrieval or processing task can be difficult basis on how the information is viewed or represented. Representation learning is a technique that allows a system to discover the representation of the data which used for detecting features and classifying things. As compared with learning from one representation, learning from different representations make the task easier. For example, learning something from images and their texts or learning from video and their audio produces good results than learning from one source (either image or text)or (video or audio). Learning from multiple modalities helps to achieve the target correctly within short time. This paper tries to design a Multimodal Autoencoder with minimum loss that can produce shared representation of images and its texts
科研通智能强力驱动
Strongly Powered by AbleSci AI