A Method for Extracting Unstructured Threat Intelligence Based on Dictionary Template and Reinforcement Learning

计算机科学 人工智能 模棱两可 机器学习 过程(计算) 构造(python库) 程序设计语言 操作系统
作者
Xuren Wang,Rong Chen,Binghua Song,Jian Yang,Zhengwei Jiang,Xiaoqing Zhang,Xiaomeng Li,Shengqin Ao
出处
期刊:Computer Supported Cooperative Work in Design 被引量:2
标识
DOI:10.1109/cscwd49262.2021.9437858
摘要

In recent years, individuals, organizations and countries are all threatened by cyber threats to some degree. The proposal of threat intelligence sharing scheme has greatly helped the protection of cyber security. Traditional threat intelligence sharing scheme mainly collects and analyzes information manually, which include but not limited to Indicators of Compromise (IOC) and forms a machine readable report for Security Operations Center (SOC) to take corresponding action. Therefore, it is challenging and significant to easily and automatically share and exchange cyber threat intelligence (CTI). Aiming at extracting the information of CTI efficiently, we construct a model of automatic information extraction process of the entity recognition and relationship extraction, which are used to extract effective entities and relationships in threat intelligence reports and improve the efficiency of threat intelligence sharing. The specific content and research results include two aspects: (1) Research on threat intelligence entity recognition model. We use the BERT model as a corpus pre-training model based on the classic neural network BiLSTM-CRF, and proposes a model DT-BERT-BiLSTM-CRF based on the dictionary template. The BERT pre-training model makes full use of the contextual semantic information of the corpus and alleviates the problem of ambiguity in the process of threat intelligence entity recognition. By constructing a dictionary template of threat intelligence entities, the accuracy of entity recognition in the threat intelligence field is further improved. (2) Research on the extraction of ITC relations. We constructed the relation extraction data set with distant supervision methods. For alleviating the noise annotation data, we introduce the attention mechanism and reinforcement learning into traditional neural networks, proposing a model NR-RL-PCNN-ATT. Through a new reward mechanism, our model improves the sentence selection quality and the efficiency of relationship extraction.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
荔枝小妹发布了新的文献求助10
2秒前
无敌鱼发布了新的文献求助10
2秒前
wipmzxu完成签到,获得积分10
4秒前
顾矜应助小安采纳,获得10
4秒前
Cindy发布了新的文献求助10
7秒前
怡然谷雪完成签到,获得积分20
7秒前
9秒前
丘比特应助无敌鱼采纳,获得10
9秒前
14秒前
月光入梦完成签到 ,获得积分10
15秒前
17秒前
18秒前
18秒前
潇洒的诗桃应助zyyyy采纳,获得20
19秒前
20秒前
21秒前
科研小白发布了新的文献求助10
21秒前
老汤姆完成签到,获得积分10
22秒前
元羞花发布了新的文献求助10
23秒前
一颗橙发布了新的文献求助10
23秒前
淡定的弘发布了新的文献求助10
25秒前
25秒前
小安发布了新的文献求助10
25秒前
26秒前
27秒前
震动的强炫完成签到,获得积分10
27秒前
细胞顺子发布了新的文献求助10
30秒前
小鹿完成签到,获得积分10
32秒前
大模型应助自然夜天采纳,获得10
32秒前
wwwwwwjh完成签到,获得积分10
33秒前
knowledge完成签到,获得积分10
33秒前
YRRRR完成签到 ,获得积分10
35秒前
马家辉发布了新的文献求助10
40秒前
Axe驳回了Akim应助
42秒前
43秒前
FashionBoy应助容与采纳,获得10
47秒前
shinysparrow应助何不可采纳,获得10
49秒前
老王完成签到,获得积分10
51秒前
姜迟迟完成签到,获得积分10
52秒前
53秒前
高分求助中
Sustainable Land Management: Strategies to Cope with the Marginalisation of Agriculture 1000
Corrosion and Oxygen Control 600
Yaws' Handbook of Antoine coefficients for vapor pressure 500
Python Programming for Linguistics and Digital Humanities: Applications for Text-Focused Fields 500
行動データの計算論モデリング 強化学習モデルを例として 500
Johann Gottlieb Fichte: Die späten wissenschaftlichen Vorlesungen / IV,1: ›Transzendentale Logik I (1812)‹ 400
The role of families in providing long term care to the frail and chronically ill elderly living in the community 380
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2555151
求助须知:如何正确求助?哪些是违规求助? 2179579
关于积分的说明 5619816
捐赠科研通 1900692
什么是DOI,文献DOI怎么找? 949363
版权声明 565579
科研通“疑难数据库(出版商)”最低求助积分说明 504712