Python(编程语言)
网络爬虫
爬行
聚焦爬虫
互联网
网页
计算机科学
数据库
操作系统
Web服务器
静态网页
万维网
医学
解剖
出处
期刊:Journal of physics
[IOP Publishing]
日期:2021-09-01
卷期号:2033 (1): 012205-012205
被引量:2
标识
DOI:10.1088/1742-6596/2033/1/012205
摘要
With the development of computer and network technology, we often get information through the Internet. However, it is difficult for us to obtain valuable information from massive amounts of data because of the large amount of network data and complex formats. At present, researches found that web crawler technology can be automatically obtained information from internet. In this paper we takes the crawling of second-hand housing information of Anjuke Xi'an as an example. According to the crawler principle and process, the structure of Anjuke's page is first analyzed, using requests to obtain web pages, lxml to analyze web pages and SQL Server 2017 to store data to design and implement a network. The crawler program collects and stores housing information in some cities in East China through this program, and finally analyzes the housing price trend through the collected data through Excel. The results show that this program can automatically obtain housing information from the Internet, which provides a data source for later data analysis.
科研通智能强力驱动
Strongly Powered by AbleSci AI