高级检索

煤矿事故案例存储与检索研究

Study on storage and retrieval of coal mine accident cases

  • 摘要: 为解决事故案例非结构化、多源异构、难以共享的问题,提高事故案例在应急救援管理中的利用率,利用网络爬虫技术获取由各地监管部门发布在互联网上的大量实时事故案例,通过框架法构建数据结构以表示事故案例蕴含的知识,建立了一个通用、全面、共享的事故案例数据库;在事故案例数据库的基础上,初步提出了一种新的案例检索算法,利用搜索引擎中倒排索引技术实现对案例非结构化数据进行检索,同时结合传统案例相似度计算方式对结构化数据进行匹配,实现利用少量关键信息进行非结构化案例数据的高效筛选,可使系统依据指挥人员意愿结合非结构化数据和结构化数据,进行有侧重、有倾向的案例检索,以中国煤矿安全生产网为例对瓦斯、水灾、火灾事故案例进行自动爬取,实践结果表明,此案例检索流程及算法提高了案例检索的有效性和实用性。

     

    Abstract: In order to solve the problem of unstructured, heterogeneous, and difficult to share accident cases, and to improve the utilization rate of accident cases in emergency rescue management, this paper uses network crawler technology to obtain a large number of real-time accident cases published on the Internet by local regulatory authorities.The framework method constructs a data structure to express the knowledge contained in accident cases, and establishes a universal, comprehensive and shared dynamic database of accident cases; on the basis of accident case database, a new case retrieval algorithm is initially proposed, using the index technology as search engine to retrieve unstructured case data. At the same time, the traditional method of case similarity calculation is used to match structured data, and realize the efficient screening of unstructured case data with a small amount of key information, so that the system can be based on the wishes of the commander combining unstructured data and structured data, a focused and inclination case search was carried out. Taking China Coal Mine Safety Production Network as an example to automatically crawl gas, flood, and fire accident cases. The practical results show that this case retrieval process and algorithm improve the effectiveness and practicability of case retrieval.

     

/

返回文章
返回