陆晓松, 王国庆, 李勖之, 等. 场地环境大数据采集和机器学习方法在污染智能识别中的应用研究进展[J]. 生态与农村环境学报, 2022, 38(9): 1101-1111. DOI: 10.19741/j.issn.1673-4831.2021.0668
    引用本文: 陆晓松, 王国庆, 李勖之, 等. 场地环境大数据采集和机器学习方法在污染智能识别中的应用研究进展[J]. 生态与农村环境学报, 2022, 38(9): 1101-1111. DOI: 10.19741/j.issn.1673-4831.2021.0668
    LU Xiao-song, WANG Guo-qing, LI Xu-zhi, et al. Research Progress of Big Data of Site Environment Acquisition and Machine Learning Method in Pollution Intelligent Identification[J]. Journal of Ecology and Rural Environment, 2022, 38(9): 1101-1111. DOI: 10.19741/j.issn.1673-4831.2021.0668
    Citation: LU Xiao-song, WANG Guo-qing, LI Xu-zhi, et al. Research Progress of Big Data of Site Environment Acquisition and Machine Learning Method in Pollution Intelligent Identification[J]. Journal of Ecology and Rural Environment, 2022, 38(9): 1101-1111. DOI: 10.19741/j.issn.1673-4831.2021.0668

    场地环境大数据采集和机器学习方法在污染智能识别中的应用研究进展

    Research Progress of Big Data of Site Environment Acquisition and Machine Learning Method in Pollution Intelligent Identification

    • 摘要: 由于大数据技术的快速发展,用于分析挖掘场地污染特征和成因机制的数据量和类型也大幅增加,传统的场地环境数据获取、清洗和挖掘方法难以满足大数据的存储和处理要求。近年来,采用机器学习算法对场地多源异构数据进行挖掘,实现地块尺度、区域尺度的污染识别已成为研究热点。系统综述了场地污染智能识别大数据的获取、处理和挖掘方面的现状和不足,提出了利用5G和互联网、终端信息采集、网络爬虫、自然语言处理方法获取场地环境数据的应用对策。针对场地多源数据集成和融合的关键技术措施以及未来我国场地污染智能识别模式进行展望。

       

      Abstract: Due to the rapid development of big data technology, the amount and type of data used to analyze and mine site pollution characteristics and formation mechanisms have also increased significantly. The traditional methods of acquiring, cleaning and mining site environmental data are difficult to meet the requirements of big data storage and processing. In recent years, it has become a research hotspot to use machine learning algorithms to mine multi-source heterogeneous site data to realize pollution identification at site and regional scales. The status and deficiencies of big data acquisition, processing and mining on intelligent identification of site pollution are systematically reviewed. Then, the application countermeasures to obtain site environmental data using 5G and the Internet, terminal information collection, web crawler, and natural language processing methods are proposed. Finally the key technologies of site multi-source site data integration and fusion and the future intelligent identification mode of site pollution in China are prospected.

       

    /

    返回文章
    返回