基于并行化K-means的综合能源服务客户识别
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TM73

基金项目:

国家电网有限公司科技项目“能源互联网环境下的多源互联配电网及多样化用电方式的需求策略系统研究”


Implementation of integrated energy service for customer identification based on parallel K-means clustering
Author:
Affiliation:

Fund Project:

Research on the Demand Strategy System of Multi?source Inter?connection Power Distribution Network and Diversified Power Consumption Modes in the Energy Internet

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    随着电力体制改革的不断深入以及大数据技术的发展,传统的供电公司和综合能源服务企业急需改善现有的粗放型营销模式,实现不同用户需求的快速响应。针对综合能源服务潜在客户的精准识别问题,文中通过对综合能源服务潜在客户的标签进行分析,基于Spark内存计算平台提出了一种改进的并行化K-means聚类算法。首先,对聚类过程中初始聚类中心的选取和样本影响因素的权值进行改进;其次,基于优化后的权值对客户数据集进行聚类分析,对综合能源服务潜在客户进行识别;最后,采集综合能源服务企业的近期交易数据,在多节点的物理机上进行实验与分析。结果表明改进后的聚类算法更准确。在执行效率上,并发度高的算法执行效率优于单线程的算法具有较好的并行能力。

    Abstract:

    With the deepening reform of electric power enterprise and the development of big data technology, traditional power supply companies and integrated energy service enterprises have to change the present extensive marketing mode for offering rapid response to consumers' requirement. In order to improve the accurate identification of potential customers in integrated energy services, this paper marks the tags of potential customers, and proposes an improved parallel K-means clustering algorithm based on spark memory computing platform. Firstly, the selection of initial cluster center and the evaluation of sample influencing factors are improved. Secondly, based on the optimized weight of factors, cluster analysis is carried out on the data setting to identify the potential customers of integrated energy services. Finally, the recent transaction data of integrated energy service enterprises are collected, and the experimental results are carried out on a multi-node physical machine. The results show that the accuracy of improved K-means clustering model is boosted. In terms of executive effectiveness, the algorithm with high concurrency has better parallel ability than that with single thread.

    参考文献
    相似文献
    引证文献
引用本文

沈子垚,袁晓玲.基于并行化K-means的综合能源服务客户识别[J].电力工程技术,2021,40(2):107-113

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2020-09-05
  • 最后修改日期:2020-10-19
  • 录用日期:2020-08-18
  • 在线发布日期: 2021-04-02
  • 出版日期: 2021-03-28