Volume 1 Issue 3
May  2021
Turn off MathJax
Article Contents
Fangwei NING, Yan SHI, Yishu CAI, Weiqing XU. Research and application progress of data mining technology in electric power system[J]. Journal of Advanced Manufacturing Science and Technology , 2021, 1(3): 2021007. doi: 10.51393/j.jamst.2021007
Citation: Fangwei NING, Yan SHI, Yishu CAI, Weiqing XU. Research and application progress of data mining technology in electric power system[J]. Journal of Advanced Manufacturing Science and Technology , 2021, 1(3): 2021007. doi: 10.51393/j.jamst.2021007

Research and application progress of data mining technology in electric power system

doi: 10.51393/j.jamst.2021007
  • Received Date: 2021-04-25
  • Rev Recd Date: 2021-05-10
  • Available Online: 2021-06-22
  • Publish Date: 2021-05-19
  • With the rapid development of computer technology and the improvement of intelligent technologies in electric power engineering, the volume of data has increased exponentially. Data mining technology can be utilized to search information hidden in the huge amounts of data, and then the data can be transformed into useful knowledge to promote the development of electric power technology. In order to be acquainted with the research and application progress of data mining technology in electric power engineering, several major data mining algorithms are introduced in this paper, including ANN (Artificial Neural Network) algorithm, SVM (Support Vector Machine) algorithm, decision tree algorithm, K-means algorithm, NBC (Naive Bayesian Classification) algorithm and Apriori algorithm. And then, the methods of data mining technology in prediction, classification, clustering and association rules analysis are explained in detail in this engineering, which are combined with the electricity price prediction, power load forecasting, fault type identification, system state classification, power generation side association rules, power grid operation data association analysis. At last, this technology in electric power engineering is summarized and an expectation for the future development is provided.

  • loading
  • [1]
    . Naisbitt J. Megatrends:Ten new directions transforming our lives. Business Horizons 1983;26(3):84-6.
    . Søilen K S. An overview of articles on competitive intelligence in JCIM and CIR. Journal of Intelligence Studies in Business 2013; 3(1):44-58.
    . Haixia G. New social networking features and model analysis of information dissemination. Journal of Modern Information 2012; 1:56-9.
    . Uthurusamy R. From data mining to knowledge discovery:Current challenges and future directions. Advances in knowledge discovery and Data Mining. 1996.p.561-9.
    . Klösgen W. Knowledge discovery in databases and data mining. International Symposium on Methodologies for Intelligent Systems. 1996.p.623-32.
    . Feyyad U M. Data mining and knowledge discovery:Making sense out of data. IEEE Expert 1996; 11(5):20-5.
    . Frankish K, Ramsey WM. The Cambridge handbook of artificial intelligence. Cambridge:Cambridge University Press. 2014.
    . Yilong G. Data Mining and Its application in Engineering diagnosis[dissertation]. Xi'an:Xi'an Jiaotong University. 2000.
    . Gaber M M, Zaslavsky A, Krishnaswamy S. Mining data streams:a review. ACM Sigmod Record 2005; 34(2):18-26.
    . Jiang N, Gruenwald L. Research issues in data stream association rule mining. ACM Sigmod Record 2006; 35(1):14-9.
    . Gray J, Chaudhuri S, Bosworth A, et al. Data cube:A relational aggregation operator generalizing group-by, cross-tab, and sub-totals. Data Mining and Knowledge Discovery 1997; 1(1):29-53.
    . Florescuand D. An extensible framework for data cleaning. Proceedings of the 16th International Conference on Data Engineering. 2000.p.312.
    . Woodard M, Wisely M, Sarvestani S S. A survey of data cleansing techniques for cyber-physical critical infrastructure systems. Advances in Computers. Elsevier 2016; 102:63-110.
    . Zolhavarieh S, Aghabozorgi S, Teh Y W. A review of subsequence time series clustering. The Scientific World Journal 2014; 2014:312521.
    . Kaur DP, Walia AS. A study on clustering based methods. International Journal of Advanced Research in Computer Science 2017; 8(4).
    . Hernández MA, Stolfo SJ. Real-world data is dirty:Data cleansing and the merge/purge problem. Data Mining and Knowledge Discovery 1998; 2(1):9-37.
    . Monge AE, Elkan C. The field matching problem:algorithms and applications. Proc Acm International Conference on Knowledge Discovery & Data Mining. 1996.p.267-70.
    . Hu W, Zaveri A, Qiu H, et al. Cleaning by clustering:methodology for addressing data quality issues in biomedical metadata. BMC Bioinformatics 2017; 18(1):1-12.
    . Galhardas H. Data cleaning and transformation using the AJAX framework. International Summer School on Generative and Transformational Techniques in Software Engineering. 2005.p.327-43.
    . Harte-Hanks Trillium Software.[2007-01-09]. http://www.trilliumsoftware.com.
    . Bruckner RM, List B, Schiefer J. Striving towards near real-time data integration for data warehouses. International Conference on Data Warehousing and Knowledge Discovery. 2002.p.317-26.
    . Devi S, Kalia A. Study of data cleaning & comparison of data cleaning tools. International Journal of Computer Science and Mobile Computing 2015; 4(3):360-70.
    . Galhardas H, Florescu D, Shasha D, et al. Declarative data cleaning:Language, model, and algorithms. Report No. RR-4149. INRIA, 2001.
    . Kamruzzaman SM, Sarkar AM. A new data mining scheme using artificial neural networks. Sensors 2011; 11(5):4622-47.
    . Sinkov A, Asyaev G, Mursalimov A, et al. Neural networks in data mining. 2016 2nd International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM). 2016.p.1-5.
    . Dan SU. Research on high-altitude meteorological data mining method based on BP neural network. Modern Electronics Technique 2017; 40(24):40-2.
    . Zhang D, Jiang Q, Li X. Application of neural networks in financial data mining. International Conference on Computational Intelligence. 2004.p.392-5.
    . Si L, Liu X, Tan C, et al. A novel classification approach through integration of rough sets and back-propagation neural network. Journal of Applied Mathematics 2014;(3):1-11.
    . Jin M, Wang H, Zhang Q, et al. Financial management and decision based on decision tree algorithm. Wireless Personal Communications 2018; 102(4):2869-84.
    . Sudrajat R, Irianingsih I, Krisnawan D. Analysis of data mining classification by comparison of C4. 5 and ID algorithms. Materials Science and Engineering Series 2017; 166(1):012031.
    . Veale M, Brass I. Administration by algorithm? Public management meets public sector machine learning. Public Management Meets Public Sector Machine Learning. 2019.
    . Cortes C, Vapnik V. Support-vector networks. Machine Learning 1995; 20(3):273-97.
    . Vapnik V, Golowich SE, Smola A. Support vector method for function approximation, regression estimation, and signal processing. Advances in Neural Information Processing Systems. 1997.p.281-7.
    . Huang CL, Chen MC, Wang CJ. Credit scoring with a data mining approach based on support vector machines. Expert Systems with Applications 2007;33(4):847-56.
    . Song J, Tang H. Support vector machines for classification of homo-oligomeric proteins by incorporating subsequence distributions. Journal of Molecular Structure:THEOCHEM 2005; 722(1-3):97-101.
    . Agrawal R, Srikant R. Fast algorithms for mining association rules. Proceedings of the 20th VLDB Conference. 1994.p.487-99.
    . Rennie JD, Shih L, Teevan J, et al. Tackling the poor assumptions of naive bayes text classifiers. Proceedings of the 20th international conference on machine learning (ICML-03). 2003.p.616-23.
    . Han J, Kamber M, Pei J. Data mining concepts and techniques third edition. The Morgan Kaufmann Series in Data Management Systems 2011;5(4):83-124.
    . Agrawal R, Gehrke J, Gunopulos D, et al. Automatic subspace clustering of high dimensional data for data mining applications. Proceedings of the 1998 ACM SIGMOD international conference on Management of data. 1998.p.94-105.
    . Fitzgerald M, Kruschwitz N, Bonnet D, et al. Embracing digital technology:A new strategic imperative. MIT Sloan Management Review 2014; 55(2):1.
    . Mori H, Awata A. A hybrid method of clipping and artificial neural network for electricity price zone forecasting. 2006 International Conference on Probabilistic Methods Applied to Power Systems. 2006.p.1-6.
    . Zhao JH, Dong ZY, Li X, et al. A framework for electricity price spike analysis with advanced data mining methods. IEEE Transactions on Power Systems 2007; 22(1):376-85.
    . Lu X, Dong ZY, Li X. Electricity market price spike forecast with data mining techniques. Electric Power Systems Research 2005; 73(1):19-29.
    . Ziel F, Steinert R. Probabilistic mid-and long-term electricity price forecasting. Renewable and Sustainable Energy Reviews 2018; 94:251-66.
    . Patil M, Deshmukh SR, Agrawal R. Electric power price forecasting using data mining techniques. International Conference on Data Management, Analytics and Innovation (ICDMAI). 2017.p.217-23.
    . Wu X, Zhou H. Short-term electricity price forecasting based on subtractive clustering and adaptive neuro-fuzzy inference system. Power System Technology 2007; 31(19):69-73.
    . Lambert-Torres G, Marra W, Lage WF, et al. Data mining in load forecasting:an approach using fuzzy techniques. 2006 IEEE Power Engineering Society General Meeting. 2006.
    . Yuniarti T, Surjandari I, Muslim E, et al. Data mining approach for short term load forecasting by combining wavelet transform and group method of data handling (WGMDH). 2017 3rd International Conference on Science in Information Technology (ICSITech). 2017.p.53-8.
    . Wang Q, Sun Q, Li Q, et al. A method of electricity utilization load analysis and visualization based on data mining of electric power big data. DEStech Transactions on Engineering and Technology Research. 2016.
    . Xydas S, Marmaras CE, Cipcigan LM, et al. Electric vehicle load forecasting using data mining methods. Hybrid and Electric Vehicles Conference. 2014.p.1-6.
    . Sun F, Yang Y. A research on power load forecasting model based on data mining. research and practical issues of enterprise information systems II. 2008.p.1369-77.
    . Faiz J, Lotfi-fard S, Shahri SH. Prony-based optimal bayes fault classification of overcurrent protection. IEEE Transactions on Power Delivery 2007; 22(3):1326-34.
    . Babnik T, Gubina F. Fast power transformer fault classification methods based on protection signals. IEE Proceedings-Generation, Transmission and Distribution 2003; 150(2):205-10.
    . Kumar N, Sharma M, Sinha A, et al. Fault detection on radial power distribution systems using fuzzy logic. International Journal of Electrical and Electronics Engineers 2015; 398-406.
    . Yan W, Zhang H, Lu JF. Study and application of tin e-interval sequential pattern to equipment fault monitoring. Journal of Computer Applications 2005; 25(7):1584-6.
    . Zhang Y, Jing MA, Zhang J, et al. Applications of data mining theory in electrical engineering. Engineering 2013; 1(3):211-5.
    . Xu L, Chow M, A classification approach for power distribution systems fault cause identification. IEEE Transactions on Power Systems 2006; 21(1):53-60.
    . Lambert-Torres G. Application of rough sets in power system control center data mining. Power Engineering Society Winter Meeting. 2002.p.627-31.
    . Huang JA, Vanier G, Valette A, et al. Application of data mining techniques for automat settings in emergency control at Hydro-Qudbec. 2003 IEEE Power Engineering Society General Meeting. 2003.p.2037-44.
    . El-Arroudi K, Joos G, Kamwa I, et al. Intelligent-based approach to islanding detection in distributed generation. IEEE Transactions on Power Delivery 2007; 22(2):828-35.
    . Cheng Z, Li SY, Han LJ, et al. PV power generation forecast based on data mining method. Acta Energiae Solaris Sinica 2017; 38(3):726-33.
    . Niu C, Li J, Liu J, et al. Correlation analysis of operation data and its application in operation optimization in power plant. 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery. 2008.p.581-5.
    . Cui C, Liu J, Yang T. Economical optimization of a boiler denitration system based on GA and fuzzy association rules. Journal of Chinese Society of Power Engineering 2016; 36(4):300-6.
    . Cai T. Application of data mining and analysis techniques for renewable energy network design and optimization. Data Mining and Analysis in the Engineering Field. 2014.p.33-47.
    . Khabibrakhmanov I, Kumar T, Lavin MA, et al. Forecasting solar power generation using real-time power data, weather data, and complexity-based similarity factors. United States Patent US 9857778. 2018 Jan 2.
    . Lin D, Jun P, Jun T. Fault location for transmission line based on traveling waves using correlation analysis method. 2008 International Conference on High Voltage Engineering and Application. 2008.p.681-4.
    . Li Z, Bai X, Zhou Z, et al. Method of power grid fault diagnosis based on feature mining. Proceedings of the CSEE 2010; 30(10):16-22.
    . Tong X, Ye S. A survey on application of data mining in transient stability assessment of power system. Power System Technology 2009; 33(20):88-93.
    . Zeng D, Yang T, Cheng X, et al. Application of data mining method in real-time optimal load dispatching of power plant. Proceedings of the CSEE. 2010.p.109-14.
    . Xu P, Xiao F, Feng S, et al. Data mining of power transmission line fault based on Apriori algorithm. 2017 IEEE 2nd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC). 2017.p.49-54.
    . Wang Y, Liu M, Bao Z, et al. Stacked sparse autoencoder with PCA and SVM for data-based line trip fault diagnosis in power systems. Neural Computing and Applications 2019; 31(10):6719-31.
    . Chicco G, Napoli R, Piglione F. Load pattern clustering for short-term load forecasting of anomalous days. 2001 IEEE Porto Power Tech Proceedings. 2001.
    . Mori H, Awata A. Normalized RBFN with hierarchical deterministic annealing clustering for electricity price forecasting. 2007 IEEE Power Engineering Society General Meeting. 2007.p.1-7.
    . Liu L. Cluster analysis of electrical behavior. Journal of Computer and Communications 2015; 3(5):88.
    . Damayanti R. Analisis profil beban listrik menggunakan teknik clustering[dissertation]. Indonesia:Universitas Pendidikan Indonesia, 2016.
    . Dessertaine A. Detection of remarkable values in Individual electric consumption's series using non-parametric approach. 2007 IEEE Lausanne Power Tech. 2007.p.1964-9.
    . Neagu BC, Grigoraş G, Scarlatache F. Outliers discovery from Smart Meters data using a statistical based data mining approach. 2017 10th International Symposium on Advanced Topics in Electrical Engineering (ATEE). 2017.p.555-8.
    . Tang Y, Wen M, Xu T, et al. The abnormal electricity consumption detection system based on the outlier behavior pattern recognition. 2017 International Conference on Energy, Power and Environmental Engineering(ICEPEE2017). 2017.
    . Sun S, Li G, Chen H, et al. Optimization of support vector regression model based on outlier detection methods for predicting electricity consumption of a public building WSHP system. Energy and Buildings 2017; 151:35-44.
    . Huang SJ, Lin JM. Enhancement of anomalous data mining in power system predicting-aided state estimation. IEEE Transactions on Power Systems 2004; 19(1):610-9.
    . Teeuwsen SP, Erlich I. Neural network based multi-dimensional feature forecasting for bad data detection and feature restoration in power systems. 2006 IEEE Power Engineering Society General Meeting. 2006.
  • 加载中


    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索


    Article Metrics

    Article views (795) PDF downloads(51) Cited by()
    Proportional views


    DownLoad:  Full-Size Img  PowerPoint