برنامه‌ریزی بهره‌برداری ریزشبکه‌ها مبتنی بر الگوریتم یادگیری تقویتی عمیق Journal Article

Writer: ناطقی، علیرضا ؛ زارع، حسن ؛ اسمعیلی، سعید ؛ اصغرپورعلمداری، حسین ؛

مهندسی و مدیریت انرژی تابستان 1401، سال یازدهم - شماره 2 Ranking Science-Research (Ministry of Science/ISC (‎10 page(s) - From 2 to 11 )

Keywords: گرادیان استراتژی قطعی عمیق فرایند تصمیم‌گیری مارکوف برنامه‌ریزی بهره‌برداری ریزشبکه microgrid Deep deterministic policy gradient Markov decision process Operational scheduling

fa en

Abstract:

: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented to minimize total operational costs by learning the optimal strategy for operation scheduling of MG systems. This model-free algorithm deploys an actor-critic architecture which can not only model the continuous state and action spaces properly but also overcome the curse of dimensionality. In order to evaluate the efficiency of the proposed algorithm, the results were compared with the analytical method and a Q-based learning algorithm which demonstrates the capability of the DDPG method from the aspects of convergence, running time, and total costs.