بدائل البحث:
policy optimization » capacity optimization (توسيع البحث)
agent » agents (توسيع البحث)
يعرض 1 - 8 نتائج من 8 نتيجة بحث عن '(( binary task objective optimization algorithm ) OR ( agent based policy optimization algorithm ))', وقت الاستعلام: 0.10s تنقيح النتائج
  1. 1

    Resources Allocation for Drones Tracking Utilizing Agent-Based Proximity Policy Optimization حسب De Rochechouart, Maxence

    منشور في 2023
    "…In particular, the Proximity Policy Optimization (PPO) reinforcement algorithm is used to discover a policy for sensor selection that results in optimum sensor resource allocation. …"
    احصل على النص الكامل
  2. 2
  3. 3
  4. 4
  5. 5

    Integrated Energy Optimization and Stability Control Using Deep Reinforcement Learning for an All-Wheel-Drive Electric Vehicle حسب Reza Jafari (3494018)

    منشور في 2025
    "…To this end, three model-free DRL-based methods, based on deep deterministic policy gradient (DDPG), twin delayed deep deterministic policy gradient (TD3), and TD3 enhanced with curriculum learning (CL TD3), are developed for determining optimal yaw moment control and energy optimization online. …"
  6. 6
  7. 7
  8. 8

    A comprehensive review of deep reinforcement learning applications from centralized power generation to modern energy internet frameworks حسب Sakib Mahmud (15302404)

    منشور في 2025
    "…We present a structured taxonomy covering value-based, policy-based, actor-critic, model-based, and advanced multi-agent and multi-objective approaches, and link algorithms to tasks such as dispatch, microgrid coordination, real-time pricing, load balancing, and demand–response. …"