نتائج البحث - (( algorithm reward function ) OR ( algorithm spc function )) :: Library Catalog

بدائل البحث:
algorithm reward » algorithm lennard (توسيع البحث), algorithm towards (توسيع البحث), algorithm reduced (توسيع البحث)
reward function » related functions (توسيع البحث)
algorithm spc » algorithm etc (توسيع البحث), algorithm pca (توسيع البحث), algorithm seu (توسيع البحث)
spc function » gpcr function (توسيع البحث), fc function (توسيع البحث), _ function (توسيع البحث)

1

The average cumulative reward of algorithms. حسب Jianbin Zheng (587000)

منشور في 2025
"…The algorithm employs recurrent neural networks to capture and process historical information. …"

أضف إلى المفضلة

محفوظ في:
2

Reward function related parameters. حسب Honglei Pang (22693724)

منشور في 2025
الموضوعات:

أضف إلى المفضلة

محفوظ في:
3

Research efforts in designing reward function for the AD problem with different criteria. حسب Nesma M. Ashraf (10954037)

منشور في 2021

أضف إلى المفضلة

محفوظ في:
4

Comparative validation of TIS indicator in reward functions. حسب Yulin Tian (1457986)

منشور في 2025
الموضوعات:

أضف إلى المفضلة

محفوظ في:
5

Initial values of the reward shaping function components as individual’s chromosome. حسب Larasmoyo Nugroho (18078260)

منشور في 2024

أضف إلى المفضلة

محفوظ في:
6

Framework of the proposed signal control algorithm. حسب Hyosun Lee (1567246)

منشور في 2022

أضف إلى المفضلة

محفوظ في:
7

Reward function weight combinations. حسب Bosen Zeng (22404042)

منشور في 2025

أضف إلى المفضلة

محفوظ في:
8

Comparative validation of overflow state feedback indicator in reward functions. حسب Yulin Tian (1457986)

منشور في 2025
الموضوعات:

أضف إلى المفضلة

محفوظ في:
9

Reward curve of DE-MADDPG algorithm in circular shaft-hole and square shaft-hole assembly. حسب Guohua Cao (697580)

منشور في 2025

أضف إلى المفضلة

محفوظ في:
10

Results of proposed algorithms and algorithms in [9] for 18 instances. حسب Jin Zhang (53297)

منشور في 2023

أضف إلى المفضلة

محفوظ في:
11

Sound version of the SF transfer algorithm’s policy after learning for 25 episodes in scale task 2. حسب Lucas Lehnert (9525615)

منشور في 2020

أضف إلى المفضلة

محفوظ في:
12

Flowchart of step() function. حسب Raed Alharthi (18340157)

منشور في 2025

أضف إلى المفضلة

محفوظ في:
13

Pseudo-code of DMDDPG algorithm. حسب Guohua Cao (697580)

منشور في 2025
"…First, we analyze the stages of hole-seeking, alignment, and insertion in the shaft-hole assembly process, based on a comprehensive study of the interactions between shafts and holes. Next, a reward function is designed by integrating the decoupled multi-agent deterministic deep deterministic policy gradient (DMDDPG) algorithm. …"

أضف إلى المفضلة

محفوظ في:
14

Comparison of assembly results of three algorithms. حسب Guohua Cao (697580)

منشور في 2025

أضف إلى المفضلة

محفوظ في:
15

Flowchart of DQN algorithm. حسب Jiandong Qiu (20389944)

منشور في 2025

أضف إلى المفضلة

محفوظ في:
16

Comparison of different algorithms. حسب Jiandong Qiu (20389944)

منشور في 2025

أضف إلى المفضلة

محفوظ في:
17

Reward outcomes for each combination of task cue and policy in the experiment. حسب Sam Hall-McMaster (10343795)

منشور في 2025

أضف إلى المفضلة

محفوظ في:
18

A more detailed interaction between DDPG controller agent–GA-searched reward shaping function–Environment. حسب Larasmoyo Nugroho (18078260)

منشور في 2024

أضف إلى المفضلة

محفوظ في:
19

Route for bays29 output by ABSQL algorithm. حسب Jin Zhang (53297)

منشور في 2023
"…DSRABSQL builds upon the Q-learning (QL) algorithm. Considering its problems of slow convergence and low accuracy, four strategies within the QL framework are designed first: the weighting function-based reward matrix, the power function-based initial Q-table, a self-adaptive <i>ε-beam</i> search strategy, and a new Q-value update formula. …"

أضف إلى المفضلة

محفوظ في:
20

Simulation parameters for the QL, ABSQL and DSRABSQL algorithms. حسب Jin Zhang (53297)

منشور في 2023

أضف إلى المفضلة

محفوظ في:

1
2
3
4
5
6
7
8
9
10
11
التالي
[14]