DRL_code.zip
资源来源:本地上传资源
文件类型:ZIP
大小:17.37MB
评分:
5.0
上传者:sinat_39620217
更新日期:2025-02-18

强化学习算法合集(DQN、DDPG、SAC、TD3、MADDPG、QMIX等等)

资源文件列表(大概)

文件名
大小
examples/
-
examples/Baselines/
-
examples/Baselines/GridDispatch_competition/
-
examples/Baselines/GridDispatch_competition/README.md
334B
examples/Baselines/Halite_competition/
-
examples/Baselines/Halite_competition/torch/
-
examples/Baselines/Halite_competition/torch/rl_trainer/
-
examples/Baselines/Halite_competition/torch/rl_trainer/controller.py
20.6KB
examples/DDPG/
-
examples/DDPG/train.py
5.27KB
examples/AlphaZero/
-
examples/AlphaZero/Coach.py
8.8KB
examples/A2C/
-
examples/A2C/actor.py
4.52KB
examples/A2C/atari_model.py
3.17KB
examples/DQN/
-
examples/DQN/README.md
849B
examples/AlphaZero/README.md
1.91KB
examples/A2C/atari_agent.py
4KB
examples/Baselines/GridDispatch_competition/torch/
-
examples/Baselines/GridDispatch_competition/torch/grid_model.py
2.54KB
examples/Baselines/GridDispatch_competition/torch/README.md
1.6KB
examples/AlphaZero/.pic/
-
examples/AlphaZero/.pic/perfect_moves_rate.png
64.44KB
examples/DDPG/mujoco_model.py
2.1KB
examples/DQN_variant/
-
examples/DQN_variant/train.py
6.56KB
examples/CARLA_SAC/
-
examples/CARLA_SAC/carla_agent.py
1.71KB
examples/Baselines/Halite_competition/torch/train.py
8.93KB
examples/CARLA_SAC/train.py
5.4KB
examples/DQN/requirements.txt
43B
examples/CARLA_SAC/evaluate.py
2.62KB
examples/CARLA_SAC/carla_model.py
3.29KB
examples/Baselines/Halite_competition/torch/rl_trainer/obs_parser.py
3.27KB
examples/Baselines/Halite_competition/torch/rl_trainer/agent.py
4.21KB
examples/Baselines/Halite_competition/paddle/
-
examples/Baselines/Halite_competition/paddle/rl_trainer/
-
examples/Baselines/Halite_competition/paddle/rl_trainer/obs_parser.py
3.27KB
examples/ES/
-
examples/ES/train.py
7.53KB
examples/ES/obs_filter.py
6.09KB
examples/IMPALA/
-
examples/IMPALA/atari_model.py
2.85KB
examples/ES/noise.py
955B
examples/MADDPG/
-
examples/MADDPG/README.md
3.16KB
examples/IMPALA/actor.py
3.9KB
examples/IMPALA/README.md
1.84KB
examples/ES/optimizers.py
1.82KB
examples/DDPG/mujoco_agent.py
1.98KB
examples/MADDPG/requirements.txt
56B
examples/AlphaZero/connect4_aiplayer.py
4.72KB
examples/AlphaZero/utils.py
1.8KB
examples/AlphaZero/main.py
2.78KB
examples/Baselines/GridDispatch_competition/paddle/
-
examples/Baselines/GridDispatch_competition/paddle/grid_agent.py
1.85KB
examples/DQN/train.py
4.31KB
examples/Baselines/Halite_competition/paddle/README.md
3.39KB
examples/Baselines/GridDispatch_competition/paddle/grid_model.py
2.55KB
examples/Baselines/Halite_competition/paddle/rl_trainer/utils.py
7.59KB
examples/CQL/
-
examples/CQL/mujoco_agent.py
1.83KB
examples/Baselines/Halite_competition/paddle/rl_trainer/replay_memory.py
3.66KB
examples/Baselines/Halite_competition/paddle/rl_trainer/algorithm.py
5.32KB
examples/Baselines/Halite_competition/torch/encode_model.py
972B
examples/AlphaZero/alphazero_agent.py
3.64KB
examples/CARLA_SAC/env_utils.py
3.87KB
examples/CARLA_SAC/env_config.py
2.72KB
examples/Baselines/Halite_competition/paddle/rl_trainer/model.py
2.25KB
examples/AlphaZero/connect4_game.py
7.87KB
examples/Baselines/Halite_competition/paddle/rl_trainer/controller.py
20.55KB
examples/AlphaZero/connect4_model.py
3.13KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/
-
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/opensim_model.py
6.4KB
examples/IMPALA/train.py
9.43KB
examples/ES/es.py
1.22KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/opensim_agent.py
8.61KB
examples/TD3/
-
examples/TD3/mujoco_agent.py
1.88KB
examples/Baselines/GridDispatch_competition/paddle/train.py
7.05KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/final_submit/
-
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/final_submit/submit_model.py
5.18KB
examples/Baselines/Halite_competition/torch/rl_trainer/policy.py
2.54KB
examples/NeurIPS2019-Learn-to-Move-Challenge/
-
examples/NeurIPS2019-Learn-to-Move-Challenge/env_wrapper.py
16.85KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/final_submit/pelvisBasedObs_scaler.npz
4.22KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/pelvisBasedObs_scaler.npz
4.22KB
examples/Baselines/Halite_competition/torch/rl_trainer/algorithm.py
5.36KB
examples/Baselines/Halite_competition/paddle/rl_trainer/policy.py
2.46KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/final_submit/test.py
3.33KB
examples/NeurIPS2019-Learn-to-Move-Challenge/actor.py
1.86KB
examples/Baselines/Halite_competition/torch/test.ipynb
1.56KB
examples/ES/actor.py
4.37KB
examples/Baselines/Halite_competition/paddle/test.py
1.39KB
examples/NeurIPS2019-Learn-to-Move-Challenge/evaluate.py
11.41KB
examples/NeurIPS2019-Learn-to-Move-Challenge/final_submit/
-
examples/NeurIPS2019-Learn-to-Move-Challenge/final_submit/env_wrapper.py
9.75KB
examples/NeurIPS2019-Learn-to-Move-Challenge/evaluate_args.py
2.46KB
examples/ES/README.md
1.47KB
examples/NeurIPS2019-Learn-to-Move-Challenge/final_submit/submit_model.py
5.54KB
examples/DQN_variant/replay_memory.py
4.09KB
examples/NeurIPS2019-Learn-to-Move-Challenge/final_submit/official_obs_scaler.npz
2.2KB
examples/NeurIPS2019-Learn-to-Move-Challenge/official_obs_scaler.npz
2.2KB
examples/NeurIPS2019-Learn-to-Move-Challenge/final_submit/test.py
2.51KB
examples/Baselines/Halite_competition/torch/README.md
3.39KB
examples/NeurIPS2019-Learn-to-Move-Challenge/scripts/
-
examples/NeurIPS2019-Learn-to-Move-Challenge/scripts/eval_difficulty2.sh
256B
examples/NeurIPS2019-Learn-to-Move-Challenge/scripts/eval_difficulty3_first_target.sh
338B
examples/NeurIPS2019-Learn-to-Move-Challenge/opensim_agent.py
3.51KB
examples/ES/utils.py
2.06KB
examples/NeurIPS2019-Learn-to-Move-Challenge/scripts/eval_difficulty1.sh
255B
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/
-
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track1/
-
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track1/es_agent.py
2.85KB
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track2/
-
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track2/evaluate.py
2.79KB
examples/NeurIPS2019-Learn-to-Move-Challenge/scripts/eval_difficulty3.sh
292B
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track1/powernet_model.py
2.6KB
examples/PPO/
-
examples/PPO/atari_config.py
2.19KB
examples/NeurIPS2019-Learn-to-Move-Challenge/replay_memory.py
60B
examples/PPO/agent.py
4.43KB
examples/ES/requirements.txt
58B
examples/AlphaZero/actor.py
6.72KB
examples/PPO/mujoco_config.py
2.18KB
examples/Baselines/GridDispatch_competition/paddle/env_wrapper.py
4.52KB
examples/Baselines/Halite_competition/paddle/encode_model.py
974B
examples/Baselines/GridDispatch_competition/torch/env_wrapper.py
4.52KB
examples/tutorials/
-
examples/tutorials/homework/
-
examples/tutorials/homework/lesson4/
-
examples/tutorials/homework/lesson4/policy_gradient_pong/
-
examples/tutorials/homework/lesson4/policy_gradient_pong/model.py
1.08KB
examples/Baselines/Halite_competition/paddle/train.py
8.82KB
examples/tutorials/homework/lesson3/
-
examples/tutorials/homework/lesson3/dqn_mountaincar/
-
examples/tutorials/homework/lesson3/dqn_mountaincar/replay_memory.py
1.64KB
examples/tutorials/parl2_dygraph/
-
examples/tutorials/parl2_dygraph/lesson3/
-
examples/tutorials/parl2_dygraph/lesson3/dqn/
-
examples/tutorials/parl2_dygraph/lesson3/dqn/train.py
4.7KB
examples/tutorials/lesson5/
-
examples/tutorials/lesson5/ddpg/
-
examples/tutorials/lesson5/ddpg/replay_memory.py
1.64KB
examples/tutorials/homework/lesson4/policy_gradient_pong/agent.py
2.87KB
examples/tutorials/lesson1/
-
examples/tutorials/lesson1/gridworld.py
6.62KB
examples/tutorials/homework/lesson5/
-
examples/tutorials/homework/lesson5/ddpg_quadrotor/
-
examples/tutorials/homework/lesson5/ddpg_quadrotor/quadrotor_model.py
1.92KB
examples/Baselines/GridDispatch_competition/paddle/README.md
1.61KB
examples/tutorials/lesson4/
-
examples/tutorials/lesson4/policy_gradient/
-
examples/tutorials/lesson4/policy_gradient/agent.py
2.87KB
examples/CQL/train.py
4.36KB
examples/tutorials/homework/lesson4/policy_gradient_pong/train.py
4.23KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/multi_head_ddpg.py
4.82KB
examples/AlphaZero/requirements.txt
37B
examples/DQN/cartpole_agent.py
3.17KB
examples/A2C/.result/
-
examples/A2C/.result/result_a2c_paddle0.png
193.24KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/replay_memory.py
3.6KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/simulator_server.py
11.88KB
examples/others/
-
examples/others/deepes.py
3.13KB
examples/SAC/
-
examples/SAC/mujoco_model.py
2.55KB
examples/tutorials/homework/lesson2/
-
examples/tutorials/homework/lesson2/q_learning_frozenlake/
-
examples/tutorials/homework/lesson2/q_learning_frozenlake/agent.py
2.73KB
examples/tutorials/lesson2/
-
examples/tutorials/lesson2/q_learning/
-
examples/tutorials/lesson2/q_learning/agent.py
2.73KB
examples/CQL/README.md
1.51KB
examples/Baselines/GridDispatch_competition/torch/train.py
7.04KB
examples/Baselines/Halite_competition/torch/requirements.txt
25B
examples/Baselines/Halite_competition/paddle/rl_trainer/agent.py
4.03KB
examples/Baselines/Halite_competition/torch/rl_trainer/model.py
2.24KB
examples/DDPG/README.md
1.11KB
examples/DQN/cartpole_model.py
1.3KB
examples/Baselines/Halite_competition/paddle/submission.py
99.84KB
examples/A2C/requirements.txt
67B
examples/DDPG/requirements.txt
58B
examples/Baselines/Halite_competition/paddle/test.ipynb
1.46KB
examples/MADDPG/train.py
6.93KB
examples/TD3/requirements.txt
58B
examples/SAC/requirements.txt
58B
examples/CQL/requirements.txt
121B
examples/A2C/README.md
1.4KB
examples/A2C/train.py
7.1KB
examples/Baselines/Halite_competition/torch/config.py
1.35KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/test.py
3.22KB
examples/MADDPG/simple_model.py
3.59KB
examples/QuickStart/
-
examples/QuickStart/cartpole_model.py
1.23KB
examples/IMPALA/atari_agent.py
2.91KB
examples/Baselines/Halite_competition/torch/submission.py
100.1KB
examples/TD3/README.md
1.24KB
examples/QuickStart/cartpole_agent.py
2.27KB
examples/SAC/train.py
5.09KB
examples/MADDPG/simple_agent.py
4.43KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/final_submit/env_wrapper.py
17.21KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/env_wrapper.py
28.33KB
examples/DQN_variant/atari_model.py
3.3KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/final_submit/mlp_model.py
6.49KB
examples/OAC/
-
examples/OAC/requirements.txt
58B
examples/NeurIPS2019-Learn-to-Move-Challenge/README.md
3.2KB
examples/TD3/train.py
5.12KB
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track1/es.py
1.57KB
examples/PPO/requirements_mujoco.txt
58B
examples/PPO/env_utils.py
6.95KB
examples/NeurIPS2019-Learn-to-Move-Challenge/train.py
11.9KB
examples/NeurIPS2019-Learn-to-Move-Challenge/final_submit/mlp_model.py
6.46KB
examples/NeurIPS2019-Learn-to-Move-Challenge/scripts/train_difficulty1.sh
341B
examples/NeurIPS2019-Learn-to-Move-Challenge/scripts/train_difficulty2.sh
320B
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track2/es.py
1.23KB
examples/PPO/requirements_atari.txt
74B
examples/tutorials/homework/lesson5/ddpg_quadrotor/quadrotor_agent.py
2.65KB
examples/QMIX/
-
examples/QMIX/replay_buffer.py
3.33KB
examples/PPO/mujoco_model.py
1.96KB
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/README.md
659B
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/image/
-
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/image/velocity_distribution.png
27.9KB
examples/tutorials/homework/lesson5/ddpg_quadrotor/train.py
6.11KB
examples/QuickStart/README.md
435B
examples/QuickStart/requirements.txt
43B
examples/tutorials/parl2_dygraph/lesson5/
-
examples/tutorials/parl2_dygraph/lesson5/ddpg/
-
examples/tutorials/parl2_dygraph/lesson5/ddpg/replay_memory.py
1.64KB
examples/tutorials/parl2_dygraph/lesson3/dqn/replay_memory.py
1.64KB
examples/tutorials/parl2_dygraph/lesson3/homework/
-
examples/tutorials/parl2_dygraph/lesson3/homework/dqn_mountaincar/
-
examples/tutorials/parl2_dygraph/lesson3/homework/dqn_mountaincar/replay_memory.py
1.64KB
examples/tutorials/parl2_dygraph/lesson5/homework/
-
examples/tutorials/parl2_dygraph/lesson5/homework/ddpg_quadrotor/
-
examples/tutorials/parl2_dygraph/lesson5/homework/ddpg_quadrotor/quadrotor_model.py
2.13KB
examples/QMIX/qmix_config.py
2.69KB
examples/tutorials/parl2_dygraph/lesson3/dqn/agent.py
2.79KB
examples/tutorials/homework/lesson3/dqn_mountaincar/model.py
1.11KB
examples/tutorials/lesson3/
-
examples/tutorials/lesson3/dqn/
-
examples/tutorials/lesson3/dqn/model.py
1.11KB
examples/tutorials/homework/lesson2/q_learning_frozenlake/train.py
2.56KB
examples/tutorials/parl2_dygraph/lesson3/dqn/model.py
1.3KB
examples/QMIX/rnn_model.py
1.45KB
examples/A2C/a2c_config.py
1.29KB
examples/DQN/cartpole.jpg
110.07KB
examples/tutorials/parl2_dygraph/lesson3/homework/dqn_mountaincar/model.py
1.3KB
examples/tutorials/lesson5/ddpg/env.py
6.33KB
examples/AlphaZero/.pic/good_moves_rate.png
60.06KB
examples/Baselines/Halite_competition/torch/rl_trainer/replay_memory.py
3.6KB
examples/CARLA_SAC/README.md
2.78KB
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track1/utils.py
3.25KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/README.md
6.94KB
examples/tutorials/lesson5/ddpg/train.py
4.25KB
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track1/evaluate.py
2.79KB
examples/DQN_variant/atari_agent.py
4.11KB
examples/IMPALA/impala_config.py
1.5KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/image/competition.png
184.81KB
examples/PPO/storage.py
3.09KB
examples/OAC/mujoco_agent.py
1.85KB
examples/NeurIPS2019-Learn-to-Move-Challenge/scripts/train_difficulty3_first_target.sh
416B
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track2/es_agent.py
1.62KB
examples/tutorials/lesson3/dqn/replay_memory.py
1.64KB
examples/Baselines/Halite_competition/paddle/config.py
1.35KB
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track1/README.md
718B
examples/QMIX/utils.py
1.66KB
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track2/powernet_model.py
2.67KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/image/curriculum-learning.png
158.38KB
examples/Baselines/GridDispatch_competition/torch/grid_agent.py
1.97KB
examples/CARLA_SAC/.benchmark/
-
examples/CARLA_SAC/.benchmark/Lane_bend.gif
3.19MB
examples/tutorials/parl2_dygraph/README.md
1.38KB
examples/tutorials/parl2_dygraph/lesson5/homework/ddpg_quadrotor/quadrotor_agent.py
2.01KB
examples/tutorials/parl2_dygraph/lesson3/dqn/algorithm.py
2.86KB
examples/tutorials/parl2_dygraph/lesson3/homework/dqn_mountaincar/agent.py
2.79KB
examples/tutorials/lesson4/policy_gradient/algorithm.py
1.7KB
examples/tutorials/lesson4/policy_gradient/model.py
1.04KB
examples/tutorials/homework/lesson3/dqn_mountaincar/train.py
4.72KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/simulator_pb2.py
7.22KB
examples/tutorials/lesson3/dqn/agent.py
3.89KB
examples/Baselines/Halite_competition/torch/rl_trainer/utils.py
7.64KB
examples/tutorials/homework/lesson3/dqn_mountaincar/agent.py
3.89KB
examples/tutorials/homework/lesson2/sarsa_frozenlake/
-
examples/tutorials/homework/lesson2/sarsa_frozenlake/gridworld.py
6.53KB
examples/tutorials/homework/lesson2/q_learning_frozenlake/gridworld.py
6.53KB
examples/DQN_variant/.benchmark/
-
examples/DQN_variant/.benchmark/Dueling DQN.png
218.21KB
examples/tutorials/lesson2/sarsa/
-
examples/tutorials/lesson2/sarsa/gridworld.py
6.53KB
examples/tutorials/requirements.txt
126B
examples/tutorials/lesson2/sarsa/train.py
2.95KB
examples/tutorials/lesson2/q_learning/gridworld.py
6.53KB
examples/tutorials/lesson5/ddpg/model.py
1.73KB
examples/SAC/mujoco_agent.py
1.83KB
examples/tutorials/lesson3/dqn/train.py
4.82KB
examples/IMPALA/requirements.txt
74B
examples/DQN_variant/requirements.txt
79B
examples/TD3/mujoco_model.py
2.54KB
examples/Baselines/Halite_competition/torch/test.py
1.44KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/args.py
3.51KB
examples/tutorials/parl2_dygraph/lesson4/
-
examples/tutorials/parl2_dygraph/lesson4/policy_gradient/
-
examples/tutorials/parl2_dygraph/lesson4/policy_gradient/agent.py
1.8KB
examples/tutorials/parl2_dygraph/lesson4/homework/
-
examples/tutorials/parl2_dygraph/lesson4/homework/policy_gradient_pong/
-
examples/tutorials/parl2_dygraph/lesson4/homework/policy_gradient_pong/agent.py
1.8KB
examples/tutorials/parl2_dygraph/lesson3/homework/dqn_mountaincar/train.py
4.67KB
examples/OAC/README.md
1.04KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/utils.py
13.97KB
examples/tutorials/parl2_dygraph/lesson4/homework/policy_gradient_pong/train.py
4.29KB
examples/tutorials/parl2_dygraph/lesson4/policy_gradient/algorithm.py
1.94KB
examples/tutorials/parl2_dygraph/lesson4/homework/policy_gradient_pong/model.py
1.35KB
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track2/utils.py
2.59KB
examples/QMIX/qmix_agent.py
5.35KB
examples/OAC/mujoco_model.py
2.55KB
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track2/README.md
700B
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/image/demo.gif
4.58MB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/image/fastest.png
270.84KB
examples/PPO/atari_model.py
2.03KB
examples/PPO/README.md
2.48KB
examples/Baselines/Halite_competition/paddle/requirements.txt
32B
examples/tutorials/lesson5/ddpg/algorithm.py
3.46KB
examples/tutorials/lesson5/ddpg/agent.py
2.67KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/simulator_pb2_grpc.py
1.93KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/simulator_client.py
4.25KB
examples/tutorials/homework/lesson2/sarsa_frozenlake/train.py
2.67KB
examples/QMIX/qmixer_model.py
3.06KB
examples/QMIX/train.py
6.51KB
examples/tutorials/lesson4/policy_gradient/train.py
3.66KB
examples/CQL/mujoco_model.py
2.78KB
examples/tutorials/parl2_dygraph/requirements.txt
130B
examples/SAC/README.md
1.24KB
examples/NeurIPS2019-Learn-to-Move-Challenge/train_args.py
2.73KB
examples/DQN_variant/README.md
2.65KB
examples/QMIX/README.md
1.31KB
examples/QMIX/requirements.txt
37B
examples/QMIX/env_wrapper.py
3.11KB
examples/QuickStart/train.py
3.83KB
examples/AlphaZero/MCTS.py
5.83KB
examples/tutorials/parl2_dygraph/lesson5/ddpg/train.py
4.21KB
examples/tutorials/lesson3/dqn/algorithm.py
3.02KB
examples/tutorials/parl2_dygraph/lesson5/homework/ddpg_quadrotor/train.py
6.06KB
examples/tutorials/parl2_dygraph/lesson5/ddpg/agent.py
2.31KB
examples/NeurIPS2019-Learn-to-Move-Challenge/opensim_model.py
5.81KB
examples/tutorials/parl2_dygraph/lesson4/policy_gradient/model.py
1.26KB
examples/tutorials/parl2_dygraph/lesson4/policy_gradient/train.py
3.65KB
examples/tutorials/lesson2/q_learning/train.py
2.85KB
examples/NeurIPS2019-Learn-to-Move-Challenge/scripts/train_difficulty3.sh
416B
examples/tutorials/README.md
1.74KB
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track1/agent.py
19.08KB
examples/tutorials/homework/lesson2/sarsa_frozenlake/agent.py
2.77KB
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/track2/agent.py
12.96KB
examples/tutorials/lesson2/sarsa/agent.py
2.77KB
examples/PPO/train.py
5.99KB
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/images/
-
examples/NeurIPS2020-Learning-to-Run-a-Power-Network-Challenge/images/l2rpn.jpeg
69.44KB
examples/OAC/train.py
5.45KB
examples/Baselines/Halite_competition/paddle/model/
-
examples/Baselines/Halite_competition/paddle/model/latest_ship_model.pth
325.14KB
examples/AlphaZero/Arena.py
3.24KB
examples/QuickStart/performance.gif
237.51KB
examples/NeurIPS2019-Learn-to-Move-Challenge/image/
-
examples/NeurIPS2019-Learn-to-Move-Challenge/image/performance.gif
782.27KB
examples/CARLA_SAC/.benchmark/carla_sac.png
141.86KB
examples/A2C/.result/result_a2c_paddle1.png
203.23KB
examples/Baselines/Halite_competition/torch/model/
-
examples/Baselines/Halite_competition/torch/model/latest_ship_model.pth
338.03KB
examples/NeurIPS2018-AI-for-Prosthetics-Challenge/image/last course.png
360.06KB
examples/QMIX/images/
-
examples/QMIX/images/paddle2.0_qmix_result.png
97.1KB
examples/tutorials/parl2_dygraph/lesson5/ddpg/algorithm.py
3.69KB
examples/tutorials/parl2_dygraph/lesson5/ddpg/env.py
6.33KB
examples/CARLA_SAC/model.ckpt
4.63MB
examples/ES/mujoco_agent.py
2.77KB
examples/tutorials/parl2_dygraph/lesson5/ddpg/model.py
1.94KB
examples/ES/es_config.py
1.2KB
examples/ES/mujoco_model.py
1.93KB

资源内容介绍

强化学习算法合集(DQN、DDPG、SAC、TD3、MADDPG、QMIX等等)内涵20+强化学习经典算法代码。对应使用教程什么的参考博客:多智能体(前沿算法+原理)https://blog.csdn.net/sinat_39620217/article/details/115299073?spm=1001.2014.3001.5502强化学习基础篇(单智能体算法)https://blog.csdn.net/sinat_39620217/category_10940146.html

用户评论 (0)

相关资源

No module named 'tensorflow.examples'

No module named 'tensorflow.examples'

946.46KB26金币

图数据集(图结构数据集)+图神经网络-深度学习

这是总结的深度学习中常用的11个图数据集。1. 近年来,深度学习越来越关注图方向的任务,通过利用图神经网络去挖掘现实中各种可以利用图来表示事物(社交网络,论文引用网络,分子结构)等等,来学习更好的表示,去实现下游任务。2. 图数据是由一些点和一些线构成的,能表示一些实体之间的关系,图中的点就是实体,线就是实体间的关系。如下图,v就是顶点,e是边,u是整张图。attrinbutes(feature)是信息的意思,每个点、每条边、每个图都是有信息的。3. 图数据集对于图任务的科研是必备的。深度学习中常用的图数据集:Cora、Citeseer(Cite)、Pubmed、DBLP、ACM、AMAP、AMAC、Corafull、WIKI、BAT、EAT、UAT。

31.04MB22金币

基于遗传算法的CVRP建模求解-Python代码

https://blog.csdn.net/qq_43276566/article/details/129402447基于遗传算法的CVRP建模求解-Python代码

480.07KB27金币

强化学习方法合集11.zip

本文件夹包含强化学习方法的python案例代码,Markov文件夹里是马尔科夫环境的编写-鸟儿找伴。Markov-D

3.33MB20金币

鸢尾花数据集(xlsx格式)

这是我进行KNN分类时所用的资源,训练集为105个数据,测试集为45个数据;前4列为特征,第5列为类别,"setosa"视为1,"versicolor"视为2,"virginica"视为3;测试集顺序略有调整。资源免费。

16.09KB30金币

适用于Yolo训练和测试的coco数据集标签文件 train2017.txt和val2017.txt

适用于Yolo训练和测试的coco数据集标签文件 train2017.txt和val2017.txt和train2017.zip和val2017.zipyolov7 yolov5 yolov8

22.05MB26金币

全新的SOTA模型YOLOv9

当今的深度学习方法专注于如何设计最合适的目标函数,以使模型的预测结果尽可能接近真实值。同时,还需要设计一个合适的架构,以便获取足够的信息进行预测。现有方法忽略了一个事实:当输入数据经过逐层的特征提取和空间变换时,会丢失大量信息。本文将深入探讨数据在深度网络中传输时数据丢失的重要问题,即信息瓶颈和可逆函数问题。我们提出了可编程梯度信息(PGI)的概念,以应对深度网络实现多个目标所需的各种变化。PGI能够为目标任务提供完整的输入信息,以计算目标函数,从而获得可靠的梯度信息以更新网络权重。此外,我们还设计了一种基于梯度路径规划的新型轻量级网络架构——广义高效层聚合网络(GELAN)。GELAN的架构证明了PGI在轻量级模型上获得了优越的结果。我们在基于MS COCO数据集的目标检测上验证了所提出的GELAN和PGI。结果显示,GELAN仅使用传统的卷积运算符就实现了比基于深度卷积的最新方法更好的参数利用率。PGI可用于从轻量级到大型的各种模型,它可以获取完整信息,使得从零开始训练的模型比使用大型数据集预训练的最新模型获得更好的结果,比较结果如图1所示。

1.46MB24金币

Halcon车牌识别源码

Halcon车牌识别源码

30.59MB30金币

TA-Lib的whl文件

由于近期的TA_Lib的资源网站https://www.lfd.uci.edu/~gohlke/pythonlibs/无法找到相应的文件,整理了从python3.7到3.11的TA_Lib文件

4.13MB26金币

LIDC-IDRI数据集

img和对应的mask:png格式 13218张图片,大小64*64(只包含肺结节区域)LIDC-IDRI原数据集处理太麻烦了,下载感觉都比较费劲;(github有个代码可以处理获得比较精确的mask,但是本身LIDC-IDRI原数据集就很大,下载感觉有些费事,所以也就没去尝试)LUNA16数据集我根据网上DSB tutorial以及参考这个的处理方法最终得到的图片训练得到的dice才为0.3左右,而且LUNA16数据集的mask都是圆形(我觉得这样用于语义分割不太准确吧,毕竟是像素级分类)找了很久,忘记在哪里下载的了,找到这个数据集,使用U-net训练,得到的dice是0.8816. 如果有大佬可以分享一下512*512的图片,不胜感激。侵,则删~

78.67MB13金币

mamba、causal-conv1d安装.whl文件

用于配置Mamba环境,安装mamba依赖。

152.7MB22金币

Informer模型实战案例(代码+数据集+参数讲解)

本篇博客带大家看的是Informer模型进行时间序列预测的实战案例,它是在2019年被提出并在ICLR 2020上被评为Best Paper,可以说Informer模型在当今的时间序列预测方面还是十分可靠的,Informer模型的实质是注意力机制+Transformer模型,Informer模型的核心思想是将输入序列进行自注意力机制的处理,以捕捉序列中的长期依赖关系,并利用Transformer的编码器-解码器结构进行预测,通过阅读本文你可以学会利用个人数据集训练模型。Informer是一种用于长序列时间序列预测的Transformer模型,但是它与传统的Transformer模型又有些不同点,与传统的Transformer模型相比,Informer具有以下几个独特的特点:1. ProbSparse自注意力机制:Informer引入了ProbSparse自注意力机制,该机制在时间复杂度和内存使用方面达到了O(Llog L)的水平,能够有效地捕捉序列之间的长期依赖关系。2. 自注意力蒸馏:通过减少级联层的输入,自注意力蒸馏技术可以有效处理极长的输入序列,提高了模型处理长序列的能力

115.86MB16金币