Check out Upstream on-demand 👉 Watch now!

ddpg
Release 0.2.0

Tensorflow implimentation of the DDPG algorithm

Homepage PyPI Python

Keywords: deep, deterministic, policy, gradient, ddpg, machine, learning
License: MIT
Install: pip install ddpg==0.2.0

Documentation

DDPG

Implimentation of DDPG algorithm which is installable with pip.

The original DDPG algorithm was proposed in the paper: Continuous Control with Deep Reinforcement Learning

http://arxiv.org/abs/1509.02971

It is still a problem to implement Batch Normalization on the critic network. However the actor network works well with Batch Normalization.

Installation

Install the package

pip3 install ddpg

Getting Started

The DDPG algorithm acts on environments which follow the openai-gym api.

# Create a test environment with gym
env = gym.make('MountainCarContinuous-v0')

Train the DDPG agent:

from ddpg import DDPG

# Create a new agent
agent = DDPG(env)

# Train the agent
agent.train()

# Save the weights
agent.model_save()

Dependencies

Python3
Tensorflow 1.1
NumPy
Matplotlib

Some Evaluations

1 InvertedPendulum

2 InvertedDoublePendulum

3 Hopper unsolved

Reference

1 https://github.com/rllab/rllab

2 https://github.com/MOCR/DDPG

3 https://github.com/SimonRamstedt/ddpg

Dependencies: 3
Dependent packages: 0
Dependent repositories: 0
Total releases: 2
Latest release: Jul 19, 2017
First release: Jul 19, 2017
Stars: 1
Forks: 0
Watchers: 1
Contributors: 0
Repository size: 187 KB
SourceRank: 6

Source repo 2FA enabled: TEXT!
Package manager 2FA enabled: TEXT!
Is security responsive: TEXT!
Dependencies are managed: TEXT!
Issue-free release available: TEXT!
Succession plan available: TEXT!
Package manager 2FA enabled: TEXT!

Releases

0.2.0: Jul 19, 2017
0.1: Jul 19, 2017

Something wrong with this page? Make a suggestion

Export .ABOUT file for this package

Last synced: 2021-02-13 22:33:40 UTC

Login to resync this project