Life as a maintainer after the xz utils backdoor hack 👉 Watch now!

active-pre-train-ppg
Release 0.0.8

Unsupervised pre-training with PPG

Keywords: deep-reinforcement-learning, reinforcement-learning, self-supervised-learning, unsupervised-learning, unsupervised-reinforcement-learning
License: MIT
Install: pip install active-pre-train-ppg==0.0.8

Documentation

Unsupervised On-Policy Reinforcement Learning

This work combines Active Pre-Training with an On-Policy algorithm, Phasic Policy Gradient.

Active Pre-Training

Is used to pre-train a model free algorithm before defining a downstream task. It calculates the reward based on an estimatie of the particle based entropy of states. This reduces the training time if you want to define various tasks - i.e. robots for a warehouse.

Phasic Policy Gradient

Improved Version of Proximal Policy Optimization, which uses auxiliary epochs to train shared representations between the policy and a value network.

Dependencies: 10
Dependent packages: 0
Dependent repositories: 0
Total releases: 8
Latest release: Mar 10, 2022
First release: Mar 10, 2022
Stars: 0
Forks: 0
Watchers: 1
Contributors: 1
Repository size: 345 KB
SourceRank: 7

Source repo 2FA enabled: TEXT!
Package manager 2FA enabled: TEXT!
Is security responsive: TEXT!
Dependencies are managed: TEXT!
Issue-free release available: TEXT!
Succession plan available: TEXT!
Package manager 2FA enabled: TEXT!

Releases

0.0.8: Mar 10, 2022
0.0.7: Mar 10, 2022
0.0.6: Mar 10, 2022
0.0.5: Mar 10, 2022
0.0.4: Mar 10, 2022
0.0.3: Mar 10, 2022
0.0.2: Mar 10, 2022
0.0.1: Mar 10, 2022

Contributors

See all contributors

Something wrong with this page? Make a suggestion

Export .ABOUT file for this package

Last synced: 2022-03-10 10:25:33 UTC

active-pre-train-ppg
Release 0.0.8

Release 0.0.8

0.0.8

0.0.7

0.0.6

0.0.5

0.0.4

0.0.3

0.0.2

0.0.1

Documentation

Unsupervised On-Policy Reinforcement Learning

Active Pre-Training

Phasic Policy Gradient

Stats

Development practices

Releases

Contributors

active-pre-train-ppg Release 0.0.8

Release 0.0.8 Toggle Dropdown 0.0.8 0.0.7 0.0.6 0.0.5 0.0.4 0.0.3 0.0.2 0.0.1

Documentation

Unsupervised On-Policy Reinforcement Learning

Active Pre-Training

Phasic Policy Gradient

Stats

Development practices

Releases

Contributors

active-pre-train-ppg
Release 0.0.8

Release 0.0.8

0.0.8

0.0.7

0.0.6

0.0.5

0.0.4

0.0.3

0.0.2

0.0.1