jupyter-notebook - Jupyter Notebook 内から完全にカスタム OpenAI ジム環境を実行することは可能ですか?

Question

簡単に言えば、カスタムのopenAIジム環境用のPythonコードがいくつか与えられました。コマンドラインから ExperimentGrid を介してコードを正常に実行できますが、スクリプトを呼び出すのではなく、Jupyter ノートブック内から実験全体を実行できるようにしたいと考えています。これは、今後行ういくつかの実験にとってより便利です。

私の質問: Jupyter Notebook 内から完全にカスタムOpenAI ジム環境で実験を実行することは可能ですか? Jupyter からジムの標準環境 (SpaceInvaders-v0 や CartPole-v0 など) を実行している例をたくさん見てきましたが、それでも、彼らは環境を次のように呼び出しています。

env=gym.make('SpaceInvaders-v0')

基本的に、その環境のスクリプトを舞台裏で実行します。

以下は、コマンドラインから実行するようにコードをセットアップする方法と、Jupyter で発生するエラーの基本的な説明です。

アドバイスをいただければ幸いです。私は確かに、Gym、Python、および Linux の初心者です。

私の基本的な環境コードは、たとえば envs/mygames/Custom_Env.py で次のように構成されています。

various import statements (numpy, gym, pyglet, copy)
class Entity()
class State()
class The_Custom_Env(core.Env) # This is the main environment class
class Shell_Class # This class calls The_Custom_Env and provides some arguments

mygames/__ init__.py で、Shell_Class をインポートします。

from gym.envs.mygames.Custom_Env import Shell_Class

envs/__ init__.py で、環境を登録しました

register(
id='TEST-v0',
entry_point='gym.envs.mygames:Shell_Class', 
max_episode_steps=200,
reward_threshold=25.0,)

最後に、このコードを含むスクリプトをコマンドラインから実行すると、実験は問題なく動作します。

from spinup.utils.run_utils import ExperimentGrid
from spinup import ppo_pytorch
import torch

if __name__ == '__main__':
    import argparse
    parser = argparse.ArgumentParser()
    parser.add_argument('--cpu', type=int, default=4)
    parser.add_argument('--num_runs', type=int, default=1)
    args = parser.parse_args()

    eg = ExperimentGrid(name='super-cool-test')
    eg.add('env_name', 'TEST-v0', '', True)
    eg.add('seed', [10*i for i in range(args.num_runs)])
    eg.add('epochs', [10])
    eg.add('steps_per_epoch', 4000)
    eg.add('ac_kwargs:hidden_sizes', [(32, 32)], 'hid')
    eg.add('ac_kwargs:activation', [torch.nn.ReLU], '')
    eg.add('pi_lr', [0.001])
    eg.add('clip_ratio', 0.3)
    eg.run(ppo_pytorch, num_cpu=args.cpu)

私のジュピターの試み

Custom_env.py のすべてのコードをセル #1 に配置しました。次に、セル #2 に環境を登録しました。

gym.register(
id='TEST-v1',
entry_point='__main__:Shell_Class',
max_episode_steps=200,
reward_threshold=25.0,)

この Q/A: Register gym environment that is defined within a jupyter notebook cell に基づいて、セル #3 で環境を作成します。

gym.make('TEST-v1')

この説明のない出力を取得します。

<TimeLimit<Shell_Class< TEST-v1 >>>

セル #4 では、次のように Jupyter 内で直接 ExperimentGrid コードを実行しようとしました。

from spinup.utils.run_utils import ExperimentGrid
from spinup import ppo_pytorch
import torch

num_runs=1
cpu=4
env_name='TEST-v1'
eg = ExperimentGrid(name='Jupyter-test')
eg.add('env_name', env_name, '', True)
eg.add('seed', [10*i for i in range(num_runs)])
eg.add('epochs', 500)
eg.add('steps_per_epoch', 4000)
eg.add('ac_kwargs:hidden_sizes', [(32, 32)], 'hid')
eg.add('ac_kwargs:activation', [torch.nn.ReLU], '')
eg.add('pi_lr', 0.001)
eg.add('clip_ratio', 0.3)
eg.run(ppo_pytorch, num_cpu=cpu)

実験は通常どおり開始されますが、何らかのエラーが発生します。

> ================================================================================
ExperimentGrid [Jupyter-test] runs over parameters:

 env_name                                 [] 

    TEST-v1

 seed                                     [see] 

    0

 epochs                                   [epo] 

    500

 steps_per_epoch                          [ste] 

    4000

 ac_kwargs:hidden_sizes                   [hid] 

    (32, 32)

 ac_kwargs:activation                     [] 

    ReLU

 pi_lr                                    [pi] 

    0.001

 clip_ratio                               [cli] 

    0.3

 Variants, counting seeds:               1
 Variants, not counting seeds:           1

================================================================================

Preparing to run the following experiments...

Jupyter-test_test-v1

================================================================================

Launch delayed to give you a few seconds to review your experiments.

To customize or disable this behavior, change WAIT_BEFORE_LAUNCH in
spinup/user_config.py.

================================================================================
                                                                                
Running experiment:

Jupyter-test_test-v1

with kwargs:

{
    "ac_kwargs":    {
        "activation":   "ReLU",
        "hidden_sizes": [
            32,
            32
        ]
    },
    "clip_ratio":   0.3,
    "env_name": "TEST-v1",
    "epochs":   500,
    "pi_lr":    0.001,
    "seed": 0,
    "steps_per_epoch":  4000
}





================================================================================


There appears to have been an error in your experiment.

Check the traceback above to see what actually went wrong. The 
traceback below, included for completeness (but probably not useful
for diagnosing the error), shows the stack leading up to the 
experiment launch.

================================================================================



---------------------------------------------------------------------------
CalledProcessError                        Traceback (most recent call last)
<ipython-input-14-de843fd528cf> in <module>
     15 eg.add('pi_lr', 0.001)
     16 eg.add('clip_ratio', 0.3)
---> 17 eg.run(ppo_pytorch, num_cpu=cpu)

~/Downloads/spinningup/spinup/utils/run_utils.py in run(self, thunk, num_cpu, data_dir, datestamp)
    544 
    545             call_experiment(exp_name, thunk_, num_cpu=num_cpu, 
--> 546                             data_dir=data_dir, datestamp=datestamp, **var)
    547 
    548 

~/Downloads/spinningup/spinup/utils/run_utils.py in call_experiment(exp_name, thunk, seed, num_cpu, data_dir, datestamp, **kwargs)
    169     cmd = [sys.executable if sys.executable else 'python', entrypoint, encoded_thunk]
    170     try:
--> 171         subprocess.check_call(cmd, env=os.environ)
    172     except CalledProcessError:
    173         err_msg = '\n'*3 + '='*DIV_LINE_WIDTH + '\n' + dedent("""

~/anaconda3/envs/spinningup/lib/python3.6/subprocess.py in check_call(*popenargs, **kwargs)
    309         if cmd is None:
    310             cmd = popenargs[0]
--> 311         raise CalledProcessError(retcode, cmd)
    312     return 0
    313

jupyter-notebook - Jupyter Notebook 内から完全にカスタム OpenAI ジム環境を実行することは可能ですか?

0 に答える 0

Related

Reference