dacbench.runner

Module Contents

Functions

run_benchmark(env, agent, num_episodes, logger=None)

Run single benchmark env for a given number of episodes with a given agent

run_dacbench(results_path, agent_method, num_episodes, bench=None, seeds=None)

Run all benchmarks for 10 seeds for a given number of episodes with a given agent and save result

dacbench.runner.current_palette
dacbench.runner.run_benchmark(env, agent, num_episodes, logger=None)

Run single benchmark env for a given number of episodes with a given agent

Parameters
  • env (gym.Env) – Benchmark environment

  • agent – Any agent implementing the methods act, train and end_episode (see AbstractDACBenchAgent below)

  • num_episodes (int) – Number of episodes to run

  • logger (dacbench.logger.Logger: logger to use for logging. Not closed automatically like env) –

dacbench.runner.run_dacbench(results_path, agent_method, num_episodes, bench=None, seeds=None)

Run all benchmarks for 10 seeds for a given number of episodes with a given agent and save result

Parameters
  • bench

  • results_path (str) – Path to where results should be saved

  • agent_method (function) – Method that takes an env as input and returns an agent

  • num_episodes (int) – Number of episodes to run for each benchmark

  • seeds (list[int]) – List of seeds to runs all benchmarks for. If None (default) seeds [1, …, 10] are used.