docs: add ask and tell Documentations (#236)

Sohambasu07 · web-flow · commit c663171d3db1 · 2025-09-07T12:15:59.000+05:30
diff --git a/docs/api/index.md b/docs/api/index.md
diff --git a/docs/api/overview.md b/docs/api/overview.md
@@ -0,0 +1,11 @@
+
+This content tree contains the core components and utilities of NePS, designed to simplify and enhance the process of running optimization experiments. Below is an overview of the files and their purposes:
+
+- [`neps.run`](neps/api.md): Provides the built-in NePS runtime to **sample new trials** and **evaluate them automatically**.
+- [`neps.runtime`](neps/runtime.md): Implements the `Worker`, offering functions to create a worker, sample new trials, and evaluate them.
+- `neps.optimizers`:
+    - [`neps.algorithms`](neps/optimizers/algorithms.md): Contains a collection of optimization algorithms, such as random search, ASHA, PriorBand, HyperBand, and more, for sampling new trials.
+    - [`neps.AskAndTell`](neps/optimizers/ask_and_tell.md): An alternative to `neps.run` that allows full control of the evaluation loop. This is useful when you don’t want to use NePS’ runtime but still want to benefit from its optimizers and state management.
+- [`neps.state`](neps/state/neps_state.md): Manages the state of workers, trials, and optimizers, ensuring reproducibility and continuity.
+- [`neps.status`](neps/status/status.md): Provides functions to retrieve the status of a run and export it to CSV files for analysis.
+- [`neps.plot`](neps/plot/plot.md): Includes tools to visualize the results of a neural pipeline search run.
diff --git a/docs/getting_started.md b/docs/getting_started.md
@@ -68,5 +68,8 @@ Understand how to leverage multi-fidelity optimization for efficient model tunin
 * **[Utilizing Expert Priors for Hyperparameters](examples/efficiency/expert_priors_for_hyperparameters.md)**:
 Learn how to incorporate expert priors for more efficient hyperparameter selection.
 
+* **[Benefiting NePS State and Optimizers with custom runtime](examples/experimental/ask_and_tell_example.md)**:
+Learn how to use AskAndTell, an advanced tool for leveraging optimizers and states while enabling a custom runtime for trial execution.
+
 * **[Additional NePS Examples](examples/index.md)**:
 Explore more examples, including various use cases and advanced configurations in NePS.
diff --git a/docs/index.md b/docs/index.md
@@ -17,10 +17,10 @@ In addition to the features offered by traditional HPO and NAS libraries, NePS s
     NePS excels in efficiently tuning hyperparameters using algorithms that enable users to make use of their prior knowledge, while also using many other efficiency boosters.
      - [PriorBand: Practical Hyperparameter Optimization in the Age of Deep Learning (NeurIPS 2023)](https://arxiv.org/abs/2306.12370)
      - [πBO: Augmenting Acquisition Functions with User Beliefs for Bayesian Optimization (ICLR 2022)](https://arxiv.org/abs/2204.11051) <br /> <br />
-1. **Neural Architecture Search (NAS) with Expressive Search Spaces:** <br />
+2. **Neural Architecture Search (NAS) with Expressive Search Spaces:** <br />
     NePS provides capabilities for designing and optimizing architectures in an expressive and natural fashion.
      - [Construction of Hierarchical Neural Architecture Search Spaces based on Context-free Grammars (NeurIPS 2023)](https://arxiv.org/abs/2211.01842) <br /> <br />
-1. **Zero-effort Parallelization and an Experience Tailored to DL:** <br />
+3. **Zero-effort Parallelization and an Experience Tailored to DL:** <br />
      NePS simplifies the process of parallelizing optimization tasks both on individual computers and in distributed
      computing environments. As NePS is made for deep learners, all technical choices are made with DL in mind and common
      DL tools such as Tensorboard are [embraced](https://automl.github.io/neps/latest/reference/analyse/#visualizing-results).
@@ -101,6 +101,8 @@ Discover how NePS works through these examples:
 
 - **[Utilizing Expert Priors for Hyperparameters](examples/efficiency/expert_priors_for_hyperparameters.md)**: Learn how to incorporate expert priors for more efficient hyperparameter selection.
 
+- **[Benefiting NePS State and Optimizers with custom runtime](examples/experimental/ask_and_tell_example.md)**: Learn how to use AskAndTell, an advanced tool for leveraging optimizers and states while enabling a custom runtime for trial execution.
+
 - **[Additional NePS Examples](examples/index.md)**: Explore more examples, including various use cases and advanced configurations in NePS.
 
 ## Contributing
diff --git a/neps/optimizers/ask_and_tell.py b/neps/optimizers/ask_and_tell.py
@@ -12,21 +12,25 @@
 [`tell()`][neps.optimizers.ask_and_tell.AskAndTell.tell] results back to the optimizer.
 
 ```python
-from neps import AskAndTell
+import neps
 
 # Wrap an optimizer
-my_optimizer = AskAndTell(MyOptimizer(space, ...))
+space = neps.SearchSpace({"a": neps.Float(0, 1), "b": neps.Integer(1, 10)})
+my_optimizer = neps.AskAndTell(optimizer=neps.algorithms.random_search(space))
 
 # Ask for a new configuration
 trial = my_optimizer.ask()
 
 # The things you would normally get into `evaluate_pipeline`
-config_id = trial.config_id
+config_id = trial.metadata.id
 config = trial.config
-previous_config_id = trial.metadata.previous_trial_id
-previous_trial_path = trial.metadata.previous_trial_location
+# other metadata you might want to use:
+# trial.metadata.previous_trial_id, trial.metadata.previous_trial_location
 
 # Evaluate the configuration
+def evaluate(config):
+    # Dummy evaluation function
+    return config["a"] * 2 + config["b"]
 loss = evaluate(config)
 
 # Tell the optimizer the result
diff --git a/neps_examples/experimental/ask_and_tell_example.py b/neps_examples/experimental/ask_and_tell_example.py
@@ -0,0 +1,146 @@
+"""
+# AskAndTell Example: Custom Trial Execution with NePS
+
+This script demonstrates how to use the `AskAndTell` interface from NePS to implement a custom trial execution workflow. 
+The `AskAndTell` interface provides full control over the evaluation loop, allowing you to manage how trials are executed 
+and results are reported back to the optimizer. This is particularly useful when you need to handle trial execution manually.
+
+## Aim of This File
+
+The goal of this script is to run a **successive halving** optimization process with 3 rungs. The first rung will evaluate 
+9 trials in parallel. The trials are managed manually using the `AskAndTell` interface, and the SLURM scheduler is used 
+to execute the trials. This setup demonstrates how to efficiently manage parallel trial execution and integrate NePS 
+with external job schedulers.
+
+## How to Use This Script
+
+1. **Define the Search Space**:
+   The search space is defined using `neps.SearchSpace`.
+
+2. **Initialize the Optimizer**:
+   We use the `successive_halving` algorithm from NePS to optimize the search space. The optimizer is wrapped with 
+   the `AskAndTell` interface to enable manual control of the evaluation loop.
+
+3. **Submit Jobs**:
+   - The `submit_job` function submits a job to the SLURM scheduler using a generated script.
+   - The `get_job_script` function generates a SLURM job script that executes the `train_worker` function for a given trial.
+
+4. **Train Worker**:
+   - The `train_worker` function reads the trial configuration, evaluates a dummy objective function, and writes the 
+     results to a JSON file.
+
+5. **Main Loop**:
+   - The `main` function manages the optimization process:
+     - It launches initial jobs based on the number of parallel trials specified.
+     - It monitors the status of active jobs, retrieves results, and submits new trials as needed.
+     - The loop continues until all trials are completed.
+
+6. **Run the Script**:
+   - Use the command line to run the script:
+     ```bash
+     python ask_and_tell_example.py --parallel 9 --results-dir results
+     ```
+   - `--parallel`: Specifies the number of trials to evaluate in parallel initially.
+   - `--results-dir`: Specifies the directory where results will be saved.
+
+## Key Features Demonstrated
+- Custom trial execution using SLURM.
+- Integration of NePS optimizers with manual control over the evaluation loop.
+- Efficient management of parallel trials and result reporting.
+
+This script serves as a template for implementing custom trial execution workflows with NePS.
+"""
+import argparse
+import time
+from pathlib import Path
+import json
+import neps
+import os
+import subprocess
+import json, sys
+
+from neps.optimizers.ask_and_tell import AskAndTell
+
+def submit_job(pipeline_directory: Path, script: str) -> int:
+    script_path = pipeline_directory / "submit.sh"
+    print(f"Submitting the script {script_path} (see below): \n\n{script}")
+
+    # You may want to remove the below check and not ask before submitting every time
+    script_path.write_text(script)
+    os.system(f"sbatch {script_path}")
+    output = subprocess.check_output(["sbatch", str(script_path)]).decode().strip()
+    job_id = int(output.split()[-1])
+    return job_id
+
+def get_job_script(pipeline_directory, trial_file):
+    script = f"""#!/bin/bash
+    #SBATCH --job-name=mnist_toy
+    #SBATCH --partition=bosch_cpu-cascadelake
+    #SBATCH --output={pipeline_directory}/%j.out
+    #SBATCH --error={pipeline_directory}/%j.err
+    python -c "import neps.neask_andtell_example; ask_andtell_example.train_worker('{trial_file}')"
+    """
+    return script
+
+def train_worker(trial_file):
+    trial_file = Path(trial_file)
+    with open(trial_file) as f:
+        trial = json.load(f)
+
+    config = trial["config"]
+    # Dummy objective
+    loss = (config["a"] - 0.5)**2 + ((config["b"] + 2)**2) / 5
+
+    out_file = trial_file.parent / f"result_{trial['id']}.json"
+    with open(out_file, "w") as f:
+        json.dump({"loss": loss}, f)
+
+def main(parallel: int, results_dir: Path):
+    space = neps.SearchSpace(
+        {"a": neps.Integer(1, 13, is_fidelity=True), "b": neps.Float(1, 5)}
+    )
+    opt = neps.algorithms.successive_halving(space, eta=3)
+    ask_tell = AskAndTell(opt)
+
+    results_dir.mkdir(exist_ok=True, parents=True)
+    active = {}
+
+    # launch initial jobs
+    for _ in range(parallel):
+        trial = ask_tell.ask()
+        if trial is None:
+            break
+        trial_file = results_dir / f"trial_{trial.id}.json"
+        with open(trial_file, "w") as f:
+            json.dump({"id": trial.id, "config": trial.config}, f)
+        job_id = submit_job(results_dir, get_job_script(results_dir, trial_file))
+        active[job_id] = trial
+
+    # monitor loop
+    while active:
+        for job_id, trial in list(active.items()):
+            result_file = results_dir / f"result_{trial.id}.json"
+            if result_file.exists():
+                result = json.load(result_file.open())
+                ask_tell.tell(trial, {"objective_to_minimize": result["loss"]})
+                del active[job_id]
+                new_trial = ask_tell.ask()
+                if new_trial:
+                    new_file = results_dir / f"trial_{new_trial.id}.json"
+                    json.dump({"id": new_trial.id, "config": new_trial.config}, new_file.open("w"))
+                    new_job_id = submit_job(results_dir, get_job_script(results_dir, new_file))
+                    active[new_job_id] = new_trial
+        time.sleep(5)
+
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser()
+    parser.add_argument(
+        "--parallel", type=int, default=9, 
+        help="Number of trials to evaluate in parallel initially"
+    )
+    parser.add_argument(
+        "--results-dir", type=Path, default=Path("results"), 
+        help="Path to save the results inside"
+    )
+    args = parser.parse_args()
+    main(args.parallel, args.results_dir)