You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The eval_tasks module provides a high-level evaluation interface that standardizes the process of evaluating trained spiking RNN models across different cognitive tasks. The system is designed to be fully extensible, automatically supporting any task registered with the ``SpikingTaskFactory``.
27
+
The eval_tasks module provides a high-level evaluation interface that standardizes the process of evaluating trained spiking RNN models across different cognitive tasks. The system is designed to be fully extensible, automatically supporting any task registered with the ``SpikingEvaluatorFactory``.
28
28
29
29
**Key Features:**
30
30
@@ -35,13 +35,10 @@ The eval_tasks module provides a high-level evaluation interface that standardiz
35
35
* **Robust Error Handling**: Graceful handling of evaluation failures
36
36
* **Flexible Visualization**: Generic visualization system for any task type
37
37
38
-
**Evaluation Layers:**
38
+
**Evaluation:**
39
39
40
-
The framework provides three levels of evaluation:
41
-
42
-
1. **Core Task Methods**: Direct task evaluation (``task.evaluate_performance()``)
@@ -141,21 +145,4 @@ The system automatically loads trained rate RNN models from `.mat` files and ext
141
145
142
146
* Network weights and connectivity matrices
143
147
* Optimal scaling factors for rate-to-spike conversion
144
-
* Task-specific parameters and configurations
145
-
146
-
**Generic Visualization:**
147
-
148
-
The visualization system uses each task's ``get_sample_trial_types()`` method to determine what trial types to generate for plotting. This allows custom tasks to specify their own visualization patterns without modifying the evaluation code.
149
-
150
-
**Error Handling:**
151
-
152
-
The evaluation system includes comprehensive error handling:
153
-
154
-
* Graceful handling of missing model files
155
-
* Validation of task names against factory registry
156
-
* Recovery from trial generation failures
157
-
* Informative error messages for debugging
158
-
159
-
**Extensibility:**
160
-
161
-
The system is designed to be fully extensible. Any task that inherits from ``AbstractSpikingTask`` and is registered with ``SpikingTaskFactory`` can be evaluated using this unified interface.
0 commit comments