User Configuration Parameters
AISBench Benchmark supports customizing the inference mode and evaluation process through two methods: Command Line Interface (CLI) Parameters and Configuration Constant File.
Command Line Parameters
The basic calling format for command line parameters [OPTIONS] is as follows:
ais_bench [OPTIONS]
Parameter Description
Based on the execution scenario, command line parameters are divided into three categories:
Common Parameters
Accuracy Evaluation Parameters (effective only when
--modeis set toall,infer,eval, orviz)Performance Evaluation Parameters (effective only when
--modeis set toperforperf_viz)
Accuracy Evaluation Parameters take effect only when the --mode parameter is specified as "all", "infer", "eval", "viz". Performance Evaluation Parameters take effect only when the --mode parameter is specified as "perf", "perf_viz". Common Parameters are not restricted by the task execution mode and can be specified in all modes.
### Common Parameters
Applicable to all modes and can be used in combination with accuracy or performance parameters.
Parameter |
Description |
Example |
|---|---|---|
|
Specifies the name of the model inference backend task (corresponding to a pre-implemented default model configuration file under the path |
|
|
Specifies the name of the dataset task (corresponding to a pre-implemented default dataset configuration file under the path |
|
|
Specifies the name of the result summary task (corresponding to a pre-implemented default configuration file under the path |
|
|
Running mode, optional values: |
|
|
Specifies the timestamp in an existing working directory to continue execution and overwrite original results. Used in conjunction with the |
|
|
Specifies the evaluation working directory for saving output results. Default path: |
|
|
Path to the folder where configuration files for |
|
|
Enables Debug mode. The mode is enabled if this parameter is configured, and disabled if not; disabled by default. In Debug mode, all logs are printed directly to the terminal. (In Debug mode, the |
|
|
Enables Dry Run mode (prints logs to the screen without actually running tasks). The mode is enabled if this parameter is configured, and disabled if not; disabled by default. |
|
|
Reserved parameter; not currently supported. |
|
|
Enables merged inference for datasets of the same type (runs multiple datasets for the same task together). |
|
|
Specifies the number of test cases for the dataset (selected in dataset order). A positive integer must be passed. If the number exceeds the total number of cases in the dataset or no value is specified, the entire dataset is used for testing. |
|
|
Number of parallel tasks, range: |
|
|
Number of warm-up runs before sending requests. Data is selected in dataset order for testing. When |
|
### Accuracy Evaluation Parameters
Valid only when the mode is all, infer, eval, or viz.
Parameter |
Description |
Example |
|---|---|---|
|
Toggle to dump details of the evaluation process. Enabled if configured, disabled if not; disabled by default. |
|
|
Toggle to dump evaluation speed data. Enabled if configured, disabled if not; disabled by default. |
|
### Performance Evaluation Parameters
Valid only when the mode is perf or perf_viz.
Parameter |
Description |
Example |
|---|---|---|
|
Switch to enable performance pressure testing mode. Effective only when |
|
|
Duration of pressure testing. Only takes effect when |
|
Configuration Constant File Parameters
Some global constants are not restricted by task type, and it is recommended to keep their default values. If customization is required, edit the constant file: global_consts.py for configuration.
The currently supported parameter configurations are as follows:
Parameter Name |
Description |
Value Range / Requirements |
|---|---|---|
|
Number of processes used for sending requests. The default value is 0, which means automatic allocation based on the maximum number of concurrent requests configured by the user. (Invalid when the command-line parameter |
[0, number of CPU cores] |
|
Maximum cache size for a single chunk returned by the streaming inference model backend. The default value is 65535 bytes (64KB). |
|
|
Timeout period for the client to wait for a response after sending a request. The default value is None, meaning infinite waiting (always waiting for the model to return results). |
|
|
Log level, optional values: |
|