Search

The MLflow UI and API support searching runs within a single experiment or a group of experiments using a search filter API. This API is a simplified version of the SQL WHERE clause.

Table of Contents

Syntax
Programmatically Searching Runs
- Python
- R
- Java

Syntax

A search filter is one or more expressions joined by the AND keyword. The syntax does not support OR. Each expression has three parts: an identifier on the left-hand side (LHS), a comparator, and constant on the right-hand side (RHS).

Example Expressions

Search for the subset of runs with logged accuracy metric greater than 0.92.
```
metrics.accuracy > 0.92
```
Search for runs created using a Logistic Regression model, a learning rate (lambda) of 0.001, and recorded error metric under 0.05.
```
params.model = "LogisticRegression" and params.lambda = "0.001" and metrics.error <= 0.05
```
Search for all failed runs.
```
attributes.status = "FAILED"
```

Identifier

Required in the LHS of a search expression. Signifies an entity to compare against.

An identifier has two parts separated by a period: the type of the entity and the name of the entity. The type of the entity is metrics, params, attributes, or tags. The entity name can contain alphanumeric characters and special characters.

This section describes supported entity names and how to specify such names in search expressions.

In this section:

Entity Names Containing Special Characters
Entity Names Starting with a Number
Run Attributes
MLflow Tags

Entity Names Containing Special Characters

When a metric, parameter, or tag name contains a special character like hyphen, space, period, and so on, enclose the entity name in double quotes or backticks.

Examples

params."model-type"

metrics.`error rate`

Entity Names Starting with a Number

Unlike SQL syntax for column names, MLflow allows logging metrics, parameters, and tags names that have a leading number. If an entity name contains a leading number, enclose the entity name in double quotes. For example:

metrics."2019-04-02 error rate"

Run Attributes

You can search using two run attributes contained in mlflow.entities.RunInfo: status and artifact_uri. Both attributes have string values. Other fields in mlflow.entities.RunInfo are not searchable.

Note

The experiment ID is implicitly selected by the search API.
A run’s lifecycle_stage attribute is not allowed because it is already encoded as a part of the API’s run_view_type field. To search for runs using run_id, it is more efficient to use get_run APIs.

Example

attributes.artifact_uri

MLflow Tags

You can search for MLflow tags by enclosing the tag name in double quotes or backticks. For example, to search for the name of an MLflow run, specify tags."mlflow.runName" or tags.`mlflow.runName`.

Examples

tags."mlflow.runName"

tags.`mlflow.parentRunId`

Comparator

There are two classes of comparators: numeric and string.

Numeric comparators (metrics): =, !=, >, >=, <, and <=.
String comparators (params, tags, and attributes): = and !=.

Constant

The search syntax requires the RHS of the expression to be a constant. The type of the constant depends on LHS.

If LHS is a metric, the RHS must be an integer or float number.
If LHS is a parameter or tag, the RHS must be a string constant enclosed in single or double quotes.

Programmatically Searching Runs

The MLflow UI supports searching runs contained within the current experiment. To search runs across multiple experiments, use one of the client APIs.

Python

Use the mlflow.tracking.MlflowClient.search_runs() or mlflow.search_runs() API to search programmatically. You can specify the list of columns to order by (for example, “metrics.rmse”) in the order_by column. The column can contain an optional DESC or ASC value; the default is ASC. The default ordering is to sort by start_time DESC, then run_id.

For example, if you’d like to identify the best active run from experiment ID 0 by accuracy, use:

from mlflow.tracking.client import MlflowClient
from mlflow.entities import ViewType

run = MlflowClient().search_runs(
  experiment_ids="0",
  filter_string="",
  run_view_type=ViewType.ACTIVE_ONLY,
  max_results=1,
  order_by=["metrics.accuracy DESC"]
)[0]

To get all active runs from experiments IDs 3, 4, and 17 that used a CNN model with 10 layers and had a prediction accuracy of 94.5% or higher, use:

from mlflow.tracking.client import MlflowClient
from mlflow.entities import ViewType

query = "params.model = 'CNN' and params.layers = '10' and metrics.`prediction accuracy` >= 0.945"
runs = MlflowClient().search_runs(experiment_ids=["3", "4", "17"], filter_string=query, run_view_type=ViewType.ACTIVE_ONLY)

To search all known experiments for any MLflow runs created using the Inception model architecture:

from mlflow.tracking.client import MlflowClient
from mlflow.entities import ViewType

all_experiments = [exp.experiment_id for exp in MlflowClient().list_experiments()]
runs = MlflowClient().search_runs(experiment_ids=all_experiments, filter_string="params.model = 'Inception'", run_view_type=ViewType.ALL)

R

The R API is similar to the Python API.

library(mlflow)
mlflow_search_runs(
  filter = "metrics.rmse < 0.9 and tags.production = 'true'",
  experiment_ids = as.character(1:2),
  order_by = "params.lr DESC"
)

Java

The Java API is similar to Python API.

List<Long> experimentIds = Arrays.asList("1", "2", "4", "8");
List<RunInfo> searchResult = client.searchRuns(experimentIds, "metrics.accuracy_score < 99.90");