Bilevel Optimization Benchmark

Results can be consulted on https://benchopt.github.io/results/benchmark_bilevel.html

BenchOpt is a package to simplify and make more transparent and reproducible the comparisons of optimization algorithms. This benchmark is dedicated to solvers for bilevel optimization:

$min_{x} f (x, z^{*} (x)) with z^{*} (x) = \arg min_{z} g (x, z),$

where $g$ and $f$ are two functions of two variables.

Different problems

This benchmark currently implements two bilevel optimization problems: regularization selection, and hyper data cleaning.

1 - Regularization selection

In this problem, the inner function $g$ is defined by

$g (x, z) = \frac{1}{n} \sum_{i = 1}^{n} ℓ (d_{i}; z) + R (x, z)$

where $d_{1}, \dots, d_{n}$ are training data samples, $z$ are the parameters of the machine learning model, and the loss function $ℓ$ measures how well the model parameters $z$ predict the data $d_{i}$ . There is also a regularization $R$ that is parametrized by the regularization strengths $x$ , which aims at promoting a certain structure on the parameters $z$ .

The outer function $f$ is defined as the unregularized loss on unseen data

$f (x, z) = \frac{1}{m} \sum_{j = 1}^{m} ℓ (d_{j}^{'}; z)$

where the $d_{1}^{'}, \dots, d_{m}^{'}$ are new samples from the same dataset as above.

There are currently two datasets for this regularization selection problem.

Covtype

Homepage : https://archive.ics.uci.edu/dataset/31/covertype

This is a logistic regression problem, where the data is of the form $d_{i} = (a_{i}, y_{i})$ with $a_{i} \in R^{p}$ are the features and $y_{i} = \pm 1$ is the binary target. For this problem, the loss is $ℓ (d_{i}, z) = \log (1 + \exp (- y_{i} a_{i}^{T} z))$ , and the regularization is simply given by $R (x, z) = \frac{1}{2} \sum_{j = 1}^{p} \exp (x_{j}) z_{j}^{2},$ each coefficient in $z$ is independently regularized with the strength $\exp (x_{j})$ .

Ijcnn1

Homepage : https://www.openml.org/search?type=data&sort=runs&id=1575&status=active

This is a multicalss logistic regression problem, where the data is of the form $d_{i} = (a_{i}, y_{i})$ with $a_{i} \in R^{p}$ are the features and $y_{i} \in {1, \dots, k}$ is the integer target, with k the number of classes. For this problem, the loss is $ℓ (d_{i}, z) = CrossEntropy (z a_{i}, y_{i})$ where $z$ is now a k x p matrix. The regularization is given by $R (x, z) = \frac{1}{2} \sum_{j = 1}^{k} \exp (x_{j}) ∥ z_{j} ∥^{2},$ each line in $z$ is independently regularized with the strength $\exp (x_{j})$ .

2 - Hyper data cleaning

This problem was first introduced by [Fra2017] . In this problem, the data is the MNIST dataset. The training set has been corrupted: with a probability $p$ , the label of the image $y \in {1, \dots, 10}$ is replaced by another random label between 1 and 10. We do not know beforehand which data has been corrupted. We have a clean testing set, which has not been corrupted. The goal is to fit a model on the corrupted training data that has good performances on the test set. To do so, a set of weights -- one per train sample -- is learned as well as the model parameters. Ideally, we would want a weight of 0 for data that has been corrupted, and a weight of 1 for uncorrupted data. The problem is cast as a bilevel problem with $g$ given by

$g (x, z) = \frac{1}{n} \sum_{i = 1}^{n} σ (x_{i}) ℓ (d_{i}, z) + \frac{C}{2} ∥ z ∥^{2}$

where the $d_{i}$ are the corrupted training data, $ℓ$ is the loss of a CNN parameterized by $z$ , $σ$ is a sigmoid function, and C is a small regularization constant. Here the outer variable $x$ is a vector of dimension $n$ , and the weight of data $i$ is given by $σ (x_{i})$ . The test function is

$f (x, z) = \frac{1}{m} \sum_{j = 1}^{n} ℓ (d_{j}^{'}, z)$

where the $d_{j}$ are uncorrupted testing data.

Install

This benchmark can be run using the following commands:

$ pip install -U benchopt
$ git clone https://github.com/benchopt/benchmark_bilevel
$ benchopt run benchmark_bilevel

Apart from the problem, options can be passed to benchopt run, to restrict the benchmarks to some solvers or datasets, e.g.:

$ benchopt run benchmark_bilevel -s solver1 -d dataset2 --max-runs 10 --n-repetitions 10

You can also use config files to setup the benchmark run:

$ benchopt run benchmark_bilevel --config config/X.yml

where X.yml is a config file. See https://benchopt.github.io/index.html#run-a-benchmark for an example of a config file. This will possibly launch a huge grid search. When available, you can rather use the file X_best_params.yml in order to launch an experiment with a single set of parameters for each solver.

Use benchopt run -h for more details about these options, or visit https://benchopt.github.io/api.html.

Cite

If you use this benchmark in your research project, please cite the following paper:

@inproceedings{saba,
   title = {A Framework for Bilevel Optimization That Enables Stochastic and Global Variance Reduction Algorithms},
   booktitle = {Advances in {{Neural Information Processing Systems}} ({{NeurIPS}})},
   author = {Dagr{\'e}ou, Mathieu and Ablin, Pierre and Vaiter, Samuel and Moreau, Thomas},
   year = {2022}
}

References

[Fra2017]

Franceschi, Luca, et al. "Forward and reverse gradient-based hyperparameter optimization." International Conference on Machine Learning. PMLR, 2017.

Name	Name	Last commit message	Last commit date
Latest commit tomMoral FIX deprecation of pip:->pip:: (#45 ) Aug 7, 2024 48314ad · Aug 7, 2024 History 181 Commits
.github/workflows	.github/workflows	MTN use test workflows+fix tests and linter (#28 )	Aug 21, 2023
benchmark_utils	benchmark_utils	RFC only use jax as a backend for the benchmark (#42 )	Jun 4, 2024
config	config	RFC only use jax as a backend for the benchmark (#42 )	Jun 4, 2024
datasets	datasets	FIX deprecation of pip:->pip:: (#45 )	Aug 7, 2024
figures	figures	MTN update config files (#38 )	Feb 19, 2024
solvers	solvers	FIX deprecation of pip:->pip:: (#45 )	Aug 7, 2024
tests	tests	RFC only use jax as a backend for the benchmark (#42 )	Jun 4, 2024
.gitignore	.gitignore	CLN missing line .gitignore	Oct 18, 2022
README.rst	README.rst	expand readme	Jul 5, 2023
objective.py	objective.py	FIX deprecation of pip:->pip:: (#45 )	Aug 7, 2024
test_config.py	test_config.py	INIT bilevel optimization benchmark	Dec 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bilevel Optimization Benchmark

Different problems

1 - Regularization selection

Covtype

Ijcnn1

2 - Hyper data cleaning

Install

Cite

References

About

Releases

Packages

Contributors 3

Languages

benchopt/benchmark_bilevel

Folders and files

Latest commit

History

Repository files navigation

Bilevel Optimization Benchmark

Different problems

1 - Regularization selection

Covtype

Ijcnn1

2 - Hyper data cleaning

Install

Cite

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages