Avoid costly re-building of pipelines #443

mfeurer · 2018-03-21T09:57:33Z

Currently, SMAC suggests hyperparameter configurations which are independent of the dataset size. For example, the hyperparameter classifier:max_features which is specified between zero and one is transformed according to max_features = int(n_features ** classifier:max_features). Assuming the dataset in question has only 10 features, SMAC does not know that most values of the tuned hyperparameter map to the same hyperparameter applied to the actual model. Therefore, one needs to track the 'actual' hyperparameters after transformation and check whether they are re-used, and return a cached function value to SMAC if done so.

Initial experiments suggest that 1-2% of the overall runs are actually re-optimizations.

The text was updated successfully, but these errors were encountered:

github-actions · 2021-05-05T01:52:14Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs for the next 7 days. Thank you for your contributions.

mfeurer added the Good first issue label Mar 20, 2019

franchuterivera added the enhancement label Feb 17, 2021

github-actions bot added the stale label May 5, 2021

mfeurer removed the stale label May 6, 2021

Nov	JUL	Aug
	29
2020	2021	2022

automl / auto-sklearn

Avoid costly re-building of pipelines #443

Avoid costly re-building of pipelines #443

mfeurer commented Mar 21, 2018

github-actions bot commented May 5, 2021

automl / auto-sklearn

Avoid costly re-building of pipelines #443

Avoid costly re-building of pipelines #443

Comments

mfeurer commented Mar 21, 2018

github-actions bot commented May 5, 2021