Skip to main content

Showing 1–10 of 10 results for author: Bergman, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.06143  [pdf, ps, other

    cs.LG

    carps: A Framework for Comparing N Hyperparameter Optimizers on M Benchmarks

    Authors: Carolin Benjamins, Helena Graf, Sarah Segel, Difan Deng, Tim Ruhkopf, Leona Hennig, Soham Basu, Neeratyoy Mallik, Edward Bergman, Deyao Chen, François Clément, Matthias Feurer, Katharina Eggensperger, Frank Hutter, Carola Doerr, Marius Lindauer

    Abstract: Hyperparameter Optimization (HPO) is crucial to develop well-performing machine learning models. In order to ease prototyping and benchmarking of HPO methods, we propose carps, a benchmark framework for Comprehensive Automated Research Performance Studies allowing to evaluate N optimizers on M benchmark tasks. In this first release of carps, we focus on the four most important types of HPO task ty… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  2. arXiv:2407.05789  [pdf, other

    cs.LG cs.AI

    CANDID DAC: Leveraging Coupled Action Dimensions with Importance Differences in DAC

    Authors: Philipp Bordne, M. Asif Hasan, Eddie Bergman, Noor Awad, André Biedenkapp

    Abstract: High-dimensional action spaces remain a challenge for dynamic algorithm configuration (DAC). Interdependencies and varying importance between action dimensions are further known key characteristics of DAC problems. We argue that these Coupled Action Dimensions with Importance Differences (CANDID) represent aspects of the DAC problem that are not yet fully explored. To address this gap, we introduc… ▽ More

    Submitted 17 September, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: 5 pages main paper, 11 pages references and appendix, 9 figures, to be published in: Proceedings of the Third International Conference on Automated Machine Learning (AutoML 2024), Workshop Track

  3. arXiv:2405.03389  [pdf, other

    cs.LG cs.AI

    Don't Waste Your Time: Early Stopping Cross-Validation

    Authors: Edward Bergman, Lennart Purucker, Frank Hutter

    Abstract: State-of-the-art automated machine learning systems for tabular data often employ cross-validation; ensuring that measured performances generalize to unseen data, or that subsequent ensembling does not overfit. However, using k-fold cross-validation instead of holdout validation drastically increases the computational cost of validating a single configuration. While ensuring better generalization… ▽ More

    Submitted 2 August, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted at Third International Conference on Automated Machine Learning (AutoML 2024); for code, see https://github.com/automl/DontWasteYourTime-early-stopping

  4. arXiv:2404.16795  [pdf, other

    cs.LG

    In-Context Freeze-Thaw Bayesian Optimization for Hyperparameter Optimization

    Authors: Herilalaina Rakotoarison, Steven Adriaensen, Neeratyoy Mallik, Samir Garibov, Edward Bergman, Frank Hutter

    Abstract: With the increasing computational costs associated with deep learning, automated hyperparameter optimization methods, strongly relying on black-box Bayesian optimization (BO), face limitations. Freeze-thaw BO offers a promising grey-box alternative, strategically allocating scarce resources incrementally to different configurations. However, the frequent surrogate model updates inherent to this ap… ▽ More

    Submitted 12 August, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Published at the 41st International Conference on Machine Learning (ICML), Vienna, Austria

  5. arXiv:2403.01888  [pdf, other

    cs.AI cs.LG

    Fast Benchmarking of Asynchronous Multi-Fidelity Optimization on Zero-Cost Benchmarks

    Authors: Shuhei Watanabe, Neeratyoy Mallik, Edward Bergman, Frank Hutter

    Abstract: While deep learning has celebrated many successes, its results often hinge on the meticulous selection of hyperparameters (HPs). However, the time-consuming nature of deep learning training makes HP optimization (HPO) a costly endeavor, slowing down the development of efficient HPO tools. While zero-cost benchmarks, which provide performance and runtime without actual training, offer a solution fo… ▽ More

    Submitted 19 August, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted to AutoML Conference 2024 ABCD Track

  6. arXiv:2306.12370  [pdf, other

    cs.LG

    PriorBand: Practical Hyperparameter Optimization in the Age of Deep Learning

    Authors: Neeratyoy Mallik, Edward Bergman, Carl Hvarfner, Danny Stoll, Maciej Janowski, Marius Lindauer, Luigi Nardi, Frank Hutter

    Abstract: Hyperparameters of Deep Learning (DL) pipelines are crucial for their downstream performance. While a large number of methods for Hyperparameter Optimization (HPO) have been developed, their incurred costs are often untenable for modern DL. Consequently, manual experimentation is still the most prevalent approach to optimize hyperparameters, relying on the researcher's intuition, domain knowledge,… ▽ More

    Submitted 15 November, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  7. Can Fairness be Automated? Guidelines and Opportunities for Fairness-aware AutoML

    Authors: Hilde Weerts, Florian Pfisterer, Matthias Feurer, Katharina Eggensperger, Edward Bergman, Noor Awad, Joaquin Vanschoren, Mykola Pechenizkiy, Bernd Bischl, Frank Hutter

    Abstract: The field of automated machine learning (AutoML) introduces techniques that automate parts of the development of machine learning (ML) systems, accelerating the process and reducing barriers for novices. However, decisions derived from ML models can reproduce, amplify, or even introduce unfairness in our societies, causing harm to (groups of) individuals. In response, researchers have started to p… ▽ More

    Submitted 20 February, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Journal ref: Journal of Artificial Intelligence Research 79 (2024) 639-677

  8. Mind the Gap: Measuring Generalization Performance Across Multiple Objectives

    Authors: Matthias Feurer, Katharina Eggensperger, Edward Bergman, Florian Pfisterer, Bernd Bischl, Frank Hutter

    Abstract: Modern machine learning models are often constructed taking into account multiple objectives, e.g., minimizing inference time while also maximizing accuracy. Multi-objective hyperparameter optimization (MHPO) algorithms return such candidate models, and the approximation of the Pareto front is used to assess their performance. In practice, we also want to measure generalization when moving from th… ▽ More

    Submitted 9 February, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

  9. arXiv:2206.03493  [pdf, other

    cs.LG

    DeepCAVE: An Interactive Analysis Tool for Automated Machine Learning

    Authors: René Sass, Eddie Bergman, André Biedenkapp, Frank Hutter, Marius Lindauer

    Abstract: Automated Machine Learning (AutoML) is used more than ever before to support users in determining efficient hyperparameters, neural architectures, or even full machine learning pipelines. However, users tend to mistrust the optimization process and its results due to a lack of transparency, making manual tuning still widespread. We introduce DeepCAVE, an interactive framework to analyze and monito… ▽ More

    Submitted 11 July, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: Workshop on Adaptive Experimental Design and Active Learning in the Real World (ReALML@ICML'22)

  10. arXiv:2006.12328  [pdf

    cs.LG cs.AI stat.ML

    Siamese Meta-Learning and Algorithm Selection with 'Algorithm-Performance Personas' [Proposal]

    Authors: Joeran Beel, Bryan Tyrell, Edward Bergman, Andrew Collins, Shahad Nagoor

    Abstract: Automated per-instance algorithm selection often outperforms single learners. Key to algorithm selection via meta-learning is often the (meta) features, which sometimes though do not provide enough information to train a meta-learner effectively. We propose a Siamese Neural Network architecture for automated algorithm selection that focuses more on 'alike performing' instances than meta-features.… ▽ More

    Submitted 23 June, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: 7th Workshop on Automated Machine Learning (AutoML 2020)