Stochastic Best Improvement with Progressive Halving for CNN Hyperparameter Optimization

Overview

This project explores lightweight hyperparameter optimization (HPO) for Convolutional Neural Networks (CNNs).

The approach combines:

Stochastic Best Improvement (SBI) – a local search method that samples from neighboring hyperparameter configurations.
Progressive Halving (PH) – an adaptive resource allocation scheme inspired by Hyperband, training promising candidates longer while pruning poor ones.

The search is applied to MobileNetV3-Small, with block-wise transfer learning: early layers are frozen (using pretrained ImageNet weights), while later layers are optimized. Although a surrogate modeling module was explored, it was ultimately abandoned due to computational constraints.

Repository Structure

├── notebooks/             # notebook for experiments in cloud environments (such as Google Colab)
├── scripts/               # Training and experimentation scripts
├── src/
│   ├── optim/             # SBI + Progressive Halving implementation
│   ├── neighborhood/      # Neighbor sampling strategies
│   ├── loading/           # Model and data loaders
│   ├── schema/            # Structural representations of model blocks and parameters
│   ├── training/          # Training utilities
│   ├── surrogate_modeling/ # (Deprecated) surrogate-based accuracy prediction
│   └── utils/             # Helper functions
├── dataset/               # CIFAR-10
├── docs/                  # Approach and environment setup documentation
├── environment.yaml        # Conda environment specification
└── tests/                  # Unit tests

Installation

Follow these Guidelines

Usage

To reproduce the experiments:

# Pretrained MobileNetV3 fine-tuning
python scripts/pretrained_training.py

# Run Stochastic Best Improvement with Progressive Halving
python src/optim/sa_optimization/main.py

Results

The proposed Stochastic Best Improvement with Progressive Halving (SBI-PH) method was tested on MobileNetV3-Small using the CIFAR-10 dataset. Each optimization run is compared to the baseline pretrained model.

Run	Accuracy	Change vs Baseline	Parameters	Reduction vs Baseline
Baseline	82.45 %	–	1.528 M	–
Initial HPO Run	82.90 %	+0.45 %	1.528 M	0 %
Second HPO Run	86.80 %	+4.35 %	1.019 M	−33 %
Third HPO Run	86.48 %	+4.03 %	0.574 M	−62 %

The best configuration achieved 86.8 % accuracy, a gain of about 4 % over the baseline, while reducing the model size by roughly one-third. A more compact variant (0.57 M parameters, about 60 % smaller) maintained nearly the same accuracy.

License

This project is done within the Semestrial Project for "Intelligent Systems & Data" Option at Ecole nationale Supérieure d’Informatique (ESI)

Contributors LinkedIn (ordered alphabetically)

Supervisors Google Scholar / Research Gate (ordered alphabetically)

This project is released under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
docs		docs
logs		logs
notebooks		notebooks
records		records
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Stochastic Best Improvement with Progressive Halving for CNN Hyperparameter Optimization

Overview

Repository Structure

Installation

Usage

Results

Read More

License

Contributors LinkedIn (ordered alphabetically)

Supervisors Google Scholar / Research Gate (ordered alphabetically)

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

License

BrouthenKamel/Local-Search-HPO-in-CNNs

Folders and files

Latest commit

History

Repository files navigation

Stochastic Best Improvement with Progressive Halving for CNN Hyperparameter Optimization

Overview

Repository Structure

Installation

Usage

Results

Read More

License

Contributors LinkedIn (ordered alphabetically)

Supervisors Google Scholar / Research Gate (ordered alphabetically)

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages