Repository for the Vinnova project "Mixture of Experts models Tailored for Fleet Intelligence"
This repository is designed for collaborative development with multiple people working simultaneously:
architecture/
- Shared, reusable model components (backbone, experts, router, MoE)training/
- Shared training utilities and frameworkspipelines/
- Experiment-specific implementations that use the shared modules
python3 -m venv .venv
source .venv/bin/activate
pip3 install -e . -U
- Shared Components: Add reusable model components to
architecture/
or training utilities totraining/
- New Experiments: Create new experiment directories in
research_pipelines/
(e.g.,research_pipelines/zod_fl/
,research_pipelines/cifar10/
) - Experiment Structure: Each experiment should have:
config.py
- Experiment-specific configurationdataset.py
- Dataset loading and preprocessingmodel.py
- Model building using shared componentstrain.py
- Training script, preferably using shared trainer
- Established Experiments: Established experiments can be moved to
production_pipelines/
(with good test coverage). Add aREADME.md
with key findings and learnings.