Multi-Agent RL for DFaaS by Emanuele Petriglia

Warning

This repository is a work in progress. If you have any questions, feel free to contact the author by email or open an issue on GitHub.

This repository contains the source code for experiments on Multi-Agent Reinforcement Learning applied to workload distribution in a FaaS-Edge computing system, with a focus on Decentralized FaaS (DFaaS).

If you are looking for the source code of the "Multi-Agent Reinforcement Learning for Workload Distribution in FaaS-Edge Computing Systems" article published at IPDPSW 2025, please check the paise2025 branch. If instead you are here for my master's thesis (presented in October 2024), see the petriglia-thesis-2024 branch. The thesis, summary, and presentation slides are hosted on a dedicated GitLab repository.

Project directory structure

configs: contains the specific configuration for the environment, models and other aspects of the experiments.
dataset: contains the dataset used to have real function invocations traces in the DFaaS environment (instead of generated ones).
notebooks: Marimo notebooks used mainly to show plots of the experiments and do some simple prototyping or experiments.
patches: required custom patches to Ray RLlib or other libraries needed to run the experiments.
plots: non-interactive Python scripts to generate plots from experiments.
results: default directory where the experiment data is stored. This folder is not shown in the repository because it contains ephemeral data.
tests: some Python scripts used to test the Ray RLLib, the DFaaS environment, or other miscellaneous tests.

How to set up the environment

The experiments are run and tested on Ubuntu 24.04 using Python 3.12. For a reproducible development environment, it is preferable to install the dependencies in a virtual environment (see the venv module). The venv module is not installed by default in Ubuntu, it must be installed using sudo apt install python3.12-venv.

To run the experiments, the main dependencies are:

Ray RLlib: this is a reinforcement learning library used to define the DFaaS custom environment, run the experiments by training the models with the implemented algorithms. The version is pinned to 2.40.0.
PyTorch: is a library for deep learning on GPUs and CPUs. It is used by Ray RLlib when training models with deep learning reinforcement learning algorithms.

When you install these dependenceis using pip, it automatically installs other dependencies, some of them are directly used by the project (like NumPy, Gymnasium, Pandas or NetworkX).

The following dependencies are required for plotting or running notebooks:

Matplotlib: is a plot generation library used in the scripts in the plots and notebooks directory.
Marimo: Marimo notebooks are used to create plots and explore results and statistics from the experiments. The notebooks are stored in the notebooks directory. We use Marimo as replacement of Jupyter Notebook.
orjson: is a JSON library that is faster than the standard library. Used because the experiments generate large JSON files that slow down the encoding/decoding processes.
tqdm: a small module that enriches the log output during the train.

There are two requirements files for pip in the repository:

requirements.base.txt: contains only the dependencies listed above with fixed versions,
requirements.txt: contains the full list of dependencies with fixed versions.

Run the following commands to set up the development environment with Ubuntu:

$ sudo apt install python3.12-venv
$ git clone https://github.com/unimib-datAI/marl-dfaas.git
$ cd marl-dfaas
$ python3.12 -m venv .env
$ source .env/bin/activate
$ pip install --requirement requirements.base.txt

Or, for a perfect reproducibility:

$ pip install --requirement requirements.txt

Please note that the requirements file expects a machine with an NVIDIA GPU and CUDA (at least 12.4) installed for PyTorch. PyTorch can also be used with a CPU, in this case follow the instructions on the official website.

The requirements.txt also contains some development tools:

ruff: a source code linter and formatter for Python code and Marimo notebooks,
pre-commit: to run hooks when doing a Git commit, notebooks before committing them.

The DFaaS environment

You can find the DFaaS environment in the dfaas_env.py file as the DFaaS class. You can use this environment independently of any experiment. To configure it, use the DFaaSConfig class from dfaas_env_config.py. DFaaSConfig follows a builder pattern, and you can call the build() method to obtain a fully working DFaaS environment. For more details, refer to the DFaaSConfig's source code.

The environment structure is largely inspired by the work "QoS-aware offloading policies for serverless functions in the Cloud-to-Edge continuum" by G. Russo Russo, D. Ferrarelli, D. Pasquali et al. DOI: https://doi.org/10.1016/j.future.2024.02.019. See Section 6.1 for more information.

There are some differences compared to the work of Russo Russo et al.:

In DFaaS, only edge nodes are present,
In DFaaS, the network bandwidth for each link follows a trace that can be generated from the dataset/5G_trace.csv file,
In DFaaS, there is only a single function,
The DFaaS environment is based on groups of function invocations to be observed and acted upon, while in the cited article nodes can make decisions for each individual function invocation.

You can run an example DFaaS episode by executing the dfaas_env.py file as a script:

$ python dfaas_env.py
Episode configuration saved to 'results/dfaas_episode_42_config.yaml'
Episode statistics saved to 'results/dfaas_episode_42_stats.csv.gz'

Run dfaas_env.py with --help option to see available options.

How to run the experiments

Warning

Work in progress section!

Important: always run Python scripts from the project root directory to allow loading of commonly used modules (dfaas_env.py...). As example, if you need to run a test script:

$ python tests/env/local_strategy.py

Training

Run the dfaas_train.py Python script.

Example:

$ python dfaas_train.py --env-config configs/env/three_agents.yaml --exp-config configs/exp/ppo.yaml three

Evaluation

Run the dfaas_evaluate.py Python script.

How to run Marimo notebooks

Just run:

$ marimo edit notebooks/ --port 9090 --headless --no-token

Then open http://localhost:9090/ in a Web browser. You can export a notebook as HTML directly from the web editor.

Patching Ray

The selected version of Ray RLlib needs to be patched to fix some bugs or undesirable behaviour that has not yet been addressed upstream. The patches are collected in the patches directory and can be applied using the patch command:

patch -p0 < patches/NAME.patch

The patches have only been tested with Ray 2.40.0. They will only work if the virtual environment is named .env and the Python version is 3.12, as the file path is hardcoded into the patch file.

Note: The patch binary is required and preinstalled on Ubuntu. If not available, it can be installed with apt install patch.

The patches are created using the standard diff tool:

diff -Naru .env/.../rllib/example.py .env/.../rllib/example_new.py > patches/NAME.patch

See this reply on StackExchange for more information.

License

The source code in this repository is licensed under the Apache License, version 2.0. See the LICENSE file for more information.

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Name		Name	Last commit message	Last commit date
Latest commit History 621 Commits
configs		configs
dataset		dataset
model		model
notebooks		notebooks
patches		patches
plots		plots
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
LICENSE		LICENSE
README.md		README.md
bandwidth_generator.py		bandwidth_generator.py
baseline.py		baseline.py
dfaas_apl.py		dfaas_apl.py
dfaas_dirichlet.py		dfaas_dirichlet.py
dfaas_env.py		dfaas_env.py
dfaas_env_config.py		dfaas_env_config.py
dfaas_evaluate_ppo.py		dfaas_evaluate_ppo.py
dfaas_input_rate.py		dfaas_input_rate.py
dfaas_tmp.py		dfaas_tmp.py
dfaas_train.py		dfaas_train.py
dfaas_upperbound.py		dfaas_upperbound.py
dfaas_utils.py		dfaas_utils.py
grid_search_entropy_coeff.py		grid_search_entropy_coeff.py
json_gzip_logger.py		json_gzip_logger.py
jupyter_notebook_config.py		jupyter_notebook_config.py
metrics_requests.py		metrics_requests.py
perfmodel.py		perfmodel.py
pyproject.toml		pyproject.toml
requirements.base.txt		requirements.base.txt
requirements.txt		requirements.txt
run-trains		run-trains

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multi-Agent RL for DFaaS by Emanuele Petriglia

Project directory structure

How to set up the environment

The DFaaS environment

How to run the experiments

Training

Evaluation

How to run Marimo notebooks

Patching Ray

License

About

Uh oh!

Uh oh!

Languages

License

unimib-datAI/marl-dfaas

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent RL for DFaaS by Emanuele Petriglia

Project directory structure

How to set up the environment

The DFaaS environment

How to run the experiments

Training

Evaluation

How to run Marimo notebooks

Patching Ray

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages