Merge pull request #217 from flux-framework/add/flux-sched-container-tutorial

mergify[bot] · web-flow · commit 93dab21f7c32 · 2023-03-02T14:45:29.000Z
add flux-sched container tutorial
diff --git a/auto_examples/auto_examples_jupyter.zip b/auto_examples/auto_examples_jupyter.zip
diff --git a/auto_examples/auto_examples_python.zip b/auto_examples/auto_examples_python.zip
diff --git a/quickstart.rst b/quickstart.rst
@@ -57,6 +57,7 @@ default CMD by supplying args to ``docker run``:
    $ flux getattr size
    4
 
+Note that a more complete Docker tutorial is available at :ref:`flux-sched-container`.
 If you are developing Flux and want to compile the modified code and run
 our testsuite within a docker container, you can use our helper script:
 
diff --git a/tutorials/containers/flux-sched-container.rst b/tutorials/containers/flux-sched-container.rst
@@ -0,0 +1,213 @@
+.. _flux-sched-container:
+
+====================================
+Quick Start with Flux in a Container
+====================================
+
+Do you want to submit a job to Flux? Here's a short tutorial on how to do so via a container!
+The reason this works is because a Flux instance can be run anywhere...
+
+    | You can run it on a cluster... 🥜️
+    | You can run it alongside Lustre! 📁️
+    | You can run it on a share... 🤗️
+    | You can run it anywhere! 🏔️
+    | You can run it in a container... 📦️
+    | Using Flux? Total no-brainer! 🧠️
+
+Flux has a regularly updated Docker image on `Docker Hub <https://hub.docker.com/u/fluxrm>`_. 
+You will need to `install docker <https://docs.docker.com/engine/install/>`_ first.
+
+----------------------
+Starting the Container
+----------------------
+
+Run the container as follows:
+
+.. code-block:: bash
+
+  # This says "run an interactive terminal"
+  $ docker run -it fluxrm/flux-sched:latest
+
+There is a bit of a trick going on with the entrypoint so that when you run the container as shown above,
+you will shell into a Flux instance of size 1. 
+
+
+.. raw:: html
+    :file: include/entrypoint-details.html
+
+.. code-block:: shell
+
+  # What is the size of the current Flux instance (i.e. number of flux-broker processes)
+  $ flux getattr size
+  1
+
+  # What resources do we have, and what are their states?
+  $ flux resource list
+       STATE NNODES   NCORES    NGPUS NODELIST
+        free      1        4        0 f9004106893d
+   allocated      0        0        0 
+        down      0        0        0 
+
+
+In the above, because we are running in a single container, we only see one node with
+four cores. Note that you could also imagine an example with docker compose (and we will
+be adding this example shortly). Let's look at more Flux interactions, first to see Flux 
+environment variables:
+
+.. code-block:: shell
+
+  # Where are different Flux things located (the environment)
+  $ flux env
+
+------------
+Running Jobs
+------------
+
+``flux submit`` submits a job which will be scheduled and run in the background. 
+It prints the jobid to standard output upon successful submission. To run a job interactively,
+use ``flux run`` which submits and then attaches to the job, displays job output in real time,
+and does not exit until the job finishes.
+
+.. code-block:: shell
+
+    $ flux submit hostname
+    ƒM5k8m7m
+    $ flux run hostname
+    f9004106893d
+
+You can inspect ``--help`` for each of the commands above to see it's possible
+to customize the number of nodes, tasks, cores, and other variables.
+There are many different options to customize your job submission. For further
+details, please see :core:man1:`flux-submit` or run ``flux help submit``.
+
+The identifier shown above is a :ref:`jobid<fluid>` (e.g., ``ƒM5k8m7m``). This kind of identifer
+(or similar) is returned for every job submitted, and will be how you interact with your job moving forward. 
+Let's throw in a few more sleep jobs, and immediately ask to see them with ``flux jobs``:
+
+.. code-block:: shell
+
+    $ flux submit sleep 360
+    $ flux submit sleep 360
+
+
+.. code-block:: shell
+
+    $ flux jobs
+           JOBID USER     NAME       ST NTASKS NNODES     TIME INFO
+        ƒq3d755R fluxuser sleep       R      1      1   2.811s 848fc387afd7
+        ƒpcByPgK fluxuser sleep       R      1      1   3.805s 848fc387afd7
+
+
+But wait, what happened to our first two jobs? ``flux jobs`` only shows "active" jobs by default,
+add an ``-a`` for "all" to see all the jobs.
+
+.. code-block:: shell
+
+    $ flux jobs -a
+           JOBID USER     NAME       ST NTASKS NNODES     TIME INFO
+        ƒq3d755R fluxuser sleep       R      1      1   39.98s 848fc387afd7
+        ƒpcByPgK fluxuser sleep       R      1      1   40.97s 848fc387afd7
+        ƒDi2hxdm fluxuser hostname   CD      1      1   0.019s 848fc387afd7
+        ƒBmTaVZ9 fluxuser hostname   CD      1      1   0.025s 848fc387afd7
+
+.. note::
+
+   See if you can figure out how to list jobs by a particular status, e.g., ``R`` in the output 
+   above means "running." Try ``flux jobs --help`` or ``flux help jobs``.
+
+
+
+Did you figure it out? It would be ``flux jobs --filter=RUNNING``. What if you were running a long 
+process, and you wanted to check on output? Let's do that. Here is script to loop, print, and sleep.
+
+.. code-block:: bash
+
+    #!/bin/bash
+    # Save this as loop.sh
+    for i in {0..10}; do
+        echo "Hello I am loop iteration $i."
+        sleep ${i}
+    done
+
+Now make the script executable, and submit the job with flux.
+
+.. code-block:: shell
+
+    $ chmod +x ./loop.sh
+    $ flux submit ./loop.sh
+
+
+To see output (and wait until completion) use ``flux job attach``:
+
+
+.. code-block:: shell
+
+    $ flux job attach ƒ4evXWb9Z
+    Hello I am loop iteration 0.
+    Hello I am loop iteration 1.
+    Hello I am loop iteration 2.
+    Hello I am loop iteration 3.
+    Hello I am loop iteration 4.
+    Hello I am loop iteration 5.
+
+See ``flux help job`` or :core:man1:`flux-job` for more information on ``flux job attach``.
+
+------------
+Viewing Jobs
+------------
+
+Aside from listing jobs with ``flux jobs`` there are other ways to get metadata about jobs.
+For your running jobs, you can use ``flux pstree`` to see exactly that - a tree of jobs.
+Let's say we run another sleep job:
+
+
+.. code-block:: shell
+
+    $ flux submit sleep 350
+    ƒ744GwLs
+
+The tree will show us that one sleep!
+
+
+.. code-block:: shell
+
+    $ flux pstree
+    flux
+    └── sleep
+
+Submit the same command a few more times? We see that reflected in the tree!
+
+.. code-block:: shell
+
+    $ flux submit sleep 350
+    ƒAivupEb
+    $ flux submit sleep 350
+    ƒB621bj5
+
+    $ flux pstree
+    flux
+    └── 3*[sleep]
+
+
+And have you heard of a flux jobspec? This is a data structure that describes the resources, tasks,
+and attributes of a job. You can see one doing the following:
+
+.. code-block:: shell
+
+    $ flux job info ƒB621bj5 jobspec | jq
+
+Finally, ``flux top`` is a cool way to see a summary of your jobs:
+
+
+.. code-block:: shell
+
+    ƒ                                    ƒ63WcEKAP                        3.6e+04d⌚
+        nodes [                        0/1]                         0 pending
+        cores [                        0/4]                         0 running
+         gpus [                        0/0]        3 complete,      0 failed     ♡
+
+     size: 1   depth: 1            uptime: 6.7m               0.47.0-148-ge2b96308f
+           JOBID     USER ST NTASKS NNODES RUNTIME NAME                             
+
+Akin to vim, you can hit ``q`` to exit. And that's it!
+If you have any questions, please `let us know <https://github.com/flux-framework/flux-docs/issues>`_.
diff --git a/tutorials/containers/include/entrypoint-details.html b/tutorials/containers/include/entrypoint-details.html
@@ -0,0 +1,17 @@
+<details>
+
+<summary>Click here to learn more about the entrypoint</summary>
+
+<p>More specifically, the entrypoint is <code class="docutils literal notranslate"><span class="pre">/usr/bin/flux</span> <span class="pre">start</span> <span class="pre">/bin/bash</span></code>.
+This means if you were to shell into a container without this entrypoint, you could run <code class="docutils literal notranslate"><span class="pre">flux</span> <span class="pre">start</span></code>
+on your own to also shell into an instance.  As an example:</p>
+
+<div class="highlight-bash notranslate"><div class="highlight"><pre id="__code_34"><span></span><button class="md-clipboard md-icon" title="Copy to clipboard" data-clipboard-target="#__code_34 > code"></button><code>
+    $<span class="w"> </span>docker<span class="w"> </span>run<span class="w"> </span>-it<span class="w"> </span>--entrypoint<span class="w"> </span>bash<span class="w"> </span>fluxrm/flux-sched:latest
+    $<span class="w"> </span>flux<span class="w"> </span>start<span class="w"> </span>--test-size<span class="o">=</span><span class="m">4</span>
+    </code></pre></div>
+    </div>
+
+<p>For this tutorial, assume that when you shell in, the Flux instance is already started.</p>
+
+</details>
diff --git a/tutorials/containers/index.rst b/tutorials/containers/index.rst
@@ -0,0 +1,15 @@
+.. _command-tutorials:
+
+Container Tutorials
+===================
+
+These are "quick start" container tutorials to get you up-and-running Flux in no time!
+
+ - ``fluxrm/flux-sched:focal``: :ref:`flux-sched-container`
+ - ``flux with docker-compose``: 🥑️ *coming soon* 🥑️
+
+.. toctree::
+   :maxdepth: 2
+   :caption: Container Tutorials
+
+   flux-sched-container
diff --git a/tutorials/index.rst b/tutorials/index.rst
@@ -13,5 +13,6 @@ find a tutorial of interest.
 
    ../auto_examples/index
    commands/index
+   containers/index
    lab/index
    integrations/index

Original file line number	Diff line number	Diff line change
@@ -57,6 +57,7 @@ default CMD by supplying args to ``docker run``:
`57`	`57`	`$ flux getattr size`
`58`	`58`	`4`
`59`	`59`
	`60`	+Note that a more complete Docker tutorial is available at :ref:`flux-sched-container`.
`60`	`61`	`If you are developing Flux and want to compile the modified code and run`
`61`	`62`	`our testsuite within a docker container, you can use our helper script:`
`62`	`63`