dmr: consolidate Compose content (#22994)

ArthurFlag · doringeman · web-flow · commit e33de3b45030 · 2025-07-04T13:42:40.000+02:00
Consolidate Compose content/remove duplicated information.

---------

Co-authored-by: Dorin-Andrei Geman &lt;doringeman@gmail.com&gt;
diff --git a/content/manuals/ai/compose/model-runner.md b/content/manuals/ai/compose/model-runner.md
diff --git a/content/manuals/ai/compose/models-and-compose.md b/content/manuals/ai/compose/models-and-compose.md
@@ -3,6 +3,9 @@ title: Define AI Models in Docker Compose applications
 linkTitle: Use AI models in Compose
 description: Learn how to define and use AI models in Docker Compose applications using the models top-level element
 keywords: compose, docker compose, models, ai, machine learning, cloud providers, specification
+alias:
+  - /compose/how-tos/model-runner/
+  - /ai/compose/model-runner/
 weight: 10
 params:
   sidebar:
@@ -18,11 +21,12 @@ Compose lets you define AI models as core components of your application, so you
 ## Prerequisites
 
 - Docker Compose v2.38 or later
-- A platform that supports Compose models such as Docker Model Runner or compatible cloud providers
+- A platform that supports Compose models such as Docker Model Runner (DMR) or compatible cloud providers.
+  If you are using DMR, see the [requirements](/manuals/ai/model-runner/_index.md#requirements).
 
 ## What are Compose models?
 
-Compose `models` are a standardized way to define AI model dependencies in your application. By using the []`models` top-level element](/reference/compose-file/models.md) in your Compose file, you can:
+Compose `models` are a standardized way to define AI model dependencies in your application. By using the [`models` top-level element](/reference/compose-file/models.md) in your Compose file, you can:
 
 - Declare which AI models your application needs
 - Specify model configurations and requirements
@@ -66,8 +70,15 @@ models:
 Common configuration options include:
 - `model` (required): The OCI artifact identifier for the model. This is what Compose pulls and runs via the model runner. 
 - `context_size`: Defines the maximum token context size for the model.
+  
+   > [!NOTE]
+   > Each model has its own maximum context size. When increasing the context length,
+   > consider your hardware constraints. In general, try to keep context size
+   > as small as feasible for your specific needs.
+  
 - `runtime_flags`: A list of raw command-line flags passed to the inference engine when the model is started.
--  Platform-specific options may also be available via extensions attributes `x-*`
+   For example, if you use llama.cpp, you can pass any of [the available parameters](https://github.com/ggml-org/llama.cpp/blob/master/tools/server/README.md).
+- Platform-specific options may also be available via extension attributes `x-*`
 
 ## Service model binding
 
@@ -131,25 +142,58 @@ One of the key benefits of using Compose models is portability across different
 
 ### Docker Model Runner
 
-When Docker Model Runner is enabled:
+When [Docker Model Runner is enabled](/manuals/ai/model-runner/_index.md):
 
 ```yaml
 services:
   chat-app:
     image: my-chat-app
     models:
-      - llm
+      llm:
+        endpoint_var: AI_MODEL_URL
+        model_var: AI_MODEL_NAME
 
 models:
   llm:
     model: ai/smollm2
+    context_size: 4096
+    runtime_flags:
+      - "--no-prefill-assistant"
 ```
 
 Docker Model Runner will:
 - Pull and run the specified model locally
 - Provide endpoint URLs for accessing the model
 - Inject environment variables into the service
 
+#### Alternative configuration with provider services
+
+> [!TIP]
+>
+> This approach is deprecated. Use the [`models` top-level element](#basic-model-definition) instead.
+
+You can also use the `provider` service type, which allows you to declare platform capabilities required by your application. 
+For AI models, you can use the `model` type to declare model dependencies.
+
+To define a model provider:
+
+```yaml
+services:
+  chat:
+    image: my-chat-app
+    depends_on:
+      - ai_runner
+
+  ai_runner:
+    provider:
+      type: model
+      options:
+        model: ai/smollm2
+        context-size: 1024
+        runtime-flags: "--no-prefill-assistant"
+```
+
+
 ### Cloud providers
 
 The same Compose file can run on cloud providers that support Compose models:
@@ -181,4 +225,4 @@ Cloud providers might:
 - [`models` top-level element](/reference/compose-file/models.md)
 - [`models` attribute](/reference/compose-file/services.md#models)
 - [Docker Model Runner documentation](/manuals/ai/model-runner.md)
-- [Compose Model Runner documentation](/manuals/ai/compose/model-runner.md)
+- [Compose Model Runner documentation](/manuals/ai/compose/models-and-compose.md)
diff --git a/content/manuals/ai/model-runner/_index.md b/content/manuals/ai/model-runner/_index.md
@@ -84,7 +84,7 @@ Models are pulled from Docker Hub the first time they're used and stored locally
 > Using Testcontainers or Docker Compose?
 > [Testcontainers for Java](https://java.testcontainers.org/modules/docker_model_runner/)
 > and [Go](https://golang.testcontainers.org/modules/dockermodelrunner/), and
-> [Docker Compose](/manuals/ai/compose/model-runner.md) now support Docker Model Runner.
+> [Docker Compose](/manuals/ai/compose/models-and-compose.md) now support Docker Model Runner.
 
 ## Enable Docker Model Runner
 
diff --git a/content/manuals/desktop/release-notes.md b/content/manuals/desktop/release-notes.md
@@ -240,7 +240,7 @@ For more frequently asked questions, see the [FAQs](/manuals/desktop/troubleshoo
 - Docker Model Runner is now available on x86 Windows machines with NVIDIA GPUs.
 - You can now [push models](/manuals/ai/model-runner.md#push-a-model-to-docker-hub) to Docker Hub with Docker Model Runner.
 - Added support for Docker Model Runner's model management and chat interface in Docker Desktop for Mac and Windows (on hardware supporting Docker Model Runner). Users can now view, interact with, and manage local AI models through a new dedicated interface.
-- [Docker Compose](/manuals/ai/compose/model-runner.md) and Testcontainers [Java](https://java.testcontainers.org/modules/docker_model_runner/) and [Go](https://golang.testcontainers.org/modules/dockermodelrunner/) now support Docker Model Runner.
+- [Docker Compose](/manuals/ai/compose/models-and-compose.md) and Testcontainers [Java](https://java.testcontainers.org/modules/docker_model_runner/) and [Go](https://golang.testcontainers.org/modules/dockermodelrunner/) now support Docker Model Runner.
 - Introducing Docker Desktop in the [Microsoft App Store](https://apps.microsoft.com/detail/xp8cbj40xlbwkx?hl=en-GB&gl=GB).
 
 ### Upgrades