Ollama

Make sure you have installed the NVIDIA Container Toolkit:

https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html

Check if everithing is configured:

sudo docker run --gpus all nvidia/cuda:12.8.0-base-ubuntu22.04 nvidia-smi

Troubleshoting:

If you get this message by checking if it works:

Failed to initialize NVML: Unknown Error

I use Ubuntu 22.04 and after following the steps above I needed to cahnte a configuration in /etc/nvidia-container-runtime/config.toml.

Change no-cgroups to false

no-cgroups = false

Quick start:

Open a terminal and run the following command to start the Ollama container:

cd ~/workspace/ai/ollama
docker run -it --rm --gpus=all \
    --name ollama \
    -v ./data:/root/.ollama \
    -v ./shared:/root/shared \
    -p 11434:11434 \
    ollama/ollama

Now you can run Ollama commands inside the container. For example, to list downloaded models:

docker exec -it ollama ollama list

Run a model

To run a model, you can use the following command:

docker exec -it ollama ollama run llama3

If the model is not downloaded, Ollama will automatically download it for you.

See the catalog: https://ollama.com/search

Set system prompt

You can set a system prompt for the model using the following command:

/set system "You are a helpful assistant."

Prompt a model passing a file

To prompt a model with a file, you can use the following command:

docker exec -it ollama ollama run llava:7b \
    'What is the Motor Nr of this image: ' \
    < ./shared/motor-info.png

Monitoring GPU Resources

CLI tool reporting GPU use, VRAM, temperature, power draw, and memory.

Real-time refresh with:

nvidia-smi --loop=1

Open Web UI

Check https://github.com/open-webui/open-webui for more information on how to run the web UI.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ollama

Quick start:

Run a model

Set system prompt

Prompt a model passing a file

Monitoring GPU Resources

Open Web UI

About

Uh oh!

Releases

Packages

mantonovic/ollama

Folders and files

Latest commit

History

Repository files navigation

Ollama

Quick start:

Run a model

Set system prompt

Prompt a model passing a file

Monitoring GPU Resources

Open Web UI

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages