Embedding Service

A FastAPI-based service that provides text embeddings using various Sentence Transformer models. This service offers a simple API to generate embeddings for text inputs, supporting both single strings and batches.

Features

Multiple model support (all-MiniLM-L6-v2, all-mpnet-base-v2, paraphrase-multilingual-MiniLM-L12-v2)
OpenAI-compatible API format
Batched inference support
Docker support
Comprehensive test suite
GitHub Actions CI/CD pipeline
Token usage tracking

Quick Start

Option 1: Using Pre-built Docker Image

docker run -d --name embedding-service -p 8000:8000 ghcr.io/shaharia-lab/embedding-service:latest

Option 2: Building Docker Image from Source

# Build the image
docker build -t embedding-service .

# Run the container
docker run -p 8000:8000 embedding-service

Option 3: Local Development

Create a virtual environment:

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies:

pip install -r requirements-dev.txt

Run the server:

uvicorn app.main:app --reload

Once running, the API will be available at http://localhost:8000. You can visit http://localhost:8000/docs for interactive API documentation.

API Usage

Generate Embeddings

Using cURL

curl -X POST http://localhost:8000/v1/embeddings \
-H "Content-Type: application/json" \
-d '{
    "input": "Hello world",
    "model": "all-MiniLM-L6-v2"
}'

Using Python

import requests

url = "http://localhost:8000/v1/embeddings"
payload = {
    "input": "Hello world",
    "model": "all-MiniLM-L6-v2"  # optional, defaults to all-MiniLM-L6-v2
}
headers = {"Content-Type": "application/json"}

response = requests.post(url, json=payload)
embeddings = response.json()

Batch Processing

payload = {
    "input": ["Hello world", "Another text"],
    "model": "all-MiniLM-L6-v2"
}

Use OpenAI SDK

import OpenAI from "openai";

const openai = new OpenAI({
  apiKey: "",
  baseURL: "http://localhost:8000/v1"
});

const embedding = await openai.embeddings.create({
  model: "all-MiniLM-L6-v2",
  input: "Your text string goes here",
  encoding_format: "float",
});

console.log(embedding);

API Documentation

Once the server is running, you can access:

Interactive API documentation: http://localhost:8000/docs
OpenAPI schema: http://localhost:8000/openapi.json

Development

Running Tests

# Run tests
pytest app/tests -v

# Run tests with coverage
pytest app/tests -v --cov=app --cov-report=term-missing

Project Structure

embedding-service/
├── app/
│   ├── __init__.py
│   ├── main.py
│   └── tests/
│       ├── __init__.py
│       └── test_main.py
├── requirements.txt
├── requirements-dev.txt
├── Dockerfile
└── README.md

CI/CD

The project uses GitHub Actions for:

Running tests on pull requests and pushes to main
Building and publishing Docker images on releases
Automated testing and validation

License

MIT License

Contributing

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
app		app
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
openapi.yaml		openapi.yaml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

Embedding Service

Features

Quick Start

Option 1: Using Pre-built Docker Image

Option 2: Building Docker Image from Source

Option 3: Local Development

API Usage

Generate Embeddings

Using cURL

Using Python

Batch Processing

Use OpenAI SDK

API Documentation

Development

Running Tests

Project Structure

CI/CD

License

Contributing

About

Uh oh!

Releases 1

Sponsor this project

Uh oh!

Packages

Uh oh!

Uh oh!

Languages

Uh oh!

shaharia-lab/embedding-service

Folders and files

Latest commit

History

Repository files navigation

Embedding Service

Features

Quick Start

Option 1: Using Pre-built Docker Image

Option 2: Building Docker Image from Source

Option 3: Local Development

API Usage

Generate Embeddings

Using cURL

Using Python

Batch Processing

Use OpenAI SDK

API Documentation

Development

Running Tests

Project Structure

CI/CD

License

Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Uh oh!

Languages

Packages