Feat/api embeddings #263

voorhs · 2025-09-03T20:19:30Z

No description provided.

voorhs · 2025-09-09T07:21:18Z

сейчас мне немного разонравилось то как устроены dump и load у Embedder и VectorIndex и в целом конфиги для них - по-моему в таком варианте нельзя задать конфиг из yaml файла

я это поправлю в другом PR

Samoed · 2025-09-10T07:00:00Z

src/autointent/_wrappers/embedder/embedder.py

+        """
+        return self._backend.get_hash()
+
+    def train(self, utterances: list[str], labels: ListOfLabels, config: EmbedderFineTuningConfig) -> None:


Мб в BaseEmbeddingBackend добавить train и его вызывать?

не думаю что красивее сделать получится

тут потому что EmbedderFineTuningConfig содержит специфичные для sentence-transformers параметры

я бы тогда просто предложил переименовать train -> train_st и в докстринге обозначить что этот метод работает только с SentenceTransformersEmbeddingBackend

все равно пока что фича с тренировкой эмбедингов не встроена в наш EmbeddingNode

Samoed · 2025-09-10T07:10:37Z

src/autointent/_wrappers/embedder/openai.py

+            batch = utterances[i : i + self.config.batch_size]
+
+            # Prepare API call parameters
+            kwargs: EmbeddingsCreateKwargs = {


Можно добавить truncate_prompt_tokens, не знаю есть такое у openai или нет (но на сколько помню у них вообще криво все сделано по токенизации)

не очень вообще шарю за то, что за truncate_prompt_tokens

можешь подробнее рассказать в будущем или открыть issue?

У vllm есть параметр truncate_prompt_tokens, который будет обрезать ввод по количеству токенов, если больше max_tokens модели отправить, то выдается ошибка. Если в openai отправить больше чем 8191 токен, то тоже просто ошибка будет

Samoed · 2025-09-10T07:12:03Z

src/autointent/_wrappers/embedder/openai.py

+        kwargs: EmbeddingsCreateKwargs = {
+            "input": batch,
+            "model": self.config.model_name,
+        }
+        if self.config.dimensions is not None:
+            kwargs["dimensions"] = self.config.dimensions


Можно вынести в функцию

в аргументы функции _process_embeddings_sync?

Создание kwargs можно вынести в функцию, тк дублируется между функциями

src/autointent/configs/_embedder.py

src/autointent/_wrappers/embedder/openai.py

src/autointent/_wrappers/embedder/sentence_transformers.py

voorhs and others added 25 commits September 3, 2025 19:48

add mypy pydantic plugin settings

642b803

implement base class and interface class

20e155a

refactor embedder config

db9d0f1

add sentence transformer embedding backend

29981f6

add openai embedding backend

89b313d

re-refactor embedder configs

c5b08e1

re-refactor dump/load

1bc59b8

add proper dump/load to Embedder

1091044

handle default embedder config usage

7176660

fix some typing errors

a0bb255

fix a couple more

266b45c

fix some more typing errors

2cd16bb

one more error

b0a48b6

is it all?

3160621

Update optimizer_config.schema.json

913987b

bug fix

1b4a3ee

fix some tests

51f6d0e

temporary way to fix tests

f911ab1

refactor embedder tests

489c182

fix some tests

67872c2

Update optimizer_config.schema.json

cb24d83

try to fix dynamic schema issues

7c20546

Update optimizer_config.schema.json

5a6917d

upd vector index tests

6b80fb2

upd inference test

1534056

voorhs marked this pull request as ready for review September 9, 2025 07:19

voorhs requested a review from Samoed September 9, 2025 07:19

upd tutorials

5c3f81a

Samoed reviewed Sep 10, 2025

View reviewed changes

voorhs mentioned this pull request Sep 20, 2025

Устройство dump и load у Embedder и VectorIndex #265

Open

voorhs added 6 commits September 20, 2025 13:35

ignore ds store

813c179

set similarity_fn default to None

5ce3e5d

upd callback test

d5442dc

remove unnecessary import

7263359

run code formatter

4ecb061

remove unnecessary import

a4217da

voorhs requested a review from Samoed September 20, 2025 11:17

voorhs added 4 commits September 21, 2025 11:35

add openai base url option

181aac9

remove openai api key everywhere for security reasons

77f0287

ignore extra envs in mcp server

88b339a

add typed marker

b9e4994

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat/api embeddings #263

Feat/api embeddings #263

Uh oh!

voorhs commented Sep 3, 2025

Uh oh!

voorhs commented Sep 9, 2025

Uh oh!

Samoed Sep 10, 2025

Uh oh!

voorhs Sep 20, 2025

Uh oh!

Samoed Sep 10, 2025

Uh oh!

voorhs Sep 20, 2025

Uh oh!

Samoed Sep 20, 2025

Uh oh!

Samoed Sep 10, 2025

Uh oh!

voorhs Sep 20, 2025

Uh oh!

Samoed Sep 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feat/api embeddings #263

Are you sure you want to change the base?

Feat/api embeddings #263

Uh oh!

Conversation

voorhs commented Sep 3, 2025

Uh oh!

voorhs commented Sep 9, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants