Skip to content

Releases: LlamaEdge/rag-api-server

LlamaEdge-RAG 0.3.8

28 Apr 07:42
Compare
Choose a tag to compare

major change: update the chat-prompts dep to 0.6.2

LlamaEdge-RAG 0.3.7

26 Apr 12:58
Compare
Choose a tag to compare

Major change:

  • Post-process the generation of phi-3-chat model in non-stream mode.

LlamaEdge-RAG 0.3.6

24 Apr 10:20
Compare
Choose a tag to compare

Major changes:

  • Post-process the generation of llama-2-chat and llama-3-chat models in non-stream mode.

LlamaEdge-RAG 0.3.5

24 Apr 10:18
Compare
Choose a tag to compare

LlamaEdge-RAG 0.3.4

22 Apr 17:13
Compare
Choose a tag to compare

Major changes:

  • Support models without system prompt in RAG

LlamaEdge-RAG 0.3.3

19 Apr 05:00
Compare
Choose a tag to compare

LlamaEdge-RAG 0.3.2

12 Apr 11:34
Compare
Choose a tag to compare

Major changes:

  • Adapt to llama-core v0.7.1
  • Improve the log messages

LlamaEdge-RAG 0.3.1

11 Apr 13:06
Compare
Choose a tag to compare

LlamaEdge-RAG 0.3.0

10 Apr 06:54
Compare
Choose a tag to compare

Major changes:

  • (BREAKING) Update CLI options

    • Keep RAG related CLI options
    • Replace --system-prompt with --rag-prompt option
    • Add --chunk-capacity option
  • Adapt to llama-core v0.8.0

LlamaEdge-RAG 0.2.0

07 Apr 13:07
Compare
Choose a tag to compare

Major changes:

  • Rename /v1/rag/embeddings to /v1/create/rag
  • Remove /v1/rag/query
  • Update README