You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+3-1Lines changed: 3 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -220,7 +220,7 @@ The embeddings returned are like below:
220
220
221
221
#### `/v1/create/rag` endpoint
222
222
223
-
`/v1/create/rag` endpoint provides users a one-click way to convert a text or markdown file to embeddings directly. The effect of the endpoint is equivalent to running `/v1/files` + `/v1/chunks` + `/v1/embeddings` sequently.
223
+
`/v1/create/rag` endpoint provides users a one-click way to convert a text or markdown file to embeddings directly. The effect of the endpoint is equivalent to running `/v1/files` + `/v1/chunks` + `/v1/embeddings` sequently. Note that the `--chunk-capacity` CLI option is required for the endpoint. The default value of the option is `100`. You can set it to different values while starting LlamaEdge-RAG API server.
224
224
225
225
<details> <summary> Example </summary>
226
226
@@ -372,6 +372,8 @@ To check the CLI options of the `rag-api-server` wasm app, you can run the follo
372
372
Max number of retrieved result [default: 3]
373
373
--qdrant-score-threshold <QDRANT_SCORE_THRESHOLD>
374
374
Minimal score threshold for the search result [default: 0.4]
375
+
--chunk-capacity <CHUNK_CAPACITY>
376
+
Maximum number of tokens each chunk contains [default: 100]
0 commit comments