-
Notifications
You must be signed in to change notification settings - Fork 2.1k
fix: Make sure AI docs are up-to-date and do some cleanup #13418
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
2 commits
Select commit
Hold shift + click to select a range
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -22,6 +22,14 @@ Currently implemented tools: | |
|
||
* Example prompt: *"List the latest messages in my conversation with Andrew"* | ||
|
||
* Send a message to a talk conversation | ||
|
||
* Example prompt: *"Can you send a joke to Andrew in talk?"* | ||
|
||
* Create a public talk conversation | ||
|
||
* Example prompt: *"Can you create a new public talk conversation titled 'Press conference'?"* | ||
|
||
* Find a person in the user's contacts | ||
|
||
* Example prompt: *"What is Andrew's Email address?"* | ||
|
@@ -34,21 +42,39 @@ Currently implemented tools: | |
|
||
* Example prompt: *"What is the company's sick leave process?"* | ||
|
||
* Transcribe a media file | ||
|
||
* Example prompt: *"Can you transcribe the following file? https://mycloud.com/f/9825679"* (Can be selected via smart picker.) | ||
|
||
* Generate documents | ||
|
||
* Example prompt: *"Can you generate me a slide deck for my presentation about cats?"* | ||
* Example prompt: *"Can you generate me a spreadsheet with some plausible numbers for countries and their population count?"* | ||
* Example prompt: *"Can you generate me a pdf with an outline about what to see in Berlin?"* | ||
|
||
* Generate images | ||
|
||
* Example prompt: *"Can you generate me an image of a cartoon drawing of a roman soldier typing something on a laptop?"* | ||
|
||
* Get coordinates for an Address from Open Street Maps Nomatim | ||
|
||
* Example prompt: *"List my calendars"* | ||
* Example prompt: *"What are the coordinates for Berlin, Germany?"* | ||
|
||
* Get the URL for a map of a location using Open Street Maps | ||
|
||
* Example prompt: *"Can you show me a map of New York, please"* | ||
|
||
* Get the current weather at a location | ||
|
||
* Example prompt: *"How is the weather in Berlin?"* | ||
|
||
* Schedule an event in the user's calendar | ||
|
||
* Example prompt: *"Make schedule an event with Andrew tomorrow at noon."* | ||
* Example prompt: *"Schedule an event with Andrew tomorrow at noon."* | ||
|
||
* Send a message to a talk conversation | ||
* Find free times in users' calendar | ||
|
||
* Example prompt: *"Can you send a joke to Andrew in talk?"* | ||
* Example prompt: *"Find a free 1-hour slot for a meeting with me and Marco next week."* | ||
|
||
* Create a deck card | ||
|
||
|
@@ -66,10 +92,42 @@ Currently implemented tools: | |
|
||
* Example prompt: *"Show me the youtube video of the Nextcloud hub 10 launch."* | ||
|
||
* Search Duckduckgo | ||
|
||
* Example prompt: *"Show me search results for quick pasta recipes, please."* | ||
|
||
* Send an email via Nextcloud Mail | ||
|
||
* Example prompt *"Send a test email from [email protected] to [email protected] from my account with id 12"* | ||
* (The account ID will soon be irrelevant) | ||
* Example prompt: *"Send a test email from my [email protected] account to [email protected]"* | ||
|
||
* Get contents of a file | ||
|
||
* Example prompt: *"Can you summarize the following file in my documents? Design/Planning.md"* | ||
|
||
* Generate a public share link for a file | ||
|
||
* Example prompt: *"Can create a share link for the following file in my documents? Design/Planning.md"* | ||
|
||
* Get the folder tree of the user's files | ||
|
||
* Example prompt: *"Can you show me the folder tree of my files?"* | ||
|
||
* Determine public transport routes | ||
|
||
* Example prompt: *"How can I get from Würzburg Hauptbahnhof to Berlin Hauptbahnhof?"* | ||
|
||
* List all projects in OpenProject | ||
|
||
* Example prompt: *"List all my projects in OpenProject, please"* | ||
|
||
* List all available assignees of a project in OpenProject | ||
|
||
* Example prompt: *"List all available assignees for the 'Product launch' project in OpenProject"* | ||
|
||
* Create a new work package in a given project in OpenProject | ||
|
||
* Example prompt: *"Create a work package called 'Publish release video' in the 'Product launch' project in OpenProject"* | ||
|
||
|
||
These tools can also be combined by the agent to fulfil tasks like the following: | ||
|
||
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,69 @@ | ||
============================================== | ||
App: Local Text-To-Speech (text2speech_kokoro) | ||
============================================== | ||
|
||
.. _ai-app-text2speech_kokoro: | ||
|
||
The *text2speech_kokoro* app is one of the apps that provide Text-To-Speech functionality in Nextcloud and act as a speech generation backend for the :ref:`Nextcloud Assistant app<ai-app-assistant>` and :ref:`other apps making use of the core `Text-To-Speech Task type<t2s-consumer-apps>`. The *text2speech_kokoro* app specifically runs only open source models and does so entirely on-premises. Nextcloud can provide customer support upon request, please talk to your account manager for the possibilities. | ||
|
||
This app uses `Kokoro <https://github.com/hexgrad/kokoro>`_ under the hood. | ||
|
||
The used model supports the following languages: | ||
|
||
* American English | ||
* British English | ||
* Spanish | ||
* French | ||
* Italian | ||
* Hindi | ||
* Portuguese | ||
* Japanese | ||
* Mandarin | ||
|
||
Requirements | ||
------------ | ||
|
||
* Minimal Nextcloud version: 31 | ||
* This app is built as an External App and thus depends on AppAPI v2.3.0 | ||
* Nextcloud AIO is supported | ||
* We currently support x86_64 CPUs | ||
* We do not support GPUs | ||
|
||
* CPU Sizing | ||
|
||
* The more cores you have and the more powerful the CPU the better, we recommend around 10 cores | ||
* The app will hog all cores by default, so it is usually better to run it on a separate machine | ||
* 800MB RAM | ||
|
||
Installation | ||
------------ | ||
|
||
0. Make sure the :ref:`Nextcloud Assistant app<ai-app-assistant>` is installed | ||
1. :ref:`Install AppAPI and setup a Deploy Demon<ai-app_api>` | ||
2. Install the *text2speech_kokoro* "Local Text-To-Speech" ExApp via the "Apps" page in the Nextcloud web admin user interface | ||
|
||
|
||
Scaling | ||
------- | ||
|
||
It is currently not possible to scale this app, we are working on this. Based on our calculations an instance has a rough capacity of 4h of transcription throughput per minute (measured with 8 CPU threads on an Intel(R) Xeon(R) Gold 6226R). It is unclear how close to real-world usage this number is, so we do appreciate real-world feedback on this. | ||
|
||
App store | ||
--------- | ||
|
||
You can also find this app in our app store, where you can write a review: `<https://apps.nextcloud.com/apps/text2speech_kokoro>`_ | ||
|
||
Repository | ||
---------- | ||
|
||
You can find the app's code repository on GitHub where you can report bugs and contribute fixes and features: `<https://github.com/nextcloud/text2speech_kokoro>`_ | ||
|
||
Nextcloud customers should file bugs directly with our customer support. | ||
|
||
Known Limitations | ||
----------------- | ||
|
||
* We currently only support languages supported by the underlying Kokoro model | ||
* The Kokoro models perform unevenly across languages, and may show lower accuracy on low-resource and/or low-discoverability languages or languages where there was less training data available. | ||
* Make sure to test the language model you are using it for whether it meets the use-case's quality requirements | ||
* Customer support is available upon request, however we can't solve false or problematic output, most performance issues, or other problems caused by the underlying model. Support is thus limited only to bugs directly caused by the implementation of the app (connectors, API, front-end, AppAPI) |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Both are now available in llm2