Restructuring #2

ultronozm · 2025-03-22T13:26:24Z

No description provided.

skissue

Thanks so much! One other nitpick—would you be willing to edit the commit messages to adhere to the format of the previous commits (based on Conventional Commits)?

llm-tool-collection.el

ultronozm · 2025-03-22T14:05:51Z

These all sound good and I’ll implement them later, although for the commit messages, I’d have a slight preference for adopting the GNU ChangeLog style, mainly out of laziness - that style is what Emacs and nearby packages (gptel, …) use, and the built-in support for drafting such messages is good (log-edit-generate-changelog-from-diff, etc), while I wouldn’t know how to be comparably lazy with the conventions you mention - do you?

…

On Sat, 22 Mar 2025 at 14:41, Ad ***@***.***> wrote: ***@***.**** commented on this pull request. Thanks so much! One other nitpick—would you be willing to edit the commit messages to adhere to the format of the previous commits (based on Conventional Commits <https://www.conventionalcommits.org/>)? ------------------------------ In llm-tool-collection.el <#2 (comment)> : > - ***@***.***) - (push ',sym llm-tool-collection--all-tools)))) - +ARGS is a list where each element is of the form + + (ARGNAME :type TYPE :description \"DESCRIPTION\"). + +Arguments after the special symbol `&optional' are marked with +`:optional t`. + +BODY contains the function body. + +This macro creates a constant with the tool's specs and defines a +function under `llm-tc/NAME' whose docstring is the value of the spec +`:description'. The tool is then added to +`llm-tool-collection--all-tools'." If this is in the docstring, should we make it into a public variable (as in, llm-tool-collection-all-tools)? I originally intended for this to be an internal variable only (for llm-tool-collection-get-all), but I could see how a list of the symbols could be useful too. ------------------------------ In llm-tool-collection.el <#2 (comment)> : > -(defmacro llm-tool-collection-deftool (name specs arguments &rest body) +(defvar llm-tool-collection-after-tool-define-hook nil The Emacs Lisp manual recommends <https://www.gnu.org/software/emacs/manual/html_node/emacs/Hooks.html> that we use -functions instead of -hook for "abnormal" hooks (those that take arguments). ------------------------------ In llm-tool-collection.el <#2 (comment)> : > + (setq arg-syms (reverse arg-syms) + arg-specs (reverse arg-specs)) + (let* ((sym (llm-tool-collection--name-to-symbol name)) + (name-spec (unless (plist-get specs :name) + `(:name ,(llm-tool-collection--make-llm-name name))))) + `(progn + (defconst ,sym + ***@***.*** + ***@***.*** + :args ,arg-specs + :function ,sym)) + (defun ,sym ,arg-syms + ,(concat (plist-get specs :description) "\n\n" + "Definition generated by `llm-tool-collection'.") + ***@***.***) + (add-to-list 'llm-tool-collection--all-tools ',sym) The docstring for add-to-list states: This is meant to be used for adding elements to configuration variables, such as adding a directory to a path variable like load-path, but please do not abuse it to construct arbitrary lists in Elisp code, where using push or cl-pushnew will get you more efficient code. Thus, would it be better to use cl-pushnew instead? I know you explicitly wanted to avoid the dependency on cl-lib, but I don't think it's a big deal, given that it would be a compile-time dependency. However, I'm willing to defer to popular convention here—if add-to-list is commonly used in packages, then we can follow others' example. I suppose this is also a minor, small use, so perhaps I'm overreacting :). — Reply to this email directly, view it on GitHub <#2 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/APC5ZXOHZ5OONXQV7TWTH5L2VVSAFAVCNFSM6AAAAABZR4LCBWVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZDOMBYGAZTCOJTHE> . You are receiving this because you authored the thread.Message ID: ***@***.***>

skissue · 2025-03-22T14:40:34Z

for the commit messages, I’d have a slight preference for adopting the GNU ChangeLog style

To be completely honest, I didn't know this was a thing! I've been using Conventional Commits purely out of habit and experience—I'll definitely take a look at the GNU style when I have a moment.

* llm-tool-collection.el (llm-tool-collection-deftool): Redefine macro to accept a simpler argument format and convert it internally to the required specs. Add edebug instrumentation. (read-file, list-directory, create-file, create-directory): Update all tool definitions to use the new argument specification format.

* llm-tool-collection.el (llm-tool-collection-deftool): Add support for arguments after &optional to be marked with :optional t in argument specs. (view-buffer): Add new tool for viewing buffer contents with optional offset and limit parameters.

* llm-tool-collection.el (llm-tool-collection-deftool): Use llm-tool-collection--make-llm-name when generating parameter names.

* llm-tool-collection.el: Add Imenu support for LLM tools defined with `llm-tool-collection-deftool'.

* llm-tool-collection.el (llm-tool-collection-deftool): Replace push with cl-pushnew when registering new tools in llm-tool-collection--all-tools, to avoid duplication.

* llm-tool-collection.el (llm-tool-collection--name-to-symbol) (llm-tool-collection--make-llm-name): Move these helper functions inside an eval-and-compile form to make them available during macro expansion. (llm-tool-collection-deftool): Fix variable name conflict by renaming 'name' variable to 'name-spec' for clarity. (llm-tool-collection-get-category, llm-tool-collection-get-all): Add autoload cookies. Escape apostrophes in usage examples. (view-buffer): Reformat arguments to avoid long lines.

* llm-tool-collection.el (llm-tool-collection-deftool): Use unquoted function symbol in the :function property of the generated tool spec instead of a quoted function value.

ultronozm · 2025-03-22T16:49:23Z

for the commit messages, I’d have a slight preference for adopting the GNU ChangeLog style

To be completely honest, I didn't know this was a thing! I've been using Conventional Commits purely out of habit and experience—I'll definitely take a look at the GNU style when I have a moment.

Very good. These are described in the CONTRIBUTE file in my Emacs installation, and maybe also yours. I'll emphasize that I'm just "following the herd" here and have no strong opinions on commit conventions (besides wanting to be lazy and fit in).

* llm-tool-collection.el (llm-tool-collection-post-define-functions): New variable to hold functions run after tool definition. (llm-tool-collection-deftool): Call it, passing the tool's plist definition as argument.

ultronozm · 2025-03-22T18:02:33Z

I made one further tweak just now, converting category strings to symbols. The rationale is that they're enum-like (belonging to some finite collection of values rather than being general strings), so it's more idiomatic (and more efficient, I suppose) to treat them as symbols rather than strings. I'll be happy to revise if you feel otherwise

ultronozm · 2025-03-22T18:47:30Z

OK, one more little tweak (and sorry for the spam). Since :description is an essentially mandatory field for functions and args, it seemed natural to drop the :description key and instead place its value immediately after the symbol name. This makes it look more like an actual docstring...

skissue · 2025-03-22T22:37:57Z

I made one further tweak just now, converting category strings to symbols. The rationale is that they're enum-like (belonging to some finite collection of values rather than being general strings), so it's more idiomatic (and more efficient, I suppose) to treat them as symbols rather than strings. I'll be happy to revise if you feel otherwise

I agree that symbols would be more appropriate. However, the reason that it is a string is because :category is actually a recognized keyword by gptel, and gptel takes it as a string. Thus, our hand is forced here. I am planning to add support for "tags" as an organization scheme as well; since that will be arbitrary, I will likely make those symbols.

OK, one more little tweak (and sorry for the spam). Since :description is an essentially mandatory field for functions and args, it seemed natural to drop the :description key and instead place its value immediately after the symbol name. This makes it look more like an actual docstring...

No worries—I agree, these are good changes!

* llm-tool-collection.el (llm-tool-collection-deftool): Accept a separate docstring parameter as second argument, for both functions and their args. (read-file, list-directory, create-file, create-directory) (view-buffer): Update all tool definitions to use the new format.

ultronozm · 2025-03-23T03:29:04Z

I agree that symbols would be more appropriate. However, the reason that it is a string is because :category is actually a recognized keyword by gptel, and gptel takes it as a string. Thus, our hand is forced here. I am planning to add support for "tags" as an organization scheme as well; since that will be arbitrary, I will likely make those symbols.

OK, I see, thanks for clarifying. I've switched it back to strings.

I've checked that the changes needed in gptel to work with symbols (or both) are quite straightforward, so if you'd like, I could submit a PR there and see how karthink feels about it. I'd also be happy to leave things as they are, if you prefer.

One point concerning optional arguments might be worth discussing. With the LLM's, I think any of the arguments may be marked as required or optional? With Emacs, the required must come before the optional. I updated the docstring of llm-tool-collection-deftool to hopefully clarify that point a bit.

ultronozm · 2025-03-23T04:48:51Z

One other thought that came to mind is whether it would make sense to include commands in llm-tool-collection that make it easier to work on tools by having frontends update as soon as the deftool form is evaluated.

For instance, the following code should do the trick with gptel:

(defun llm-tool-collection-register-with-gptel (tool-spec)
  "Register a tool defined by TOOL-SPEC with gptel.
TOOL-SPEC is a plist that can be passed to `gptel-make-tool'."
  (when (featurep 'gptel)
    (declare-function gptel-make-tool "gptel")
    (declare-function gptel-tool-name "gptel")
    (defvar gptel-tools)
    (let ((tool (apply #'gptel-make-tool tool-spec)))
      (setq gptel-tools
            (cons tool (seq-remove
                        (lambda (existing)
                          (string= (gptel-tool-name existing)
                                   (gptel-tool-name tool)))
                        gptel-tools))))))

(add-hook 'llm-tool-collection-post-define-functions
          #'llm-tool-collection-register-with-gptel)

It's not clear to me whether such code belongs in the README, in some docstring, as part of llm-tool-collection, as part of gptel, or wherever else, but figured I'd share it for the sake of discussion.

For my personal package ai-org-chat.el, the corresponding code is

(add-hook 'llm-tool-collection-post-define-functions #'ai-org-chat-register-tool-spec)

karthink · 2025-03-23T23:50:16Z

I agree that symbols would be more appropriate. However, the reason that it is a string is because :category is actually a recognized keyword by gptel, and gptel takes it as a string. Thus, our hand is forced here. I am planning to add support for "tags" as an organization scheme as well; since that will be arbitrary, I will likely make those symbols.

OK, I see, thanks for clarifying. I've switched it back to strings.

I've checked that the changes needed in gptel to work with symbols (or both) are quite straightforward, so if you'd like, I could submit a PR there and see how karthink feels about it. I'd also be happy to leave things as they are, if you prefer.

I made the category a string to give the user freedom to name tool groups however they wanted. While I expect that a few standard categories will arise over time, users can also have category names like

"retrieval (local, text)"
"retrieval (Roam db + Pocket)"
"retrieval (web sources, jira)"
"slack and irc"
"filesystem, read-only"
"filesystem, read-write"
"eval (watch out!)"

Users have different ways to tag, and not all of these can be represented as symbols. So I wasn't thinking of it as an enum.

I agree that tools contributed here should be categorized using some semi-standard set (like an enum).

karthink · 2025-03-23T23:55:44Z

It's not clear to me whether such code belongs in the README, in some docstring, as part of llm-tool-collection, as part of gptel, or wherever else, but figured I'd share it for the sake of discussion.

It can be part of gptel if required. Unlike the more commercial offerings, we're developing this suite of functionality split across several projects (gptel, llm, llm-tool-collection, mcp) so I'm in favor of providing glue code in all these packages to make it easy for the user to integrate them.

karthink · 2025-03-23T23:58:01Z

OK, one more little tweak (and sorry for the spam). Since :description is an essentially mandatory field for functions and args, it seemed natural to drop the :description key and instead place its value immediately after the symbol name. This makes it look more like an actual docstring...

See also karthink/gptel#685

As an aside, for elisp tools it usually makes sense to have a function docstring that's different from the description intended for the LLM. The latter focuses on different things. For example, it can contain instructions for the LLM on which tool (or set of tools) should be called next depending on the result of this one. This helps guide chained tool use with LLMs, as in this video and the previous one in this series.

* gptel.el (gptel-register-tool): New function to register a tool with gptel, replacing any existing tool with the same name. Following skissue/llm-tool-collection#2 (comment)

ultronozm · 2025-03-24T08:46:38Z

As an aside, for elisp tools it usually makes sense to have a function docstring that's different from the description intended for the LLM. The latter focuses on different things. For example, it can contain instructions for the LLM on which tool (or set of tools) should be called next depending on the result of this one. This helps guide chained tool use with LLMs, as in this video and the previous one in this series.

Thanks for this and your other comments. That's a good point about the different purposes of the docstrings, so I guess it'd make sense to add a further argument specifying the docstring. It's not clear to me what would be the best ergonomics for this, so I'll leave it for now.

skissue · 2025-03-24T16:14:39Z

One point concerning optional arguments might be worth discussing. With the LLM's, I think any of the arguments may be marked as required or optional? With Emacs, the required must come before the optional. I updated the docstring of llm-tool-collection-deftool to hopefully clarify that point a bit.

That's a good point. I don't think it should be an issue—is there ever a scenario where required and optional arguments need to be intermixed?

One other thought that came to mind is whether it would make sense to include commands in llm-tool-collection that make it easier to work on tools by having frontends update as soon as the deftool form is evaluated.

It's not clear to me whether such code belongs in the README, in some docstring, as part of llm-tool-collection, as part of gptel, or wherever else, but figured I'd share it for the sake of discussion.

Personally, I don't think I'll be adding this to the codebase itself. To me, users that are only consuming—not iterating on—the tools here should never have a reason to be rapidly evaluating the same tool many times. They can instead add all tools of interest via functions such as llm-tool-collection-get-all. Additionally, I'd like this collection to stay reasonably client-agnostic (though the :category tag shows that sometimes that does have to be traded for pragmatism). However, this is definitely a good thing to have for development, and I will certainly add this to the README when I get around to writing that (hopefully after merging this PR!).

As an aside, for elisp tools it usually makes sense to have a function docstring that's different from the description intended for the LLM. The latter focuses on different things. For example, it can contain instructions for the LLM on which tool (or set of tools) should be called next depending on the result of this one. This helps guide chained tool use with LLMs, as in this video and the previous one in this series.

Thanks for this and your other comments. That's a good point about the different purposes of the docstrings, so I guess it'd make sense to add a further argument specifying the docstring. It's not clear to me what would be the best ergonomics for this, so I'll leave it for now.

Perhaps this is naive or reductive of me, but I am of the (tentative) opinion that, in this specific case, we can simply forgo the Elisp docstring entirely. To me, the most important benefit of the conventions of an Elisp docstring is ensuring that the interface is well defined: arguments, return value, and context. I feel that LLM-friendly descriptions almost always end up being a superset of the information one would have gleaned from an Elisp docstring, since the former also requires documenting context, arguments, and the return value. However, I'm open to examples of this assumption being false.

skissue · 2025-03-24T16:16:43Z

llm-tool-collection.el

+  "Functions called after defining a new LLM tool.
+Each function is called with one argument, the tool's plist definition.")
+
+;;;###autoload


Is there a reason for this macro to be autoloaded? I'm not sure if there is application outside of this package—I was under the impression that this was a macro solely to simplify making definitions for us (and contributors).

This seemed necessary at some point to get font-lock and
indentation for the deftool macros working when editing
llm-tool-collection.el before loading the package, but I haven't been able
to reproduce that issue, so I'll be happy to remove the autoload (but will
first wait for your feedback on the other commits).

Other commits look good to me; there's a little wording I'd like to tweak, but it's not an issue at all and I'm happy to do it post-merge. Should be ready to merge after this 🎉!

Thanks, updated, should be all set

ultronozm · 2025-03-24T17:15:07Z

Sounds good on keeping things package agnostic, and forgoing Elisp docstrings for now.

* llm-tool-collection.el (llm-tool-collection-font-lock-keywords): New constant.

* llm-tool-collection.el (llm-tool-collection-deftool): Expand docs. Clarify that optional arguments are specified via the &optional keyword, rather than with `:optional t'. Add link to the tool spec.

* llm-tool-collection.el (llm-tool-collection-deftool): Reorder macro arguments to put description last, after specs and args. Expand docstring to explain SPECS and its keywords. The rationale is to more closely mimick the ordering for 'defun' and to put the :category specification on the line below the tool name, so that one can survey tools by category using C-1 M-x occur. (read-file, list-directory, create-file, create-directory) (view-buffer): Update existing tool definitions to follow the new argument order.

* llm-tool-collection.el (edit-buffer): New tool.

skissue · 2025-03-24T18:48:10Z

Thank you so much!

skissue self-assigned this Mar 22, 2025

skissue reviewed Mar 22, 2025

View reviewed changes

llm-tool-collection.el Outdated Show resolved Hide resolved

llm-tool-collection.el Outdated Show resolved Hide resolved

llm-tool-collection.el Outdated Show resolved Hide resolved

ultronozm added 7 commits March 22, 2025 17:23

Convert dashes to underscores in tool parameter names

7109844

* llm-tool-collection.el (llm-tool-collection-deftool): Use llm-tool-collection--make-llm-name when generating parameter names.

Add llm-tool-collection tools to Imenu

6f13cab

* llm-tool-collection.el: Add Imenu support for LLM tools defined with `llm-tool-collection-deftool'.

Use cl-pushnew instead of push for tool registration

e351b6a

* llm-tool-collection.el (llm-tool-collection-deftool): Replace push with cl-pushnew when registering new tools in llm-tool-collection--all-tools, to avoid duplication.

Fix function symbol reference in generated tool specs

fab4311

* llm-tool-collection.el (llm-tool-collection-deftool): Use unquoted function symbol in the :function property of the generated tool spec instead of a quoted function value.

ultronozm force-pushed the restructuring branch from 889cf31 to 9e1fe56 Compare March 22, 2025 16:47

Add hook to run after defining an LLM tool

5c1423f

* llm-tool-collection.el (llm-tool-collection-post-define-functions): New variable to hold functions run after tool definition. (llm-tool-collection-deftool): Call it, passing the tool's plist definition as argument.

ultronozm force-pushed the restructuring branch from 9e1fe56 to 5c1423f Compare March 22, 2025 17:45

ultronozm force-pushed the restructuring branch from 46af7be to 09681d5 Compare March 22, 2025 18:03

ultronozm force-pushed the restructuring branch from a4f971e to 3858b89 Compare March 23, 2025 03:05

ultronozm force-pushed the restructuring branch from 775dc55 to cf39aeb Compare March 23, 2025 04:12

ultronozm mentioned this pull request Mar 24, 2025

Add gptel-register-tool function karthink/gptel#738

Closed

skissue reviewed Mar 24, 2025

View reviewed changes

ultronozm added 4 commits March 24, 2025 18:45

Add fontification for llm-tool-collection-deftool macro

4b5f306

* llm-tool-collection.el (llm-tool-collection-font-lock-keywords): New constant.

Expand documentation concerning tool specs

f4cbc1c

* llm-tool-collection.el (llm-tool-collection-deftool): Expand docs. Clarify that optional arguments are specified via the &optional keyword, rather than with `:optional t'. Add link to the tool spec.

Add edit-buffer tool

1d70fdd

* llm-tool-collection.el (edit-buffer): New tool.

ultronozm force-pushed the restructuring branch from 7b21946 to 1d70fdd Compare March 24, 2025 17:47

skissue merged commit 1d70fdd into skissue:restructuring Mar 24, 2025

Restructuring #2

Restructuring #2

Uh oh!

Conversation

ultronozm commented Mar 22, 2025

Uh oh!

skissue left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ultronozm commented Mar 22, 2025 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

skissue commented Mar 22, 2025

Uh oh!

ultronozm commented Mar 22, 2025

Uh oh!

ultronozm commented Mar 22, 2025

Uh oh!

ultronozm commented Mar 22, 2025

Uh oh!

skissue commented Mar 22, 2025

Uh oh!

ultronozm commented Mar 23, 2025

Uh oh!

ultronozm commented Mar 23, 2025

Uh oh!

karthink commented Mar 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

karthink commented Mar 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

karthink commented Mar 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ultronozm commented Mar 24, 2025

Uh oh!

skissue commented Mar 24, 2025

Uh oh!

skissue Mar 24, 2025

Choose a reason for hiding this comment

Uh oh!

ultronozm Mar 24, 2025

Choose a reason for hiding this comment

Uh oh!

skissue Mar 24, 2025

Choose a reason for hiding this comment

Uh oh!

ultronozm Mar 24, 2025

Choose a reason for hiding this comment

Uh oh!

ultronozm commented Mar 24, 2025 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

skissue commented Mar 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

skissue left a comment •

edited

Loading

ultronozm commented Mar 22, 2025 via email •

edited

Loading

karthink commented Mar 23, 2025 •

edited

Loading

karthink commented Mar 23, 2025 •

edited

Loading

karthink commented Mar 23, 2025 •

edited

Loading

ultronozm commented Mar 24, 2025 via email •

edited

Loading