Skip to content
Discussion options

You must be logged in to vote

Hello,

From my perspective re-reading, I'd say you are correct, It seems like Sebastian inverted.
That would be something like: a larger base means frequencies decrease faster and therefore angles/rotations are getting smaller/slower across dimensions (vs a smaller base, for the same position)

This needs to be double checked but that would make sense when larger LLMs tend to increase the base to better adapt for longer contexts and minimize overlaps/increase slowdown.

Edit: a little desmos is even better to visualize

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@d-kleine
Comment options

@rasbt
Comment options

@d-kleine
Comment options

@rasbt
Comment options

Answer selected by rasbt
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants