Find common ground between what Jan is doing and what's happening in ggml with Training

Let's start looking at what's happening over here https://github.com/ggerganov/ggml/tree/master/examples/mnist down to the flow of PRs and the kinds of discussions folks are having there (e.g. https://github.com/ggerganov/ggml/pull/982).

The work being done there is obviously a massive refactoring of an earlier attempt at llama.cpp finetuning that was ripped out some time ago waiting for the above ggml research to conclude. The previous work was around here https://github.com/ggerganov/llama.cpp/pull/8669 and here https://github.com/ggerganov/llama.cpp/pull/2632)

It may actually make sense to look back at the last release of llama.cpp that still had that functionality just to appreciate the scope a little bit.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Find common ground between what Jan is doing and what's happening in ggml with Training #12

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Find common ground between what Jan is doing and what's happening in ggml with Training #12

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions