GGUF progressing nicely, ggerganov is back on it tomorrow!
GGUF progressing nicely, ggerganov is back on it tomorrow!

github.com
GGUF by ggerganov · Pull Request #2398 · ggerganov/llama.cpp

Been a lot of good work done the past week by several pivotal members, and now the boss is back and focused on it, going to be a very breaking change but I'm really excited where this will lead us!
What is GGUF? Is it a model format? Why is this important?
it's a standardizing of a universal GGML format which would mean going forward no more breaking changes when new formats are worked on, and also includes the same functionality of llama.cpp for all GGML types (falcon, mpt, starcoder etc)
I definitely wouldn't count on that.
But it does make it much easier to do some changes, like adding/changing model specific fields which previously would have required a format change. Stuff like changing or dropping support for existing quantizations would also break stuff independent of the model format itself.