Yeah, if you already have it then it’s not really an extra cost. But the smaller models perform less well and less reliably.
In order to write a book that’s convincing enough to fool at least some buyers, I wouldn’t expect a Llama2 7B to do the trick, based on what I see in my work (ML engineer). But even at work, I run Llama2 70B quantized at most, not the full size one. Full size unquantized requires 320 GPU vram, and that’s just quite expensive (even more so when you have to rent it from cloud providers).
Although if you already have a GPU that size at home, then of course you can run any LLM you like :)
You’d think they would make it less obvious than that if they wanted to hide a conspiracy.
Funny how they always think that there are some intricate hidden conspiracies, yet that they are obvious enough that some dumbasses on Facebook can figure it out.
What kind of monster stacks pizzas like that?!