Learn
-

Best Prompt Settings for Local LLMs: Temperature, Top-p, Min-p
Somewhere on Reddit, three years ago, someone posted their sampling settings for a Llama-2 finetune. Those settings spread like a…
Read More » -

Tokens Per Second (t/s) Explained: Beginner’s Guide to LLM Speed
You’ve watched it happen. You type a question into ChatGPT or your local LLM, hit enter, and the answer starts…
Read More »