I have experience in running servers, but I would like to know if it’s possible to do it, I just need a GPT 3.5 like private LLM running.
I have experience in running servers, but I would like to know if it’s possible to do it, I just need a GPT 3.5 like private LLM running.
Yeah, I did see something related to what you mentioned and I was quite interested. What about quantized models?
I don’t have any experience with them honestly so I can’t help you there
Appreciate you 👍👍