Model is https://huggingface.co/TheBloke/airoboros-65B-gpt4-1.2-GGML Software is ...

2023-06-28 16:50:19

Model is https://huggingface.co/TheBloke/airoboros-65B-gpt4-1.2-GGML

Software is https://github.com/ggerganov/llama.cpp

Not pretend that response was fast. A 30B or even 13B model might be faster than Pygmalion.

Llama can offload layers to GPU.

Koboldcpp can use llama.

Author Public Key

npub1nmk2399jazpsup0vsm6dzxw7gydzm5atedj4yhdkn3yx7jh7tzpq842975

Show more details

iru on Nostr: Model is https://huggingface.co/TheBloke/airoboros-65B-gpt4-1.2-GGML Software is ...