HACKER Q&A
📣 namrog84

When/where can we get an offline GPT3 type chatbot?


With the new OpenAI's ChatGPT, it's super neat and all. But there were lots of issues I had with it, sometimes I felt like I was running into arbitrary guard rails because they are concerned about bad PR or something.

Also the fact that you have no control on weights or other modifications to help steer it into certain areas.

After having been majorly spoiled by StableDiffusion and being offline and all the community mods/changes that have been contributed to it. I now what an offline chatbot model.

I think I read there are some older GPT2 that is available offline but also that most of them are still considered 'inefficient'. What does this mean? Is it the compute to use it, or the physical size of the model? Would it be at all possible to split it into groups or stuff (e.g. I only care about English and programming languages and not about other cultural languages).

I am sorry if this is common knowledge to those in the know, but could someone help share some details if what I am asking is silly(like asking for an offline version of a search engine) or I am asking the wrong questions?


  👤 tripplyons Accepted Answer ✓
I think the best publicly available model that can follow instructions right now is https://huggingface.co/bigscience/bloomz.

It has 176 billion trainable parameters, but I think it uses up terabytes of memory, so there is a trade off between model size and the ability of the model.

On most GPUs, you should be able to run https://huggingface.co/google/flan-t5-large. It is pretty good and is trained to follow instructions.


👤 freediver
Here is an example of one general purpose open source LLM, probably the best you can get:

https://github.com/EleutherAI/gpt-neox

To manage your expectations it is nowhere as good as ChatGPT.

If you are interested in programming only:

https://github.com/salesforce/CodeGen

is decent.


👤 touringa
Here's a full list of the big ones:

https://lifearchitect.ai/models/

https://lifearchitect.ai/timeline/

There are another 97,000 Transformer models on HF:

https://huggingface.co/models