If I'm not mistaken these will be some of the best sota language models to try:
- Flan-T5
- OPT
- Distilbert
- GPT-J
Look into PEFT to fit finetuning on a consumer-grade device. You'll have to roll your own RLHF though.. :)
https://old.reddit.com/r/programming/comments/szqq5m/gptj_is...