Merge of Xwin and Euryale, particularly good at story-telling and role-play.
Here's a summary of a benchmark by the great WolframRavenwolf. Goliath dominated this benchmark, specifically this RP oriented quant.
Very good at ERP from my experience. Descriptive and relatively creative, but still lacking in some areas.
Best local model I've ever used. +Has sense of humor(!!!), this is the first model that joked unprompted +Very creative +Very smart +Juicy -Has some GPTisms in it. If you spot one, remove ASAP, otherwise the model will keep spitting them out -Has sometimes troubles with following the prompt, solved by editing
From my experience using it more, it clearly has significant flaws others tend to overlook. If you are constantly having to reroll or edit with 70B+, something is wrong, and unfortunately this model tends to face such issues. There is likely an over focusing on isolated examples of prose and not the overall consistency, accuracy, and coherency of the model, especially as context grows.
While it can produce some creative outputs, this likely is due to token probabilities being unstable due to how it was merged or stitched together, which also likely causes a lot of the issues mentioned.
All in all, getting creative good prose some of the time cannot off set these fairly constant issues for, especially given the massive size of the model and how slow it is compared to alternatives that do not have these issues.
Goliath is oustanding in it's ability to handle complicated characters with a high level of coherency. It often follows a train of thought well and very rarely deviates from instructions (given that they are provided in the correct format). It does lack in it's inherent ability to write good NSFW content, often providing top-level summaries of what's happening.
It depicts a beautiful world without the preachy feel of GPT. Excellent understanding and no jailbreak required. Large models may be greatly affected by quantization. Someone hosted this on Horde a long time ago and there were no errors or grammar mistakes. I had a wonderful time for about a week. Nowadays, I almost have no choice but to use that API, but I feel that the performance is a little lower than it was then. I don't know if it's a difference in some settings or an effect of quantization. Maybe it's just my imagination. However, if you swipe it a few times, it will still give you the kind of depiction that only this model can do. I have several characters that I created myself that I only use with this model. I only use it when I want to see them. It would be a waste to use it for ERP. Simply because the usage fee is high.
This is just another garbage merge model. So many 13B and 7B role-play models beat this for many reasons.
This always messes up its spelling, or goes off topic / changes my role-play topic. It's quite annoying.