You must log in or register to comment.
The examples are pretty scary/impressive. The demo is less scary. It doesn’t seem to have a typical LLM’s proficiency in understanding concepts and direction. A test with “synthwave” and “1980s” produced a vaguely 2010ish pop rap mashup.
So I think this is impressive, especially with understanding verse/chorus/refrain sections and maintaining coherency. But still has a long way to go.
Luckily they’ve already released LoRA training code, so people can fix some of these problems.
Good point, that may bridge the gap.