Gave this one a go: GitHub - zhvng/open-musiclm: Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
It has some pretrained models, but they don't work and hasn't really been any response in the issues section.
size mismatch for...