New post Need visibility? Apply for a FREE post for your Startup.  Apply Here

Artificial IntelligenceNews

Google’s MusicLM: an AI that can create music from text prompts

2 Mins read

Researchers working with Google have recently developed an AI model called MusicLM. MusicLM is equipped with the capability to generate minutes-long musical pieces from text prompts. The AI was also made with the ability to transform a whistled or hummed melody into different instruments, similar to how systems like DALL-E generate images from written prompts.

Several samples of works produced by MusicLM has been uploaded by the company. The samples includes 30-second snippets of what sound like actual songs created from paragraph-long descriptions. The descriptions used to create the models include directions of the genre, vibe, and even specific instruments.

Another model include five-minute-long pieces generated from one or two words such as “melodic techno”. The model also includes musical interpretations of phrases such as “futuristic club” and “accordion death metal”.

Here’s a music made from the “story mode” prompts, where the model is basically given a script to morph between prompts:

The AI model also has the ability to simulate human vocals, although the quality of the vocals is not quite perfect yet, as they sound rainy or staticky, like the one here. The technology is still in its early stages and Google has no plans to release the model at this point, citing potential risks of “potential misappropriation of creative content” and potential cultural appropriation or misrepresentation.

AI-generated music has been around for a while. There has been systems used to compose pop songs, copying Bach better than a human could in the 90s, and accompanying live performances. One recent version uses AI image generation engine, StableDiffusion, to turn text prompts into spectrograms that are then turned into music.

According to the paper on MusicLM, the model can outperform other systems in terms of its “quality and adherence to the caption,” as well as the fact that it can take in audio and copy the melody. This ability is perhaps one of the most impressive features of MusicLM. This is obvious in the demo where you can play the input audio, where someone hums or whistles a tune, then hear how the model reproduces it as an electronic synth lead, string quartet, guitar solo, etc.

Read also: 10 platforms and tools Web Developers should know about

The release of MusicLM is a significant step forward for AI-generated music. It’s impressive how the model can generate music from text prompts and even copy melodies from humming or whistling. Although the quality of the vocals is not perfect yet, it’s still a remarkable achievement. Google’s decision to hold on on the release of MusicLM is understandable, given the potential risks of “potential misappropriation of creative content” and potential cultural appropriation or misrepresentation.

In the meantime, Google has publicly released a dataset with around 5,500 music-text pairs, which could help when training and evaluating other musical AI.

598 posts

About author
When I'm not reading about tech, I'm writing about it, or thinking about the next weird food combinations to try. I do all these with my headphones plugged in, and a sticky note on my computer with the words: "The galaxy needs saving, Star Lord."
Related posts
Big StoryNews

WhatsApp is working to allow users to Pin Messages in a chat list

2 Mins read
WhatsApp is reported to be currently working on a “Pin Message” feature, the ability to pin down a message. FYI what we…
Big StoryNews

Twitter Begins Testing Government ID-Based Verification for Blue Subscribers

2 Mins read
Twitter is reportedly testing a feature that allows subscribers to submit government-issued ID to have their profiles verified on the platform. Code-level…
Artificial IntelligenceNow you knowRandom

Do You Know You Can Lose Your Study Visa For Using ChatGPT?

1 Mins read
The use of ChatGPT is causing concern among educators, with some schools outrightly banning the AI-powered tool from their networks. Such bans…
Get powered up with Techpadi Newsletter

Be the first to know what's happening in the African tech space

Leave a Reply

Now you knowRandomStartups

San Francisco Crypto Integration Platform Hatchfi, Raises $1.2M In Pre-Seed Funding