Site icon Nairametrics

Nvidia unveils ‘Fugatto’ AI model for music and audio generation 

Nvidia, the world’s largest supplier of AI chips and software, has unveiled Fugatto a new artificial intelligence model capable of generating and modifying music, sound effects, and audio.

According to reports by Reuters, this innovative tool is designed for professionals in the music, film, and video game industries.

However, Nvidia has stated there are no immediate plans to make Fugatto publicly available.

The Fugatto model, an acronym for Foundational Generative Audio Transformer Opus 1, is a leap forward in audio AI technology. Unlike many AI models, it can create both unique sounds from text descriptions and modify existing audio recordings.

Bryan Catanzaro, Nvidia’s Vice President of Applied Deep Learning Research, highlighted how computers and synthesizers have transformed music over the past 50 years and emphasized that generative AI will unlock new creative possibilities for music, video games, and everyday creators.

If we think about synthetic audio over the past 50 years, music sounds different now because of computers, because of synthesizers. I think that generative AI is going to bring new capabilities to music,  video games, and to ordinary folks that want to create things,” Catanzaro stated.

Features of Fugatto 

The model can: 

Comparison with other AI models 

Nvidia’s Fugatto joins a growing list of AI technologies developed by companies such as Meta Platforms and startups like Runway, which also generate audio or video content from text prompts.

“Any generative technology always carries some risks, because people might use that to generate things that we would prefer they don’t. We need to be careful about that, which is why we don’t have immediate plans to release this,” Catanzaro said.

What you should know 

Exit mobile version