Udio Book.
   HOME

TheInfoList



OR:

Udio is a
generative artificial intelligence Generative artificial intelligence (Generative AI, GenAI, or GAI) is a subfield of artificial intelligence that uses generative models to produce text, images, videos, or other forms of data. These models Machine learning, learn the underlyin ...
model that produces music based on simple text prompts. It can generate vocals and instrumentation. Its free
beta version The software release life cycle is the process of developing, testing, and distributing a software product (e.g., an operating system). It typically consists of several stages, such as pre-alpha, alpha, beta, and release candidate, before the fi ...
was released publicly on April 10, 2024. Users can pay to subscribe monthly or annually to unlock more capabilities such as audio inpainting. Founded in December 2023 by a team of former researchers for
Google DeepMind DeepMind Technologies Limited, trading as Google DeepMind or simply DeepMind, is a British–American artificial intelligence research laboratory which serves as a subsidiary of Alphabet Inc. Founded in the UK in 2010, it was acquired by Goo ...
headed by Udio's CEO, David Ding, the program received financial backing from the
venture capital Venture capital (VC) is a form of private equity financing provided by firms or funds to start-up company, startup, early-stage, and emerging companies, that have been deemed to have high growth potential or that have demonstrated high growth in ...
firm
Andreessen Horowitz AH Capital Management, LLC (commonly known as Andreessen Horowitz, or a16z) is an American privately held venture capital firm, founded in 2009 by Marc Andreessen and Ben Horowitz. The company is headquartered in Menlo Park, California. As of M ...
and musicians
will.i.am William James Adams Jr. (born March 15, 1975), known professionally as will.i.am (pronounced "will I am"), is an American rapper, singer, songwriter, record producer and actor. He is the frontman of the musical group Black Eyed Peas, which he ...
and
Common Common may refer to: As an Irish surname, it is anglicised from Irish Gaelic surname Ó Comáin. Places * Common, a townland in County Tyrone, Northern Ireland * Boston Common, a central public park in Boston, Massachusetts * Cambridge Com ...
, among others. Critics praised its ability to create realistic-sounding vocals while others raised concerns over the possibility that its training data contained copyrighted music.


History

Udio was created in December 2023 by a team of four former researchers for
Google DeepMind DeepMind Technologies Limited, trading as Google DeepMind or simply DeepMind, is a British–American artificial intelligence research laboratory which serves as a subsidiary of Alphabet Inc. Founded in the UK in 2010, it was acquired by Goo ...
, including Udio's CEO David Ding, Conor Durkan, Charlie Nash, Yaroslav Ganin, as well as Andrew Sanchez under the name of Uncharted Labs. The
venture capital Venture capital (VC) is a form of private equity financing provided by firms or funds to start-up company, startup, early-stage, and emerging companies, that have been deemed to have high growth potential or that have demonstrated high growth in ...
firm
Andreessen Horowitz AH Capital Management, LLC (commonly known as Andreessen Horowitz, or a16z) is an American privately held venture capital firm, founded in 2009 by Marc Andreessen and Ben Horowitz. The company is headquartered in Menlo Park, California. As of M ...
; the music distributor UnitedMasters; musicians
will.i.am William James Adams Jr. (born March 15, 1975), known professionally as will.i.am (pronounced "will I am"), is an American rapper, singer, songwriter, record producer and actor. He is the frontman of the musical group Black Eyed Peas, which he ...
, Tay Keith, and
Common Common may refer to: As an Irish surname, it is anglicised from Irish Gaelic surname Ó Comáin. Places * Common, a townland in County Tyrone, Northern Ireland * Boston Common, a central public park in Boston, Massachusetts * Cambridge Com ...
; investor
Kevin Wall Kevin Wall is an American entrepreneur, investor, activist and Emmy Award-winning producer of international events such as the benefit concert series Live Earth and Live 8. His first media company, Radio Vision International, produced internati ...
;
Instagram Instagram is an American photo sharing, photo and Short-form content, short-form video sharing social networking service owned by Meta Platforms. It allows users to upload media that can be edited with Social media camera filter, filters, be ...
cofounder
Mike Krieger Michel Krieger (born March 4, 1986) is a Brazilian entrepreneur and software engineer who co-founded Instagram along with Kevin Systrom, and served as its CTO. During Krieger's tenure as CTO, Instagram's user base expanded from a few million to 1 ...
; and DeepMind researcher Oriol Vinyals all provided financial backing for Udio, and it was valued at $10 million in
seed funding Seed money, also known as seed funding or seed capital, is a form of securities offering in which an investor puts capital in a startup company in exchange for an equity stake or convertible note stake in the company. The term ''seed'' suggests ...
(plus the original $8.5 million raised previously). It spent several months in a
closed beta The software release life cycle is the process of developing, testing, and distributing a software product (e.g., an operating system). It typically consists of several stages, such as pre-alpha, alpha, beta, and release candidate, before the fi ...
phase before being publicly released in its beta phase on April 10, 2024 on the Udio website. , it allows users to generate 600 songs per month for free. Sanchez described it as "enabl ng musiciansto create great music and... to make money off of that music in the future". Udio's release followed the releases of other text-to-music generators such as
Suno AI Suno AI, or simply Suno, is a generative artificial intelligence Music and artificial intelligence, music creation program designed to generate realistic songs that combine vocals and instrumentation, or are purely instrumental. Suno has been wi ...
and Stability Audio. Udio was used to create "
BBL Drizzy "BBL Drizzy" (released as the file name "BBL DRIZZY BPM 150.mp3") is a " diss track beat" by American record producer Metro Boomin. It was released on May 5, 2024, in response to the Drake–Kendrick Lamar feud which consisted of multiple diss ...
" by Willonius Hatcher, a parody song that went viral in the context of the
Drake–Kendrick Lamar feud The Canadian rapper Drake (musician), Drake and the American rapper Kendrick Lamar have been involved in a rap feud since 2013, when Drake responded to Lamar's verse on the Big Sean song "Control (Big Sean song), Control". It escalated in 202 ...
, with over 23 million views on Twitter and 3.3 million streams on
SoundCloud SoundCloud is a German audio streaming service owned and operated by SoundCloud Global Limited & Co. KG. The service enables its users to upload, promote, and share audio. Founded in 2007 by Alexander Ljung and Eric Wahlforss, SoundCloud is ...
the first week. In August 2024, ''Verknallt in einen Talahon'' (''In Love with a Talahon'') a song generated with Udio by Austrian producer
Butterbro Josua Waghubinger is an Austrian producer living in Germany. He is known by his stage name Butterbro (a blend word of Butterbrot and Bro). His single "Verknallt in einen Talahon" about a German girl in love with an Arab immigrant, a stereotypi ...
became the first AI-generated song in the German Top 50.


Capabilities

Udio bases the songs it creates on text prompts, which can include their
genre Genre () is any style or form of communication in any mode (written, spoken, digital, artistic, etc.) with socially agreed-upon conventions developed over time. In popular usage, it normally describes a category of literature, music, or other fo ...
(including
barbershop quartet A barbershop quartet is a group of four singers who sing music in the barbershop style, characterized by four-part harmony without instrumental accompaniment (a cappella). The four voices are: the lead, the vocal part which typically carries t ...
,
country A country is a distinct part of the world, such as a state, nation, or other political entity. When referring to a specific polity, the term "country" may refer to a sovereign state, state with limited recognition, constituent country, ...
, classical,
hip hop Hip-hop or hip hop (originally disco rap) is a popular music genre that emerged in the early 1970s from the African-American community of New York City. The style is characterized by its synthesis of a wide range of musical techniques. Hip- ...
,
German German(s) may refer to: * Germany, the country of the Germans and German things **Germania (Roman era) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizenship in Germany, see also Ge ...
pop Pop or POP may refer to: Arts, entertainment, and media * Pop music, a musical genre Artists * POP, a Japanese idol group now known as Gang Parade * Pop! (British group), a UK pop group * Pop! featuring Angie Hart, an Australian band Album ...
, and
hard rock Hard rock or heavy rock is a heavier subgenre of rock music typified by aggressive vocals and Distortion (music), distorted electric guitars. Hard rock began in the mid-1960s with the Garage rock, garage, Psychedelic rock, psychedelic and blues ...
, among others),
lyrics Lyrics are words that make up a song, usually consisting of verses and choruses. The writer of lyrics is a lyricist. The words to an extended musical composition such as an opera are, however, usually known as a "libretto" and their writer, ...
, story direction, and other artists to base their sound on. Its lyrics are created with a
large language model A large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language processing tasks, especially language generation. The largest and most capable LLMs are g ...
(LLM), while the process used to generate the music itself, , has not been disclosed. The program generates two songs based on the prompts and users can "remix" their songs with further text prompts. Songs are first generated as roughly 30 second-long pieces, and can be extended by additional 30 second increments. Paying subscribers can access advanced functionality such as audio inpainting.


Reception

Mark Hachman, the senior editor of ''
PC World ''PC World'' (stylized as PCWorld) is a global computer magazine published monthly by IDG. Since 2013, it has been an online-only publication. It offers advice on various aspects of PCs and related items, the Internet, and other personal tec ...
'', compared Udio to AI art generators and praised its ability to turn "a few rather poor lyrics" into a "rather catchy" song, also calling the vocals it generated "incredibly realistic and even emotional". Sabrina Ortiz of '' ZDNET'' described the songs it generated as being "impressive" and sounding "as though they were produced professionally". She also called them "fuller and richer" than those of other text-to-music generators, which she said it had "more personalization options" than. '' Tom's Guide''s Ryan Morrison wrote that Udio had "an uncanny ability to capture emotion in synthetic vocals" and was the only AI music generator "to have captured the passion, pain and spirit of a vocal performance". He added that the program was geared toward "people with no or minimal musical ability". Brian Hiatt of ''
Rolling Stone ''Rolling Stone'' is an American monthly magazine that focuses on music, politics, and popular culture. It was founded in San Francisco, California, in 1967 by Jann Wenner and the music critic Ralph J. Gleason. The magazine was first known fo ...
'' wrote that Udio was "more customizable but also perhaps less intuitive to use" than Suno AI and added that "some early users have suggested that on average, Udio's output may sound crisper than Suno's". For ''
Ars Technica ''Ars Technica'' is a website covering news and opinions in technology, science, politics, and society, created by Ken Fisher and Jon Stokes in 1998. It publishes news, reviews, and guides on issues such as computer hardware and software, sci ...
'', Benj Edwards wrote that Udio's generation capability was imperfect and "less impressive" than Suno AI's, noting that its songs were substantially shorter than Suno AI's. He also called the songs it produced "half-baked and almost nightmarish". In response to the company's announcement of Udio's beta release on
Twitter Twitter, officially known as X since 2023, is an American microblogging and social networking service. It is one of the world's largest social media platforms and one of the most-visited websites. Users can share short text messages, image ...
,
Telefon Tel Aviv Telefon Tel Aviv is an American electronic music act formed in 1999 by musicians Charles Cooper and Joshua Eustis. Since Cooper's accidental death in 2009, Telefon Tel Aviv has continued with Eustis as the sole official member. History Telefon ...
member Joshua Eustis tweeted that Udio was "an app to replace musicians" and called into question the data that it used. Udio has also been criticized online as "soulless" and for having the potential to create
audio deepfake Audio deepfake technology, also referred to as voice cloning or deepfake audio, is an application of artificial intelligence designed to generate speech that convincingly mimics specific individuals, often speech synthesis, synthesizing phrases or ...
s. Lucas Ropek of ''
Gizmodo ''Gizmodo'' () is a design, technology, science, and science fiction website. It was originally launched as part of the Gawker Media network run by Nick Denton. ''Gizmodo'' also includes the sub-blogs ''io9'' and ''Earther'', which focus on pop ...
'' stated that Udio was "full of acoustical nonsense" and that its songs were "extraordinarily bad".


Copyright concerns

Critics of Udio have questioned what data was used to train it and if that data consisted of copyrighted music. ''Rolling Stone'' wrote that there was "substantial reason to believe" that both Udio and Suno AI were trained with copyrighted music, while Benj Edwards of ''Ars Technica'' wrote that its training data was "likely filled with copyrighted material". Udio does not directly recreate copyrighted songs if prompted. Ding has stated that Udio has "extensive automated copyright filters" and that the company is "continually refining tssafeguards".
Stability AI Stability AI Ltd is a UK-based artificial intelligence company, best known for its text-to-image model Stable Diffusion. History and founding Stability AI was founded in 2019 by Emad Mostaque and by Cyrus Hodes. In August 2022 Stability AI r ...
took a different approach with Stable Audio 2.0, and used an explicitly licensed dataset of music called AudioSparx. In June 2024, a lawsuit, lead by the
Recording Industry Association of America The Recording Industry Association of America (RIAA) is a trade organization that represents the music recording industry in the United States. Its members consist of record labels and distributors that the RIAA says "create, manufacture, and/o ...
, was filed against Udio and Suno alleging widespread infringement of copyrighted sound recordings. The lawsuit sought to bar the companies from training on copyrighted music, as well as damages of up to $150,000 per work from infringements that have already taken place.


See also

*
Suno AI Suno AI, or simply Suno, is a generative artificial intelligence Music and artificial intelligence, music creation program designed to generate realistic songs that combine vocals and instrumentation, or are purely instrumental. Suno has been wi ...


References


External links

* {{Music streaming services 2024 software Artificial intelligence art Music software