Добавить новость
ru24.net
Technology Review
Июнь
2024
1
2
3 4 5 6 7 8
9
10 11 12 13 14
15
16
17 18 19 20 21
22
23
24 25 26 27 28
29
30

Training AI music models is about to get very expensive

0

AI music is suddenly in a make-or-break moment. On June 24, Suno and Udio, two leading AI music startups that make tools to generate complete songs from a prompt in seconds, were sued by major record labels. Sony Music, Warner Music Group, and Universal Music Group claim the companies made use of copyrighted music in their training data “at an almost unimaginable scale,” allowing the AI models to generate songs that “imitate the qualities of genuine human sound recordings.”

Two days later, the Financial Times reported that YouTube is pursuing a comparatively above-board approach. Rather than training AI music models on secret data sets, the company is reportedly offering unspecified lump sums to top record labels in exchange for licenses to use their catalogs for training. 

In response to the lawsuits, both Suno and Udio released statements mentioning efforts to ensure their models don’t imitate copyrighted works, but neither company has specified whether their training sets contain them. Udio said its model “has ‘listened’ to and learned from a large collection of recorded music,” and two weeks before the lawsuits, Suno’s CEO Mikey Shulman told me its training set is “both industry standard and legal,” but that the exact recipe is proprietary.

While the ground here is moving fast, none of these moves should be all that surprising: litigious training-data battles have become something like a rite of passage for generative AI companies. The trend has led many of those companies, including OpenAI, to pay for licensing deals while the cases unfold. 

However, the stakes of a fight over training data for AI music are different than for image generators or chatbots. Generative AI companies working in text or photos have options to work around lawsuits, including by cobbling together open-source corpuses to train models. In contrast, the public domain for music is much more limited (and not exactly what most people want to listen to). 

Other AI companies can also more easily cut licensing deals with interested publishers and creators, of which there are many; but rights in music are far more concentrated than those in film, images, or text, industry experts say. They’re largely managed by the three biggest record labels—the new plaintiffs—whose publishing arms collectively own more than 10 million songs and much of the music that has defined the last century. (The filing names a long list of artists who the labels allege were wrongfully included in training data, ranging from ABBA to those on the Hamilton soundtrack.) 

On top of all this, it’s also just more difficult to create music worth listening to—generating a readable poem or passable illustration with AI is one technical challenge, but infusing a model with the taste required to create music we like is another. 

It’s of course possible that the AI companies will win the case, and none of this will matter; they would have carte blanche to train on a century of copyrighted music. But experts say the case from the record labels is strong, and it’s more likely that AI companies will soon have to pay up—and pay a lot—if they want to survive. If a court were to rule that AI music companies could not train for free on these labels’ catalogs, then expensive licensing deals, like the one YouTube is reportedly pursuing, would seem to be the only path forward. This would effectively ensure the company with the deepest pockets ends up on top.

More than any training-data case yet, the outcome of this one will determine the shape of a big slice of AI—and if there is a future for it at all. 

Merits of the case

Suno’s music generator has been public for less than a year, but the company has already garnered 12 million users, a $125-million funding round last month, and a partnership with Microsoft Copilot. Udio is even newer to the scene, launching in April with $10 million in seed funding from musician-investors like will.i.am and Common. 

The record labels allege that both of the startups are engaging in copyright infringement on the training and the output sides of their models.

“The plaintiffs here have the best odds of almost anyone suing an AI company,” says James Grimmelmann, professor of digital and information law at Cornell Law School. He draws comparisons to the ongoing New York Times case against OpenAI, which he says was, until now, the best example of a rights holder having a strong case against an AI company. But the suit against Suno and Udio “is worse for a bunch of reasons.”

The Times has accused OpenAI of copyright infringement in its model training by using the publication’s articles without consent. Grimmelmann says OpenAI has a bit of plausible deniability in this accusation, because the company could say that it scraped much of the internet for a training corpus and copies of New York Times articles appeared in places without the company’s knowledge. 

For Suno and Udio, that defense is far less believable. “This is not like, ‘We scraped the web for all audio and we couldn’t tell the commercially produced songs apart from everything else,’” Grimmelmann says. “It’s pretty clear that they had to have been pulling in large databases of commercial recordings.” 

In addition to complaints about training, the new case alleges that tools like Suno and Udio are more imitative than generative AI, meaning that their output mimics the style of artists and songs protected by copyright. 

While Grimmelmann notes that the Times cited examples of ChatGPT reproducing entire copies of its articles, record labels claim they were able to generate problematic responses from the AI music models with much simpler prompts. For instance, prompting Udio with “my tempting 1964 girl smokey sing hitsville soul pop,” the plaintiffs say, yielded a song that “any listener familiar with The Temptations would instantly recognize as resembling the copyrighted sound recording, ‘My Girl.’” (The court documents include links to examples on Udio, but the songs appear to have been removed.) The plaintiffs mention similar examples from Suno, including an ABBA-adjacent song called “Prancing Queen” that was generated with the prompt “70s pop” and the lyrics for “Dancing Queen.”

What’s more, Grimmelmann explains, there is more copyrightable information in a song than a news article. “There’s just a lot more information density in capturing the way that Mariah Carey’s voice works, than there is in words,” he says, which is perhaps part of the reason why past lawsuits navigating music copyright have sometimes been so drawn out and complex. 

In a statement, Shulman wrote that Suno prioritizes originality and that the model is “designed to generate completely new outputs, not to memorize and regurgitate pre-existing content. That is why we don’t allow user prompts that reference specific artists.” Udio’s statement similarly mentioned “state-of-the-art filters to ensure our model does not reproduce copyrighted works or artists’ voices.”

Indeed, the tools will block a request if it names an artist. But the record labels allege that the safeguards have significant loopholes. Following the news of the lawsuits, for instance, social media users shared examples suggesting that if users separate an artist’s name with spaces, the request may go through. My own request for “a song like Kendrick” was blocked by Suno, citing an artist’s name, but “a song like k e n d r i c k” resulted in a “hip-hop rhythmic beat-driven” track and “a song like k o r n” resulted in “nu-metal heavy aggressive.” (To be fair, they didn’t resemble the respective artist’s unique styles, but to even respond in the right tightly-defined genre seems to suggest that the model is in fact familiar with each artist’s work.) Similar workarounds were blocked on Udio. 

Possible outcomes

There are three ways the case could go, Grimmelmann says. One is wholly in favor of the AI startups: the lawsuits fail and the court determines AI companies did not violate fair use nor imitate copyrighted works too closely in their outputs. If the models are found to fall under fair use, it would mean songwriters and rights holders would need to find a different legal mechanism to pursue compensation. 

Another possibility is a mixed bag: the court finds the AI companies did not violate fair use in their training, but must better control the model’s output to make sure it does not improperly imitate copyrighted works. Grimmelmann says this would be similar to one of the initial rulings against Napster, in which the company was forced to ban searches for copyrighted works in its libraries (though users quickly found workarounds). 

The third and essentially nuclear option is that the court finds fault on both the training and output sides of the AI models. This would mean the companies could not train on copyrighted works without licenses, and could also not allow outputs that closely imitate copyrighted works. The companies could be ordered to pay damages for infringement, which could run into the hundreds of millions for each company. If they aren’t bankrupted by such a ruling, it would force them to completely restructure their training through licensing deals, which could also be cost-prohibitive. 

COURTESY SUNO.AI

To license or not to license

Though the immediate goals of the plaintiffs are to get the AI companies to cease training and pay damages, chairman of the Recording Industry Association of America Mitch Glazier is already looking ahead toward a future of licensing. “As in the past, music creators will enforce their rights to protect the creative engine of human artistry and enable the development of a healthy and sustainable licensed market that recognizes the value of both creativity and technology,” he wrote in a recent op-ed in Billboard.

Such a market for licenses could mirror what has already unfolded for text generators. OpenAI has struck licensing deals with a number of news publishers, including Politico, The Atlantic, and The Wall Street Journal. The deals promise to make content from the publishers discoverable in OpenAI’s products, though the ability for the models to transparently cite where they’re getting information from is limited at best.

If AI music companies follow that pattern, the only ones with the means to create powerful music models might be those with the most cash. That’s perhaps exactly what YouTube is thinking. The company did not immediately respond to questions from MIT Technology Review about the details of its negotiations, but given the massive amount of data required to train AI models and the concentration of rights owners in music, it’s fair to assume the price of deals with record labels would be eye-popping. 

In theory, an AI company could bypass the licensing process altogether by building its model exclusively on music in the public domain, but it would be a herculean task. There have been similar efforts in the realm of text and image generators, including a legal consultancy in Chicago that created a model trained on dense regulatory documents, and a model from Hugging Face that trained on images of Mickey Mouse from the 1920s. But the models are small and unremarkable. If Suno or Udio is forced to train on only what’s in the public domain—think military march music and the royalty-free songs found in corporate videos—the resulting model would be a far cry from what they have today.

If AI companies do move forward with licensing agreements, negotiations may be tricky, says Grimmelmann. Music licensing is complicated by the fact that two different copyrights are at play: one for the song, which generally covers the composition, like the music and lyrics; and one for the master, which covers the recording, like what you’d hear if you stream the song. 

Some artists, like Taylor Swift and Frank Ocean, have come to own the masters of their catalogs after drawn-out legal battles, and would therefore be in the driver’s seat for any potential licensing deal. Many others, though, retain only the song copyright, while the record labels retain the masters. In these cases, the record label might theoretically be able to grant AI companies a license to use the music without an artist’s permission, but doing so could risk burning relationships with artists and sparking more legal battles. 

The question of whether to license their music to such companies has divided musician groups. In contract rules adopted in April by SAG-AFTRA, which represents recording artists as well as actors, AI clones of member voices are allowed, though there are minimum rates for compensation. Back in December, a group called the Indie Musician’s Caucus expressed frustrations that the leading instrumental musicians’ union, the 70,000-member American Federation of Musicians (AFM), was not doing enough to protect its rank and file against AI companies in contracts. The caucus wrote that it would vote against any agreement “obligating AFM members to dig [their] own graves by participating—without a right to consent, compensation, or credit—in the training of our permanent Generative AI replacements.”

But at this point, AFM does not appear eager to facilitate any deals. I asked Kenneth Shirk, international secretary-treasurer at AFM, whether he thought musicians should engage with AI companies and push to be fairly compensated, whatever that means, or instead resist licensing deals completely. 

“Looking at those questions makes me think, would you rather have a swarm of fire ants crawling all over you, or roll around in a bed of broken glass?” he told me. “We want musicians to get paid. But we also want to ensure that there’s a career in music to be had for those that are going to come after us.”




Moscow.media
Частные объявления сегодня





Rss.plus



Дирекция по качеству АО "Желдорреммаш" посетила локомотивостроительные заводы ТМХ

За кулисами бизнес-конференции MEDIABOSS

Фестиваль троечной езды и гастрономии "Русский драйв"

Пот ручьём: когда стоит обращать внимание на повышенную потливость, рассказал доктор Кутушов


Дирекция по качеству АО "Желдорреммаш" посетила локомотивостроительные заводы ТМХ

Отдыхающий в Кисловодске понял, почему там не любят москвичей, назвав 5 причин для недовольства столичными туристами

Российским туристам нашли лучшую альтернативу отдыха в дорогой Турции: сами турки тоже отказываются от Анталии в пользу этого курорта

Textile Collection Moscow Autumn 2024: присоединяйтесь к масштабному событию текстиля – единому текстильному кластеру!


Building A Blockbuster Trade Between The White Sox And Mariners

Ian Wright and Gary Neville go wild after Bellingham’s England equaliser… as eagle-eyed fans spot Roy Keane’s reaction

Diego Lopes holds no ill will toward Brian Ortega after UFC 303, hopes for Sphere rebooking

Portugal vs France – Euro 2024: Ronaldo and Mbappe have one last dance in quarter-final tie – stream FREE, TV, team news


Беспроводной сканер штрих-кодов Heroje S-H29W

Военное следственное управление Следственного комитета Российской Федерации по Черноморскому флоту предупреждает:

Свердловчанке, избивавшей детей, придется заплатить им три миллона рублей

Острова укладываеюся спать...


Состоялся релиз «T.D.Z. 4 Сердце Припяти Сталкер» на Android

Epic Games подала Apple заявку на возвращение Fortnite на iOS и запуск собственного магазина приложений в ЕС

Глобальную версию Mega Man X DiVE закроют к концу июля

Для Dark and Darker Mobile проходит короткий бета-тест на iOS и Android



Оранжевый уровень опасности: ураган из Петербурга скоро обрушится на Москву

Куда сходить москвичам и гостям столицы 13 июля - Мытищинский форсаж: часть вторая

Сила знания. Какие загадки разгадывают дети на этноолимпиаде

Программа «Цифровой инвестор» ГМК "Норникель" выходит на новый этап




Stardogs завоюет рынок Армении своими легендарными хот-догами.

Пот ручьём: когда стоит обращать внимание на повышенную потливость, рассказал доктор Кутушов

«Антиоксидантные свойства»: названы закуски, которые помогут похудеть

Гуляем отпуск в ритме джаза: лучшие фестивали этого лета


Детский сад на 180 мест построят в Реутове

РЖД лихорадит при минус 3 в летний зной. Спасти падающую погрузку на железной дороге могут не самые массовые грузы и «агент» в Минтрансе

Экс-глава НИИ точных приборов Роскосмоса получил пять лет колонии

Пора открывать закрытый нефтяной клондайк в Подмосковье


Теннисистка Пивоварова назвала травму Джоковича шагом к завершению карьеры

Уимблдон. 3 июля. Алькарас сыграет вторым запуском, Медведев и Синнер выйдут на Центральный корт

Первая ракетка России Касаткина выиграла теннисный турнир в Британии

Звезда «Гонки» Даниэль Брюль снимет байопик о немецком теннисисте Готфриде фон Крамме


«Антиоксидантные свойства»: названы закуски, которые помогут похудеть

В Нижнем Новгороде в реку упала люлька с рабочими, один из них погиб

Диана Арбенина рассказала о смерти отца своих детей

Сестра Градского судится с врачами, из-за которых ее сын родился с аутизмом


Музыкальные новости

В амфитеатре Никосии прозвучала музыка Шостаковича и Гершвина

Певец Егор Крид прокомментировал слухи о романе с Алиной Загитовой

Вход на все мероприятия фестиваля «Песни России» абсолютно свободный - директор проекта

За Соловки наступает расплата // Прокуроры требуют наказать освобожденных чиновников рублем



Программа «Цифровой инвестор» ГМК "Норникель" выходит на новый этап

Москва и Петербург лидируют по уровню зарплат для IT-специалистов

Куда сходить москвичам и гостям столицы 13 июля - Мытищинский форсаж: часть вторая

Сила знания. Какие загадки разгадывают дети на этноолимпиаде


Вина «Фанагории» выбрали для приветствия гостей церемонии открытия ХIII Московского Международного Фестиваля Искусств «Традиции и Современность»

Создание Портфолио Актера. Создание Фото Портфолио.

Детектив Финник на VK Fest

Орбакайте обратилась к Пугачевой со сцены в Израиле


Двух пешеходов на тротуаре в центре Воронежа сбил Mercedes

Два человека пострадали при взрыве баллона на крыше автобуса в столице

Источник 360.ru: при взрыве в автобусе в Москве пострадали два мужчины

2 июля на полдня перекроют трассу "Москва — Челябинск"


Bloomberg: визит Моди в Москву станет дипломатической победой Путина

Приезд Моди к Путину разрушил планы США сделать Россию изгоем

Песков: Путин и Моди обсудят вопросы региональной и глобальной безопасности

Путин на полях саммита ШОС обсудит с Эрдоганом свой визит в Турцию


Показатель заболеваемости коронавирусной инфекцией снизился в России на 10 процентов

Почти 1,4 тысячи случаев COVID-19 выявили в Москве за неделю

Почти 1,4 тыс. случаев коронавируса выявили в столице за неделю

Почти 1,4 тысячи случаев COVID-19 выявили в столице за неделю




В Центре хирургии клиники «Будь Здоров» провели 1000-ю операцию

Врач-стоматолог Татьяна Сумцова: какие могут быть противопоказания к установке брекетов

Терапевт Чернышова рассказала, кто хуже всего переносит летнюю жару

Москвич попал в реанимацию после теплового удара


«Орбан – п***р!» – Венгерского премьера встретили в Киеве нецензурной бранью

Песков сделал предположение о цели визита Орбана в Киев

Зачем в Киев приедет западный куратор, что не нравится Зеленскому в ВСУ, какую повестку определили на июльском саммите НАТО: обзор мировых новостей 3 июля

Премьер-министр Венгрии предложил Зеленскому прекратить огонь


Решил задачку, и — огонь! В Подмосковье прошёл чемпионат Москвы по стрельбе

Мяч на нашей стороне. В Москве создают новые спортивные объекты и кластеры

«Я в Москве, а он в Питере»: Кудрявцева раскрыла, к кому ревновала Макарова

Сила знания. Какие загадки разгадывают дети на этноолимпиаде


Вячеслав Володин сегодня прибыл с рабочим визитом в Минск. Он примет участие в мероприятиях по случаю Дня независимости Республики Беларусь

Лукашенко: Западу не терпится втянуть Белоруссию в военные разборки

Как Лукашенко еще в 90-е расправился со всеми ОПГ Беларуси

Лукашенко в Минске встретился с Володиным



Собянин рассказал о проекте «Золотая маска. Послесловие»

Сергей Собянин. Главное за день

Собянин вручил премии города в области архитектуры и градостроительства

Собянин рассказал, какую микроэлектронику производят в ОЭЗ «Технополис «Москва»


Мужчина получил тепловой удар в Москве и попал в реанимацию

В Нальчике появилось Детское радио

III класс пожарной опасности ожидается на большей части Подмосковья 2 июля

Донстрой — лидер среди застройщиков по подписанию сделок через «Госключ»


«У меня не осталось любви к России»: как живет актер Александр Збруев 28 июня 270K прочитали

Сестра Градского судится с врачами, из-за которых ее сын родился с аутизмом

Россия и Британия начали обсуждать новую архитектуру безопасности в Евразии

В Благовещенске умер известный спортсмен и тренер Абдурашид Раджабов


Острова укладываеюся спать...

Николай Нестеров: «Архангельск ─ старейший порт России»

Создание в Архангельске галереи северного стиля обсудили на площадках международного фестиваля «Белый июнь»

Архангельская область подключилась к проекту "Императорский маршрут"


Михаил Ведерников поздравил владыку Тихона с днем рождения

Музыкально-поэтическая композиция «Спасибо, жизнь, за всех родных людей, живущих на таком огромном свете»

Ретро-выставка «Актер на сцене должен жить»

Войны древних славян с греками


Альбина Джанабаева впервые показала сына Луку от Валерия Меладзе

«У меня не осталось любви к России»: как живет актер Александр Збруев 28 июня 270K прочитали

Губернатор Андрей Травников представил туристический потенциал Новосибирской области на выставке-форуме «Россия»

Российская молодежь больше всего тратит на покупки на маркетплейсах












Спорт в России и мире

Новости спорта


Новости тенниса
Уимблдон

Уимблдон. 2 июля. Джокович сыграет вторым запуском на Центральном корте, Маррей – третьим, турнир начнут Рублев, Сафиуллин, Швентек, Самсонова






Опубликовано точное расписание первых семи туров РПЛ

Сотрудников могут запретить наказывать за опоздание на работу из-за жары

Антарктида, Северный Полюс и автопробеги: какой отдых выбирают хайнеты и ультрахайнеты

Предпринимателям Бурятии помогут выйти на зарубежные рынки