Добавить новость
ru24.net
News in English
Август
2024

Nonprofit scrubs illegal content from controversial AI training dataset

0

Enlarge (credit: Kirillm | iStock / Getty Images Plus)

After Stanford Internet Observatory researcher David Thiel found links to child sexual abuse materials (CSAM) in an AI training dataset tainting image generators, the controversial dataset was immediately taken down in 2023.

Now, the LAION (Large-scale Artificial Intelligence Open Network) team has released a scrubbed version of the LAION-5B dataset called Re-LAION-5B and claimed that it "is the first web-scale, text-link to images pair dataset to be thoroughly cleaned of known links to suspected CSAM."

To scrub the dataset, LAION partnered with the Internet Watch Foundation (IWF) and the Canadian Center for Child Protection (C3P) to remove 2,236 links that matched with hashed images in the online safety organizations' databases. Removals include all the links flagged by Thiel, as well as content flagged by LAION's partners and other watchdogs, like Human Rights Watch, which warned of privacy issues after finding photos of real kids included in the dataset without their consent.

Read 36 remaining paragraphs | Comments




Moscow.media
Частные объявления сегодня





Rss.plus




Спорт в России и мире

Новости спорта


Новости тенниса
ATP

Рублев обыграл Басилашвили и вышел в полуфинал турнира ATP в Монпелье






СЕНСАЦИЯ ПРО "Z" И "Аз" В НЛП СВО. ДОРАБОТКА РАКЕТЫ "ОРЕШНИК". СПАСЕНИЕ ОТ ДЕНУКЛЕАРИЗАЦИИ! В.В. Путин, Дональд Трамп. Новости. Россия, США, Европа могут улучшить отношения и здоровье общества?!

Взыщут автоматом. Юрист Георгиева пояснила, что грозит должникам по ЖКХ

Бастрыкин затребовал доклад по делу об отравлении детей в отеле в Москве

Красота православных храмов (#546)