Apple has denied using an unethically collected dataset from EleutherAI to train its flagship artificial intelligence (AI) product, Apple Intelligence. However, they state they have used the dataset for another AI model.
After it was revealed this week that a company called EleutherAI used a dataset containing hundreds of thousands of YouTube video captions to create a dataset to aid in AI training, Apple spoke to Apple Insider, denying that EleutherAI’s ‘Pile’ was used to train Apple Intelligence.
However, they confirmed that ‘the Pile’ was used when developing the open-source OpenELM models released earlier this year.
What is EleutherAI’s ‘the Pile’?
EleutherAI is a non-profit organization that wants to make AI research and development more accessible to companies outside of the huge tech firms we see primarily working on huge AI models like OpenAI.
One of the ways they do this is by providing training datasets for large language models and other AI applications. However, instead of paying licensing fees to access data, or entering into partnerships to use data from sources, EleutherAI scrapes the web to obtain its data. This includes the captions from over 170,000 YouTube videos.
‘The Pile’ is the result of this – a huge corpus of unethically sourced training data is intended to lower the barrier to entry for smaller firms to enter the AI market. However, larger companies have also made use of the dataset.
What is Apple’s OpenELM?
Although they did not use ‘the Pile’ to train Apple Intelligence (and claim Apple Intelligence models were trained “on licensed data, including data selected to enhance specific features, as well as publicly available data collected by our web crawler,”) Apple has admitted to using it to develop their OpenELM models.
Apple released OpenELM in April. It was created for research purposes and is not used to power any of Apple Intelligence’s functions or features. Apple has told 9to5Mac that they have no plans to expand on OpenELM or release any further versions of the tool.
Москва слезам не верит: в ТПП РФ подержали инициативу по защите прав предпринимателей
Филиал № 4 ОСФР по Москве и Московской области информирует:
С начала 2024 года 140 тысяч женщин и новорожденных Московского региона получили услуги по родовым сертификатам
Певец Дмитрий Камский готовит к релизу новый сингл "Песня Земли"
Владислав Овчинский: жители дома на Радужной улице начали переезд в новостройку по программе реновации
Faculty of International Journalism and Mass Communications Eurasian International University is conducting an additional intake of applicants!
England U20 overpower France to be crowned world champions
Exclusive - Sayantani Ghosh expresses happiness as sets of her show Dahej Daasi shifted close to her home; says 'I've been manifesting this for quite some time now'
Suspect arrested for ‘threatening to kill Trump and his VP pick JD Vance’ in Florida days after assassination attempt
Conscript is an old school survival horror game where the horror is just that you're in World War 1
According to BioWare, Dragon Age: The Veilguard is the first entry in the series where "the combat's actually fun" and where characters are "intentionally" the focus of the storytelling, which seems pretty unfair on the first three games
D&D's new 2024 Player's Handbook will have 10 species to choose from including goliaths, and drow will be closer to their Baldur's Gate 3 version
Филиал № 4 ОСФР по Москве и Московской области информирует:
С начала 2024 года 140 тысяч женщин и новорожденных Московского региона получили услуги по родовым сертификатам
Отрытый конкурс красоты и таланта «Одна на миллион»
Спортивные игры в СЛД "Москва-Сортировочная" филиала "Московский"
Бизнесмен вакцинировался от суда // Дело об особо крупной растрате рассмотрят в заочном режиме
Несколько авиарейсов в Томск задерживаются из-за тумана
Филиал № 4 ОСФР по Москве и Московской области информирует:
С начала 2024 года 140 тысяч женщин и новорожденных Московского региона получили услуги по родовым сертификатам
Представители «Метровагонмаш-Сервиса» посетили СЛД «Москва-Сортировочная» филиала «Московский» компании «ЛокоТех-Сервис» для обмена опытом
Отрытый конкурс красоты и таланта «Одна на миллион»
До конца июля анимационная компания «ЯРКО» проведет еще одно мероприятие в ТРЦ «Ривьера» – развлекательную программу по мотивам мультсериала «Команда МАТЧ» (27 июля).
Экс-игрок Мостовой: в матче с "Акроном" я увидел обычный "Локомотив"
Спортивные игры в СЛД "Москва-Сортировочная" филиала "Московский"
«Спартак» проиграл в первом матче РПЛ под руководством тренера Станковича