fosstodon.org is one of the many independent Mastodon servers you can use to participate in the fediverse.
Fosstodon is an invite only Mastodon instance that is open to those who are interested in technology; particularly free & open source software. If you wish to join, contact us for an invite.

Administered by:

Server stats:

8.5K
active users

#trainingdata

1 post1 participant0 posts today

6 Essential Data Annotation Techniques that Drive Computer Vision

Our latest video on the 6 common types of annotation in Computer Vision reveals how the perfect blend of human intelligence and cutting-edge data annotation techniques can significantly enhance the performance and scalability of your AI and ML models.

youtube.com/watch?v=EHXVzz7VHvo

“Suno, for those of you not familiar, is an #AI #SongGenerator: enter a text prompt (such as “a jazz, reggae, EDM pop song about my imagination”) and a song comes back. Like many #GenerativeAI companies, it is also being sued by all and sundry for ingesting #copyrighted #material. The parties in the suit — including major labels and the #RIAA — don’t have a smoking gun, since they can’t directly peek at Suno’s #TrainingData. But they have managed to generate some suspiciously similar-sounding AI generated materials, #mimicking (among others) “Johnny B. Goode,” “Great Balls of Fire,” and Jason Derulo’s habit of singing his own name.

#Suno essentially admits these songs were #regurgitated from #copyrighted source material, but it says such use was legal. “It is no secret that the tens of millions of #recordings that Suno’s model was trained on presumably included recordings whose rights are owned by the Plaintiffs in this case,” it says in its own legal filing. Whether AI training data constitutes fair use is a common but unsettled legal argument, and the plaintiffs contend Suno still amounts to “pervasive #illegal #copying” of artists’ works.”

#NYA / #music / #ElizabethLopatto / #amazon / #DataTheft <neilyoungarchives.com/news/3/a>

neilyoungarchives.comNeil Young Archives

“Broad didn’t train his #AI on #Rothko; he didn’t train it on any #data at all. By hacking a #NeuralNetwork, and locking elements of it into a #recursive #loop, he was able to induce this AI into producing #images without any #TrainingData at all — no inputs, no influences.

Depending on your perspective, Broad’s art is either a pioneering display of pure artificial creativity, a look into the very soul of AI, or a clever but meaningless electronic by-product, closer to guitar feedback than music.

In any case, his work points the way toward a more creative and ethical use of #GenerativeAI beyond the large-scale manufacture of #DerivativeSlop now oozing through our visual culture.”

#Art / #TerenceBroad / #UnstableEquilibrium <theverge.com/ai-artificial-int>

The Verge · What happens when you feed AI nothingBy Franklin Schneider

🐢 Oh, look! A thrilling 120-page #novella about Claude 4 System Cards, because in the #AI world, longer is clearly better... right? 📜 Filled with steaming details for those who miss "Person of Interest" fan fiction and enjoy deciphering cryptic training data. 🎭 Meanwhile, landing pages remain a mythical creature in Anthropics' universe. 🦄
simonwillison.net/2025/May/25/ #Claude4 #FanFiction #TrainingData #MythicalCreatures #HackerNews #ngated

Simon Willison’s WeblogSystem Card: Claude Opus 4 & Claude Sonnet 4Direct link to a PDF on Anthropic's CDN because they don't appear to have a landing page anywhere for this document. Anthropic's system cards are always worth a look, and …

Open Web Crawl is such a security vulnerability, that I don’t know why it isn’t the top of the news every day.

If you turn on a general suction hose, how do you not realise there’s going to be a party of attackers right there feeding it all the #propaganda they possibly can?

How can you be so nonchalant about it? How do you not realise you created the biggest attack vector in the history of computing?