fosstodon.org is one of the many independent Mastodon servers you can use to participate in the fediverse.
Fosstodon is an invite only Mastodon instance that is open to those who are interested in technology; particularly free & open source software. If you wish to join, contact us for an invite.

Administered by:

Server stats:

11K
active users

#dataquality

5 posts5 participants0 posts today

#DataFest2025 #KODAQS #DataQuality
Data Fest 2025, which takes place from 28 to 30 March at the Ludwig-Maximilians-Universität in Munich, is getting closer. KODAQS is officially taking part for the first time this year. The competition offers students the opportunity to work on extensive data sets in teams of 3-5 people within 48 hours. Kodaqs will contribute with a team of experts to measure and analyse the data quality.
datafest.de/home

Why should you care that the software you use is open-source?

With OSS you get peace of mind through:

- Transparency into how things work
- Ability to contribute and improve the software
- Community-driven innovation

With Recce you can see exactly how each diff works:

Replied in thread

@ChrisMayLA6 A key ingredient in AI is data. I have spent much of my career helping organisations manage #dataquality and it is fair to say, the quality of data related to most organisations is pretty poor at best. Feeding poor data into any LLM or AI tool will not deliver the results anticipated, but as often observed, may come up with a plausibly wrong answer. Treat all outputs of AI with extreme caution!

Dealing with a supplier for a repair.
Them: “And that will be done in the Gorey Depot which is the nearest one to you.”
Me: “I’m nowhere near Gorey”
Them: “But your eircode says your nearest depot is in Gorey”
Me: “Nope. It’s 3 minutes from me in Wexford”
Them: “OK. I’ll change that”
#DataQuality

Data analytics, big data, and AI algorithms are revolutionizing data extraction, usage, and decision-making. Web scraping is advancing rapidly, with AI algorithms enabling the quick extraction of massive data volumes. This provides businesses with immediate, actionable insights to stay ahead.

AI web scrapers are transforming data handling. They navigate dynamic websites, enhance accuracy, and scale effortlessly. With a 17.8% CAGR forecast by 2033, this $3.3B market improves adaptability, data quality, and speed. Companies adopting AI tools secure a competitive edge in the information economy.

forbes.com/councils/forbestech

#ai #webscraping #data #dataanalytics #bigdata #algorithms #aialgorithms #dynamicweb #datacollection #dataquality #dataprivacy #datascience #aiforbusiness #techinretail #dataefficiency #digitaltransformation #informationeconomy #scalabletech #dataeconomy

I released some updates for #ABECTO: github.com/fusion-jena/abecto/
ABECTO is a tool that compares #RDF data to spot errors 📑 and assess completeness 📏.
Recent changes:
➡️ adjusts result export for #Wikidata Mismatch Finder to changed format (phabricator.wikimedia.org/T313)
➡️ add reporting of qualifier mismatches to Wikidata Mismatch Finder export
➡️ suppress illegal empty external values in Wikidata Mismatch Finder export

@nightrose #DataQuality #Ontologies #KnowledgeGraphs

GitHubReleases · fusion-jena/abectoAn ABox Evaluation and Comparison Tool for Ontologies. - fusion-jena/abecto