fosstodon.org is one of the many independent Mastodon servers you can use to participate in the fediverse.
Fosstodon is an invite only Mastodon instance that is open to those who are interested in technology; particularly free & open source software. If you wish to join, contact us for an invite.

Administered by:

Server stats:

8.8K
active users

#datalake

2 posts2 participants0 posts today
HackerNoon<p>It is not possible to eliminate the risk of failures, but it is possible to mitigate them by making failures explainable, detectable, and manageable. <a href="https://hackernoon.com/diving-deep-into-data-lake-observability-why-it-matters-more-than-ever" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">hackernoon.com/diving-deep-int</span><span class="invisible">o-data-lake-observability-why-it-matters-more-than-ever</span></a> <a href="https://mas.to/tags/datalake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datalake</span></a></p>
Winbuzzer<p>Microsoft Unveils Sentinel Data Lake to Power AI Defenses and Cut Security Costs</p><p><a href="https://mastodon.social/tags/Cybersecurity" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Cybersecurity</span></a> <a href="https://mastodon.social/tags/Microsoft" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Microsoft</span></a> <a href="https://mastodon.social/tags/MicrosoftSentinel" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MicrosoftSentinel</span></a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/CloudSecurity" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CloudSecurity</span></a> <a href="https://mastodon.social/tags/SIEM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SIEM</span></a> <a href="https://mastodon.social/tags/DataLake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataLake</span></a></p><p><a href="https://winbuzzer.com/2025/07/22/microsoft-unveils-sentinel-data-lake-to-power-ai-defenses-and-cut-security-costs-xcxwbn" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">winbuzzer.com/2025/07/22/micro</span><span class="invisible">soft-unveils-sentinel-data-lake-to-power-ai-defenses-and-cut-security-costs-xcxwbn</span></a></p>
Graylog<p>⬆️ Data volumes continue to rise. In fact, within industries like <a href="https://infosec.exchange/tags/engineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>engineering</span></a> and <a href="https://infosec.exchange/tags/finance" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>finance</span></a>, the volume and volatility of log data have even outpaced the capacity of traditional <a href="https://infosec.exchange/tags/SIEM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SIEM</span></a> and analytics tools. 😰 What this means is... with orgs facing high costs and fatigue, the ones that thrive will be the ones that treat storage and retrieval as distinct functions. 🤔 </p><p>This is where selective retrieval comes in—the ability to triage, park, and later selectively ingest high-volume data from a centralized repository for forensic or compliance-driven investigation. 🙌 </p><p>Read this excellent article by <a href="https://infosec.exchange/tags/Graylog" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Graylog</span></a>'s Adam Abernethy in BigDATAwire to learn about:<br>🌏 Selective retrieval examples in the real world<br>⚠️ Risk coverage without always-on cost<br>🔒 Flexibility without architectural lock-in<br>💻 The technological shifts that are converging to make selective retrieval possible and necessary<br>↔️ How selective retrieval bridges the gap between data engineering complexity and <a href="https://infosec.exchange/tags/security" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>security</span></a> usability<br>💼 The business case for selective retrieval, especially for mid-size IT teams<br>🛂 Regaining control over data sprawl<br>➕ More</p><p><a href="https://www.bigdatawire.com/2025/07/14/rethinking-risk-the-role-of-selective-retrieval-in-data-lake-strategies/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">bigdatawire.com/2025/07/14/ret</span><span class="invisible">hinking-risk-the-role-of-selective-retrieval-in-data-lake-strategies/</span></a> <a href="https://infosec.exchange/tags/datalake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datalake</span></a> <a href="https://infosec.exchange/tags/logdata" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>logdata</span></a> <a href="https://infosec.exchange/tags/datamanagement" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datamanagement</span></a> <span class="h-card" translate="no"><a href="https://infosec.exchange/@bigabe" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>bigabe</span></a></span> <span class="h-card" translate="no"><a href="https://bird.makeup/users/bigdatawirenews" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>bigdatawirenews</span></a></span></p>
Sebastian Lauwers<p>New project alert! Comparqter, a tool that compacts Parquet files and optimises file sizes.</p><p><a href="https://codeberg.org/unticks/comparqter" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">codeberg.org/unticks/comparqter</span><span class="invisible"></span></a></p><p><a href="https://mastodon.online/tags/rust" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>rust</span></a> <a href="https://mastodon.online/tags/parquet" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>parquet</span></a> <a href="https://mastodon.online/tags/s3" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>s3</span></a> <a href="https://mastodon.online/tags/datalake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datalake</span></a></p>
Salar Rahmanian :verified: :scala: :swift: :nix:<p>🎉 Huge thanks to the LanceDB CEO / cofounder Chang She for delivering an incredible talk on "Search, Retrieval, Training, and Analytics with Modern AI Data Lake" at <a href="https://social.softinio.com/tags/dataandaiengineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataAndAIEngineering</span></a> <a href="https://social.softinio.com/tags/sanfrancisco" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SanFrancisco</span></a> <a href="https://social.softinio.com/tags/meetup" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>meetup</span></a> !</p><p>📹 Great news - the recording is now available! Check it out if you missed it or want to revisit the key concepts. 👇</p><p><a href="https://watch.softinio.com/w/mVkLgtcQw8Qv5vA4v8SDHB" rel="nofollow noopener" target="_blank">https://watch.softinio.com/w/mVkLgtcQw8Qv5vA4v8SDHB</a></p><p><a href="https://social.softinio.com/tags/dataengineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> <a href="https://social.softinio.com/tags/aiengineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AIEngineering</span></a> <a href="https://social.softinio.com/tags/sanfrancisco" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SanFrancisco</span></a> <a href="https://social.softinio.com/tags/lancedb" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LanceDB</span></a> <a href="https://social.softinio.com/tags/datalake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataLake</span></a> <a href="https://social.softinio.com/tags/machinelearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://social.softinio.com/tags/vectordb" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VectorDB</span></a> <a href="https://social.softinio.com/tags/database" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Database</span></a> <a href="https://social.softinio.com/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://social.softinio.com/tags/artificialintelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArtificialIntelligence</span></a></p>
Data Quine<p>"Centralize Your Data Lake: Apache Polaris Supports Apache Iceberg and Now Delta Lake"</p><p>BTW 'Polaris' used to be the name of the UK nuclear deterrent pre 1996. 😬</p><p><a href="https://snowflake.com/en/engineering-blog/apache-polaris-supports-iceberg-delta-lake/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">snowflake.com/en/engineering-b</span><span class="invisible">log/apache-polaris-supports-iceberg-delta-lake/</span></a></p><p><a href="https://datasci.social/tags/ApacheIceberg" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ApacheIceberg</span></a> <a href="https://datasci.social/tags/ApachePolaris" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ApachePolaris</span></a> <a href="https://datasci.social/tags/DataLake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataLake</span></a></p>
Data Quine<p>First I thought I'd found the Loch Ness Monster...turns out to be Nessie instead. 🦕 </p><p>Project Nessie: Transactional Catalog for Data Lakes with Git-like semantics<br>"Nessie is to Data Lakes what Git is to source code repositories..."</p><p><a href="https://projectnessie.org/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">projectnessie.org/</span><span class="invisible"></span></a></p><p><a href="https://datasci.social/tags/ProjectNessie" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ProjectNessie</span></a> <a href="https://datasci.social/tags/LochNessMonster" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LochNessMonster</span></a> <a href="https://datasci.social/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> <a href="https://datasci.social/tags/DataLake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataLake</span></a></p>
N-gated Hacker News<p>Ah, the $10/month Lakehouses: because who wouldn't want a bargain-basement data lake with all the charm of a timeshare in purgatory? 🤔💸 Just add a sprinkle of buzzwords like "DuckLake" and "time travel" and voilà, you've got a tech article that feels like a 2-hour <a href="https://mastodon.social/tags/infomercial" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>infomercial</span></a> for something you'll never use. 📈🔮<br><a href="https://tobilg.com/the-age-of-10-dollar-a-month-lakehouses" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">tobilg.com/the-age-of-10-dolla</span><span class="invisible">r-a-month-lakehouses</span></a> <a href="https://mastodon.social/tags/Lakehouses" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Lakehouses</span></a> <a href="https://mastodon.social/tags/DuckLake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DuckLake</span></a> <a href="https://mastodon.social/tags/DataLake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataLake</span></a> <a href="https://mastodon.social/tags/TechTrends" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TechTrends</span></a> <a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HackerNews</span></a> <a href="https://mastodon.social/tags/ngated" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ngated</span></a></p>
Graylog<p><a href="https://infosec.exchange/tags/TBT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TBT</span></a>... to an entire week ago at <a href="https://infosec.exchange/tags/RSAC" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RSAC</span></a> where Seth Goldhammer had the chance to demo Graylog's data telemetry pipeline management! 🖥️ ⭐ </p><p>Join Seth as he talks about data lakes, data lake previews, getting your data back when you need it, and more. </p><p>Wanna learn more about this topic? Here you go: <a href="https://graylog.org/post/security-data-lake-strategy/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">graylog.org/post/security-data</span><span class="invisible">-lake-strategy/</span></a> <a href="https://infosec.exchange/tags/RSA" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RSA</span></a> <a href="https://infosec.exchange/tags/RSAC2025" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RSAC2025</span></a> <a href="https://infosec.exchange/tags/datalake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datalake</span></a> <a href="https://infosec.exchange/tags/datamanagement" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datamanagement</span></a> <a href="https://infosec.exchange/tags/datapipeline" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datapipeline</span></a></p>
Habr<p>Секреты Spark в Arenadata Hadoop: как мы ускорили построение витрин для задач ML</p><p>Привет, Хабр! Я Дмитрий Жихарев, CPO Платформы искусственного интеллекта RAISA в Лаборатории ИИ РСХБ-Интех. В этой статье я и архитектор нашей платформы Александр Рындин @aryndin9999 расскажем о том, как мы построили взаимодействие Платформы ИИ и Озера данных для работы с витринами данных моделей машинного обучения с использованием Spark.</p><p><a href="https://habr.com/ru/companies/rshb/articles/904072/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">habr.com/ru/companies/rshb/art</span><span class="invisible">icles/904072/</span></a></p><p><a href="https://zhub.link/tags/spark" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>spark</span></a> <a href="https://zhub.link/tags/arenadata" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>arenadata</span></a> <a href="https://zhub.link/tags/hadoop" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>hadoop</span></a> <a href="https://zhub.link/tags/datalake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datalake</span></a> <a href="https://zhub.link/tags/%D0%B2%D0%B8%D1%82%D1%80%D0%B8%D0%BD%D0%B0_%D0%B4%D0%B0%D0%BD%D0%BD%D1%8B%D1%85" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>витрина_данных</span></a> <a href="https://zhub.link/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://zhub.link/tags/%D0%BF%D0%BB%D0%B0%D1%82%D1%84%D0%BE%D1%80%D0%BC%D0%B0" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>платформа</span></a> <a href="https://zhub.link/tags/livy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>livy</span></a></p>
InfoQ<p>Shifting Left isn’t just a buzzword - it’s the foundation for efficiency in your organization!</p><p>By making clean, reliable, and accessible data available across your organization, you reduce complexity and unlock time to focus on higher-value work.</p><p>💡 Data products are the foundation of this <a href="https://techhub.social/tags/ShiftLeft" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ShiftLeft</span></a>, enabling healthy, scalable data communication.</p><p>📖 Dive into the details in the <a href="https://techhub.social/tags/InfoQ" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>InfoQ</span></a> article: <a href="https://bit.ly/3WHjxsf" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">bit.ly/3WHjxsf</span><span class="invisible"></span></a> </p><p><a href="https://techhub.social/tags/SoftwareArchitecture" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SoftwareArchitecture</span></a> <a href="https://techhub.social/tags/DataMesh" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataMesh</span></a> <a href="https://techhub.social/tags/DataLake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataLake</span></a> <a href="https://techhub.social/tags/DataPipelines" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataPipelines</span></a> <a href="https://techhub.social/tags/ETL" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ETL</span></a></p>
Gytis Repečka<p>Attended an event <em>Brewing Data with Snowflake</em> yesterday in Vilnius :blobcatnerd:</p><p>Some of they key insights:</p><ul><li>Medallion Architecture (good or bad) is widespread.</li><li>Snowflake and Databricks are clear competitors, targeting similar landscape.</li><li>Open formats are trending: file format, table format, catalog, etc. - the more of them are open source, the better.</li><li>Time travel feature is important, many users already used it for disaster recovery.</li><li>Clear distinction of <strong>Storage</strong> from <strong>Compute</strong> (generic cloud approach).</li></ul><p>Full text of one of the slides presented:</p><blockquote><p>Strategic Architecture Outlook</p><ul><li>Agility &amp; Future-Proofing - Open, portable data means you can adopt new technologies or switch platforms with minimal friction. No single vendor can hold your data hostage, so you can evolve vour architecture as needed.</li><li>Multi-Cloud and Hybrid - An open data layer can span clouds and on-prem seamlessly. You avoid cloud vendor lock-in and leverage best-of-breed services on different clouds using the same data. This flexibility is key for resilience and optimization.</li><li>Accelerating Innovation - When any team can access data with the tools of their choice, experimentation flourishes. Open data fosters Al/ML and cross-domain analytics since data isn't locked in silos - more innovation and insights from the same data.</li><li>Vendor Leverage - Strategically, using open standards increases your leverage in vendor negotiations. You car opt in or out of services more freely, pushing vendors to provide value (since you're not irreversibly locked to them).</li></ul></blockquote><p><a href="https://social.gyt.is/tags/data" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>data</span></a> <a href="https://social.gyt.is/tags/datalake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datalake</span></a> <a href="https://social.gyt.is/tags/datalakehouse" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datalakehouse</span></a> <a href="https://social.gyt.is/tags/medallion" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>medallion</span></a> <a href="https://social.gyt.is/tags/architecture" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>architecture</span></a> <a href="https://social.gyt.is/tags/snowflake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>snowflake</span></a> <a href="https://social.gyt.is/tags/vilnius" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>vilnius</span></a> <a href="https://social.gyt.is/tags/lithuania" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lithuania</span></a> <a href="https://social.gyt.is/tags/bigdata" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>bigdata</span></a> <a href="https://social.gyt.is/tags/event" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>event</span></a> <a href="https://social.gyt.is/tags/meetup" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>meetup</span></a></p>
Justin Buzzard<p>A Data Lake in the software world is essentially where raw data is taken and turned into something tangible like reports, often using AI/machine learning and them put into the Data Warehouse. <a href="https://mastodon.social/tags/software" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>software</span></a> <a href="https://mastodon.social/tags/datalake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datalake</span></a> <a href="https://mastodon.social/tags/datawarehouse" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datawarehouse</span></a></p>
Bernhard Luecke<p>🟢 Demo: SAP Business Data Cloud | SAP Business Unleashed <a href="https://youtu.be/OkwQimWDeos?si=UNGdcAVobyMNCkUm" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">youtu.be/OkwQimWDeos?si=UNGdcA</span><span class="invisible">VobyMNCkUm</span></a> via @YouTube <br>(and find related Videos in the SAP channel - see below)</p><p><a href="https://techhub.social/tags/SAP" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SAP</span></a> <a href="https://techhub.social/tags/SAPBDC" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SAPBDC</span></a> <a href="https://techhub.social/tags/GenAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GenAI</span></a> <a href="https://techhub.social/tags/LLM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LLM</span></a> <a href="https://techhub.social/tags/DataCloud" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataCloud</span></a> <a href="https://techhub.social/tags/DataLake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataLake</span></a> <a href="https://techhub.social/tags/SAPChampions" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SAPChampions</span></a> <a href="https://techhub.social/tags/SAPBW" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SAPBW</span></a> <a href="https://techhub.social/tags/SAPDatasphere" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SAPDatasphere</span></a> <span class="h-card" translate="no"><a href="https://a.gup.pe/u/sap" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>sap</span></a></span></p>
Sarah Lea<p>There is no need to move data. Data latency is minimised. Data can be transformed and analysed within a single platform.</p><p>Let me know what you know about Zero-ETL :blobcoffee: </p><p>Why ETL-Zero? Understanding the shift in Data Integration“ by Sarah Lea on Medium: <a href="https://medium.com/towards-data-science/why-etl-zero-understanding-the-shift-in-data-integration-as-a-beginner-d0cefa244154" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">medium.com/towards-data-scienc</span><span class="invisible">e/why-etl-zero-understanding-the-shift-in-data-integration-as-a-beginner-d0cefa244154</span></a></p><p><a href="https://techhub.social/tags/python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>python</span></a> <a href="https://techhub.social/tags/datalake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datalake</span></a> <a href="https://techhub.social/tags/cloudcomputing" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cloudcomputing</span></a> <a href="https://techhub.social/tags/etl" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>etl</span></a> <a href="https://techhub.social/tags/zeroetl" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>zeroetl</span></a> <a href="https://techhub.social/tags/salesforce" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>salesforce</span></a> <a href="https://techhub.social/tags/data" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>data</span></a> <a href="https://techhub.social/tags/tech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>tech</span></a> <a href="https://techhub.social/tags/technology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>technology</span></a> <a href="https://techhub.social/tags/datawarehousing" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datawarehousing</span></a> <a href="https://techhub.social/tags/datalakehouse" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datalakehouse</span></a></p>
InfoQ<p>A <a href="https://techhub.social/tags/ShiftLeft" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ShiftLeft</span></a> approach to <a href="https://techhub.social/tags/DataProcessing" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataProcessing</span></a> relies on data products, which form the basis of data communication across the business.</p><p>This addresses many flaws in traditional data processing and makes data more relevant, complete, and trustworthy.</p><p><a href="https://techhub.social/tags/InfoQ" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>InfoQ</span></a> article: <a href="https://bit.ly/3WHjxsf" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">bit.ly/3WHjxsf</span><span class="invisible"></span></a> </p><p><a href="https://techhub.social/tags/SoftwareArchitecture" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SoftwareArchitecture</span></a> <a href="https://techhub.social/tags/DataMesh" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataMesh</span></a> <a href="https://techhub.social/tags/DataLake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataLake</span></a> <a href="https://techhub.social/tags/DataPipelines" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataPipelines</span></a> <a href="https://techhub.social/tags/ETL" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ETL</span></a></p>
Stefan Ziegler<p>The house at the lake, Teil 3 - The Dashboard Diaries: <a href="https://blog.sogeo.services/blog/2025/01/26/house-at-the-lake-03.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.sogeo.services/blog/2025/</span><span class="invisible">01/26/house-at-the-lake-03.html</span></a> <a href="https://mstdn.social/tags/Trino" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Trino</span></a> <a href="https://mstdn.social/tags/SQL" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SQL</span></a> <a href="https://mstdn.social/tags/datalake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datalake</span></a> <a href="https://mstdn.social/tags/datalakehouse" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datalakehouse</span></a> <a href="https://mstdn.social/tags/lakehouse" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lakehouse</span></a> <a href="https://mstdn.social/tags/duckdb" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>duckdb</span></a> <a href="https://mstdn.social/tags/apacheiceberg" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>apacheiceberg</span></a></p>
InfoQ<p><a href="https://techhub.social/tags/ApacheHudi" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ApacheHudi</span></a> 1.0 is now generally available!</p><p>The release introduces new features aimed at transforming data lakehouses into what the project community considers a fully-fledged "Data Lakehouse Management System" (DLMS).</p><p>Details on <a href="https://techhub.social/tags/InfoQ" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>InfoQ</span></a> 👉 <a href="https://bit.ly/3E5AXZi" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">bit.ly/3E5AXZi</span><span class="invisible"></span></a> </p><p><a href="https://techhub.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://techhub.social/tags/DataLake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataLake</span></a> <a href="https://techhub.social/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a> <a href="https://techhub.social/tags/DataAnalytics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataAnalytics</span></a></p>
Thilo Dotzel 🤓(Mr. Storage )<p>All in one.<br>Massively scalable, software defined storage (<a href="https://techhub.social/tags/SDS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SDS</span></a>) for modern workloads with support for file, block and object based applications:<br>➡️ <a href="https://ibm.com/products/ceph" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">ibm.com/products/ceph</span><span class="invisible"></span></a> </p><p>👁🐝Ⓜ️<br><a href="https://techhub.social/tags/IBM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>IBM</span></a> <a href="https://techhub.social/tags/RedHat" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RedHat</span></a><br><a href="https://techhub.social/tags/IBMStorage" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>IBMStorage</span></a> <br><a href="https://techhub.social/tags/IBMStorageCeph" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>IBMStorageCeph</span></a> <a href="https://techhub.social/tags/DataLake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataLake</span></a><br><a href="https://techhub.social/tags/IBMtechnology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>IBMtechnology</span></a> <a href="https://techhub.social/tags/technology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>technology</span></a><br><a href="https://techhub.social/tags/IBMStorageRocks" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>IBMStorageRocks</span></a>🚀</p>
Stefan Ziegler<p>The house at the lake, Teil 2 - Start your engines: <a href="https://blog.sogeo.services/blog/2025/01/12/house-at-the-lake-02.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.sogeo.services/blog/2025/</span><span class="invisible">01/12/house-at-the-lake-02.html</span></a> <a href="https://mstdn.social/tags/Spark" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Spark</span></a> <a href="https://mstdn.social/tags/ApacheIceberg" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ApacheIceberg</span></a> <a href="https://mstdn.social/tags/SQL" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SQL</span></a> <a href="https://mstdn.social/tags/Datalake" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Datalake</span></a> <a href="https://mstdn.social/tags/Lakehouse" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Lakehouse</span></a> <a href="https://mstdn.social/tags/DuckDB" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DuckDB</span></a></p>