Data Governance in Lakehouse Using Open Source Tools 👩‍💻

🦾Plus: 👨‍⚕️ OpenAI launched HealthBench

In partnership with

Hey folks! Let’s get into Big Data and AI craziness…

In today's edition: What's Shaping the Future of Data?

  • 🧠A hands-on introduction to Agentic RAG

  • 🚀Top 20 Artificial Intelligence Platforms for 2025

  • 💻 Visual Studio Code beefs up AI coding features.

  • 🤝Open Agent-User Interaction Protocol

  • 🧠 Sakana teaches AI to think with time

  • 📱Samsung launches thin S25 Edge

  • 💡AI Tutorial:Create interactive crosswords from your lessons

  • 🤖 AI Tools and Data Tools to checkout

Before we get into why Agentic RAG is such a big deal, it’s worth taking a step back. RAG (Retrieval-Augmented Generation) is a technique for building LLM-powered applications that can access external knowledge sources. The idea is simple: instead of relying only on what the LLM “remembers” from training, you give it relevant context at runtime - reducing hallucinations without needing to finetune the model.

This guide is your go-to resource for streamlining payments, improving cash flow, and keeping your business running smoothly.

What’s inside:

✔️ An actionable 8-step framework to create a seamless payment process

✔️ Expert strategies to reduce late payments and enhance your professional image

A well-structured payment system leads to smoother operations, happier clients, and long-term financial success.

In 2021 as new updates and improvements take over the field of AI, the ranking of these AI platforms will have to shift. We look at the 20 best artificial intelligence platforms for 2021, and why we think they deserve to be on the list.

Visual Studio Code 1.100, the latest release of Microsoft’s code editor, has arrived with several upgrades to its AI chat and AI code editing capabilities. Highlighting the list are support for Markdown-based instructions and prompt files, faster code editing in agent mode, and more speed and accuracy in Next Edit Suggestions.

CopilotKit has released AG-UI, an open, lightweight, event-based protocol that standardizes how AI agents connect to front-end applications. Think of it as a universal translator for AI-driven systems- no matter what language an agent speaks: AG-UI ensures fluent communication. It’s built to support real-time agent-user collaboration, live state streaming, and frontend tool use, without forcing you to change your agent backend.

As organizations adopt the Lakehouse architecture which blends the flexibility of data lakes with the reliability of data warehouses, the need for robust data governance becomes critical. But good governance doesn’t have to mean expensive vendor tools. With a smart selection of open-source tools, you can enforce policies, ensure data quality, and maintain compliance across your entire data platform.

You’re already enjoying free shipping and exclusive shows, but Amazon Prime has a lot more to offer. From early access to exclusive deals to unlimited photo storage, these 10 hidden perks can transform your shopping experience and help you get the most out of your membership. Don’t miss out on valuable benefits that can save you time, money, and effort. Start unlocking these powerful features today. You’re paying for them – now it’s time to make them work for you.

👨‍💻 Data Tools, Libraries

StarGuard (GitHub Repo)

Fake stars are rampant and supply chain attacks are rising. StarGuard is a command-line tool for detecting fake-star campaigns, dependency hijacks, license red flags, and other signals of open-source risk.

Airweave (GitHub Repo)

Airweave is an MCP-compatible tool that lets agents semantically search any app.

Simple Agent API: a robust, production-ready application for serving Agents as an API. A minimal, open-source setup for serving Agents using FastAPI and Postgres. Built for speed, clarity, and dev happiness.

AI News:

ChatGPT at Work: Free Resource Bundle

Power up your productivity with Mindstream's exclusive ChatGPT toolkit, designed for professionals who want to work smarter, not harder.

Your free bundle includes:

  1. ChatGPT Decision Flowchart

  2. Advanced Prompt Templates

  3. 2025 AI Productivity Guide

  4. Task Automation Framework

  5. Industry-Specific Use Cases

Join thousands of AI-powered professionals by subscribing to our daily newsletter. Get the complete bundle instantly after signup - no extra steps required.

OpenAI released HealthBench, a benchmark created with 262 physicians to evaluate how AI systems perform in health conversations — and establish a new standard for measuring AI’s safety and effectiveness in medical contexts. The benchmark tests models across several themes (like emergency referrals and global health) and behaviors (accuracy, communication quality, etc.).

Through Squarespace’s cutting-edge features that combine automation, design presets, creative guidance, and generative AI, Design Intelligence makes it easy to build a beautiful and impactful website. With just a few pieces of information, Blueprint AI generates an entire website customized based off your brand’s goals, name, and personality. It’s AI speed, with Squarespace’s 20+ years of design expertise in website building. 

Sakana AI unveiled Continuous Thought Machines (CTMs), a new type of model that makes AI more brain-like by allowing it to “think” step-by-step over time instead of making instant decisions like current AI systems do. Unlike most AI that processes information in a static, one-shot way, the CTM considers how its internal activity unfolds over time, much like our brains do.

Apple is working on versions of the AirPods and Apple Watch that incorporate a camera, and the devices could be ready to launch sometime around 2027, reports Bloomberg. Apple has developed a chip codenamed "Nevis" that will be used for its camera-equipped Apple Watch, while a chip codenamed "Glennie" will be incorporated into the AirPods.

Samsung has unveiled a thin version of its flagship smartphone called the Samsung Galaxy S25 Edge. The device is 5.8 millimeters thin and weighs 163 grams. It will go on sale on May 30, starting at $1,099. Samsung may be trying to get ahead of Apple, which is gearing up to launch the iPhone 17 Air.

Google just launched the AI Futures Fund to supercharge startup innovation with early access to advanced AI models, hands-on support and huge Google Cloud credits. Apply today.

Did you know that the World Wide Web was born in Geneva, Switzerland? Indeed, the first version of the Internet cropped up at CERN in 1989. Today the world-renowned center is home to the largest particle accelerator and to the CERN Science Gateway – a must-see hub for science enthusiasts that features hands-on exhibits, immersive virtual reality experiences, and live demonstrations.

AI Tutorial

🧩 Create interactive crosswords from your lessons

In this tutorial, you will learn how to turn any lesson material into engaging crossword puzzles by combining NotebookLM's AI analysis with CrosswordLabs' puzzle generator.

Step-by-step:

  1. Visit NotebookLM and click "Create new" to start a fresh notebook for your lesson materials.

  2. Upload your content by clicking “Add” in the Sources section: PDFs, documents, and audio files all work great.

  3. Use the prompt “Create [number] clues for a crossword in the following style. Do not add any bullets or formatting: Dog man’s best friend…” in the chat section.

  4. Copy the generated word-clue pairs and paste them directly into CrosswordLabs to automatically build your puzzle.

🔥Top AI tools to increase productivity: 

  1. iyiai.com is an AI-powered dialogue platform where users can interact with historical figures, mythological characters

  2. MyLooks.AI, your personal beauty and style coach powered by the advanced capabilities of GPT-4.

  3. Find AI- AI-powered research engine for companies and people

  4. aHelp AI Essay Writer is an easy-to-use learning platform that uses AI technology to simplify complex tasks

  5. WorkHQ – your strategic partner for redefining talent acquisition.

  6. aimusic.one is ALL In One AI Music Generator and lyrics Generator Platform, make unique MP3 songs, free to use

  7. ⏱️ Martin - A tool for productivity and time management

View our database of all the best AI tools for your needs: aitoolsup.com

Have cool resources to share? Submit AI tool

A.I. Generated Image of the Day

👀 Eldritch Drainage Tunnels

AI Tools Up NewsletterReceive a weekly email with updates on new AI tools, helpful prompts, and the latest AI developments. Join over 10000 + professionals from Google, OpenAI, Notion, Apple, and more.

SPONSOR US

Get your product in front of Big Data & AI enthusiasts

Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world.

Interested in Sponsoring the Big Data News Weekly Newsletter?Get in touch today

What did you think of today's email?

Your feedback helps me create better emails for you!

Login or Subscribe to participate in polls.