📊 Dreaming of Graphs in the Open Lakehouse

🦾Plus: 💰 Microsoft Joins the $4T Club

Hey folks! Let’s get into Big Data and AI craziness…

In today's edition: What's Shaping the Future of Data?

  • 🧠Computational Data Science vs. Data Science

  • 💻Alibaba Releases Opensource Qwen3 Coder Flash

  • 📚ML System Design Case Studies Repository

  • 📄RAGFlow - RAG Engine for Deep Document Understanding

  • 🎆 BFL, Krea tackle ‘AI look’ with new image model

  • 📊 Anthropic takes enterprise AI lead as spending surges

  • 💡 AI Tutorial:Create an AI agent with Mistral AI

  • 🤖 AI Tools and Data Tools to checkout

While Open Lakehouse platforms now natively support tables, geospatial data, vectors, and more, property graphs are still missing. In the age of AI and growing interest in Graph RAG, graphs are becoming especially relevant – there’s a need to deliver Knowledge Graphs to RAG systems, with standards, ETL, and frameworks for different scenarios.

Through Squarespace’s cutting-edge features that combine automation, design presets, creative guidance, and generative AI, Design Intelligence makes it easy to build a beautiful and impactful website. With just a few pieces of information, Blueprint AI generates an entire website customized based off your brand’s goals, name, and personality. It’s AI speed, with Squarespace’s 20+ years of design expertise in website building. 

A computational data scientist might tackle challenges like, “How can we build a real-time fraud detection system that processes millions of transactions per second?” or “How can we simulate protein folding to accelerate drug discovery?”

China’s Alibaba just dropped another opensource code model that matches Claude Sonnet 4's performance while running locally on consumer hardware. Qwen3-Coder-Flash brings agentic coding capabilities to a 30B parameter model that needs just 33GB of RAM, making it accessible to developers who can't afford massive cloud bills or don't want to send their code to third-party APIs.

This repository is a comprehensive collection of 500+ case studies from over 80 leading companies, showcasing practical applications and insights into machine learning (ML) system design. Companies like Netflix, Airbnb, and Doordash have shared their experiences, providing a valuable resource for anyone interested in learning how ML is used to improve products and processes…

A new open-source RAG engine built to handle complex data formats with ease.RAGFlow makes it easier to extract insights from real-world documents PDFs, spreadsheets, scans, images by deeply understanding their structure.

Veterinarians nationwide reported that corporate managers pushed clinics to focus on profit, with vets often paid based on revenue. This encouraged them to see more pets, order more tests, and upsell services, creating a growing burden for uninsured pet owners. Pet insurance could help you offset some of these rising costs, with some providing up to 90% reimbursement. View Money’s top pet insurance picks to see plans starting at only $10/month.

👨‍💻 Data Tools, Libraries

Venator (GitHub Repo)

Venator is a flexible threat detection system that simplifies rule management and deployment. It is optimized for Kubernetes deployment but can run standalone or with other job schedulers.

Zero (GitHub Repo)

Zero is a set of types and functions that enables JSX to be transpiled into DOM nodes.

🐉 rust-gpu (GitHub Repo)

rust-gpu is a project still in an early stage that aims to make Rust a first-class language and ecosystem for GPU shaders.

AI News:

Microsoft’s stock price rose so much today that it passed a $4 trillion market valuation for the first time in its 50-year history. The software maker is the second company to be valued at such a feat after Nvidia reached a market cap of over $4 trillion earlier this month.

Whether you’re looking to change careers or just learn something new, Codecademy can help. With over 600 interactive courses, plus portfolio projects and industry certification prep, you'll get hands-on experience using in-demand tech skills. Big Data News Weekly readers can use code SKILLUP15 to save 15% on a year of Codecademy Pro.

Menlo Ventures just released its mid-year LLM market report, revealing that enterprise AI spending is continuing to surge, with Anthropic emerging as the new market leader over OpenAI with a 32% model usage share. The report surveyed 150 technical leaders, finding that enterprises doubled their LLM API spending to $8.4B in the last 6 months.

AI image startup Black Forest Labs and creative platform Krea just released FLUX.1 Krea, an open-weight image model focused on eliminating the typical “AI look” with upgraded photorealism and quality. The model was trained on a diverse, curated dataset to avoid common AI outputs like waxy skin, blurry backgrounds, and oversaturated colors.

Elon Musk’s AI startup xAI has agreed to sign the “Safety and Security” chapter of the EU’s AI Code of Practice, a voluntary framework designed to help AI companies align with the upcoming EU AI law. While this move signals a partial endorsement of European AI oversight, xAI criticized other parts of the code as harmful to innovation and overly restrictive.

Figma has finally begun trading on the New York Stock Exchange after a long delay. The stock soared so quickly that trading was halted for a short time due to market volatility. Shareholders originally sold at the IPO price of $33/share, but the price now is between $101-112 with a mid-day market cap of $45 billion. Pretty nice.

Mood’s new rapid onset THC gummies deliver gentle body relaxation with a strong but balanced, clear-headed euphoria — all in as little as five minutes.

Plus, Mood gummies are USA-grown and tested by third-party labs to ensure they meet federal legal and health standards.

AI Tutorial

Create an AI agent with Mistral AI

  1. Go to Le Chat and select “Agents.”

  2. Provide detailed instructions in the instruction chatbox. For example, if you want an agent that writes like you, describe your style and paste examples of your writing.

*Note: If it says you need an active subscription, you can simply subscribe to the free plan.

  1. Adjust other settings like guardrails, tone, and knowledge accordingly.

  2. You can test your agent in the chat on the right.

  3. To finalize, click “Open New Chat” and you can start chatting with your agent.

  4. When you're in Le Chat, type “@” to view the list of your created agents.

You can further edit your agent at any time. Just go to “Agents”, click on the one you want to change, and select “Customize” to make edits.

Create How-to Videos in Seconds with AI

Stop wasting time on repetitive explanations. Guidde’s AI creates stunning video guides in seconds—11x faster.

  • Turn boring docs into visual masterpieces

  • Save hours with AI-powered automation

  • Share or embed your guide anywhere

How it works: Click capture on the browser extension, and Guidde auto-generates step-by-step video guides with visuals, voiceover, and a call to action.

🔥Top AI tools to increase productivity: 

  1. Law Blocks is a legal tech ecosystem, offering secure, accessible and affordable legal solutions with the help of AI

  2. SheetMagic: Elevate Google Sheets with AI-driven content and web scraping.

  3. StartKit.AI is a boilerplate designed to speed up the development of AI projects.

  4. CollaborativeAI - Build custom self-hosted AI assistants using the latest models like GPT-4, GPT-3.5, Claude and Gemini.

  5. Clarity AI is a state-of-the-art AI Image Upscaler & Enhancer that supports upscaling from 64px to 37 megapixels

  6. Eazl.ai represents a paradigm shift in the way professionals approach their work.

  7. airapgenerators.com -  Create unique AI Raps now, free to use.

View our database of all the best AI tools for your needs: aitoolsup.com

Have cool resources to share? Submit AI tool

A.I. Generated Image of the Day

👀 Cities from dreams

AI Tools Up NewsletterReceive a weekly email with updates on new AI tools, helpful prompts, and the latest AI developments. Join over 10000 + professionals from Google, OpenAI, Notion, Apple, and more.

SPONSOR US

Get your product in front of Big Data & AI enthusiasts

Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world.

Interested in Sponsoring the Big Data News Weekly Newsletter?Get in touch today

What did you think of today's email?

Your feedback helps me create better emails for you!

Login or Subscribe to participate in polls.