Data Engineering is Not Software Engineering 👨‍💻

🦾Plus: 👀 Broadcom-OpenAI $10B AI Chip Deal

Hey folks! Let’s get into Big Data and AI craziness…

In today's edition: What's Shaping the Future of Data?

  • 🌟The Data Stack Report 2025

  • 📚Datasets for Natural Language Processing

  • 📊Understanding Multilevel Modeling

  • 🤖AI Overlords: Redesigning Data Systems to be Agent-First

  • 👀FTC launches probe into AI chatbot risks

  • 📱 TikTok Tops 200M Monthly Users in Europe

  • 💡 AI Tutorial:Transform photos into 3D-style visuals

  • 🤖 AI Tools and Data Tools to checkout

We, at Metabase, asked 338 teams in our community about how they build and use their data stacks, from tool choices to AI adoption, and built a community resource for data stack decisions in 2025…Some of our findings: - Postgres wins it all: #1 transactional DB and #1 analytics storage - 50% of teams skip warehouses/lakes - Data teams stay small:

For quality online learning at a price ranked in the bottom third compared to our online competitors in 2025, think Liberty University. Liberty is a nonprofit institution that offers a world-class Christian education online so you can pursue your goals wherever you have internet. Discounts for service members and first responders can make learning at Liberty even more accessible. Experience online convenience with on-campus benefits when you enroll at Liberty University today!

In the realm of Natural Language Processing (NLP), datasets are the bedrock upon which the foundations of language understanding and communication between humans and machines are built. NLP, a subfield of artificial intelligence, thrives on the rich and intricate tapestry of human language

In recent years, it would appear that data engineering is converging with DevOps. Both have embraced cloud infrastructure, containerization, CI/CD, and GitOps to deliver reliable digital products to their customers. The convergence on a subset of tooling has led many to the opinion that there is no significant distinction between data engineering and software engineering.

Multilevel modeling (also known as hierarchical linear modeling or mixed-effects modeling) extends the panel data framework to handle nested data structures that are ubiquitous in health research. While panel data considers observations across entities and time, multilevel models address situations where individual observations are nested within higher-level units, creating natural hierarchies in the data structure….

Large Language Model (LLM) agents, acting on their users' behalf to manipulate and analyze data, are likely to become the dominant workload for data systems in the future…We argue that data systems need to adapt to more natively support agentic workloads. We take advantage of the characteristics of agentic speculation that we identify, i.e., scale, heterogeneity, redundancy, and steerability - to outline a number of new research opportunities for a new agent-first data systems architecture

What if the solution to muscle spasms, bladder issues, fatigue, and brain fog…Was hiding in your water?

And in just seconds a day you could turn back the clock on your body!?

Without painful treatments, expensive doctor bills, or even leaving your house…In one study, participants experienced a difference in as little as 2 hours!

👨‍💻 Data Tools, Libraries

Foundations (GitHub Repo)

Foundations is a modular Rust library designed to help scale programs for distributed production-grade systems.

Proton (GitHub Repo)

Proton is a streaming SQL engine powered by ClickHouse. It can help developers solve streaming data processing and routing and analytics challenges and send aggregated data to downstream systems

ViroReact (GitHub Repo)

ViroReact is a library for building augmented reality and virtual reality experiences. It can run React Native code natively across all mobile VR and AR platforms.

AI News:

If you had Broadcom stock, it was indeed a happy Friday. The 15% jump Broadcom closed up 15% after securing a $10 billion custom AI chip order, reportedly from OpenAI. This solidifies its position as a serious contender to Nvidia in the AI infrastructure race. Analysts are bullish on Broadcom’s custom silicon strategy, which could drive AI-related revenue above $40 billion in fiscal 2026.

LeafGuard replaces your entire system with a design that's earned the Good Housekeeping Seal for 15 consecutive years. AI News Roundup readers qualify for 75% off installation plus $200 off. Custom-manufactured for your home with lifetime protection.

Top U.S. tech leaders, including Google’s Sundar Pichai, IBM’s Arvind Krishna, and OpenAI’s Sam Altman, joined government officials in Washington to launch the “Presidential AI Challenge,” a new initiative focused on bringing artificial intelligence into K-12 education. The event emphasized AI’s role as a powerful tool for preparing students for the future, with leaders highlighting its potential to transform classrooms, enhance learning, and inspire the next generation of innovators.

The U.S. Federal Trade Commission is preparing a major study into the risks posed by AI-powered chatbots from companies like OpenAI, Google, and Meta, focusing on privacy, data handling, and potential harms to children and other users. Using its 6(b) authority, the FTC will compel the nine largest chatbot providers to share information

Roughly one-third of the EU population is now on TikTok daily. The app still faces scrutiny under the Digital Services Act (DSA), where it’s classified as a Very Large Online Platform (VLOP) and must comply with stricter transparency, moderation, and algorithmic accountability rules. As regulatory pressure builds, TikTok’s ability to scale while maintaining compliance will be a key watchpoint for marketers and global tech operators

Users can now view live translations while scrolling thanks to Google’s newest update to its “Circle to Search” feature, eliminating the need to pause and reselect text. This enhancement makes browsing foreign-language content more seamless across apps, particularly useful for travel, research, and global shopping.

Here’s how it works:

  1. Take our questionnaire and get matched with a therapist.

  2. Schedule a time to meet and communicate on your terms.

  3. Reach out to your therapist anytime, from anywhere.

AI Tutorial

📷 Transform photos into 3D-style visuals

In this tutorial, you will learn how to use Google’s Nano Banana model to recreate any room or environment in isometric view, giving you a bird's-eye perspective that reveals hidden details and creates visuals for content/design mockups.

Step-by-step:

  1. Go to gemini.google.com, toggle on "Tools", and select "Create Images" (with the banana icon)

  2. Upload any room photo and prompt: "Recreate this image in isometric view" —suddenly see details that weren't visible before

  3. Refine elements: "Make the room bigger," "Add punk rock theme with minimalist chandelier" — Nano Banana edits without regenerating the image

  4. Swap environments: "Change cityscape window to ocean view" or "Add natural sunlight and a door to another room" — perfect for testing interior design ideas

  5. Push further with VEO: Upload your edited image and prompt "Make this room lively by adding two dogs running through" to create a video with sound effects

Your boss will think you’re a genius

Optimizing for growth? Go-to-Millions is Ari Murray’s ecommerce newsletter packed with proven tactics, creative that converts, and real operator insights—from product strategy to paid media. No mushy strategy. Just what’s working. Subscribe free for weekly ideas that drive revenue.

🔥Top AI tools to increase productivity: 

  1. Looksmax AI analyzes your physical appearance, and shares AI-generated self-improvement tips

  2. PhotoPacks.AI is a platform that enables generating high-quality professional headshots

  3. Growth Makers is a team of AI agents that finds growth hacking strategies for your business.

  4. ContentPieAI, say goodbye to the hassle of juggling multiple tools and spending hours on end crafting content

  5. Glowup AI, your personalized AI beauty companion.

  6. AI Humanize is a tool that transforms AI-generated text into writing that closely resembles human text

  7. Robopic by StackForward LLC transforms your digital photography experience

  8. airapgenerators.com -  Create unique AI Raps now, free to use.

View our database of all the best AI tools for your needs: aitoolsup.com

Have cool resources to share? Submit AI tool

A.I. Generated Image of the Day

👀 What are they plotting?

AI Tools Up NewsletterReceive a weekly email with updates on new AI tools, helpful prompts, and the latest AI developments. Join over 10000 + professionals from Google, OpenAI, Notion, Apple, and more.

SPONSOR US

Get your product in front of Big Data & AI enthusiasts

Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world.

Interested in Sponsoring the Big Data News Weekly Newsletter?Get in touch today

What did you think of today's email?

Your feedback helps me create better emails for you!

Login or Subscribe to participate in polls.