šŸ¤–Sensitive Data Discovery with LLMs

🦾Plus: 🧠 OpenAI invests in Altman’s Neuralink rival

In partnership with

Hey folks! Let’s get into Big Data and AI craziness…

In today's edition: What's Shaping the Future of Data?

  • šŸ”How to Get Started Monitoring Your ML Models in Production

  • šŸ“ Using Trapezoids to Visualize Matrix Multiplication

  • šŸ“± Replit launches mobile ā€œvibe codingā€ support

  • āš–ļø Real-world Claude data cuts Anthropic’s productivity claims in half

  • 🚨 Thinking Machines loses co-founders to OpenAI

  • 🦻 ChatGPT Translate just dropped

  • šŸ’” AI Tutorial:How to use Claude Cowork to perform non-technical tasks

  • šŸ¤– AI Tools and Data Tools to checkout

Can you point an LLM at your database and ask it to tag any sensitive data? I’ve been benchmarking the performance of the latest LLMs on this task, and, to spoil the surprise, the answer is quite close to "yes". Frontier models achieve ~80% recall and >80% precision tagging complex, realistic database schemas.

Introducing the first AI-native CRM

Connect your email, and you’ll instantly get a CRM with enriched customer insights and a platform that grows with your business.

With AI at the core, Attio lets you:

  • Prospect and route leads with research agents

  • Get real-time insights during customer calls

  • Build powerful automations for your complex workflows

Join industry leaders like Granola, Taskrabbit, Flatfile and more.

Machine learning model monitoring refers to tracking and understanding your production models’ performance from a science and operation point of view. In other words, it’s the tracking of a Machine learning model during production so that you can fix any potential issues that may negatively impact your business.

I started using diagrams that have helped me internalize matrix products, and it has made the bookkeeping less cumbersome. In this article, I’ll describe the approach, which I call the trapezoid diagram. After going over some examples from linear algebra, I’ll show how it can be applied to various matrix products in data science: principal component analysis

Artificial intelligence coding startup Replit is now letting users create and publish mobile apps for Apple  devices using only natural language prompts, the latest evolution in so-called vibe coding. Additionally, Replit is nearing a new round of funding that would value the startup at $9 billion, a source familiar with the matter told CNBC.

The fourth Economic Index tracked one million conversations from November 2025 using five metrics: task complexity, skill level, purpose, autonomy, and success rates. The findings: Claude accelerates high-skill, college-level tasks 12Ɨ faster than humans, but success rates dip slightly for complex work. Adjusted for reliability, AI could lift U.S. labor productivity by 1–1.2 percentage points annually which is significant, but well below early hype.

AI can help you move faster, but real leadership still requires human judgment.

The free resource 5 Traits AI Can’t Replace explains the traits leaders must protect in an AI-driven world and why BELAY Executive Assistants are built to support them.

šŸ‘Øā€šŸ’» Data Tools, Libraries

This repository contains the React components for the Cloudscape Design System. Cloudscape is a design system for building web applications that offers interactive guidelines, frontend components, design resources, and development tools

DwarFS is a read-only file system with very high compression ratios for very redundant data.

kvass is a personal key-value store.

AI News:

Mira Murati's Thinking Machines Lab just parted ways with co-founder and CTO Barret Zoph amid misconduct allegations (H/T to Kylie Robison for breaking the news), with Zoph and several other former staffers returning to OAI just hours later. Murati announced the split at an all-hands meeting and on X, with Zoph reportedly accused of sharing proprietary information with competitors.

What investment is rudimentary for billionaires but ā€˜revolutionary’ for 70,571+ investors entering 2026?

Imagine this. You open your phone to an alert. It says, ā€œyou spent $236,000,000 more this month than you did last month.ā€

If you were the top bidder at Sotheby’s fall auctions, it could be reality.

Sounds crazy, right? But when the ultra-wealthy spend staggering amounts on blue-chip art, it’s not just for decoration.

The scarcity of these treasured artworks has helped drive their prices, in exceptional cases, to thin-air heights, without moving in lockstep with other asset classes.

The contemporary and post war segments have even outpaced the S&P 500 overall since 1995.*

Now, over 70,000 people have invested $1.2 billion+ across 500 iconic artworks featuring Banksy, Basquiat, Picasso, and more.

How? You don’t need Medici money to invest in multimillion dollar artworks with Masterworks.

Thousands of members have gotten annualized net returns like 14.6%, 17.6%, and 17.8% from 26 sales to date.

*Based on Masterworks data. Past performance is not indicative of future returns. Important Reg A disclosures: masterworks.com/cd

OpenAI announced a new seed investment into Merge Labs, a brain-computer interface startup co-founded by Sam Altman that emerged from stealth alongside its $252M raise, with the AI giant becoming the company’s largest backer. Merge is aiming to boost BCI bandwidth using ultrasound and engineered proteins, skipping the surgical brain implants required by rivals like Neuralink.

President Donald Trump has signed a proclamation imposing a 25% tariff on advanced AI semiconductors, including Nvidia's H200 chips, that are produced outside the US and pass through the country before being exported to customers in China.

Wikimedia has announced enterprise partnerships with Amazon, Meta, Microsoft, Perplexity and others as part of Wikipedia's 25th anniversary. The deals give these companies streamlined, high-throughput API approach to Wikipedia's 65M articles in exchange for commercial fees that help offset rising infrastructure amounts.

OpenAI rolled out ChatGPT Translate, a standalone tool supporting 50+ languages that lets users adjust tone and context after translating. Try it here.

Know what works before you spend.

Discover what drives conversions for your competitors with Gethookd. Access 38M+ proven Facebook ads and use AI to create high-performing campaigns in minutes — not days.

AI Tutorial

How to use Claude Cowork to perform non-technical tasks

  • Open Claude Cowork on your laptop

  • Select a folder (Downloads, Notes, Project files, Screenshots, etc.)

  • Claude can now read what’s inside

  • Tell Claude what you want done

  • Claude edits, creates, and organizes directly in your files

  • You can use it to auto-organize your folders, create spreadsheets from screenshots, draft reports from scattered notes, and more

šŸ”„Top AI tools to increase productivity: 

  1. Nectar AI is an AI companion platform where users can create and roleplay

  2. FeatureShark is an all-in-one platform designed to revolutionize how you collect and manage customer feedback.

  3. BeadPattern, AI-powered perler bead pattern maker with smart color matching.

  4. Brandmaven is a brand intelligence platform for marketers, powered by AI.

  5. SEOzilla - Let AI Transform Your SEO: Publish Articles That Rank Every Day

  6. SellerPic is an AI SaaS platform purpose-built for e-commerce sellers

  7. Undress AI Tool is a website that offers a deepnude application, allowing users to create modified images

  8. Screenloop is the ultimate Talent Operations Platform, seamlessly integrating a next-gen ATS

View our database of all the best AI tools for your needs: aitoolsup.com

Have cool resources to share? Submit AI tool

A.I. Generated Image of the Day

šŸ‘€ Information density

AI Tools Up NewsletterReceive a weekly email with updates on new AI tools, helpful prompts, and the latest AI developments. Join over 20000 + professionals from Google, OpenAI, Notion, Apple, and more.

SPONSOR US

Get your product in front of Big Data & AI enthusiasts

Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world.

Interested in Sponsoring the Big Data News Weekly Newsletter?Get in touch today

What did you think of today's email?

Your feedback helps me create better emails for you!

Login or Subscribe to participate in polls.