Using Amazon SageMaker Lakehouse with DuckDB 🤖

🦾Plus: 🧸 OpenAI, Mattel partner on AI toys

Hey folks! Let’s get into Big Data and AI craziness…

In today's edition: What's Shaping the Future of Data?

  • 💻 Top 30 Agentic IDEs for Programmers in 2025

  • 🔗 Entity-Resolved Knowledge Graphs with Kuzu & Senzin

  • 🏗 Scalable Lakehouse Architecture with Iceberg & Polaris

  • 🔐 Securing AI Agents: The Future of MCP Authentication & Authorization

  • 🌪️ Google Launches 1-Minute, 15-Day Hurricane Forecasting AI

  • ⚡ AMD Reveals Next-Gen AI Chips, Partners with OpenAI

  • 🎬 Kalshi’s AI Ad Premieres During the NBA Finals

  • 💡 AI Tutorial: How to Create PDFs in Grok"

  • 🤖 AI Tools and Data Tools to checkout

To use the Amazon SageMaker Lakehouse with DuckDB, you first have to create a S3 Table bucket, a namespace and an actual S3 Table. All those steps are described in my other blog post “Query S3 Tables with DuckDB”, so please make sure you followed the outlined (manual) steps of the Setting up a S3 Table section before continuing with this blog post.

Whether you’re looking to change careers or just learn something new, Codecademy can help. With over 600 interactive courses, plus portfolio projects and industry certification prep, you'll get hands-on experience using in-demand tech skills. Big Data News Weekly readers can use code SKILLUP15 to save 15% on a year of Codecademy Pro.

An Integrated Development Environment (IDE) is a software suite that provides programmers with the tools needed to develop applications efficiently. IDEs typically include features like debuggers and compilers, simplifying the coding process by bringing all essential development components into one platform.

Investigative graph analyses involve using a variety of graph queries and network analysis techniques to uncover patterns, relationships, and insights within complex data represented as a graph. They are commonly used in domains like social networks, finance, cybersecurity, and biology to discover useful relationships and structures within the data.

As AI agents like Claude, Cursor, and other intelligent assistants become integral to enterprise workflows, a critical security challenge has emerged: How do we safely allow AI agents to access sensitive enterprise resources on behalf of users?

How can modern data teams achieve massive scalability, flexibility, and efficiency while avoiding the pitfalls of fragmented data lakes? This session explores Taktile's journey from a complex mix of S3, Glue, and Snowflake to a fully integrated, pythonic Lakehouse with Apache Iceberg, Polaris Catalog, and dlt.

Through Squarespace’s cutting-edge features that combine automation, design presets, creative guidance, and generative AI, Design Intelligence makes it easy to build a beautiful and impactful website. With just a few pieces of information, Blueprint AI generates an entire website customized based off your brand’s goals, name, and personality. It’s AI speed, with Squarespace’s 20+ years of design expertise in website building. 

👨‍💻 Data Tools, Libraries

Twilio Segment: Your data, built your way.. For data you can depend on. Twilio Segment was purpose-built so that you don’t have to worry about your data. Forget the data chaos, dissolve the silos between teams and tools, and bring your data together with ease. So that you can spend more time innovating and less time integrating.

VectorChord (GitHub Repo)

VectorChord is a PostgreSQL extension designed for scalable, high-performance, and disk-efficient vector similarity search.

Sequin (GitHub Repo)

Sequin is a tool for change data capture in Postgres that supports native sinks, making it easy to stream Postgres rows and changes to streaming platforms and queues.

AI News:

OpenAI and Mattel just announced a new strategic partnership to create AI-powered toys and experiences, bringing the tech to franchise brands like Barbie, Hot Wheels, American Girl, and more. The collaboration will integrate OpenAI's tech into Mattel's product development, with the first AI-powered product expected later this year.

Meet the #1 gamified treadmill that makes working out something you’ll actually look forward to. The Victory Treadmill combines immersive gameplay with industry-leading hardware to make every workout feel fun, fresh, and effective. Explore scenic trails, take on epic quests, or challenge friends in multiplayer games – all while staying consistent and seeing real results. With Aviron, hitting your goals feels less like work and more like play.

ByteDance just released Seedance 1.0, a new video generation model that ranks first on benchmarks for both text-to-video and image-to-video tasks, outperforming SOTA options from Google, Kuaishou, and OpenAI.

A new AI model from DeepMind just beat every traditional hurricane forecasting system in both accuracy and speed, and it only takes one minute to predict storms up to 15 days in advance. Weather Lab, its interactive platform, shows 5-day forecasts with 140 km closer to real storm paths.

Learn AI in 5 minutes a day

What’s the secret to staying ahead of the curve in the world of AI? Information. Luckily, you can join 1,000,000+ early adopters reading The Rundown AI — the free newsletter that makes you smarter on AI with just a 5-minute read per day.

Advanced Micro Devices (AMD) has unveiled new details about its next-generation AI chips. The Instinct MI400 series will ship next year. They can be assembled into full server racks with thousands of chips. AMD's rack-scale setup will make the chips look to users like one system, which is important for most customers who develop large language models.

Prediction market Kalshi aired one of the first instances of an AI-generated commercial during Game 3 of the NBA Finals, running an “unhinged” 30-second spot created with clips created using Google's Veo 3 video model on ABC.

56% of workers say scheduling a meeting is the only way to get information. With Jira, use AI to automatically add work from Slack, create subtasks, or attach relevant resources. So instead of scheduling a meeting, check the status in Jira. Easy.

AI Tutorial

How to create PDFs in Grok

How To Create a PDF in Grok (It’s This Simple)

  1. Go to Grok.com on a desktop browser and log in.

  2. Type the prompt for the content you want in your PDF.

Example: “Write a guide for job seekers on how to use ChatGPT to improve their resumes.”

  1. Once you have a reply, just type “Turn this into a PDF.”

  2. You’ll get a preview of your pdf.

  3. You can ask Grok to make changes, like adding a new section.

  4. You can also click the CODE button at the top to edit the HTML yourself. When you go back to the preview, you’ll see the changes you just made.

  5. Once you’re done, download the PDF.

With car insurance premiums projected to reach a record $2,101 annually in 2025, it's more important than ever to make sure you're not overpaying. In fact, switching car insurance providers could save drivers over $1,300 a year, according to a 2024 survey.

🔥Top AI tools to increase productivity: 

  1. Robopic transforms your digital photography experience, allowing you to create hyper-realistic AI photos and videos

  2. MindShow is an AI-powered slide creation tool designed to elevate your presentations effortlessly.

  3. Robopost is an all-in-one social media management tool designed to help freelancers, entrepreneurs, small businesses

  4. Talkiemate - Connect with custom virtual characters for engaging conversations.

  5. Deckee.AI is an AI website, Ethereum web3, and nft marketplace builder that helps influencers

  6. Stylar is an AI-powered design partner that revolutionizes image generation

  7. Postlyy - All in one platform to create, schedule, and analyze content on X and LinkedIn

View our database of all the best AI tools for your needs: aitoolsup.com

Have cool resources to share? Submit AI tool

A.I. Generated Image of the Day

👀 child soldier as a mech pilot.

AI Tools Up NewsletterReceive a weekly email with updates on new AI tools, helpful prompts, and the latest AI developments. Join over 10000 + professionals from Google, OpenAI, Notion, Apple, and more.

SPONSOR US

Get your product in front of Big Data & AI enthusiasts

Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world.

Interested in Sponsoring the Big Data News Weekly Newsletter?Get in touch today

What did you think of today's email?

Your feedback helps me create better emails for you!

Login or Subscribe to participate in polls.