- Big Data News Weekly
- Posts
- đź§Š Modern Lakehouse Stack with Lance and Iceberg
đź§Š Modern Lakehouse Stack with Lance and Iceberg
🦾Plus: 👓 China’s answer to Meta’s smart glasses

Hey folks! Let’s get into Big Data and AI craziness…
In today's edition: What's Shaping the Future of Data?
đź§ AI Agents: The Future of Intelligent Automation
đź§±Building a Durable Execution Engine With SQLite
📚Learning Deep Representations of Data Distributions
🧬 OpenLineage: Open Framework for Data Lineage
🤖 DeepSeek’s new reasoner crushes IMO 2025
‼️ OpenAI’s API user data leaked in third-party breach
đź’ˇ AI Tutorial:Create your Clothing Line Presentation with AI
🤖 AI Tools and Data Tools to checkout

The modern, composable data stack has evolved around the idea of the lakehouse — a unified system that blends the flexibility of data lakes (i.e., object stores designed to hold data in open file formats) with the analytical performance and reliability of data warehouses. Projects like Apache Iceberg have been pivotal in making this vision a reality, offering transactional guarantees and schema evolution at scale
Shoppers are adding to cart for the holidays
Peak streaming time continues after Black Friday on Roku, with the weekend after Thanksgiving and the weeks leading up to Christmas seeing record hours of viewing. Roku Ads Manager makes it simple to launch last-minute campaigns targeting viewers who are ready to shop during the holidays. Use first-party audience insights, segment by demographics, and advertise next to the premium ad-supported content your customers are streaming this holiday season.
Read the guide to get your CTV campaign live in time for the holiday rush.

Artificial Intelligence (AI) is one of the most transformative technologies of the 21st century. Within the broad field of AI, one particularly impactful concept is that of the AI agent. As businesses, governments, and individuals increasingly adopt AI-powered systems, understanding what AI agents are, how they function, and what their implications are has become essential.

Lately, there has been a lot of excitement around Durable Execution (DE) engines. The basic idea of DE is to take (potentially long-running) multi-step workflows, such as processing a purchase order or a user sign-up, and make their individual steps persistent.

A modern fully open-source textbook exploring why and how deep neural networks learn compact and information-dense representations of high-dimensional real-world data…
Data lineage is the foundation for a new generation of powerful, context-aware data tools and best practices. OpenLineage enables consistent collection of lineage metadata, creating a deeper understanding of how data is produced and used…
You shouldn’t be. Get paid up to 2 days early and make your money go further with 4% interest on savings,* up to $200 in free overdraft coverage,** and more.
👨‍💻 Data Tools, Libraries
Coresignal’s Free Webinar: How to Evaluate External Data Quality with Confidence. Join Coresignal’s free, vendor-neutral webinar on December 2, where Data Analyst Egidijus Griska will share practical tools and metrics you need to be confident when assessing B2B data.
kit (GitHub Repo)
kit is a toolkit for codebase mapping, symbol extraction, code search, and building LLM-powered developer tools, agents, and workflows. It can build things like code reviewers, code generators, and IDEs.
Styleframe (Website)
Styleframe's powerful TypeScript CSS API helps developers compose design systems in minutes.
AI News:

DeepSeek just released DeepSeek-Math-V2, an open-source MoE model that achieves gold-medal performance at IMO 2025, democratizing “research-level” mathematical reasoning that was previously locked behind proprietary walls. The model scored 118/120 on the 2024 Putnam competition (beating the top human score) and solved 5 of 6 IMO 2025 problems, hitting the gold standard.
Is your Shopify Brand ready for Agentic Commerce this Q4?
Agentic Commerce is transforming ecommerce.
Zipchat.ai is the AI Agent built for Shopify brands — converting visitors, recovering carts, and automating support 24/7. Trusted by Police, TropicFeel, and Jackery, it works whether you have 10k visitors/month or millions, so you can win Q4 without extra headcount.
Use code NEWSLETTER10 for 10% off forever.

OpenAI just revealed that its analytics vendor Mixpanel suffered a security incident, with an attacker exporting some of its API users’ profile information — although no chat data, API keys, payment details, or credentials were compromised. The breach occurred on November 9, covering Mixpanel’s systems that provided web analytics on the frontend interface of OpenAI’s API product.

Alibaba's Quark S1 glasses, powered by the company's Qwen AI models, are now available in China, with international versions coming next year. The glasses feature built-in translucent displays that superimpose contextual information on the wearer's view of their surroundings. They are equipped with cameras, bone conduction microphones, and swappable batteries rated to last 24 hours.

German software giant SAP has launched EU AI Cloud, a European-focused platform that gives organizations complete control over their AI and cloud infrastructure while keeping data within EU borders. The service includes partnerships with Cohere, Mistral AI, and OpenAI, offering multiple deployment options from SAP's own data centers to on-premises installations for enterprises with strict compliance requirements.

Investor Michael Burry of "The Big Short" fame has launched a public campaign against Nvidia, taking a bearish position with put options worth over $1B. He says that Nvidia's stock compensation take shareholders $112.5B and that customers overstate GPU lifespans to justify spending. Burry also suggests high demand is an illusion, arguing AI customers are “funded by their dealers” in a circular financing arrangement.
AI Tutorial
Create your Clothing Line Presentation with AI

Log in to Designs.ai and select the Presentation Maker.
Enter a detailed prompt that defines the clothing line's goal, its audience (investors/buyers), and the required slides (e.g., Brand Vision, Target Market, Spring Collection Mood Board, Financials).
For Example:
"Create a 10-slide pitch deck for a new sustainable women's activewear line called 'MD’s Apparel.' The target audience is venture capital investors. The presentation must cover: Brand Vision, Target Market, Spring Collection Mood Board, Go-to-Market Strategy, and 5-Year Financial Projections. The tone should be modern, minimalist, and confident."
Choose the best AI-generated template that matches your brand's style (e.g., minimalist, modern).
Now go for brand Integration and upload your logo, brand colors, and specific fonts.
Add the essential Visuals by replacing all stock photos with high-resolution images of your actual clothing line, lookbook photos, and mood board collages.
Insert your specific financial data, review the copy for tone, and export as PPTX or PDF.
Q4 is the perfect window to turn this year’s numbers into a clear, actionable forecast aligned with your goals. Set your business up for a stronger 2026 with BELAY’s new guide.
🔥Top AI tools to increase productivity:
Marblism generates a fully-functional web application from a single prompt:
Clipwing A tool for cutting long videos into dozens of short clips
aiPDF is an innovative, multi-modal tool designed to work with a wide array of inputs, including ebooks, web articles, YouTube videos, podcasts.
podcast.ai, a podcast that is entirely generated by artificial intelligence
Clipwing A tool for cutting long videos into dozens of short clips
aiPDF is an innovative, multi-modal tool designed to work with a wide array of inputs, including ebooks, web articles, YouTube videos, podcasts.
Verk- Hire AI employees to add more firepower to your team, who work 24/7 to do sales, be your personal assistant, do graphic designing and more
Codetoflow enables you to understand the code in simple terms using a flowchart which enables you to understand the details
View our database of all the best AI tools for your needs: aitoolsup.com
Have cool resources to share? Submit AI tool
A.I. Generated Image of the Day
đź‘€ Animals Wedding

Recommended reading:
SPONSOR US
Get your product in front of Big Data & AI enthusiasts
Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world.
Interested in Sponsoring the Big Data News Weekly Newsletter?Get in touch today
What did you think of today's email?Your feedback helps me create better emails for you! |



