- Big Data News Weekly
- Posts
- 📊 Deep Learning for Tabular Data
📊 Deep Learning for Tabular Data
🦾Plus: 🤝OpenAI announces new deal with Pentagon

Hey folks! Let’s get into Big Data and AI craziness…
In today's edition: What's Shaping the Future of Data?
⚙️Build and deploy custom AI models using Lambda and Oumi
🧠How to stay in control when doing EDA with coding agents
🧪Frontier model training methodologies
🤝How can software engineers and data scientists work together?
💬 ChatGPT Reaches 900M Weekly Active Users
🤖Microsoft’s Copilot Tasks AI uses its own computer to get things done
💡 AI Tutorial:How to turn a product photo into a full campaign shoot
🤖 AI Tools and Data Tools to checkout

It’s been nearly four years since I first summarized the state of deep learning (DL) for tabular data, and about three years since my follow-up post. Back then, the verdict was pretty clear: for most tabular data scenarios, especially those with heterogeneous features and even very large sample sizes, gradient boosting methods like XGBoost, LightGBM, and CatBoost, were still the pragmatic and performant choice.
Want to get the most out of ChatGPT?
ChatGPT is a superpower if you know how to use it correctly.
Discover how HubSpot's guide to AI can elevate both your productivity and creativity to get more things done.
Learn to automate tasks, enhance decision-making, and foster innovation with the power of AI.

Both data scientists and engineers must be responsible for the issue and must try to solve the issue at any step of the work. Continuous communication ensures that possible discrepancies are recognized in the early stage.

Today, we’re announcing a partnership between Oumi and Lambda for providing global enterprises with a complete solution for end-to-end model development and deployment. With Oumi, AI teams can build custom models dramatically faster and easier than ever before. They can then immediately deploy them on Lambda powered by NVIDIA AI infrastructure to achieve the speed, scale, and reliability production demands.

In this blog post, I share how coding agents can supercharge data analysis, but only if we stay in control. By slowing down, asking the right questions, and structuring sessions with journals and artifact gating, we avoid chaos and keep our scientific thinking sharp.

How do labs train a frontier, multi-billion parameter model? We look towards seven open-weight frontier models: Hugging Face’s SmolLM3, Prime Intellect’s Intellect 3, Nous Research’s Hermes 4, OpenAI’s gpt-oss-120b, Moonshot’s Kimi K2, DeepSeek’s DeepSeek-R1, and Arcee’s Trinity series. This blog is an attempt at distilling the techniques, motivations, and considerations used to train their models with an emphasis on training methodology over infrastructure.
Learn how financial institutions are deploying conversational AI agents with enterprise-grade governance — reducing costs, improving satisfaction, and ensuring regulatory trust.
Read The State of Conversational Agents in Financial Services from ElevenLabs to see how enterprise leaders are scaling responsibly.
👨💻 Data Tools, Libraries
Thesys: Build conversational analytics agents without setting up data pipelines or building dashboards manually.
docker-pgautoupgrade
A PostgreSQL Docker container that automatically upgrades your database.
Lets-Plot
An open-source plotting library for statistical data.
pg_embedding
Hierarchical Navigable Small World (HNSW) algorithm for vector similarity search in PostgreSQL.
AI News:

Sam Altman said late Friday night that his company reached an agreement with the Pentagon to use its AI models, after the Defense Department agreed to its safety red lines that were similar to rival Anthropic's. Why it matters: The Pentagon has blasted Anthropic for days, contending its red lines for AI use in the military — mass surveillance and autonomous weapons — are philosophical and "woke."
The comprehensive IT-industry rundown
Every day, IT teams make decisions that affect security, budgets, and how the business runs.
IT Brew is built for those moments—delivering clear, timely coverage of the trends shaping IT so you understand what’s changing before it turns into a meeting, a ticket, or a fire drill.
Join 125K+ industry pros reading {IT Brew’s newsletter} for free.

OpenAI finalized a $110 billion funding round at a $730 billion valuation. Amazon led with $50 billion, while SoftBank and Nvidia each put in $30 billion. OpenAI will use Amazon’s in-house AI chips and spend another $100 billion on AWS over eight years.

AI music generator Suno hit 2 million paid subscribers to the tune of roughly $300 million in annual recurring revenue, signaling strong appetite for AI-assisted creative tools.Just three months ago, Suno announced a $250 million funding round that valued the company at $2.45 billion.

Not to be overshadowed by its big funding news, OpenAI also shared that ChatGPT now has ~900 million weekly active users globally, plus about 50 million paying subscribers, keeping the product near the top of the AI app charts. The new weekly active user figure marks a jump of 100 million users from the 800 million that OpenAI reported in October 2025.

Microsoft is previewing a new AI system, Copilot Tasks, that it says is designed to take care of busywork for you in the background, the company announced on Thursday. The feature takes the load off your device using its own cloud-based computer and browser, allowing it to handle a variety of jobs ranging from scheduling appointments to generating study plans while you do something else.
Will Your Retirement Income Last?
A successful retirement can depend on having a clear plan. Fisher Investments’ The Definitive Guide to Retirement Income can help you calculate your future costs and structure your portfolio to meet your needs. Get the insights you need to help build a durable income strategy for the long term.
AI Tutorial
How to turn a product photo into a full campaign shoot

Go to Pomelli and click on ‘Create Photoshoot‘
Drop in your company’s website link. Pomelli extracts your logo and brand voice.
Upload a product photo and pick a template from the list.
Generate professional-grade campaign images.
You can edit the header, description, or image.
Choose your format and download instantly.
🔥Top AI tools to increase productivity:
Alice - A native app that offers fast and reliable experience with models (OpenAI, Perplexity, Claude and more)
Linktopia - Community link-building for bloggers, entrepreneurs and startup brands to grow
VerifactAI is an AI fact-checking tool that allows you to fact-check your articles within a minute.
Undress AI Tool is a website that offers a deepnude application, allowing users to create modified images
Screenloop is the ultimate Talent Operations Platform, seamlessly integrating a next-gen ATS
Postlyy - All in one platform to create, schedule, and analyze content on X and LinkedIn
View our database of all the best AI tools for your needs: aitoolsup.com
Have cool resources to share? Submit AI tool
A.I. Generated Image of the Day
👀

Recommended reading:
SPONSOR US
Get your product in front of Big Data & AI enthusiasts
Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world.
Interested in Sponsoring the Big Data News Weekly Newsletter?Get in touch today
What did you think of today's email?Your feedback helps me create better emails for you! |




