- Big Data News Weekly
- Posts
- ⚔ The Revenge of the Data Scientist
⚔ The Revenge of the Data Scientist
🦾Plus: 🚀 SpaceX Files For $1.75 TRILLION IPO

Hey folks! Let’s get into Big Data and AI craziness…
In today's edition: Data scientists are back on top. SpaceX is going public at a valuation that dwarfs most economies. OpenAI is paying humans to teach it their jobs. This week belongs to the builders — and the data👇
🌿Green Cloud Computing – The Sustainable Way to Use the Cloud
⚡Developer rewrites leaked Claude Code from scratch overnight
⚖️Regulating AI Agents
🔧How to fine-tune an LLM for structured data extraction
🎭 OpenAI taps freelancers to teach ChatGPT their jobs
💡 AI Tutorial:How to Use ChatGPT to Build Playlists and find Music
🤖AI Tools and Data Tools to checkout

Training models was never most of the job. The bulk of the work is setting up experiments to test how well the AI generalizes to unseen data, debugging stochastic systems, and designing good metrics. Calling an LLM over an API does not make this work go away…I recently gave a talk titled “The Revenge of the Data Scientist” at PyAI Conf to make that case with examples rather than assertions alone. Below is an annotated version of that presentation…
The fastest-growing repo on GitHub is a one person team!
OpenClaw went from 9K to 185K GitHub stars in 60 days — the fastest-growing repo in history.
Their docs? One person, plus Claude. They scaled to the top 1% of all Mintlify sites, shipping 24 documentation updates a day.

Energy-efficient solutions are necessary to minimize the impact of cloud computing on the environment. Green cloud computing, also known as green information technology, is a potential solution to aide in the reduction of energy consumption.

Following Anthropic's accidental source code leak, the lab rushed to issue DMCA takedowns across GitHub. That's when Korean developer Sigrid Jin stepped in. Waking up at 4 AM, Jin rewrote the entire codebase in Python before sunrise and launched it as claw-code, making it the fastest repo ever to cross 50K stars. Since it's a clean-room rewrite, it remains protected from Anthropic's legal reach.

The European Union’s AI Act - promulgated prior to the development and widespread use of AI agents, the EU AI Act faces significant obstacles in confronting the governance challenges arising from this transformative technology, such as performance failures in autonomous task execution, the risk of misuse of agents by malicious actors, and unequal access to the economic opportunities afforded by AI agents.
You'll learn how to fine-tune a small language model (Gemma 3) locally using Hugging Face. This end-to-end tutorial teaches how to format custom datasets, train a model to extract structured data, and deploy the final result as a live, interactive web demo.
Most “AI assistants” just talk. This one does the work.
ElevenLabs Agents v2 lets you create real-time voice agents that don’t just respond, but execute tasks across your systems.
Talk, chat, and act in real-time across channels
Connect to your tools, APIs, and workflows
Deploy globally with multilingual voice support
👨💻 Data Tools, Libraries
Thesys.dev: Change how you interact with your data. Thesys Agent Builder lets anyone ask questions in natural language and get instant charts, tables, and dashboards - no SQL, no new infrastructure, no code required.
EmDash (GitHub Repo)
EmDash is a full-stack TypeScript CMS that takes the ideas that made WordPress dominant and rebuilds them on serverless, type-safe foundations.
JSON parser: This Go library lets you cut through messy JSON payloads in microseconds.
AI News:

Well, this is shaping up to be a truly out of this world IPO. Elon Musk’s technology conglomerate SpaceX reportedly filed disclosures confidentially with the U.S. Securities and Exchange Commission ahead of an initial public offering. According to Bloomberg, SpaceX could seek a valuation of $1.75 trillion.
Learn how to code faster with AI in 5 mins a day
You're spending 40 hours a week writing code that AI could do in 10.
While you're grinding through pull requests, 200k+ engineers at OpenAI, Google & Meta are using AI to ship faster.
How?
The Code newsletter teaches them exactly which AI tools to use and how to use them.
Here's what you get:
AI coding techniques used by top engineers at top companies in just 5 mins a day
Tools and workflows that cut your coding time in half
Tech insights that keep you 6 months ahead
Sign up and get access to the Ultimate Claude code guide to ship 5X faster.

A new report from Business Insider just revealed “Project Stagecraft,” an internal OpenAI effort paying as many as 4K freelancers at least $50/hr to build occupation-specific training data across a variety of jobs. The project runs through Handshake AI, with freelancers from jobs including commercial aviation, pharmacists, plant scientists, and HR specialists.

Twitter founder and Block CEO Jack Dorsey just co-authored a post arguing AI can replace middle management, framing Block’s recent 40% workforce cut as the opening move in a massive workplace restructure for the AI era. Block cut over 4K employees in February, over 40% of its staff — with Dorsey calling it a bet on AI, not a response to weakness.

OpenAI shares have dropped in value on the secondary market as investors pivot to Anthropic. Investors are, in some cases, unable to sell their shares. Meanwhile, buyers have indicated that they have $2 billion in cash ready to deploy to Anthropic. Anthropic and OpenAI don't allow investors to trade shares on secondary markets without permission

Snapchat’s Spotlight shortform video feed is its version of Instagram Reels… and now it really is. In a video featuring co-founder and CEO Evan Spiegel, Snap has renamed the feature ‘Reals’ and describes it as “a place where real people share real moments. Really.”
Most coverage tells you what happened. Fintech Takes is the free newsletter that tells you why it matters. Each week, I break down the trends, deals, and regulatory shifts shaping the industry — minus the spin. Clear analysis, smart context, and a little humor so you actually enjoy reading it. Subscribe free.
AI Tutorial
How to Use ChatGPT to Build Playlists and Discover New Music

Go to ChatGPT and sign in.
Click on Apps in the left sidebar and search for Spotify. Then link your ChatGPT account with your Spotify account.
Now ChatGPT will be able to analyze your listening habits and recommend the right tracks for you.
Start a new chat, and to make sure ChatGPT is considering Spotify, click the plus button on the left, followed by More, and then click the Spotify icon.
Example prompt:
"Can you make me a playlist for my early morning runs, with high-energy tracks that keep my pace up and help me lock into a steady rhythm, using my Spotify taste as inspiration?"
ChatGPT will provide you with a playlist that incorporates some of your favorite artists as well as songs you've never heard before.
Whether you want a playlist for certain activities or just to discover new music, this is a great way to get recommendations that take your taste into consideration.
AI can help you move faster, but real leadership still requires human judgment.
The free resource 5 Traits AI Can’t Replace explains the traits leaders must protect in an AI-driven world and why BELAY Executive Assistants are built to support them.
🔥Top AI tools to increase productivity:
Zoice is the single platform for every creator. Transcribe, generate, and animate your content.
Floowed is a flexible, no-code AI credit workflow automation platform
BookSwift is a modern appointment booking platform for providers
Marketsy.ai provides a smart e-commerce experience supported by a powerful admin panel.
StrideFuel - Built for weight loss success—especially GLP-1 users
WorldEngen is an AI copilot for 3D production that helps professional teams
AppWizzy is an AI tool that helps you build and host full-stack web applications
SongGuru.AI: An AI-Based Music Creation and Audio Processing Platform
View our database of all the best AI tools for your needs: aitoolsup.com
Have cool resources to share? Submit AI tool
A.I. Generated Image of the Day
👀 Deep in the ancient forest

Recommended reading:
SPONSOR US
Get your product in front of Big Data & AI enthusiasts
Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world.
Interested in Sponsoring the Big Data News Weekly Newsletter?Get in touch today
What did you think of today's email?Your feedback helps me create better emails for you! |





