- Big Data News Weekly
- Posts
- š Python Libraries for Analytics Engineers
š Python Libraries for Analytics Engineers
š¦¾Plus: āāļø Perplexity launches Comet browser for free

Hey folks! Letās get into Big Data and AI crazinessā¦
In today's edition: What's Shaping the Future of Data?
āļø Why Complex Projects Need More Than Smart Engineers
š Battle-Tested Modeling Techniques for Tabular Data
ā”A Note on the Dirichlet Distribution
ā” Inside Husky's Query Engine: Real-Time Access to 100 Trillion
š± OpenAI's Sora soars to No. 3 on the US App Store
š Google pushes Jules coding agent into terminals
š” AI Tutorial:Turn any logo into a 3D design with AI
š¤ AI Tools and Data Tools to checkout

Analytics engineers sit at the intersection of data engineering and data analysis. While data engineers focus on infrastructure and data scientists focus on modeling, analytics engineers concentrate on the "middle layer", transforming raw data into clean, reliable datasets that other data professionals can use.
You shouldnāt be. Get paid up to 2 days early and make your money go further with 4% interest on savings,* up to $200 in free overdraft coverage,** and more.

Every engineer starts by getting really good at their specific area. Software developers learn programming languages and algorithms. Hardware engineers master circuit design and component selection. But when projects get big enough, knowing your own piece isnāt enough anymore. You need to understand how your work connects to everyone elseās work.

Over hundreds of Kaggle competitions, weāve refined a playbook that consistently lands us near the top of the leaderboardāno matter if weāre working with millions of rows, missing values, or test sets that behave nothing like the training dataā¦Below are seven of our most battle-tested techniques, each one made practical through GPU accelerationā¦

In this post, I explore some properties of the Dirichlet distribution and illustrate the behavior of the symmetric Dirilichet distribution as alpha, the concentration parameter, varies. Understanding this behavior may be helpful in constructing an informative prior for the multinomial distributionā¦

At Datadog, we process more than 100 trillion events and billions of queries every dayāacross logs, traces, network data, and more. To support that scale, we built Husky, our third-generation event store. We detailed its architecture in a series of posts on exactly-once ingestion and multi-tenancy and massively parallel compaction.
An estimated 75 percent of Americans are chronically dehydrated. The cause can be as simple as not drinking enough water, or from taking certain medications, and consequently, your cells are unable to function properly.
NativePathās Hydrate drink mix is made with 100% clean Ingredients, zero sugar, and high bioavailability. It includes essential mineralsālike sodium, potassium, chloride, magnesium, and calciumāthat are vital to many key functions in the body, along with Amino Acids to enhance muscle recovery and all 9 essential amino acids.
Restore Whole-Body Hydration with NativePathās Hydrate.
šØāš» Data Tools, Libraries
spyglass
A personal search engine, crawl & index websites/files you want with a simple set of rules
trigger.dev
The developer-first open source Zapier alternative.
DocsGPT
DocsGPT is a cutting-edge open-source solution that streamlines the process of finding information in project documentation
With Observable, you can fast-track data exploration, analysis, and visualization at scale.
AI Security Summit: Snyk is proud to be a founding partner of The AI Security Summit, where forward-thinking leaders, practitioners, and security teams are coming together to confront the "AI security chasm" head-on and build a foundation of trust for AI initiatives.
Join the FREE AI:ROI Conference - Featuring Scott Galloway - Sept 25. The virtual conference where AI hype meets hard numbers ā and you leave with a clear path to ROI.
AI News:

Perplexity AI has rolled out its Comet browser globally at no cost. The browser acts as a personal assistant, able to search, organize tabs, draft emails, shop and more. Originally reserved for $200/month Max subscribers, Comet quickly attracted a waitlist of millions. Now available for free, it positions Perplexity against rivals like Googleās Gemini in Chrome, OpenAIās Operator and Anthropicās browser agents.
Growing up comes with plenty of firsts. With a Cash App Card, teens have a safe way to practice saving, managing money, and spendingāall with their own debit card, and you as their guide.
Cash App is a financial services platform, not a bank. Banking services are provided by Cash Appās bank partner(s). Prepaid debit cards issued by Sutton Bank, Member FDIC. See Terms and Conditions.

OpenAI's Sora app saw 56,000 downloads on its first day. It is now ranked third Top Overall app on the US App Store, despite being invite-only and limited to users in the US and Canada. The ChatGPT app and Google's Gemini iOS apps had stronger launches, with each reaching at least 80,000 downloads on day one, but since Sora is invite-only, that may not be a fair comparison.

Andreessen Horowitz released its AI Spending Report, analyzing transaction patterns from fintech startup Mercuryās 200,000+ customers to show which AI companies are capturing real startup dollars versus just generating traffic. OpenAI took the top spot, with Anthropic in the second place, and Perplexity (No. 12) and Merlin AI (No. 30) rounding out the list of general assistants.

Google launched Jules Tools, a new command-line interface and public API for its autonomous coding agent, allowing developers to trigger tasks and monitor progress from terminals rather than switching to separate browser windows. Developers can now control Jules through typed commands in terminal windows, automating repetitive tasks or creating coding assignments.

Anthropic has hired Rahul Patil, former Chief Technical Officer (CTO) of Stripe, as its new CTO to help strengthen its AI infrastructure and improve the speed, reliability, and safety of its AI platforms. Patil, who is replacing Anthropic co-founder Sam McCandlish (who is now Anthropicās Chief Architect and will primarily work on AI model training), will focus on compute, infrastructure, and other engineering tasks.
Economic pressure is rising, and doing more with less has become the new reality. But surviving a downturn isnāt about stretching yourself thinner; itās about protecting what matters most.
BELAY matches leaders with fractional, cost-effective support ā exceptional Executive Assistants, Accounting Professionals, and Marketing Assistants ā tailored to your unique needs. When you're buried in low-level tasks, you lose the focus, energy, and strategy it takes to lead through challenging times.
BELAY helps you stay ready for whatever comes next.
AI Tutorial
Turn any logo into a 3D design with AI

Go to ChatGPT and sign in.
Upload a clean black-and-white version of your logo or icon.
Search Google Images for āCinema 4D materialsā or āBlender textures,ā or look for specific materials like marble, metal, wood, or glass; save the PNG.
Upload both the logo and texture into ChatGPT-4o and prompt:
āCreate a 3D version of this logo (left image) using the texture from the right image. Cinema 4D style, 8K quality, realistic lighting and shadows, black background.ā
If needed, refine the output by asking for tweaks like āMake it more metallicā, āAdd more depthā, or āChange the lighting angle.ā
Tip: experiment with unusual textures like fabric, ice, or even food to make your design stand out.
Save instantly with offers for groceries, coffee, rides, and places you love with a Cash App Card. Itās a secure debit card that works online and in person.
Cash App is a financial services platform, not a bank. Banking services provided by Cash App's bank partner(s). Prepaid debit cards issued by Sutton Bank, Member FDIC. See Terms and Conditions. Offers provided by Cash App, a Block, Inc. brand.
The best marketing ideas come from marketers who live it. Thatās what The Marketing Millennials delivers: real insights, fresh takes, and no fluff. Written by Daniel Murray, a marketer who knows what works, this newsletter cuts through the noise so you can stop guessing and start winning. Subscribe and level up your marketing game.
š„Top AI tools to increase productivity:
Looksmax AI analyzes your physical appearance, and shares AI-generated self-improvement tips
PhotoPacks.AI is a platform that enables generating high-quality professional headshots
Growth Makers is a team of AI agents that finds growth hacking strategies for your business.
ContentPieAI, say goodbye to the hassle of juggling multiple tools and spending hours on end crafting content
Rolemantic is an innovative platform that creates personalized AI companions
AutoRFP.ai - From automating routine operations to uncovering valuable insights,
Chatsistant.com- Your ultimate Large Language Model Framework. Streamline your AI workflows with multi-agent capabilities
SnapHeadshots-Create professional headshots from the comfort of your home
WriteGo.ai - Generate research papers, essays, and articles effortlessly.
View our database of all the best AI tools for your needs: aitoolsup.com
Have cool resources to share? Submit AI tool
A.I. Generated Image of the Day
š I wonder how long they are going to last

Recommended reading:
SPONSOR US
Get your product in front of Big Data & AI enthusiasts
Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world.
Interested in Sponsoring the Big Data News Weekly Newsletter?Get in touch today
What did you think of today's email?Your feedback helps me create better emails for you! |