- Big Data News Weekly
- Posts
- Pipeline Design Patterns for Data Engineers 👨💻
Pipeline Design Patterns for Data Engineers 👨💻
🦾Plus: 🥭 Meta’s Next AI Push Takes Shape

Hey folks! Let’s get into Big Data and AI craziness…
In today's edition: What's Shaping the Future of Data?
🛠️ How to Fine-Tune Local Mistral or Llama 3 Model on Your Own Dataset
🔮Top 12 Strategic Technology Trends for 2026
🧠A Complete Guide to Spherical Equivariant Graph Transformers
📘Great Ideas in Theoretical Computer Science
🌐 Claude Moves Into Your Browser
👨💻 Cursor continues acquisition spree with Graphite deal
💡 AI Tutorial:How to analyze and visualize data in ChatGPT
🤖 AI Tools and Data Tools to checkout

In this article, you will learn how to fine-tune open-source large language models for customer support using Unsloth and QLoRA, from dataset preparation through training, testing, and comparison. In this tutorial, we’ll learn how to fine-tune two powerful open-source models, Mistral 7B and Llama 3 8B, using a customer support question-and-answer dataset. By the end of this tutorial, you’ll learn how to:
See every move your competitors make.
Get unlimited access to the world’s top-performing Facebook ads — and the data behind them. Gethookd gives you a library of 38+ million winning ads so you can reverse-engineer what’s working right now. Instantly see your competitors’ best creatives, hooks, and offers in one place.
Spend less time guessing and more time scaling.
Start your 14-day free trial and start creating ads that actually convert.

Her are the 12 strategic technology trends to act as enhancers of digital business and innovation over the next 3 to 5 years, According Gartner. By 2026, generative AI will significantly alter 70% of the design and development effort for new web applications and mobile apps, According to Gartner.

Data pipelines are the backbone of moving and processing information from multiple sources so businesses can make better decisions. In this post, we cover: a) What is a data pipeline, and b) 10 key design patterns, their principles, and practical applications for building effective data pipelines…

A 2.5-hour breakdown of spherical equivariant graph neural networks (EGNNs) and a deconstruction of the SE(3)-Transformer model…This article will focus on a specific type of geometric GNN called Spherical Equivariant GNNs (Spherical EGNNs), which are extremely useful in tasks dealing with geometric graph representations of objects with rotational symmetries, like molecules and proteins.

This course is about the rigorous study of computation, which is a fundamental component of our universe, the societies we live in, the new technologies we discover, as well as the minds we use to understand these things. Therefore, having the right language and tools to study computation is important.
👨💻 Data Tools, Libraries
pg_embedding enables the use of the Hierarchical Navigable Small World (HNSW) algorithm for vector similarity search in PostgreSQL.
NativePHP (Website)
NativePHP is a framework for building desktop applications using PHP. It allows PHP developers to create cross-platform, native apps using familiar tools and technologies.
TypeScript Execute is Node.js enhanced with esbuild to run TypeScript and ESM.
LLMs, GPTs... think your 5G can handle it all? Get connected with T-Mobile for Business Internet for all of your team's biggest AI projects.
AI News:

Reportedly, two new models are in the works for 2026: “Mango” for image and video understanding, and “Avocado” for coding and reasoning. The timing is tricky, though, as leadership and researcher exits shake up its AI division.
Banish bad ads for good
Google AdSense's Auto ads lets you designate ad-free zones, giving you full control over your site’s layout and ensuring a seamless experience for your visitors. You decide what matters to your users and maintain your site's aesthetic. Google AdSense helps you balance earning with user experience, making it the better way to earn.

From one major company to the next. Starbucks announced the hire of its new Chief Technology Officer, Amazon veteran Anand Varadarajan, most recently the leader of its grocery technology and supply chain. The hire comes after Deb Hall Lefevre, Starbucks’ former CTO, departed in September as the company underwent a second round of layoffs and announced a $1 billion restructuring plan as a part of its ongoing turnaround.

Big products require big partnerships. In an effort to expand their strategic partnership and deepen their engineering collaboration, cybersecurity producer Palo Alto Networks is migrating key internal workloads to Google Cloud as part of a new multibillion-dollar agreement.

AI coding assistant Cursor announced that it has acquired Graphite, a startup that uses AI to review and debug code. Although the terms of the deal were not disclosed, Axios reported that Cursor paid “way over” Graphite’s last valuation of $290 million, which was set when the five-year-old company raised a $52 million Series B earlier this year.

OpenAI is in talks to raise up to $100 billion in a funding round that could value the ChatGPT maker at up to $830 billion, The Wall Street Journal reported Thursday, citing anonymous sources. The funding would come as OpenAI commits to spend trillions of dollars and strikes deals around the world as the company tries to stay ahead in the race to develop AI technology.
AI Tutorial
How to analyze and visualize data in ChatGPT

Go to ChatGPT
Start a new chat
Upload your data or spreadsheet
Use the following prompt: Analyze the data attached, generate key trends and insights, and generate charts to visualize and understand the data
Press enter and wait to get your output
Note: Please be careful when uploading any sensitive company or personal data to any AI app. Take necessary precautions and anonymize data where appropriate.
The best HR advice comes from those in the trenches. That’s what this is: real-world HR insights delivered in a newsletter from Hebba Youssef, a Chief People Officer who’s been there. Practical, real strategies with a dash of humor. Because HR shouldn’t be thankless—and you shouldn’t be alone in it.
🔥Top AI tools to increase productivity:
Alice - A native app that offers fast and reliable experience with models (OpenAI, Perplexity, Claude and more)
Linktopia - Community link-building for bloggers, entrepreneurs and startup brands to grow
VerifactAI is an AI fact-checking tool that allows you to fact-check your articles within a minute.
Undress AI Tool is a website that offers a deepnude application, allowing users to create modified images
Screenloop is the ultimate Talent Operations Platform, seamlessly integrating a next-gen ATS
Postlyy - All in one platform to create, schedule, and analyze content on X and LinkedIn
Auto Streamer, your digital alchemist turning ideas into engaging learning experiences
BeeDone is an innovative productivity app that marries AI technology with the art of gamification
View our database of all the best AI tools for your needs: aitoolsup.com
Have cool resources to share? Submit AI tool
A.I. Generated Image of the Day
👀 Eidolon

Recommended reading:
SPONSOR US
Get your product in front of Big Data & AI enthusiasts
Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world.
Interested in Sponsoring the Big Data News Weekly Newsletter?Get in touch today
What did you think of today's email?Your feedback helps me create better emails for you! |



