🤖 Deploying ML/AI systems to production

🦾Plus: 🚀 Runway Gen-3 Alpha is now public

Hey folks! Let’s get into Big Data and AI craziness…

In today's edition:

  • 🔍Profiling Big Datasets With Apache Spark and Deequ

  • 📈How To Organize Continuous Delivery of ML/AI Systems

  • 💾Introduction to Kafka Tiered Storage at Uber

  • 🗣️ ElevenLabs launches ‘Iconic Voices’

  • 🎨 Meta's AI Transforms Text into 3D Art

  • 💡 AI Tutorial: Create talking photos with just a selfie

  • 🤖 AI Tools and Data Tools to checkout

Data science is a field that revolves around gaining insights from data, both structured and unstructured. A data scientist uses mathematical, statistical, and other scientific methods and computer algorithms to analyze big data and extract knowledge from it.

In today's data-driven environment, mastering the profiling of large datasets with Apache Spark and Deequ is crucial for any professional dealing with data analysis, SEO optimization, or similar fields requiring a deep dive into digital content.

This article outlines ten stages of operational maturity for deploying ML/AI systems to production. Which stage are you at? Every production-oriented ML/AI team grapples with the same challenge: how to work with data, code, and models effectively so that projects are readily deployable to production.

Uber’s Data Pipeline

Apache Kafka® is the cornerstone of Uber’s tech stack. It plays an important role in powering several critical use cases and is the foundation for batch and real-time systems at Uber. Kafka stores the messages in append-only log segments on the broker’s local storage. Each topic can be configured with the targeted retention based on size or time

So, what is Natural Language Processing? NLP involves several steps that help computers process language similarly to humans. Learn more about Natural Language Processing, what it entails and its benefits in general.

👨‍💻 Data Tools, Libraries 

Mako (GitHub Repo)

Mako is a production-grade web bundler. It is used by companies like Ant Group and projects like Umi and Father to make web apps and websites and for bundling. Mako is extremely fast and built in Rust.

SmoothMQ (GitHub Repo)

SmoothMQ is a drop-in replacement for SQS. It has a much smoother developer experience, a functional UI, observability, tracing, message scheduling, and rate-limiting.

OmniParse (GitHub Repo)

OmniParse is a completely local platform that ingests and parses unstructured data into structured, actionable data optimized for GenAI applications.

AI News:

Runway Gen-3 Alpha is now public, revolutionizing video creation with advanced AI. Public release of Runway Gen-3 Alpha; high-quality video generation. Realistic video production from text, images, or videos. Accessible to all, but requires a paid account.

AI audio company ElevenLabs just announced a new ‘Iconic Voices’ feature for its recently released Reader App, allowing users to have text read by AI-generated voices of famous Hollywood stars.

Meta's 3D Gen leverages two advanced models: AssetGen and TextureGen. AssetGen crafts a 3D object from text in about 30 seconds, while TextureGen refines or creates textures in 20 seconds. This dual-step process, supporting Physical Based Rendering (PBR), ensures high-quality, relightable 3D assets.

Figma temporarily disables its new AI design tool following accusations of copying Apple's Weather app, highlighting the ethical challenges in AI-driven design.

Meta plans to bring more generative AI tech into games, specifically VR, AR and mixed reality games, as the company looks to reinvigorate its flagging metaverse strategy.

AI Tutorial

🗣️ Create talking photos with just a selfie

Hedra Labs now transforms static photos into dynamic talking images, bringing your selfies to life with synchronized speech and facial animations.


  1. Visit Hedra Labs' website and create a free account.

  2. Upload a high-quality, clear photo of yourself facing the camera directly.

  3. Generate audio by typing text (up to 300 characters) or import your own audio file.

  4. Click "Generate video" to create your talking photo.

  5. Preview, download, or share your animated selfie!

 🔥Top AI tools to increase productivity: 

  1. Competitor Research keeps you ahead of the game by monitoring your rivals’ websites and social media

  2. Undress AI offers a compelling solution for digital image transformation, driven by advanced AI

  3. Tablepad – Connect any data source and magically generate stunning charts, transform data, and get insights with AI.

  4. Context Data automates the process and time to deploy data platforms

  5. Secta Labs – Reinventing Photography with authentic AI.

  6. Panem – There’s no excuse for spending unnecessary budget on SaaS subscriptions with Panem.

  7. DreamPetsAI is a cutting-edge tool that leverages advanced AI to create stunning, lifelike portraits of pets

View our database of all the best AI tools for your needs: aitoolsup.com 

Have cool resources to share? Submit AI tool 


A.I. Generated Image of the Day

👀 Avengers selling fruits in crowded Indian market..! (source)

AI Tools Up NewsletterReceive a weekly email with updates on new AI tools, helpful prompts, and the latest AI developments. Join over 8000 + professionals from Google, OpenAI, Notion, Apple, and more.


Get your product in front of Big Data & AI enthusiasts

Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world.

Interested in Sponsoring the Big Data News Weekly Newsletter? Get in touch today

Read news on Big Data | Data Science | AI | ML | NoSQL | ChatGPT | IoT | Cloud

What did you think of today's email?

Your feedback helps me create better emails for you!

Login or Subscribe to participate in polls.