🤖Beat Proprietary LLMs With Smaller Open Source Models

🦾Plus: 🔍 Is OpenAI bribing publishers?

Hey folks! Let’s get into Big Data and AI craziness…

In today's edition:

  • 🦆When duckdb meets dplyr!

  • 📈 High-level tools to simplify visualization in Python

  • 🔄Patterns and anti-patterns of data analysis reuse

  • 📑Read the 2024 State of AI Readiness Report

  • 🔍Is OpenAI bribing publishers?

  • 📊 BYOAI: Microsoft Corp. and LinkedIn Release 2024 Report

  • 🤖 AI Tools and Data Tools to checkout

From small startups to multinational corporations, the fusion of AI with software engineering is rapidly shaping how programs are developed and what these software can achieve. The integration of AI into software development brings a host of benefits and unlocks unparalleled advancements.

Become an AI & ChatGPT Genius in just 3 hours for FREE!  (Early Easter Sale)

Join ChatGPT & AI Workshop (worth $199) at no cost (Offer valid for first 100 people only) 🎁

I like DuckDB 🦆. I am excited to see that it is now possible to use it with dplyr using the fantastic duckplyr package which gives us another way to bridge dplyr with DuckDB…In this short post, I will show how duckplyr can be used to query parquet files hosted on an S3 bucket

HoloViz provides a set of Python packages that make viz easier, more accurate, and more powerful: Panel for making apps and dashboards for your plots from any supported plotting library, hvPlot to quickly generate interactive plots from your data, HoloViews to help you make all of your data instantly visualizable, GeoViews to extend HoloViews for geographic data, Datashader for rendering even the largest datasets

In this article, we explore the unique advantages of open source LLMs, and how you can leverage them to develop AI applications that are not just cheaper and faster than proprietary LLMs, but better too.

Every data analysis / data scientist role I’ve worked in has had a strong theme of redoing variations of the same analysis. I expect this is something of an industry-wide trend. If you’re in marketing you’re ranking prospects, and A/B testing their responses.

 Nice report with good insights and cool charts. The research team at Scale AI interviewed 1,800 AI/ ML practitioners on the latest AI trends, applied AI, and what it takes beyond “adopting AI.

👨‍💻 Data Tools, Libraries 

pragmatic-drag-and-drop
Fast drag and drop for any experience on any tech stack. 

Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI.

tscircuit
React for Circuits.

lunatik
Lunatik is a framework for scripting the Linux kernel with Lua.

Captable
Open-Source Captable, an alternative to Carta, Pully, Angelist and others.

AI News:

A leaked slide deck has revealed how OpenAI is wooing global publishers—like The FT and Le Monde—to partner with them so they can use their content to train their GPT models and legally display information, without getting sued.

Microsoft Corp. and LinkedIn have released a 2024 report about AI at work. Here are the main points:

AI Usage Growing Fast: AI use at work has almost doubled in six months. Many leaders say they wouldn't hire someone without AI skills, yet worry their companies lack a clear AI vision. Despite this, 75% of knowledge workers use AI, and 78% bring their own AI tools to work.

The Biden administration is considering new regulations to limit the export of proprietary AI software to countries like China and Russia. This initiative, part of a broader strategy to control the spread of sophisticated AI capabilities, targets the underlying software of systems such as ChatGPT.

Apple gears up to enhance its AI capabilities using cloud-based M2 Ultra chips. Apple to deploy M2 Ultra chips in data centers for complex AI tasks. Simpler AI operations will remain on-device. Future plans include transitioning to more advanced M4 chips.

Google DeepMind and Isomorphic Labs just introduced AlphaFold 3, the newest version of the groundbreaking AI model that can predict the structure of proteins, DNA, and other molecules with extreme accuracy.

SoundHound AI just announced a new partnership with Perplexity, aiming to integrate the company’s online LLM capabilities with real-time web knowledge across voice assistants in cars and devices.

AI Tutorial

How to use the new Midjourney Web

The Midjourney Web is now open to nearly everyone, enabling you to create AI-generated images through a simple web interface. Here's how to get started:

  1. Join the Midjourney Discord Server if you haven't already.

*Note: To access the Alpha version of Midjourney Web, you now only need to have generated over 100 images on Discord, compared to the previous requirement of 1000 images.

  1. Go to alpha.midjourney.com and sign up or log in with your existing Midjourney credentials.

  2. Once you have access, you can start creating images by using the "Imagine" textbox at the top of the page to input your prompt.

  3. Adjust parameters to customize your image

  4. Click the ‘Generate’ button

  5. Once the images are ready, you can view them on the same page or in the 'Create' tab. You can also download them; once you do, it will give you the four images in this format:

 🔥Top AI tools to increase productivity: 

  1. SopCreator.com is an AI-enabled platform designed to help students create compelling Statements of Purpose (SOP) essays

  2. Udioai - An app for music creation and sharing that allows you to generate amazing music

  3. ChatSweetie chatbot, you can chat online with your chosen virtual character anytime

  4. Exploding Insights - The #1 Market Research Tool. Finding the right idea used to be hard

  5. StudyGPT is your personal AI study assistant

  6. Suno-top, your ultimate companion for music creation and sharing!

View our database of all the best AI tools for your needs: aitoolsup.com 

Have cool resources to share? Submit AI tool 

 

A.I. Generated Image of the Day

👀 Is this image is AI-generated! (source)

AI Tools Up NewsletterReceive a weekly email with updates on new AI tools, helpful prompts, and the latest AI developments. Join over 8000 + professionals from Google, OpenAI, Notion, Apple, and more.

SPONSOR US

Get your product in front of Big Data & AI enthusiasts

Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world.

Interested in Sponsoring the Big Data News Weekly Newsletter? Get in touch today

Read news on Big Data | Data Science | AI | ML | NoSQL | ChatGPT | IoT | Cloud
 

What did you think of today's email?

Your feedback helps me create better emails for you!

Login or Subscribe to participate in polls.