DATA Pill #077 - Snowflake + Snowpark + Streamlit + Vanna AI, How to reduced docker build times by 40%

DATA Pill #077 - Snowflake + Snowpark + Streamlit + Vanna AI, How to reduced docker build times by 40%

Hi,

Another week is behind us!

Let’s dive deep into the best, meaty content.

Enjoy!


ARTICLES

How to leverage intent-based automation with AI/ML for 5G RAN | 8 min | AI/ML | Ayodele Damola, Mathias Sintorn, Ajay Gautam | Ericsson Blog

The journey toward a Zero-touch network parallels the evolution from cars to self-driving vehicles. In the context of 5G complexities, automating Radio Access Networks (RANs) becomes vital. Intent-based Automation in RANs is pivotal in refining operations, boosting network performance, and enabling innovative services. This approach is a pioneer for achieving a Zero-touch network, revolutionizing network management and introducing new service offerings.

Revolutionizing Real-Time Streaming Processing: 4 Trillion Events Daily at LinkedIn | 7 min | Real-Time Streaming | Bingfeng Xia, Xinyu Liu | LinkedIn Engineering Blog

LinkedIn relies heavily on Apache Beam to process over 4 trillion events daily through thousands of pipelines, enabling real-time data processing across crucial services for their 950 million members. The case study details how Apache Beam's unified framework has significantly impacted streaming processing, reducing latency, optimizing costs, enabling real-time ML feature generation, improving user experiences through personalized services and improving abuse detection.

The Future of Business Analytics: Artificial Intelligence and Data Streaming | 10 min | AI | Zach Hamilton | Confluent Blog

In this blog, you’ll get to discover some of the problems, challenges and solutions facing the field of AI/ML-driven business analytics, focused on what is at its core—data. You’ll walk away with a better understanding of what you’re up against when it comes to doing real-time business analytics with AI/ML, optimizing your data infrastructure, and preparing analyst teams and technologies to use the highest quality data for the job.

In MORE LINKS you will read about: How we reduced our docker build times by 40%

{ MORE LINKS }



TUTORIALS

Using undocumented AWS APIs | 6 min | Cloud | Jacco Kulman | Xebia Tech Blog

Discover how Jacco developed an 80-line Python code to automate the process, providing comprehensive insights into services, actions and more for policy validation and potential tool development.


Using Snowflake + Snowpark + Streamlit + Vanna AI to chat with your database | 5 min | Data Engineering | Zain Hoda | Personal Blog

This tutorial will show you how to use Snowpark Stored Procedures to let you chat with your Snowflake data using Snowflake’s new built-in Streamlit integration.


In MORE LINKS you will read about: use Snowflake with Amazon MWAA to orchestrate data pipelines

{ MORE LINKS }



NEWS

Lakehouse Monitoring | 3 min | Data Engineering | Databricks

Databricks Lakehouse Monitoring allows teams to oversee their entire data pipelines, from data and features to ML models, all without needing extra tools. Using Unity Catalog, it ensures data and AI assets are high-quality, accurate and dependable by providing detailed insights into their lineage.



TOOLS

K9s | Data Engineering

K9s is a terminal based UI designed to interact with your Kubernetes clusters. The aim of this project is to make it easier to navigate, observe and manage your deployed applications in the wild. K9s continually watches Kubernetes for changes and offers subsequent commands to interact with your observed resources.

In MORE LINKS you will find Kor

{ MORE LINKS }



DATA TUBE

Next-generation data analytics with BigQuery and PaLM | 38 min | LLM | Chris Crosbie, Seamus Abshere | Google Cloud Tech

Learn about BigQuery's built-in machine learning capabilities and how to use Vertex AI's LLMs directly within BigQuery. You'll see how these tools make tasks like sentiment analysis and entity extraction easier. Plus, there'll be a live demo showing how Faraday uses these powerful models in BigQuery to predict customer behavior.



PODCAST

Data, engineering and analytics at Pleo | 27 min | Data Engineering | Adam Kawa,? Agnieszka Bomersbach | Radio DaTa Podcast

Topics that we talk about include:

  • What Pleo is, who uses it and how it works
  • How data is collected and utilized by Pleo
  • Pleo's data tech stack, Including GCP, BigQuery, Kafka, Metabase and Looker
  • Pleo's approach to Generative AI
  • Differences in working with data at Pleo, Acast, and Skyscanner

and more…



CONFS, EVENTS, AND MEETUPS

Big Data Technology Warsaw Summit 2024 | Call For Papers | 10th-11th April 2024

We invite you to go through the process of speaking proposal submission – you will find the necessary details in the link. The 10th edition of? the Big Data Technology Warsaw Summit takes place on 10th-11th April 2024.? More than 500 professionals will attend the conference to listen to technical presentations given by dozens of speakers from top data-driven companies.


GenAI: How to get the most for your business from the latest AI revolution | Webinar | 9th November 2023

Join our upcoming webinar, where we dive into Language Model Models (LLMs) and GenAI. Discover the new way of interacting with LLM models, gain insights into the current LLM landscape, explore their novel possibilities, learn about GenAI’s branches and modalities, and address potential challenges. We’ll also showcase practical use cases, making this a must-attend event for AI enthusiasts and professionals.

________________________

Have any interesting content to share in the DATA Pill newsletter?

? Join us on GitHub

? Dig previous editions of DataPill?


Adam from the GetInData | Part of Xebia

Thanks for linking my blog ??

回复

要查看或添加评论,请登录

Adam Kawa的更多文章

社区洞察

其他会员也浏览了