DATA Pill #047 - Leaving Amazon after 7.5 years, a new method for Kubernetes integration, and more

DATA Pill #047 - Leaving Amazon after 7.5 years, a new method for Kubernetes integration, and more

Hi,


After the first quarter of 2023, we can take a look at the newest trends.

Also, how does it feel about leaving a big company after more than 7 years?

You will find answers to all questions in DATA Pill 47.


ARTICLES

Top 5 Data Streaming Trends for 2023 | 9 min | Data Streaming | Kai Waehner| Personal Blog

Let’s see how Kai identifies five trends in the data streaming space that are expected to gain momentum over the next few years. What are they?

  1. Cloud-native lakehouses?
  2. Decentralized data mesh?
  3. Data sharing in real-time?
  4. Improved developer and user experience?
  5. Advanced data governance and policy enforcement

Kai’s point of view on these trends is waiting for you.


Maximizing Personalization: Real-Time Context and Persona Drive Better-Suited Products and Customer Experiences | 14 min | Recommendation System | Adam Kawa | GetInData | Part of Xebia Blog

We all know personalization is the key to creating a competitive advantage in today's market, where customers expect tailored experiences based on their preferences, needs, and behaviors.

But what happened in 2012 at Spotify, or how did Kcell use real-time streaming to provide help?

Read about the importance of real-time context and persona. Real-time context refers to using data and analytics to understand a customer's current situation and needs.

What use cases can you find there?

  • Advertise a relevant product
  • Personalized gifts
  • Emergency situation without a delay
  • Fast and Smart Decision-Making,
  • Increasing rare chances for a highly-profitable conversion


Tencent Data Engineer: Why We Go from ClickHouse to Apache Doris? | 13 min | Data Science | Jun Zhang, Kai Dai | Geek Culture Blog

Meaty article with cheers and tears, lessons learned and practical tips that will be helpful during transition from ClickHouse to Doris. Read about problems with ClockHouse and the step-by-step transition to Apache Doris.

No alt text provided for this image


In MORE LINKS you will read about Vault Secrets Operator: A new method for Kubernetes integration.

{ MORE LINKS }



DATA PRO

Left Amazon after 7.5+ years; Here is my honest review | 9 min | Careers | Pooya Amini | Personal Blog


How was it to work at Amazon for 7.5 years? Read a balanced view of working at Amazon and highlights the pros and cons of working for the tech giant. Pooya writes about the job roles available at Amazon and how they can sometimes be draining, the company's work culture and how it operates, and the perks of working there. The article also touches upon Amazon's hiring process, which the author points out is extensive, with multiple rounds of interviews.


In MORE LINKS you will read a complete guide about speaking at conferences.

{ MORE LINKS }



TUTORIAL

Terraform with YAML: Part 1 | 5 min | Data Engineering | Chris ter Beke | Xebia Tech Blog

In the first part of the tutorial, you will find a basic understanding of the benefits of using YAML configuration files in your Terraform code. How you can replace as much HCL code as possible with YAML, what the benefits are of doing so, and a simple example is waiting for you.




TOOLS

Fugue project | Data Engineering | Fugue

Fugue provides users with a cohesive platform to perform distributed computing, enabling the execution of Python, Pandas, and SQL code on Spark, Dask, and Ray with minimal need for adjustment.

It’s used for:??

  • Parallelizing or scaling existing Python and Pandas code by bringing it to Spark, Dask, or Ray with minimal rewrites.?
  • Using FugueSQL to define end-to-end workflows on top of Pandas, Spark, and Dask DataFrames.?
  • FugueSQL is an enhanced SQL interface that can invoke Python code.

In MORE LINKS you will read about Amazon DataZone and K8sGPT gives Kubernetes SRE superpowers to everyone.

{ MORE LINKS }



NEWS

Announcing new BigQuery inference engine to bring ML closer to your data | 5 min | Data Analytics | Amir Hormati, Abhinav Khushraj | Google Cloud Blog

Google announces BigQuery ML inference engine, which allows you to run predictions on a broad range of models hosted across multiple locations.

With this new feature, you can run ML inferences across:

  • Imported custom models trained outside of BigQuery with a variety of formats (e.g., ONNX, XGBoost, and TensorFlow)?
  • Remotely-hosted models on Vertex AI Prediction?
  • State-of-the-art pretrained Cloud AI models (e.g., Vision, NLP, Translate, and more)?



DATA TUBE

How to scale your Scrum? Nexus Framework for 30+ experts | 22 min | Stream Processing | Rafa? Zalewski | GetInData | Part of Xebia

Have you ever wondered how to scale scrum to multiple teams working on the same projects? If so, do not hesitate to watch a video about Nexus Framework! Well explained what Nexus Framework is and how it helps to integrate and coordinate work in multiple teams. Everything is based on a real example from one of the GetInData | Part of Xebia projects, where they had to scale scrum to 30+ experts in different groups.?


In MORE LINKS you will watch a video about introducing AlloyDB Omni.

{ MORE LINKS }



PODCAST

Data Journey with Jonas Bj?rk (Acast) - Data & analytics at Acast, AI & trends in the podcasting industry | 56 min | host: Adam Kawa guest: Jonas Bj?rk | Radio DaTa


Listen to the data journey with Adam Kawa and his guest Jonas Bj?rk, the CTO at Acast, in the latest episode of Radio Data. Acast, the Swedish-born podcast hosting and monetization platform, is revolutionizing the podcasting industry by allowing creators to distribute their content across various podcasting apps and monetize their shows through advertising and listener support.?

This episode includes:

  • The data collection and utilization process at Acast?
  • How measuring podcasts differs from measuring songs on platforms like Spotify Real-life analytics use cases implemented at Acast?
  • The cutting-edge cloud-managed data tech stack used by Acast, featuring AWS, Snowflake, Airflow, Python, Rust, and more?
  • How AI/ML is being used to revolutionize the podcasting industry today and tomorrow


In MORE LINKS you will listen about the Past, Present, and Future, of the Data Science Notebook with Jodie Burchell

{ MORE LINKS }



CONFS EVENTS AND MEETUPS

DevOps Enterprise Summit Amsterdam 2023 | 15-17th May | Amsterdam

Learn through experience reports from technology leaders helping their organizations win, rapidly disseminate winning tools and techniques and ways of thinking, and bring in the best experts for the problems identified by the community.

BTW, we have a great job offer for DevOps, check it out here .


________________________


Have any interesting content to share in the DATA Pill newsletter?

? Join us on GitHub

? Dig previous editions of DataPill ?



Adam from the GetInData | Part of Xebia

Adeel Imrani

$50M in Sales on Amazon & Shopify as Seller and Advisor.

1 年

Incredible insights from Q1 2023! ?? Real-time context & persona-driven personalization are game-changers for customer experiences. But here's a thought: How do we balance hyper-personalization with privacy concerns in an ever-evolving data landscape? ??

回复

要查看或添加评论,请登录

社区洞察

其他会员也浏览了