登录查看更多内容

Data Phoenix Digest - ISSUE 8.2024

Dmytro Spodarets

DevOps Architect @ Grid Dynamics | Founder of Data Phoenix - The voice of AI and Data industry

发布日期: 2024年5月30日

Welcome to this week's edition of Data Phoenix Digest!

Be active in our community and join our Slack to discuss the latest news, events of our community, research papers, articles, jobs, and more...

Join our Slack

What can good data do for you? - Twilio Segment

Great customer experiences require better data. With customer profiles that update real-time, and best in class privacy features - Segment's Customer Data Platform allows you to make good data available to every team.

Deliver personalized experiences at scale. Twilio Segment helps 25,000 companies power their most important objectives with data they can trust.

Good Data - Segment

Data Phoenix's upcoming webinar:

The challenge with financial agents successfully completing complex workflows like tabular reasoning or sentiment analysis often comes down to the reliability of executing numerous chained tasks together. Establishing the p99s necessary has to happen at the model level, yet most finance domain-specific LLMs are either only pre-training (BloombergGPT) or using supervised fine-tuning (FinBERT).

This presentation reveals how we transformed an open-source model into?Albatross , capable of performing at the top of the leaderboard on chat as well as domain-specific tasks. Our journey involved an intensive data pipeline and training regiment, incorporating a combination of continual pre-training, fine-tuning, and preference optimization, to customize the model for the intricacies of financial tasks. We'll share our insights on overcoming the execution hurdle, which is often the downfall of AI projects in specialized domains.

Key Highlights of the Webinar:

Building Domain-Specific Models:?Explore how to evolve an open-source model into a leading domain-specific model like Albatross - capable of excelling in both general and domain-specific tasks.
Model Transformation Techniques: Learn about the intensive data pipeline and training regimen that included continual pre-training, fine-tuning, and preference optimization.
Customization for Financial Tasks:?Understand the specific strategies used to tailor Albatross for financial tasks, addressing the unique intricacies of this field.
Importance of Performance Metrics:?Gain insight into why establishing high-performance benchmarks (like p99s) at the model level is crucial for success in finance-specific applications, where current financial LLMs often focus only on pre-training or supervised fine-tuning.

Explore recordings of all our past webinars to deepen your AI knowledge and enhance your learning journey:

Rakuten Symphony 3 个月前

Innovation Spotlight: 5 Ways Data Science is Shaping a…

TalentSprint 8 个月前

Data Drives Statistical Models, Not Cognitive Models

thinkbridge 1 年前

200+ AI Models. One API. 24/7 AI Solution

AI/ML API specializes in delivering a comprehensive suite of AI models, including predictive analytics, natural language processing, and image recognition, among others. Ideal for developers, tech startups, and innovation labs, this tool simplifies the integration of AI technologies into applications, enhancing functionalities and driving forward the boundaries of what's possible.

Get API Key

ARTICLES, TUTORIALS, and LECTURES

Customizing Large Language Models

In this step-by-step article, the author explains how to use the Modelfile in Ollama to change how an existing LLM (Llama2) behaves when interacting with it. He also shows how to save newly customized models to a personal namespace on the Ollama server.

Stanford Seminar: Transformers United

In this Stanford seminar, the lecturers examine the details of how transformers work and dive deep into the different kinds of transformers and how they are applied in different fields. The seminar combines instructor lectures, guest lectures, and classroom discussions.

Efficiently Finetune Llama 3 with PyTorch FSDP and Q-Lora

Unlocking the potential of LLMs often involves fine-tuning them on custom data. Fine-tuning smaller LLMs can be done on a single GPU by using Q-Lora. But efficiently fine-tuning bigger models like Llama 3 70b or Mixtral is a challenge. See how it can be done!

DragonCrawl: Generative AI for High-Quality Mobile Testing

DragonCrawl is a system that uses LLMs to execute mobile tests with the intuition of a human. It decides what actions to take based on the screen it sees and independently adapts to UI changes. Learn more about it in this article!

Building DoorDash’s Product Knowledge Graph with Large Language Models

Building an in-house attribute extraction/tagging model requires a significant amount of labeled training data. But LLMs can perform NLP with reasonable accuracy without requiring many labeled examples. See how this can be used to build a product knowledge graph!

PAPERS & PROJECTS

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

StoryDiffusion is a novel framework that helps maintain consistent content across a series of generated images. It uses Consistent Self-Attention, a new way of self-attention calculation, and Semantic Motion Predictor, a novel semantic space temporal motion prediction module, to describe a text-based story with consistent images or videos. Check it out!

Improving Diffusion Models for Authentic Virtual Try-on in the Wild

IDM-VTON is an image-based virtual try-on, which renders an image of a person wearing a curated garment, given a pair of images depicting the person and the garment, respectively. IDM-VTON, uses two different modules to encode the semantics of garment image. Learn more about them and real-world testing results!

Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting

In this paper, the authors make two fundamental contributions to the 3D scene generation field: note that lifting images to 3D with a monocular depth estimation model is suboptimal; introduce a novel depth completion model, trained via teacher distillation and self-training to learn the 3D fusion process. Explore their method in more detail!

Data Phoenix News

3,830 位关注者

要查看或添加评论，请登录

Dmytro Spodarets的更多文章

Explore This Week's AI Events with Data Phoenix (November 19th)

2024年11月19日

Explore This Week's AI Events with Data Phoenix (November 19th)

Welcome to Data Phoenix's Weekly AI Events Newsletter! We've gathered the most exciting conferences, meetups…
Explore This Week's AI Events with Data Phoenix (November 11th)

2024年11月11日

Explore This Week's AI Events with Data Phoenix (November 11th)

Welcome to Data Phoenix's Weekly AI Events Newsletter! We've gathered the most exciting conferences, meetups…

1 条评论
Explore This Week's AI Events with Data Phoenix (October 21st)

2024年10月22日

Explore This Week's AI Events with Data Phoenix (October 21st)

We've gathered the most exciting conferences, meetups, workshops, and webinars happening this week. Whether you're…
Discover Upcoming Week's AI Events with Data Phoenix Calendar (October 2nd)

2024年10月4日

Discover Upcoming Week's AI Events with Data Phoenix Calendar (October 2nd)

Welcome to Data Phoenix's Weekly AI Events Newsletter! Here, we bring you the most relevant conferences, meetups…
Discover This Week's AI Events with Data Phoenix Calendar (August 5th)

2024年8月5日

Discover This Week's AI Events with Data Phoenix Calendar (August 5th)

Welcome to Data Phoenix's Weekly AI Events Newsletter! Here, we bring you the most relevant conferences, meetups…

1 条评论
Discover This Week's AI Events with Data Phoenix Calendar (July 30th)

2024年7月30日

Discover This Week's AI Events with Data Phoenix Calendar (July 30th)

Welcome to Data Phoenix's Weekly AI Events Newsletter! Here, we bring you the most relevant conferences, meetups…
Discover This Week's AI Events with Data Phoenix Calendar (July 22nd)

2024年7月23日

Discover This Week's AI Events with Data Phoenix Calendar (July 22nd)

Welcome to Data Phoenix's Weekly AI Events Newsletter! We're excited to bring you a curated list of upcoming Data and…
Data Phoenix Digest - ISSUE 9.2024

2024年6月28日

Data Phoenix Digest - ISSUE 9.2024

Hi, everyone, and welcome to this month's edition of Data Phoenix Digest! Today, I want to reveal some exciting…

2 条评论
Webinar "Should I Use RAG or Fine-Tuning?"

2024年4月15日

Webinar "Should I Use RAG or Fine-Tuning?"

The Data Phoenix team invites you to our upcoming webinar, which will take place on May 2nd at 10 a.m.

1 条评论
Data and AI Evening: Demos & Networking

2024年3月29日

Data and AI Evening: Demos & Networking

The Data Phoenix team is thrilled to announce the launch of our new monthly event, "Data and AI Evening: Demos &…

See all articles

Data Phoenix Digest - ISSUE 8.2024

Dmytro Spodarets

DevOps Architect @ Grid Dynamics | Founder of Data Phoenix - The voice of AI and Data industry

What can good data do for you? - Twilio Segment

Data Phoenix's upcoming webinar:

Key Highlights of the Webinar:

领英推荐

200+ AI Models. One API. 24/7 AI Solution

ARTICLES, TUTORIALS, and LECTURES

PAPERS & PROJECTS

Data Phoenix News

3,830 位关注者

Dmytro Spodarets的更多文章

社区洞察

其他会员也浏览了

3 Ways to Transition Your Company Into A Data-Driven Culture

Revolutionizing Data Processing: How DSPyGen and Control Flow DSL Are Set to Save Days and Millions

Impact of Data Science in 2024

The Future of Data and Analytics

How I think about "Innovation" when building a Data Strategy

Lost in Translation: Bridging the Gap Between Data Complexity and Business Simplicity

September Edition: Top 5 Data Innovation Books for Your Reading List

Future Trends in Data Democratization: Envisioning Tomorrow's Insights

Harness The Power of Data

Is AI-Driven Productivity & Insight an Illusion? [Enterprise Edition]

What can good data do for you? - Twilio Segment

Data Phoenix's upcoming webinar:

Key Highlights of the Webinar:

领英推荐

200+ AI Models. One API. 24/7 AI Solution

ARTICLES, TUTORIALS, and LECTURES

PAPERS & PROJECTS

Data Phoenix News

3,830 位关注者

Dmytro Spodarets的更多文章

Explore This Week's AI Events with Data Phoenix (November 19th)

Explore This Week's AI Events with Data Phoenix (November 11th)

Explore This Week's AI Events with Data Phoenix (October 21st)

Discover Upcoming Week's AI Events with Data Phoenix Calendar (October 2nd)

Discover This Week's AI Events with Data Phoenix Calendar (August 5th)

Discover This Week's AI Events with Data Phoenix Calendar (July 30th)

Discover This Week's AI Events with Data Phoenix Calendar (July 22nd)

Data Phoenix Digest - ISSUE 9.2024

Webinar "Should I Use RAG or Fine-Tuning?"

Data and AI Evening: Demos & Networking

社区洞察

其他会员也浏览了

3 Ways to Transition Your Company Into A Data-Driven Culture

Revolutionizing Data Processing: How DSPyGen and Control Flow DSL Are Set to Save Days and Millions

Impact of Data Science in 2024

The Future of Data and Analytics

How I think about "Innovation" when building a Data Strategy

Lost in Translation: Bridging the Gap Between Data Complexity and Business Simplicity

September Edition: Top 5 Data Innovation Books for Your Reading List

Future Trends in Data Democratization: Envisioning Tomorrow's Insights

Harness The Power of Data

Is AI-Driven Productivity & Insight an Illusion? [Enterprise Edition]