登录查看更多内容

The Unprecedented Shift in Application Architecture: AI Meets Cloud Infrastructure

Jitender Aswani

VPE @ StarTree; Ex-Moveworks; Ex-Netflix; Ex-Facebook; Multiple Patents

发布日期: 2024年6月17日

A few weeks ago, I participated as a panelist at an AI Design & Practice mixer organized by Sapphire Ventures . The following article is a summary of the answers I provided, detailing StarTree's AI journey to date.

Never before has there been such a dramatic shift in application architecture as when AI met the power of limitless cloud infrastructure. This fusion has catapulted powerful Large Language Models to the forefront, with the promise to profoundly transform a multitude of industries with their versatility and depth. The magnitude of this transformation surpasses even the pivotal shifts witnessed during the client-server and Internet eras.

The Genesis of Our AI Journey: More Than Just Automating Tasks

Our First Use Case: Automating Security Questionnaires

Our venture into generative AI began not from a purely technological challenge but from a recurring pain point— vendor security questionnaires. Each questionnaire from prospective client was unique, making canned responses ineffective. By harnessing LLMs, we transformed a tedious task into an automated process, drawing from a vast trove of documents, from technical white papers to detailed presentations on StarTree cloud architecture. This wasn't just about efficiency; it was a necessary evolution to keep pace with the demands of modern security compliance.

Strategic Model Evaluation: Choosing the Right Foundation for AI

Delving Deep into Model Selection

When evaluating models, our approach is thorough and grounded in practicality:

Training Content Depth: We meticulously analyze the training materials of each model. For our tech-heavy needs, it's crucial that models are well-versed in technical documentation and coding repositories. For instance, we ensure that our models are trained on a diverse range of programming languages, including Python, Java, and C++.
Model Flexibility: From zero-shot capabilities to fine-tuning potential, understanding a model's adaptability helps us stay agile. We assess each model's ability to handle various tasks, such as text classification, sentiment analysis, and code generation.
Performance and Precision: We rely on benchmarks like MMLU to measure model accuracy and ensure we maintain a lean set of high-performing models, balancing complexity with manageability. Our goal is to achieve optimal performance while minimizing latency and computational resources.

Vector Databases: The New Frontier

Embracing Vector DBs with Apache Pinot

Our journey with vector databases began with early experiments using FAISS and HNSW for simple applications. However, the real game-changer was Apache Pinot's announcement of version 1.1, which supports storing vector embeddings efficiently. This capability allows us to perform sophisticated similarity searches and has prompted us to migrate our existing applications to Pinot, heralding a new phase in our data handling capabilities. Following visual shows an architecture for a) RAG application and b) pipeline to build and move embeddings into Pinot.?

领英推荐

Microservices and Micro Frontends

David Shergilashvili 3 个月前

Observability Patterns in Microservices: Illuminating…

David Shergilashvili 6 个月前

Saga Pattern in Microservices

Omar Ismail 2 年前

The Underrated Art of Prompt Engineering

The Craft of Prompt Engineering

Prompt engineering is perhaps the most nuanced yet underappreciated aspect of working with LLMs. Each prompt requires careful iteration and a deep understanding of the desired outcome. Our guidelines on prompt engineering focus on defining roles clearly, structuring prompts effectively, and contextualizing inputs. This iterative process is not just about technical precision; it's about creating a dialogue with the machine that feels natural and productive. Eventually, we leverage various AI models to generate more effective prompts.?

Managing Risks in AI Integration

Navigating the Risks

Integrating AI into applications brings about complex challenges, particularly in ensuring the security of outputs. We implement rigorous checks to guard against potential vulnerabilities like code injections, emphasizing the importance of auditing and monitoring to maintain trust and integrity in our systems. For instance, we use techniques like input validation and sanitization to prevent malicious code from being executed.

Democratizing AI Development: Centralized Teams and Local Integration

Centralized Innovation, Local Execution

Our approach to AI development mirrors our broader engineering philosophy: centralize expertise but decentralize execution. By building a strong central team that develops AI frameworks and infrastructures, we empower local teams to apply these tools effectively, ensuring that AI integration is both innovative and practical. This approach enables us to strike a balance between consistency and adaptability, allowing us to respond quickly to changing requirements and user needs.

Reflecting on the Journey and Anticipating the Future

Looking Back and Gazing Forward

Reflecting on our journey, the pace at which AI technologies have evolved is nothing short of dizzying. The future holds exciting prospects, particularly in how AI will handle complex, multi-step reasoning tasks. With tools like Apache Pinot enhancing our capabilities in real-time data handling, we're on the brink of transforming how businesses leverage AI to make real-time, data-driven decisions.

In closing, by embracing this shift in application architectures, we're poised to revolutionize industries and create new possibilities. Join us on this exciting journey as we continue to explore and innovate in the world of AI and real time analytics.

Connect Tech+Talent

1 个月

It's great to see innovative companies like StarTree pushing the boundaries of AI integration. What are some key challenges you've faced in implementing Gen AI in your real-time analytics stack?

Bhomik Pande

Principal Product Manager at Epicor

5 个月

“Aggregation” with lower latency is fundamental specially when dealing with almost real time immutable data ingestion. Columnar Apache Pinot ability to vector out “high-dimensional” information into a structured space is a game changer. Steven Garcia

Abhishek Singh

Crafting Brands @ Upturn Brands | MBA at Stevens Institute of Technology

5 个月

With the rise of AI, we may see a shift towards large action models that can trigger complex behaviors on the infrastructure side.?

2 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

The Unprecedented Shift in Application Architecture: AI Meets Cloud Infrastructure

Jitender Aswani

VPE @ StarTree; Ex-Moveworks; Ex-Netflix; Ex-Facebook; Multiple Patents

The Genesis of Our AI Journey: More Than Just Automating Tasks

Strategic Model Evaluation: Choosing the Right Foundation for AI

Vector Databases: The New Frontier

领英推荐

The Underrated Art of Prompt Engineering

Managing Risks in AI Integration

Democratizing AI Development: Centralized Teams and Local Integration

Reflecting on the Journey and Anticipating the Future

更多精彩文章

社区洞察

其他会员也浏览了

Linkerd - The modern-day Service Mesh (Part 1/5)

Implementing Microservice Communication (Part 2)

Architecture to AWS CloudFormation code using Anthropic’s Claude 3 on Amazon Bedrock

Asynchrony and Parallelism in System Design: Lessons from AWS re:Invent 2022 Keynote

Serverless AI Systems. Multicloud Orchestration

SAGA Pattern?-?Distributed Design?Pattern

From Monolith to Microservices: Reimagining Google Search Architecture

How Does Low-Code Fit into the Cloud-Native World?

Leader’s Daily Digest: Microservices Unpacked

Svitla Systems May updates

The Genesis of Our AI Journey: More Than Just Automating Tasks

Strategic Model Evaluation: Choosing the Right Foundation for AI

Vector Databases: The New Frontier

领英推荐

The Underrated Art of Prompt Engineering

Managing Risks in AI Integration

Democratizing AI Development: Centralized Teams and Local Integration

Reflecting on the Journey and Anticipating the Future

User-Facing Analytics Supercycle and My Reasons to Join StarTree

2023年2月28日

Opportunities to Build and Scale Data Science & Analytics Function at Moveworks: Why You Must Consider?

2020年4月6日

How Data Inspires Building a Scalable, Resilient and Secure Cloud Infrastructure At Netflix

2019年11月22日

Amazon QuickSight May Usher In A New Era of Cloud Analytics Solution

2015年10月7日

Is Your Startup Enterprise Ready?

2015年10月6日

Business Metrics Driven ETL: The Future of Building Fast and Scalable Data Pipelines

2015年9月15日

社区洞察

其他会员也浏览了

Linkerd - The modern-day Service Mesh (Part 1/5)

Implementing Microservice Communication (Part 2)

Architecture to AWS CloudFormation code using Anthropic’s Claude 3 on Amazon Bedrock

Asynchrony and Parallelism in System Design: Lessons from AWS re:Invent 2022 Keynote

Serverless AI Systems. Multicloud Orchestration

SAGA Pattern?-?Distributed Design?Pattern

From Monolith to Microservices: Reimagining Google Search Architecture

How Does Low-Code Fit into the Cloud-Native World?

Leader’s Daily Digest: Microservices Unpacked

Svitla Systems May updates