登录查看更多内容

FsOpenAI: A GPT 'chat' app for Internal Organizational Data

Faisal Waris

Data Scientist/Gen. AI Strategist in the telecom industry

发布日期: 2023年8月14日

#chatgp #azureopenai #semantickernel #semanticsearch #fsharp

Unsurprisingly, the demand for accessing Large Language Models (LLMs) is high because of the promise they hold. I will not belabor their benefits here to keep this post brief and focus on the key point that organizational employees are clamoring to apply the power of GPT-style models on their own internal data.

Many organizations have directives that discourage employees from using the public OpenAI endpoints with internal data - which just adds fuel to the fire.

Today, the Azure OpenAI service is the only option for privately accessible GPT-style models. However, beyond models we also need services to securely store and index internal data so that it can be used effectively with the Azure deployed models. Azure Cognitive Search service can fulfill this need (although other options are also available),

FsOpenAI is meant to be deployed as an Azure web app. The goal of FsOpenAI is to provide a chat UI that can be quickly deployed to securely utilize the aforementioned Azure services. The configuration options are kept limited to reduce the IT deployment overhead.

Question-Answer Interactions

FsOpenAI has two chat or interaction modes:

Basic chat with pre-trained GPT-style models
Question and answer sessions over data stored in Cognitive Search - in conjunction with Azure OpenAI models

Most are familiar with the first. The second is of primary interest here. The diagram below shows the data flows that enable the Q&A interactions.

No alt text provided for this image — Front and backend data flows associated with Q&A interaction mode

The right-hand side shows the data flows associated with creating the search index, while the left shows the flows associated with a typical Q&A interaction. Internal document data is 'shredded' into ~1K, (possibly overlapping) chunks. An embedding vector is generated for each chunk. The chunk text, vector, and any associated metadata are inserted into an Azure Cognitive Search index. The embedding vector captures the semantics of the text chunk.

领英推荐

Databricks’ new open-source AI model could offer…

Fast Company 11 个月前

ODSC’s AI Weekly Recap: Week of June 14th

Open Data Science Conference (ODSC) 8 个月前

Perform Contextual Retrieval using Milvus with an LLM,…

Milvus 4 个月前

When a user inputs a query (question or instruction) against the index data, the query is first vectorized using the same embedding model that was used for indexing. The resulting vector is then used to perform a nearest neighbor search of the index to find the most relevant text chunks. The retrieved chunks are combined with the original query and sent to the deployed chat model for generating the answer to the query - contextual to the search results. See example below:

Application Features

The FsOpenAI interface design was informed by the experienced gained from performing Q&A interactions using script code over several internal and public datasets. The following are the main features worth mentioning:

Each chat has its own tab. Multiple concurrent chat sessions are supported.
The chat parameters (temperature, selected models, etc.) are specific to each chat. One can easily experiment with how the settings effect the same basic query.
The chats can be saved into local browser storage and are loaded when the site is first opened.
To the extent possible, the model responses are streamed back to the UI in real-time. This is key because some models, e.g., GPT-4 and GPT-4-32K, can take a while (up to a minute) to generate the full response.

Features to be supported in the future:

Custom prompts.
OpenAI plugins and functions to support more complex interactions.
Ability to index and query documents on the fly

Summary

FsOpenAI exists to support Q&A interactions with GPT models in the context of internal organizational data. Beyond the infrastructure setup required for Azure OpenAI (model deployments) and Azure Cognitive Search (vector indexes), FsOpenAI requires minimal infrastructure for deployment - just the Azure Web App and Key Vault services.

FsOpenAI source is available under a liberal MIT license. An online version of the app - with no backend services - is available here for a limited time. You may use your own OpenAI key to quickly test the base chat interface with publicly deployed OpenAI models.

要查看或添加评论，请登录

Faisal Waris的更多文章

Revisiting Logic Puzzle with "o1"

2024年9月29日

Revisiting Logic Puzzle with "o1"

A few months ago, I tested the reasoning capabilities of then latest 'gpt' model. Here is a link to that article: An…
Phi-3 Vision is a Surprisingly Useful Gem

2024年7月8日

Phi-3 Vision is a Surprisingly Useful Gem

My work involves building RAG applications for question-answering over highly technical internal company documents. The…
Constrained and Provable LLM Code Generation

2024年3月11日

Constrained and Provable LLM Code Generation

LLMs are now good at generating code but human intervention is still required. Can't accept the generated code blindly…

1 条评论
An Investigation into LLM Reasoning Capabilities (+ 'Zebra' puzzles & SMT Solvers)

2024年1月15日

An Investigation into LLM Reasoning Capabilities (+ 'Zebra' puzzles & SMT Solvers)

With the (preview) release of GPT 4 Turbo, OpenAI has updated its Technical Report on GPT performance. The results are…
An Elegant Web Application Architecture for Contemporary Times

2023年12月16日

An Elegant Web Application Architecture for Contemporary Times

It used to be that as data scientists we rarely built full-stack production applications. However, that is changing…

1 条评论
Applying Some 'Unconventional Wisdom' to Improve Model Scoring wrt. High-Velocity Streaming Data

2023年1月17日

Applying Some 'Unconventional Wisdom' to Improve Model Scoring wrt. High-Velocity Streaming Data

Some data streams in the telecom industry can exceed the rate of 50,000 messages / second. And at about 300KB per…

7 条评论
Resource-efficient model deployment

2022年9月28日

Resource-efficient model deployment

AI/ML is now mainstream. Model scoring capacity requirements are ever-increasing.
Text Classification with BERT and .Net

2021年11月21日

Text Classification with BERT and .Net

Transformer based models are currently the state-of-the-art for text classification and other natural language related…
Graph Convolutional Network Model with a Strongly-typed Functional Language

2021年5月17日

Graph Convolutional Network Model with a Strongly-typed Functional Language

My present job requires me to work with network or graphical data formats. Graphical data are not readily amenable to…

1 条评论
Lessons learnt in moving a data science 'project' to 'product'

2020年10月11日

Lessons learnt in moving a data science 'project' to 'product'

Data science is complex and so is software engineering. The nature of contemporary technology work often requires…

1 条评论

See all articles

FsOpenAI: A GPT 'chat' app for Internal Organizational Data

Faisal Waris

Data Scientist/Gen. AI Strategist in the telecom industry

Question-Answer Interactions

领英推荐

Application Features

Summary

Faisal Waris的更多文章

社区洞察

其他会员也浏览了

Harnessing the Power of Large Language Models for Knowledge Graph Creation

Leveraging ChatGPT in Data Science

Assassin GPT or Saviour GP

New York, Paris & AI

Vector Databases: The Power Behind AI's Next Wave

The Rise of Agentic AI and Large Language Models in Data Engineering: A Revolution in the Making

Latest AI, Crypto News Headlines for September 12, 2023

Issue #293 - The ML Engineer ??

Issue #272 - The ML Engineer??

Issue #277 - The ML Engineer ??

Question-Answer Interactions

领英推荐

Application Features

Summary

Faisal Waris的更多文章

Revisiting Logic Puzzle with "o1"

Phi-3 Vision is a Surprisingly Useful Gem

Constrained and Provable LLM Code Generation

An Investigation into LLM Reasoning Capabilities (+ 'Zebra' puzzles & SMT Solvers)

An Elegant Web Application Architecture for Contemporary Times

Applying Some 'Unconventional Wisdom' to Improve Model Scoring wrt. High-Velocity Streaming Data

Resource-efficient model deployment

Text Classification with BERT and .Net

Graph Convolutional Network Model with a Strongly-typed Functional Language

Lessons learnt in moving a data science 'project' to 'product'

社区洞察

其他会员也浏览了

Harnessing the Power of Large Language Models for Knowledge Graph Creation

Leveraging ChatGPT in Data Science

Assassin GPT or Saviour GP

New York, Paris & AI

Vector Databases: The Power Behind AI's Next Wave

The Rise of Agentic AI and Large Language Models in Data Engineering: A Revolution in the Making

Latest AI, Crypto News Headlines for September 12, 2023

Issue #293 - The ML Engineer ??

Issue #272 - The ML Engineer??

Issue #277 - The ML Engineer ??