登录查看更多内容

What We Learnt from PunterGPT

Dr. Michael G. Kollo

I provide training and advisory on the use of Generative AI tools for financial services (asset management, advisory, insurance and banking) businesses.

发布日期: 2023年11月8日

We built PunterGPT as a kind of light hearted supplement to the more robust and complete statistical analysis built by Macquarie Quant team to pick the winners of Melbourne Cup. Using sophisticated AI language models, we fed in a set of punter conversations from around the track, and asked the AI to analyse it for its best picks based on the often contextual information of the experts that follow horses for a living. The AI picked its top horses based on very different reasons, and one of them ended up winning the race.

Here is our story.

The Beginning

It was only two weeks ago when John Conomos, Head of Global Quantitative Research and I found ourselves talking about the upcoming Melbourne cup. Like all good conversations, it started with banter. Thinking about all the quantifiable details of a horse race, and how the Quant team had turned their attention to forecasting this race using a factor-type analysis. This quantitative modelling approach identifies the factors that are significant for success across horses, and takes a very broad approach for potential drivers. As with all quantitative models, the relevance and significance of factors is identified across horses. In order for a factor to be significant, it must be a useful driver of separation across horses. The assumption is that there are a set of predictive factors, F, that work uniformly across horses, and the role of the statistical analysis is to identify these.

We turned to the idea of that so much of horse races, like financial markets, work on a revolving door of news, context and bits and pieces of information. Much of it is noise, its transient and irrelevant. But some of it is relevant, and specifically relevant to some horses but not others. For example, it doesn't matter if a horse has had international travel recently before the race, unless you are that horse of course. For another contender, there may be an issue of a recent illness, but again, a rare occurrence across the field.

These contextual factors are hard to capture for any systematic model, and probably forms a kind of 'exception' rule at best. But its hard, and not usually within the realm of traditional factor-based econometrics.

How we Built PunterGPT

From that somewhat technical conversation, emerged the idea of PunterGPT, an application of the language models of ChatGPT and Claude 2. We would use these models to analyse and summarise the nuance and the contextual factors that the market of punters are talking about, and find important. We would rely upon the 'market experts' to determine these, there are no independent statistical tests. What we would use the language model was to aggregate these factors and attempt to sift out the relevant contexts, and use (horse-racing specific) language as a kind of gauge to see relevance.

We researched the most concentrated and context rich content online regarding the race. This was a quick manual task, but we could have used language models themselves to do this search, and identify the sources of input. This was critical so that we understood where our (language) data came from.

In the end, we landed on these three websites:

This created a document of roughly 24 pages (roughly 1 page per horse) of punter analysis. We then fed this analysis into Claude 2 as input into their model, and identified that element as 'PunterGPT'.

We started working with the following prompt, and used various versions of it as we investigated the punters:

You are punterGPT, an expert system that understands all of the nuance and subtle information around horses and their likely form before a big race. The following is a description from the other punters on the street. Provide the top six horses that you think are most likely to be in excellent form.

For each horse, the description was stunningly thoughtful, balanced and articulate. The reasons being talked about, and the flow of reasoning was very compelling, even as we knew that there was no formal science behind the analysis. (sound familiar?)

领英推荐

Weekly #FIRGUN Newsletter

Eze Vidra 1 周前

Weekly #FIRGUN Newsletter

Eze Vidra 10 个月前

The Generative AI Compass or a New Kind of Personal…

Sanjiv Goyal 4 个月前

We wanted to explore the use of the analysis to provide counter-factual arguments. Tell me why it may not be a good idea to bet on a horse. Something that is often hard to find in punting magazines, unless it is implied.

The analysis once again surprised us. Here was a set of reasons and counter-reasons also drawn from the punter information, and things that could be perceived as both negative or positive.

We were curious as to how we could further use this to make counter-arguments. We focused on the favourite, Vauban, which looked like the horse to beat. (Vauban did not place in the top 3 in the end).

The counter-arguments were stunningly focused and readable. The information is crafted from the inputs is clever, point 3 on Weight is : "He blitzed them with 57.5kg last year but goes up an extra kilogram and the legendary Makybe Diva is the only horse to lump more than 58kg to Cup victory since Think Big in 1975.".

However, is not entirely correct. The original text suggests that two horses had more than 58kg, and one had 57.5kg. Therefore, the three horses qualify as having won carrying at least 57.5kg, not 55.5kg as implied in the summation text. There are no other mentions of historic weight and winning in the text.

What we learnt

Once again, language models impressed us in their capability to summarise, and interrogate written text. The richness of the text and contextual nature of the information was easy to understand and keep tabs on. As financial services professionals, this reminded us of the barrage of detail and data that we get each day in our inboxes.

Language models brought us closer to this content, and allowed us to dive in and summarise much more than otherwise. It allowed us to build cases for and against, to look for hidden meaning, and to analyse the market voices themselves.
For cross-sectional, and often heterogeneous information like market summation, with highly context specific language, we found this type of analysis extremely accessible.
We learnt that context window length is extremely important, and that attention wasn't always evenly split. We needed to ensure that we controlled not only the data that we used, but its positioning relative to other data.
We learnt that context can be retained or disappear as the conversation grows longer. This was something always to keep an eye on.

As an exercise in using experimental technology to augment a successful and established investment process, it was a powerful exercise.

Bonney Lee

4 个月

Amzing! Dr. Michael G. Kollo. How can I get it about 2024 Melbourne Cup?

Mark Jones

Portfolio Manager at Resolution Capital

1 年

Great insight. I look forward to this being applied to sell side research. The real interesting part of PunterGPT was the counterfactual on Vauban. Sell side research tilts to the positive, but in aggregation there may be sufficient information to allow for insights into the counterfactual negative. Keen to subscribe to StreetGPT!

1 次回应

John Conomos

Head of Global Quantitative Research at Macquarie Group

1 年

Excellent summary Dr. Michael G. Kollo. The potential applications of this technology span across a wide range of our daily work. While scaling something like this may require a decent investment, the process of bringing PunterGPT to life was surprisingly straightforward. The key is to shift our perspective on what is possible.

2 次回应

James McLoughlin

App Marketing Assassin ?? | Sports Betting & FinTech Acquisition | Coffee Snob ?

1 年

Great content Dr. Michael G. Kollo

2 次回应

Jason L.

1 年

Thanks for sharing. Prediction, Compression:)

1 次回应

查看更多评论

要查看或添加评论，请登录

Dr. Michael G. Kollo的更多文章

A practical example of Using AI in Quantitative Research

2025年2月13日

A practical example of Using AI in Quantitative Research

A recent study called 'AI-Powered Finance Scholarship' is making its way through the academic world. Its a powerful…

9 条评论
Will AGI evolution solve the AI Adoption Problem

2025年1月11日

Will AGI evolution solve the AI Adoption Problem

I wrote this piece as a follow-on from Ethan's recent piece https://www.oneusefulthing.

6 条评论
Part 1: Why 2024's Progress Hasn't Translated to A lot of Business Value

2024年12月13日

Part 1: Why 2024's Progress Hasn't Translated to A lot of Business Value

In the landscape of financial services technology, 2024 has emerged as a year marked by stark contrasts - extraordinary…

6 条评论
Is higher intelligence more useful for us?

2024年11月13日

Is higher intelligence more useful for us?

If you've been part of the AI story over the past few years, you've probably swung between being amazed by the…

8 条评论
The New AI Agents in Financial Services

2024年10月23日

The New AI Agents in Financial Services

Financial services are inherently about the movement and allocation of money, an increasingly virtual asset. If you…
Making financial research useful for the non-technical audience

2024年10月6日

Making financial research useful for the non-technical audience

Entire jobs, like financial advice, centre around the translation of technical financial analysis for non-technical…

4 条评论
Fed Lowers rates by 50bps - Discussed by o1 - Sept 19th

2024年9月18日

Fed Lowers rates by 50bps - Discussed by o1 - Sept 19th

GPt o1 is asked to provided with the Fed announcement and media questions for July 2024, and the Fed announcement only…

1 条评论
The 50bps cut by Fed: Analysis by GPT o1 - 18th Sept 2024

2024年9月18日

The 50bps cut by Fed: Analysis by GPT o1 - 18th Sept 2024

Summary of the Headline Reasons for Lowering Interest Rates: The Federal Open Market Committee (FOMC) decided to reduce…

2 条评论
The new companion on the creative road

2024年9月6日

The new companion on the creative road

I like to write, but I find it difficult to write in the normal course of life. I need specific moments, usually my…

3 条评论
National Security: asset development and intelligence collection with Language Models (like ChatGPT)

2023年2月2日

National Security: asset development and intelligence collection with Language Models (like ChatGPT)

This is an exert from a conversation with a friend, Martin Genesse. This is all fictional, and guess-work as to a…

See all articles

What We Learnt from PunterGPT

Dr. Michael G. Kollo

I provide training and advisory on the use of Generative AI tools for financial services (asset management, advisory, insurance and banking) businesses.

The Beginning

How we Built PunterGPT

领英推荐

What we learnt

Dr. Michael G. Kollo的更多文章

社区洞察

其他会员也浏览了

A Data Scientist’s journey from Sudoku to Kaggle

Newsletter #5: How do you ride the AI crescendo, tsunami wave? Where is Laird Hamilton when you need him... ;)

The End of Information Hunting: How AI Will Flip How We Find & Consume Information

Dive into Web3: Personalized Micro-Interactive Experiences Await, Guided by Astro.

Navigating Artificial Complexity - Part 4 - Ducks and Horses

Chasing Mastery: Practicing with More than Brute Force Repetition

Diving into Fuyu-8B: The Multimodal Marvel ??

Cultural Exports, AI Takeovers, and 2x New Kings of Europe

Of Kick Boxing and Aug Analytics (1/4)

Ghosts of data future

The Beginning

How we Built PunterGPT

领英推荐

What we learnt

Dr. Michael G. Kollo的更多文章

A practical example of Using AI in Quantitative Research

Will AGI evolution solve the AI Adoption Problem

Part 1: Why 2024's Progress Hasn't Translated to A lot of Business Value

Is higher intelligence more useful for us?

The New AI Agents in Financial Services

Making financial research useful for the non-technical audience

Fed Lowers rates by 50bps - Discussed by o1 - Sept 19th

The 50bps cut by Fed: Analysis by GPT o1 - 18th Sept 2024

The new companion on the creative road

National Security: asset development and intelligence collection with Language Models (like ChatGPT)

社区洞察

其他会员也浏览了

A Data Scientist’s journey from Sudoku to Kaggle

Newsletter #5: How do you ride the AI crescendo, tsunami wave? Where is Laird Hamilton when you need him... ;)

The End of Information Hunting: How AI Will Flip How We Find & Consume Information

Dive into Web3: Personalized Micro-Interactive Experiences Await, Guided by Astro.

Navigating Artificial Complexity - Part 4 - Ducks and Horses

Chasing Mastery: Practicing with More than Brute Force Repetition

Diving into Fuyu-8B: The Multimodal Marvel ??

Cultural Exports, AI Takeovers, and 2x New Kings of Europe

Of Kick Boxing and Aug Analytics (1/4)

Ghosts of data future