登录查看更多内容

Artificial Intelligence Unfolded: Article 6 - Synthetic Data, Model Training to Model Interaction

Hrishi Kulkarni

Chief Technology Officer (CTO), Executive Director, Board Member, Innovation and Change Catalyst, Strategic Technologist, Product & Data Engineering, Cloud Computing, AI/ML, GenAI, MLOps, Programme Management

发布日期: 2024年5月26日

Today, in the sixth article in my series on Artificial Intelligence, I thought of sharing my experience of working on a project of retail demand prediction as part of my recent University of Oxford course on "Artificial Intelligence: Generative AI, Cloud and MLOps" run by Course Director Ajit Jaokar and his team. It's been a lot of fun, especially when seeing everything come together in a working model, which also needed me to go deep in my understanding of synthetic data generation, feature engineering, various machine learning algorithms, training model multiple times using hyperparameter tuning, and more importantly assessing model performance before it is deployed. As always, I'll try and keep it simple.

The Role of Synthetic Data in Model Training

One of the biggest challenges in machine learning is obtaining large and diverse datasets that are reflective of real-world scenarios. This is where synthetic data comes in. By generating my own data, I ensured that the model training was robust and comprehensive. I created datasets that mimic real-world sales data, complete with product IDs, sales figures, and economic indicators such as GDP and inflation.

Generating Synthetic Data with ChatGPT

To kick things off, I used ChatGPT to help brainstorm and outline the types of data points relevant for a demand prediction model. Through conversation (I had to ask right questions), we discussed various attributes like GDP growth rates, inflation rates, and typical sales figures, which helped me shape a dataset that was not only realistic but also tailored for the specific challenges of predicting product demand.

ChatGPT was able to assist me with in generating 500k-record dataset that contained sales transactions across various countries, stores, products and time period.

I had to go through several iterations to produce the data I needed, as I had to fine-tune my prompts and provide GPT improved data rules I wanted it to abide with each iteration.

#PromptEngineering If interested, please read my previous article https://www.dhirubhai.net/pulse/artificial-intelligence-unfolded-article-5-crafting-hrishi-kulkarni-ytvse

The Crucial Steps of Model Training

Loading and Preparing Data: The first step was to load the data into a suitable format for analysis and processing. This included basic cleaning and setting up data structures that support efficient access and manipulation.

Feature Engineering: The heart of a good model lies in its features. I spent considerable time crafting features that could capture the underlying patterns in the data. From rolling averages to more complex calculations like exponential moving averages (EMA), each feature was designed to provide the model with insightful inputs.

Data Exploration and Feature Selection: Before diving into model building, I explored the data through visualisations to understand the distributions and relationships. This step was crucial for feature selection, ensuring that only the most relevant and impactful features were included in the final model.

Model Development and Training

Before starting to train, Scaling, also known as feature scaling or data normalisation, was a crucial preprocessing step. Its primary purpose was to standardise the range of independent variables or features of data, which helps to ensure that the machine learning algorithm functions optimally

Choosing the right model was key. I tried Linear Regression, Random Forest Regressor but I settled on the Gradient Boosting Regressor for its robustness and effectiveness in handling diverse datasets. I spent good time on each of these models analysing their output before deciding on which one to use.

Using GridSearchCV was a game-changer — it automated the process of tuning, trying out various combinations of hyperparameters to find the best fit. In other words, Cross-Validation is a method used to tune hyperparameters — these are the parameters of a model that are set prior to training and significantly influence model performance.

Fitting process itself was quite interesting. I tried fitting models 96 times, 24, times and 12 times with different parameters. These essentially relate to number of training runs to find the best fit hyperparameters.

领英推荐

Generative AI Slow Rolls into Industry

Global Recruiters of Palmetto (GRN) Automation Recruitment Specialist 2 个月前

The Role of Data in AI

Centizen, Inc. 2 个月前

Artificial Intelligence in Controlling – an Outlook

torq.partners 1 年前

Hyperparameter Tuning and Retraining

I experimented with different settings, adjusting parameters like the number of estimators and the depth of the decision trees. This not only enhanced the model's accuracy but also gave me a deep dive into how each parameter impacts the model’s performance.

Another interesting observation was on how the learning rate can impact model training and outcomes. I tried training models with 0.1 and 0.01 training rates. After all observations, the use of a higher learning rate (0.1 compared to 0.01 in one of the previous iterations) and optimal tree parameters in the model helped regain much of the lost accuracy and model fit seen in some of the intervening iterations.

Below were key aspects I considered before model suitability for deployment:

Stability and Generalisation: The model settings had to offer a good balance between accuracy and generalisation. The model parameters need to well-tune to prevent overfitting while still capturing the essential patterns in the data.

Model Robustness: The mode robustness meaning, consistent performance across training and testing datasets, as indicated by similar Mean Squared Error (MSE) values, was important consideration.

Further Tuning: While the model performed very well, there will always be ways to improve the accuracy.

From Model Training to Interaction

The final piece of the puzzle was the forecast demand function, which uses past sales data to calculate key features like EMA and feed them into the model. This function became the bridge between raw data and actionable predictions, allowing me to interact with the model in a meaningful way.

Using the exact same scaler or transformation applied during training when making predictions was fundamental for maintaining consistency and accuracy.

Deploying the Best Model

After numerous training sessions and adjustments, I identified the best-performing model based on its accuracy and generalisation capabilities. Deploying this model and plugging it into a user-friendly interface was the culmination of all the hard work — a tool that not only predicts but also adapts and learns from new data.

Conclusion

This journey from data preparation to model deployment has been incredibly rewarding. I kept the model deployment simple and didn't deploy it as an end-point. It was real fun though and enjoyed every step, from the nitty-gritty of tuning models to the thrill of seeing accurate predictions unfold. It's a testament to how AI can transform data into insights, and insights into actionable intelligence.

Stay tuned for more updates as I continue exploring new aspects of AI. If you're embarking on a similar journey, I suggest you make your hands dirty and understand the principles.

Kelly Coutinho Anjali Jain tagging you as felt you'd like to read this :-)

Woodley B. Preucil, CFA

Senior Managing Director

10 个月

Hrishi Kulkarni Fascinating read. Thank you for sharing

Kelly Coutinho

Head of Business planning & Data Science at Ralph Lauren

10 个月

Love this! Thank you Hrishi. The iterations and prompting to ensure synthetic data reflect statistical richness of real world resonates with my own experience. Really enjoying your articles!

1 次回应

Anjali Jain

Author | Co-founder@Erdos Research | AI & machine learning Senior Tutor at University of Oxford| Data architect at Metro Bank

10 个月

Hrishi Kulkarni Always find your thoughts insightful and I look forward to next post too.

1 次回应

查看更多评论

要查看或添加评论，请登录

Hrishi Kulkarni的更多文章

Artificial Intelligence Unfolded: Article 7 - GraphRAG-Based Agentic AI System for Investment Trade Analysis

2024年10月19日

Artificial Intelligence Unfolded: Article 7 - GraphRAG-Based Agentic AI System for Investment Trade Analysis

Passion for exploring new technologies and to find solutions to complex problems never wanes. I've been working on the…

2 条评论
Artificial Intelligence Unfolded: Article 5 - Crafting Prompts - Techniques and Frameworks

2024年4月15日

Artificial Intelligence Unfolded: Article 5 - Crafting Prompts - Techniques and Frameworks

Crafting effective prompts for Generative AI involves understanding the underlying techniques and applying robust…

1 条评论
Artificial Intelligence Unfolded: Article 4 - The Fusion of Creativity and Engineering in Generative AI Prompting

2024年4月7日

Artificial Intelligence Unfolded: Article 4 - The Fusion of Creativity and Engineering in Generative AI Prompting

The intersection of art and engineering within the realm of artificial intelligence (AI), especially in the context of…

1 条评论
Artificial Intelligence Unfolded - Article 3: Model Biases & Ethical Considerations

2024年3月16日

Artificial Intelligence Unfolded - Article 3: Model Biases & Ethical Considerations

In my last article, I wrote about Foundational and Large Language Models. If you're interested, you can read the…

2 条评论
Artificial Intelligence Unfolded - Article 2: Exploring Foundational Models and Large Language Models (LLMs)

2024年3月9日

Artificial Intelligence Unfolded - Article 2: Exploring Foundational Models and Large Language Models (LLMs)

In my last article, I covered Machine Learning, Deep Learning and Neural Networks. In an era where the pace of…

4 条评论
Artificial Intelligence Unfolded - Article 1: A Comprehensive Guide to ML, Neural Networks, and Deep Learning

2024年2月29日

Artificial Intelligence Unfolded - Article 1: A Comprehensive Guide to ML, Neural Networks, and Deep Learning

In the ever-evolving landscape of technology, terms such as Artificial Intelligence (AI), Machine Learning (ML), Neural…

10 条评论
How technology is shaking up the UK pensions market

2023年3月23日

How technology is shaking up the UK pensions market

The UK pensions market, worth ￡13.9billion, is facing unprecedented challenges and opportunities as technology-driven…

See all articles

Artificial Intelligence Unfolded: Article 6 - Synthetic Data, Model Training to Model Interaction

Hrishi Kulkarni

Chief Technology Officer (CTO), Executive Director, Board Member, Innovation and Change Catalyst, Strategic Technologist, Product & Data Engineering, Cloud Computing, AI/ML, GenAI, MLOps, Programme Management

The Role of Synthetic Data in Model Training

Generating Synthetic Data with ChatGPT

The Crucial Steps of Model Training

Model Development and Training

领英推荐

Hyperparameter Tuning and Retraining

From Model Training to Interaction

Deploying the Best Model

Conclusion

Hrishi Kulkarni的更多文章

社区洞察

其他会员也浏览了

The Critical Role of Data in AI Development: Why High-Quality Data Matters More Than Ever

AI tools that can Upscale Your Productivity

AI, The Intelligent, Deliberate and Adaptable

AI vs ML: The Surprising Differences that Matter to Businesses

Machine Learning or AI?

Assembling the "AI Model": A Tale of Instruction-less Ingenuity

Building AI/ML: 5 Practical Lessons

AI's Evolutionary Path in Data Analytics

WHY AI IN WEB 3??

The Role of Synthetic Data in Model Training

Generating Synthetic Data with ChatGPT

The Crucial Steps of Model Training

Model Development and Training

领英推荐

Hyperparameter Tuning and Retraining

From Model Training to Interaction

Deploying the Best Model

Conclusion

Hrishi Kulkarni的更多文章

Artificial Intelligence Unfolded: Article 7 - GraphRAG-Based Agentic AI System for Investment Trade Analysis

Artificial Intelligence Unfolded: Article 5 - Crafting Prompts - Techniques and Frameworks

Artificial Intelligence Unfolded: Article 4 - The Fusion of Creativity and Engineering in Generative AI Prompting

Artificial Intelligence Unfolded - Article 3: Model Biases & Ethical Considerations

Artificial Intelligence Unfolded - Article 2: Exploring Foundational Models and Large Language Models (LLMs)

Artificial Intelligence Unfolded - Article 1: A Comprehensive Guide to ML, Neural Networks, and Deep Learning

How technology is shaking up the UK pensions market

社区洞察

其他会员也浏览了

The Critical Role of Data in AI Development: Why High-Quality Data Matters More Than Ever

AI tools that can Upscale Your Productivity

AI, The Intelligent, Deliberate and Adaptable

AI vs ML: The Surprising Differences that Matter to Businesses

Machine Learning or AI?

Assembling the "AI Model": A Tale of Instruction-less Ingenuity

Building AI/ML: 5 Practical Lessons

AI's Evolutionary Path in Data Analytics

WHY AI IN WEB 3??