Getting Hands-on Experience with Machine Learning in Offshore Geotechnics - ISFOG2020 Prediction Exercise
Data science techniques are rapidly transforming businesses in a broad range of sectors. While marketing and social applications have received most attention to date, geotechnical engineering can also benefit from data science tools which are now readily available.
There is a lot of buzz around machine learning and AI. Some posts are claiming it will completely transform the way we do work. But what I had not yet seen was an in-depth discussion between members of the community on how we can benefit from these methods and how they can be applied in practice. Before believing that data science will fundamentally change my job, I had to see for myself what machine learning algorithms are capable of and how they can be put to good use.
In August 2020, the 4th International Symposium Frontiers in Offshore Geotechnics will take place in Austin, TX. Since data science is clearly a frontier area, the organising committee believed it would be a good idea to launch a community-driven prediction exercise. The example chosen for this is a regression problem where the number of hammer blows required for pile driving needs to be predicted. The machine learning algorithm is trained on a number of locations where the observed blowcount is available and is then used to make predictions on unseen data.
The competition is hosted on Kaggle and is open for anyone, including novice users of data science techniques. A cleaned dataset is provided and a tutorial using linear regression modelling is made available to introduce the concepts and get people started quickly.
The most important aspect of this exercise is to get discussion going on the following questions:
· How confident can we be of our predictions when using machine learning models?
· How well do machine learning models perform when trained with limited data (which is often the case in offshore geotechnical engineering)
· Are these models just black boxes or can they also be used to learn more about the physics of the problem?
· Can we somehow feed our engineering knowledge into these models?
Meanwhile, we are two months into the competition and there is broad interest from motivated individuals around the globe. Initial results look promising, with the best predictions still making use of the engineering knowledge on pile driving which was developed before machine learning toolboxes were readily available.
The competition runs until the end of 2019 so there is still plenty of time left to get involved and make a contribution. The results of the competition will be presented at the ISFOG2020 conference in Austin, TX.
Have fun predicting!
Bruno
Principal Geotechnical Engineer presso geowynd
5 年Interesting topic Bruno. I'm looking forward to see the results. The idea behind this exercise is pretty old (colleagues in D'Appolonia did a similar work in the '90!) but it should be much easier/faster now. Why not setting up a JIP collecting a large database of data and trying to develop new correlations between soil parameters, hammer properties and blow counts?
Director Offshore Energy at NGI
5 年Nice article and excellent topic Bruno Stuyts! This is a perfect challenge for ML in geotechnics and I am looking forward to seeing the results next year at ISFOG2020. One question to ponder in this age of unifying our CPT axial capacity methods and with reference to the comment on underlying physics: should it be easier to predict pile installation response than long term capacity? During installation, we remove the issue of set-up / ageing and provide a common reference at T=0 (albeit, for a more complicated loading condition).
Strategy Advisor at Heerema Marine Contractors
5 年Peter van Esch?and myself will hand in our first results soon Bruno!
Director of Geosciences at Global Maritime
5 年Bruno - how useful is this when blowcount modelling is needed well in advance of the offshore installation campaign? I guess that this could be used to validate analytical predictions during installation and perhaps refine the pile driving approach? Otherwise, to use this approach at the design and planning stage, you would need to be working on a site with construction data from an adjacent project? In this situation, you would also need to deal with geological uncertainty and spatial variation?