AI APPLICATIONS TO INCREASE NAVIGATION SAFETY

AI APPLICATIONS TO INCREASE NAVIGATION SAFETY

Today we will consider the basic problem of classification. Can the type and navigational purpose of the ships be estimated only from known data such as width, height, course, or speed? To solve this research question, the dynamic values and basic characteristics of any ship in a certain region will be discussed, and the type and purpose of the ship will be estimated by statistical models. Thus, we can predict the intentions of any ship. This situation undoubtedly is an important step to increasing the safety of navigation.

  • Briefly, What is ML?

Machine learning (ML) algorithms mimic the shape of people's learning and focus on the use of data and gradually increase accuracy, a branch of artificial intelligence and computer science.

Machine learning algorithms make statistical assumptions by training the data to make predictions. Generalizable models are revealed by detecting patterns that humans cannot see and calculating over the sample data given to the algorithm. These generalizable models, which are obtained as output, can make predictions with high accuracy when non-model data is entered. Machine learning algorithms are used in a wide variety of applications where it is difficult or impossible to develop traditional algorithms to perform needed tasks such as medicine, email filtering, speech recognition, and computer vision.

  • Briefly, What is AIS?

AIS (Automatic Identification System) device is an automatic identification system. It is an effective system for tracking ships and regulating maritime traffic. It transmits and receives data from VHF radio frequencies and signals. Ships and ship traffic services (VTS) equipped with VHF transceivers can thus reliably display the overall picture. However, the VHF range is limited to a maximum of approximately 25 nautical miles.

? There are two types of AIS devices, type A and type B. Type A is more powerful so it can send data up to 25 nautical miles. Type B AIS device is effective at a distance of 5-10 nautical miles. Type A AIS has a power of 12.5 kW, while AIS type B has a power of 2 kW. AIS signals every 2-10 seconds while cruising. On the other hand, it makes a signal update every 3-4 minutes for ships at anchor.

According to the SOLAS rules determined by the International Maritime Organization (IMO), it must be kept on ships of 300 GRT and larger and all commercial ships carrying passengers. Although it is not compulsory, some fishermen and other boats use it voluntarily due to its usefulness and benefits.

  • Handling the Classification Problem

To solve our research question mentioned above, we will use AIS information published as open source by the Danish Maritime Authority. The aforementioned data were compiled from the ships transiting the Kattegat Strait between January 1st and March 10th, 2022. AIS data is published as open source in some countries.

No alt text provided for this image

Two types of data, static and dynamic, are kept in the AIS device:

Static Information :

1. The ship's IMO number 2. The ship's MMSI number 3. The ship's Call Sign 4. The ship's name 5. The ship's type 6. What type of destination this message was received from (like Class A / Class B) 7. Width of ship 8. Length of ship 9. Draft of ship 10. Type of GPS device 11. Length from GPS to bow (Length A) 12. Length from GPS to stern (Size B) 13. Length from GPS to starboard (Size C) 14. Length from GPS to port side (Dimension D)

?Dynamic Data:

1. Time information (31/12/2015 in 23:59:59 format) 2. Latitude 3. Longitude 4. Navigational status (For example: 'Fishing', Anchored, etc.) 5. Rate of Turn (ROT) 6. Speed Over Ground (SOG) 7. Course Over Ground (COG) 8. Heading 9. Type of cargo 10. Port of Destination 11. Estimated Time of Arrival (ETA) 12. Data source type, eg. AIS

Our main goal is to build a machine learning model using AIS information in the Kattegat Strait and then to predict the ship type of any ship passing through this region using the width, length, draft, course, and speed information. For this purpose, in our model we used 'MMSI', 'Width', 'Height', 'Draft', 'Cruise', 'Road', 'Speed' and 'Heading' from AIS data as independent variables. We used the ship type as the target, that is, the dependent variable.

As can be seen, AIS data contains a lot of information, but not every ship has AIS data. As an example of this:

* Warships * Ships smaller than 300grt * Fishermen * May turn off AIS transceivers due to technical malfunctions.

Our dataset consists of 358351 AIS messages, which means we have 358351 observations. However, since a ship has more than one AIS message, the number of unique ships in our dataset is 3894. The properties of our variables are as below.

No alt text provided for this image

The other table at the bottom shows the distribution of 3894 ships detected in the Kattegat Strait between January 1st and March 10th, 2022, according to their types. Ship types are also the target variable that we will estimate.

No alt text provided for this image

We have aggregated those below 2% of the ship classes in Table 2. Because the machine learning algorithm cannot catch the pattern of this kind of low-rate classes. In other words, there is not enough data to learn these classes. Therefore, we combined high-speed boats, government vessels, pilotage boats, harbor boats, and towing/towing vessels as one class with a ratio of less than 2%. These ship types are entered in the model as a rare class. As a result of the estimation, we will get the result of the rare class.

? The correlation graph between our variables is given below. It was observed that there was a high correlation between the 'width' and 'length' variable and between the 'heading' and 'cog' variable. To solve this problem, the 'dimension' variable was created by multiplying the 'width' and 'length' numeric variables, and the 'width' and 'length' variables were removed from the data set. Also, 'heading' and 'cog' variables were removed from the data set after they were combined under the 'route' variable. Since the 'mmsi' variable is unique and specific for each ship, it has only been used for exploratory data analysis. It was removed from the dataset during the final model setup phase.

No alt text provided for this image

  • Machine Learning Model

After preparing all the variables, we can build the machine learning model. For this purpose, Light GBM Machine Learning Algorithm, which is known to give fast and successful results, was used.

Light GBM is a decision tree-based algorithm. The training time of the decision trees is directly proportional to the calculation and the number of divisions. With this method, both the training time is shortened and the resource usage is reduced.

Two strategies, level-wise or leaf-wise, can be used in decision trees. In the level-wise strategy, the tree's balance is maintained as the tree grows. In the leaf-wise strategy, division from leaves, which reduces loss, continues. Thanks to this feature, LightGBM is separated from other boosting algorithms. The model has less error rate and learns faster with the leaf-wise strategy. However, the leaf-wise growth strategy causes the model to be prone to over-learning in cases where the number of data is low. Therefore, the algorithm is more suitable for use in big data.

After the model was established, to test the model, we entered the model with 10 different values navigating in the Kattegat Strait in December 2021. As noted, we used the data from January 1 to March 10, 2022, when building the model. In other words, the 10 samples we determined for the estimation are an outside example from our data set.

No alt text provided for this image

Looking at the upper table, it was observed that 7 out of 10 predictions gave correct results. Considering that our model has an overall accuracy score of 0.67, this is an expected result. Prediction scores (confusion matrix) based on ship type are given in table-5. Models with higher precision and sensitivity values give more accurate results. The f-1 score shows us the harmonic mean of precision and sensitivity values.

No alt text provided for this image

  • Conclusion

In this study, it has been tried to predict the ship types and finally the ship's navigational intention by using the dynamic and static data obtained from the ships' AIS data. This study was carried out in a particular area. Because different routes and speeds can be used in each region. For example, in shallows, islands, islets, rocks, and coastal areas, clear routes/speeds are used by the characteristics of the ship to pass clear. The Kattegat Strait, which is the area we chose, is a heavy traffic area with narrow waterways like this.

Once we know the type of a ship, we can guess its purpose. In this case, it can be considered as a step toward increasing situational awareness on the seas and thus increasing the safety of navigation.

More data can be used in the future to increase the reliability/accuracy of the model. In addition, we used a single algorithm in our model. Using two or more machine learning algorithms in an integrated way will increase the success of the model.

?For Detailed Information:

https://www.kaggle.com/code/eminserkanerdonmez/ai-in-maritime-industsy






要查看或添加评论,请登录

Serkan Erdonmez的更多文章

  • NEW ERA FOR THE AI: TRANSFORMERS

    NEW ERA FOR THE AI: TRANSFORMERS

    Transformers mark a groundbreaking shift in AI technology, heralding a new era of innovation. They represent a…

    4 条评论
  • OCR COMPARISONS - TESSERACT, EAST, AND KERAS OCR

    OCR COMPARISONS - TESSERACT, EAST, AND KERAS OCR

    OCR (Optical Character Recognition) is a technology that enables the extraction of text and characters from scanned…

    2 条评论
  • Distributed Training And TPU Usage On TensorFlow

    Distributed Training And TPU Usage On TensorFlow

    Distributed Training TensorFlow is one of the most popular machine learning frameworks used today. One of the key…

    8 条评论
  • From A to Z How to Clone Any Sound with AI

    From A to Z How to Clone Any Sound with AI

    The changing technological conditions of our decline no longer surprise us. Although we are familiar with methods such…

    1 条评论
  • A Daily Life Example to Human Pose Estimation

    A Daily Life Example to Human Pose Estimation

    While working at a desk, sometimes when our focus shifts to the subject of work, we inevitably lose our healthy upright…

  • Belirli Anahtar Kelimeler Yard?m?yla Otomasyon Olarak Akademik Literatür Taramas?

    Belirli Anahtar Kelimeler Yard?m?yla Otomasyon Olarak Akademik Literatür Taramas?

    Günümüzde teknolojinin geli?mesi, e?itim ve ??retim sürecine dair farkl? tekniklerin kullan?lmas?na ?n ayak olmu?tur…

  • YüZ HATLARI üZER?NE ??LEMLER

    YüZ HATLARI üZER?NE ??LEMLER

    Bu yaz?mda Python ile bir insan?n yüz hatlar?n? ??karmay? ve bu durumun fonksiyonel kullan?m alanlar?n? aktaraca??m…

  • RENK SKALALARI ve STANDARTLARI

    RENK SKALALARI ve STANDARTLARI

    G?rüntü i?leme ve tasar?m a?amalar?nda kullan?lan g?rseller belirli renk kodlar?n? kullan?rlar. Bunlardan belli…

  • Extracting Data From Whatsapp

    Extracting Data From Whatsapp

    Hi everyone. Today we will extract our data from Whatsapp and make a ‘word cloud’ from the most used words.

  • Funny Sides of Regex

    Funny Sides of Regex

    Last week we had regex lessons as a bonus. To intensify my theoric knowledge about regex, I studied over our last…

社区洞察

其他会员也浏览了