Prediction of shipping delay based on AIS Data

The final baseline from Port Busan(KRPUS) to Port Long-beach(USLGB).

Project Description

The company who use transportation by ship want to know that their carriages will have arrived at the destination port. However, the transportation by ship is not stable and short. For these reasons, many companies try to predict the arrival of the ship with ship’s GPS Data(AIS), their own logistic data and various external data, but their accuracy is too low to use.

Out team including me try to predict the arrival of the ship using only AIS data. Before the prediction, we separate each path in the whole AIS Data. After separating, paths having the same start, end clustered into the reasonable number of clusters with a single represented path(baseline).

The prediction worked at a point of 70% progress of distance between start, endpoints. Using the points where the target ship is in, choose the most proper baseline and check the remaining time of the selected baseline.

For this project, the following steps:

  1. Preprocessing raw data to analyze
  2. Extract staying point nearby port in raw AIS data and make route path to move between staying point
  3. Making baseline of each route(Depart port - Arrival port)
    • Clustering extracted historical paths that have similar distance and time
    • Making major points(latitude, longitude) base on the total distance of the route
    • Fill available point instead of missing points on the baseline
  4. Calculate the average staying time and area of each port.
  5. Calculate the estimated time of arrival using baseline
    • Evaluate appropriate baseline about moving the vessel
    • Estimate time of arrival and whether or not that vessel will be delayed

For this project, using the following skills:

python scala brightics MSSQL Spark


Raw Data

Pre-process

Extract Baseline

Post-process