Data-driven models, such as deep learning models, are increasingly adopted within Maritime Autonomous Surface Ship (MASS) components for perception, prediction, and decision-making tasks. However, most existing approaches implicitly assume that operat...
Data-driven models, such as deep learning models, are increasingly adopted within Maritime Autonomous Surface Ship (MASS) components for perception, prediction, and decision-making tasks. However, most existing approaches implicitly assume that operational data are independently and identically distributed (i.i.d.), overlooking the pronounced distributional shifts induced by changing routes, environmental conditions, and operational settings across voyages. Using ship fuel consumption prediction as a representative data-driven component within MASS, this thesis demonstrates that such voyage-level distributional shifts lead to a failure of generalization in components of MASS relying on data-driven models. As a result, these components exhibit severe performance degradation when deployed under unseen operational conditions. Through systematic in-distribution and out-of-distribution experiments on real-world voyage data, we show that high predictive accuracy under idealized i.i.d. evaluation settings does not guarantee robustness under realistic deployment scenarios. To address this limitation, we propose the VaRNN, a Voyage-aware Recurrent Neural Network that disentangles voyage-invariant and task-specific representations and selectively enforces alignment across voyages using a commonality loss applied to the voyage-invariant subspace. Extensive voyage-based cross-validation experiments confirm that the proposed VaRNN consistently improves predictive performance compared to conventional machine learning and deep learning baselines, while representation analysis and ablation studies highlight the necessity of its disentangled alignment for stable learning. These findings suggest that accounting for voyage-level distributional shifts is essential for ensuring reliable behavior of data-driven components within MASS, underscoring the broader value of distribution-aware modeling under real-world maritime conditions.