Federated multi-modal learning for cross-platform image computation: A functional analysis and nonlinear optimization approach to privacy preservation
Keywords:
Federated Learning; Functional Analysis; Nonlinear Optimization; Hilbert Spaces; Privacy Preservation; Variational Problems; Multi-Modal Imaging; Image Computation.Abstract
In Federated multi-modal learning, raw data is not concentrated in a single location because it can perform distributed image computation on heterogeneous platforms. Nonetheless, it is still open to guarantee that the convergence, stability and privacy properties of such systems are mathematically rigorous. In this paper, a functional-analytic, nonlinear-optimization system of federated cross-platform image computation is developed in which local image modalities, and global learning goals are posed as nonlinear variational problems, with local image modalities modelled as an element of separable Hilbert spaces. We present a Nonlinear Federated Proximal Operator (NFPO) that provides a privacy limiting functionality by a dual functional mechanism. We prove existence and uniqueness results of the global minimizer in the presence of coercivity and strong monotonicity, convergence of the NFPO in a contractive mapping argument, and test the framework on synthetic multimodal image datasets given across a plurality of virtual platforms. Numerical experiments show that the proposed approach provides better privacy guarantees with the competitive reconstruction and classification performance. This paper introduces a mathematical based theoretical foundation of a privacy-
conserving federated image computation to cross-platform and multi-modal imaging systems.
References
Amin, A., Hasan, K. M. A., Zein-Sabatto, S., Chimba, D., Liang, H., Ahmed, I., & Islam, T. (2024). Empowering Healthcare through Privacy-Preserving MRI Analysis. SoutheastCon, 1534. https://doi.org/10.1109/southeastcon52093.2024.10500144
Aueawatthanaphisut, A. (2025). Secure Multi-Modal Data Fusion in Federated Digital Health Systems via MCP. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2510.01780
Barhoumi, E. M., Charabi, Y., & Farhani, S. (2024). Detailed guide to machine learning techniques in signal processing. Progress in Electronics and Communication Engineering, 2(1), 39–47. https://doi.org/10.31838/PECE/02.01.04
Borazjani, K., Khosravan, N., Ying, L., & Hosseinalipour, S. (2024). Multi-Modal Federated Learning for Cancer Staging Over Non-IID Datasets With Unbalanced Modalities. IEEE Transactions on Medical Imaging, 44(1), 556. https://doi.org/10.1109/tmi.2024.3450855
Byeon, G., Ryu, M., Di, Z. W., & Kim, K. (2025). FIRM: Federated Image Reconstruction using Multimodal Tomographic Data. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2501.05642
Ciupek, D., Malawski, M., & Pieciak, T. (2025). Federated Learning: A new frontier in the exploration of multi-institutional medical imaging data. https://doi.org/10.48550/ARXIV.2503.20107
Díaz, J. S.-P., & García, Á. L. (2025). Enhancing the Convergence of Federated Learning Aggregation Strategies with Limited Data. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2501.15949
Fan, Y., Xu, W., Wang, H., Zhu, J., & Guo, S. (2024). Balanced Multi-modal Federated Learning via Cross-Modal Infiltration. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2401.00894
Guan, H., Yap, P., Bozoki, A., & Liu, M. (2024). Federated learning for medical image analysis: A survey. Pattern Recognition, 151, 110424. https://doi.org/10.1016/j.patcog.2024.110424
Haripriya, R., Khare, N., & Pandey, M. (2025). Privacy-preserving federated learning for collaborative medical data mining in multi-institutional settings. Scientific Reports, 15(1). https://doi.org/10.1038/s41598-025-97565-4
Haripriya, R., Khare, N., Pandey, M., & Biswas, S. (2025). A privacy-enhanced framework for collaborative Big Data analysis in healthcare using adaptive federated learning aggregation. Journal Of Big Data, 12(1). https://doi.org/10.1186/s40537-025-01169-8
He, N., Liu, Y., Sun, W., Ye, X., Ouyang, Y., Gao, T., & Zhang, Z. (2025). FedMMKT:Co-Enhancing a Server Textto-Image Model and Client Task Models in Multi-Modal Federated Learning. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2510.12254
Liu, J., Gao, Y., Sun, Y., Jin, Y., Chen, Y., Wang, J., & Zeng, G. (2025). FedRecon: Missing Modality Reconstruction in Heterogeneous Distributed Environments. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2504.09941
Nguyen, A. V., Klabjan, D., Ryu, M., Kim, K., & Di, Z. W. (2025). Federated Low-Rank Tensor Estimation for Multimodal Image Reconstruction. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2502.02761
Ponduri, V., & Mohan, L. (2021). Reliable multiple object detection on noisy images by using Yolov3. International Journal of Communication and Computer Technologies, 9(1), 6-9.
Poudel, P., Chhetri, A., Gyawali, P., Leontidis, G., & Bhattarai, B. (2025). Multimodal Federated Learning With Missing Modalities through Feature Imputation Network. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2505.20232
Sathish Kumar, T. M. (2024). Measurement and modeling of RF propagation in forested terrains for emergency communication. National Journal of RF Circuits and Wireless Systems, 1(2), 7–15.
Sun, G., Mendieta, M., Dutta, A., Li, X., & Chen, C. (2024). Towards Multi-modal Transformers in Federated Learning. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2404.12467
Veerappan, S. (2025). Finite element-based modeling of stress distribution in 3D-printed lattice structures. Journal of Applied Mathematical Models in Engineering, 1(1), 44–53.
Wiśniewski, K. P., Zielińska, K., & Malinowski, W. (2025). Energy efficient algorithms for real-time data processing in reconfigurable computing environments. SCCTS Transactions on Reconfigurable Computing, 2(3), 1–7. https://doi.org/10.31838/RCC/02.03.01


