CROSS-MODEL SPATIAL-SEMANTIC FUSION: INTEGRATING MRI AND HISTOPATHOLOGY FOR AUTOMATED BONE-SARCOMA SEVERITY ASSESSMENT
DOI:
https://doi.org/10.63458/ijerst.v4i2.154Keywords:
Clinical Data Integration, Cross-Modal Learning, Deep Learning, Histopathology Imaging, Multimodal Diagnosis, Bone-SarcomaAbstract
The accurate diagnosis and severity staging of bone-sarcoma inherently depend on synthesizing macroscopic radiological imaging (MRI) with microscopic histopathological analysis. However, contemporary deep learning frameworks process these modalities in strict isolation, creating a diagnostic bottleneck that limits clinical utility. In this paper, we propose the Cross-Modal Spatial-Semantic Diffusion Network (CM-SSD), a novel multi-modal fusion architecture. CM-SSD synchronously processes macroscopic tumor dissemination from MRI alongside microscopic malignancy patterns from Hematoxylin and Eosin (H&E) slides. By leveraging dual spatial-semantic encoders and a cross-modal attention mechanism, the framework adaptively aligns complementary features across both modalities. This integrated approach not only enhances automated tumor localization but also generates a quantitative, percentage-based severity score. By bridging the gap between radiological and pathological computer-aided diagnostics, CM-SSD provides a unified, highly robust framework for real-time clinical decision support.
References
A. K. Jain et al., "Deep learning in bone-sarcoma diagnostics: A comprehensive review of macroscopic and microscopic imaging modalities," IEEE Reviews in Biomedical Engineering, vol. 16, pp. 112-128, 2024.
M. Macenko et al., "A method for normalizing histology slides for quantitative analysis," in Proc. IEEE Int. Symp. Biomed. Imaging (ISBI), 2009, pp. 1107-1110. 2009
N. J. Tustison et al., "N4ITK: Improved N3 bias correction," IEEE Trans. Med. Imaging, vol. 29, no. 6, pp. 1310-1320, 2010. 2010
S. Wang, "Cross-modal attention networks for multi-modal medical image fusion," Medical Image Analysis, vol. 42, pp. 1024-1035, 2023.
K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), 2016, pp. 770-778. 2016
J. Zhou et al., "Omni-Scale representation learning for histopathological image classification," IEEE Trans. Pattern Anal. Mach. Intell., vol. 43, no. 11, pp. 3810-3824, 2023.
I. Loshchilov and F. Hutter, "Decoupled weight decay regularization (AdamW)," in Proc. Int. Conf. Learn. Represent. (ICLR), 2019.
Y. Chen et al., "Bridging the semantic gap: Dual-stream spatial-semantic encoders for complex tumor microenvironments," Nature Machine Intelligence, vol. 5, pp. 412-425, 2025.
L. Zhao et al., "Quantitative severity scoring in osteoid malignancies using automated regression networks," Computer Methods and Programs in Biomedicine, vol. 230, 107321, 2024.
T. Lin, "Mitigating clinical artifacts in computational pathology via adaptive attention weighting," IEEE Trans. Biomed. Eng., vol. 71, no. 3, pp. 885-896, 2025.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 P J.Adit, C.Priya

This work is licensed under a Creative Commons Attribution 4.0 International License.
License Statement
This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).
Authors retain copyright of their articles and grant International Journal of Engineering Research in Science and Technology (IJERST) the right of first publication.
This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
The journal encourages open access and supports the global exchange of knowledge.






