CROSS-MODEL SPATIAL-SEMANTIC FUSION: INTEGRATING MRI AND HISTOPATHOLOGY FOR AUTOMATED BONE-SARCOMA SEVERITY ASSESSMENT

Authors

  • P J.Adit Dr.M.G.R.Educational and Research Institute, India
  • C.Priya Dr.M.G.R.Educational and Research Institute, India

DOI:

https://doi.org/10.63458/ijerst.v4i2.154

Keywords:

Clinical Data Integration, Cross-Modal Learning, Deep Learning, Histopathology Imaging, Multimodal Diagnosis, Bone-Sarcoma

Abstract

The accurate diagnosis and severity staging of bone-sarcoma inherently depend on synthesizing macroscopic radiological imaging (MRI) with microscopic histopathological analysis. However, contemporary deep learning frameworks process these modalities in strict isolation, creating a diagnostic bottleneck that limits clinical utility. In this paper, we propose the Cross-Modal Spatial-Semantic Diffusion Network (CM-SSD), a novel multi-modal fusion architecture. CM-SSD synchronously processes macroscopic tumor dissemination from MRI alongside microscopic malignancy patterns from Hematoxylin and Eosin (H&E) slides. By leveraging dual spatial-semantic encoders and a cross-modal attention mechanism, the framework adaptively aligns complementary features across both modalities. This integrated approach not only enhances automated tumor localization but also generates a quantitative, percentage-based severity score. By bridging the gap between radiological and pathological computer-aided diagnostics, CM-SSD provides a unified, highly robust framework for real-time clinical decision support.

Author Biographies

P J.Adit, Dr.M.G.R.Educational and Research Institute, India

Department of CSE

Dr.M.G.R.Educational and Research Institute, India

C.Priya, Dr.M.G.R.Educational and Research Institute, India

Department of Computer Applications

Dr.M.G.R.Educational and Research Institute, India

References

A. K. Jain et al., "Deep learning in bone-sarcoma diagnostics: A comprehensive review of macroscopic and microscopic imaging modalities," IEEE Reviews in Biomedical Engineering, vol. 16, pp. 112-128, 2024.

M. Macenko et al., "A method for normalizing histology slides for quantitative analysis," in Proc. IEEE Int. Symp. Biomed. Imaging (ISBI), 2009, pp. 1107-1110. 2009

N. J. Tustison et al., "N4ITK: Improved N3 bias correction," IEEE Trans. Med. Imaging, vol. 29, no. 6, pp. 1310-1320, 2010. 2010

S. Wang, "Cross-modal attention networks for multi-modal medical image fusion," Medical Image Analysis, vol. 42, pp. 1024-1035, 2023.

K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), 2016, pp. 770-778. 2016

J. Zhou et al., "Omni-Scale representation learning for histopathological image classification," IEEE Trans. Pattern Anal. Mach. Intell., vol. 43, no. 11, pp. 3810-3824, 2023.

I. Loshchilov and F. Hutter, "Decoupled weight decay regularization (AdamW)," in Proc. Int. Conf. Learn. Represent. (ICLR), 2019.

Y. Chen et al., "Bridging the semantic gap: Dual-stream spatial-semantic encoders for complex tumor microenvironments," Nature Machine Intelligence, vol. 5, pp. 412-425, 2025.

L. Zhao et al., "Quantitative severity scoring in osteoid malignancies using automated regression networks," Computer Methods and Programs in Biomedicine, vol. 230, 107321, 2024.

T. Lin, "Mitigating clinical artifacts in computational pathology via adaptive attention weighting," IEEE Trans. Biomed. Eng., vol. 71, no. 3, pp. 885-896, 2025.

Downloads

Published

2026-06-25

How to Cite

P J.Adit, & C.Priya. (2026). CROSS-MODEL SPATIAL-SEMANTIC FUSION: INTEGRATING MRI AND HISTOPATHOLOGY FOR AUTOMATED BONE-SARCOMA SEVERITY ASSESSMENT. International Journal of Engineering Research and Sustainable Technologies (IJERST), 4(2), 1–7. https://doi.org/10.63458/ijerst.v4i2.154