Integration and Harmonization of Multi-Source Obstetric Data Using Rule-Based NLP for Fetomaternal Risk Modelling

Authors: Jon Barrenetxea, Elias Grünewald, Barbara Tabernig et al.

Publication: Studies in Health Technology and Informatics, Opening the Personal Gate between Technology and Health Care

Published: May 21, 2026

Source: Crossref

Back to Search View Original Cite This Article

Abstract

<jats:p>More than 80% of pregnancies in Germany are classified as high risk, leading to inefficient resource allocation. Although data-driven approaches could improve fetomaternal risk prediction, limited standardization across clinical databases remains a major barrier. We developed a rule-based Natural Language Processing (NLP) pipeline to integrate and standardize structured and unstructured obstetric data from multiple hospital IT systems. The resulting harmonized real-world fetomaternal dataset (FEMAR) comprises 123,183 unique birth deliveries and 449 fetomaternal factors, providing a foundation for developing more accurate risk prediction models for pregnancy complications.</jats:p>

Keywords

risk fetomaternal more prediction than

Integration and Harmonization of Multi-Source Obstetric Data Using Rule-Based NLP for Fetomaternal Risk Modelling

Abstract

Keywords

Related Articles

Technological impact on work life through integration of workplace learning and higher technical education learning: Stakeholders’ perspective

1597 Hybrid Additive-Subtractive Manufacturing: The Integration of 3D Printing and Precision Machining for Advanced Manufacturing

1798 Next-Generation Hybrid Post-processing of Additively Manufactured Components: Techniques, Integration, and Industry 4.0 Innovations

20 781Cloud Computing, Cloud-Datenbanksysteme und Multi-Tenancy

Complex analysis