Engineering Knowledge Graph Construction from Technical Documents via OpenIE and T5-driven Hybrid Extraction

Authors

  • Marko Babić Faculty of Informatics and Digital Technologies, University of Rijeka, 51000 Rijeka, Croatia
  • Stjepan Vuković Faculty of Informatics and Digital Technologies, University of Rijeka, 51000 Rijeka, Croatia
  • Ante Radić Faculty of Electrical Engineering, Computer Science and Information Technology, University of Osijek, 31000 Osijek, Croatia

DOI:

https://doi.org/10.64972/jiic.2026v4.140p4s:39-52

Keywords:

Knowledge Graph, Open Information Extraction, Transformer Models, Engineering Text Mining, Semantic Disambiguation

Abstract

The automatic extraction of structured engineering knowledge from unstructured technical papers is the second requirement for computer-aided engineering intelligence. In order to create a general-purpose engineering knowledge graph from a variety of text resources, this study presents a novel combination of Open Information Extraction (OpenIE) and Transformer-based semantic modeling (T5). First of all, the domain jargon and intricate technical syntax can be handled by carefully prepared preprocessing and token normalization. A T5 semantic encoder uses deep contextual representations to further disambiguate context and increase precision once OpenIE modules have extracted candidate entity-relation triples. The suggested approach outperforms the best-performing baselines, as demonstrated by experimental evaluation on a hierarchically annotated engineering corpus; it has a macro F1 score of 0.82 and a micro F1 score of 0.85, improving by 7.3% and 6.8%, respectively. Over 65% of the net performance boost is attributed to transformer-based semantic filtering, according to ablation studies. Coreference ambiguity and relation type overlap are the primary residual issues, according to error analysis. Over 90% of the original accuracy has been maintained, and several testings in various fields have confirmed that this approach can be applied in others. According to the aforementioned analysis, this approach offers great support for next technical document intelligence research and has good scalability and accuracy for the management of engineering information.

Downloads

Published

2026-01-10

How to Cite

Babić, M., Vuković, S., & Radić, A. (2026). Engineering Knowledge Graph Construction from Technical Documents via OpenIE and T5-driven Hybrid Extraction. Journal of Intelligent Information and Communication, 4, 4s:39–52. https://doi.org/10.64972/jiic.2026v4.140p4s:39-52

Issue

Section

Articles

Similar Articles

1 2 3 > >> 

You may also start an advanced similarity search for this article.