LLM Performance in Automated RDF Knowledge Graph Creation

Date:

Performance Evaluation of LLMs in Automated RDF Knowledge Graph Generation

Summary: arXiv:2603.29878v1 Announce Type: cross

Abstract: Cloud systems generate large, heterogeneous log data containing critical infrastructure, application, and security information. Transforming these logs into RDF triples enables their integration into knowledge graphs, improving interpretability, root-cause analysis, and cross-service reasoning beyond what raw logs allow. Large Language Models (LLMs) offer a promising approach to automate RDF knowledge graph generation; however, their effectiveness on complex cloud logs remains largely unexplored.

Introduction

This article evaluates multiple LLM architectures and prompting strategies for automated RDF extraction using a controlled framework with two pipelines for systematically processing semi-structured log data. The extraction pipeline integrates multiple LLMs to identify relevant entities and relationships, automatically generating subject-predicate-object triples.

Methodology

The study employed an extraction pipeline that combines various LLMs to process log data effectively. The following steps were taken:

  • Creation of a reference Log-to-KG dataset from OpenStack logs using manual annotation and ontology-driven methods.
  • Implementation of an evaluation pipeline to assess the generated RDF triples using both syntactic and semantic metrics.
  • Testing of multiple LLM architectures with different prompting strategies, including Few-Shot, One-Shot, Zero-Shot, and advanced techniques like Tree-of-Thought.

Results

The analysis revealed that Few-Shot learning emerged as the most effective strategy, with the following results:

  • Llama: Achieved a 99.35% F1 score and 100% valid RDF output.
  • Qwen, NuExtract, and Gemma: Also performed well under Few-Shot prompting.
  • Chain-of-Thought approaches: Maintained similar accuracy as Few-Shot methods.
  • One-Shot prompting: Provided a lighter but effective alternative for RDF extraction.
  • Zero-Shot and advanced strategies: Such as Tree-of-Thought, Self-Critique, and Generate-Multiple performed substantially worse.

Discussion

The results highlight the significance of contextual examples and prompt design in achieving accurate RDF extraction. The analysis also revealed model-specific limitations across different LLM architectures, suggesting that while some models excel in Few-Shot scenarios, they may not perform equally well in other prompting contexts.

Conclusion

This study underscores the potential of LLMs for automating RDF knowledge graph generation from cloud logs. By leveraging Few-Shot prompting and thorough evaluation frameworks, researchers can enhance the integration of cloud log data into knowledge graphs, thereby improving interpretability and analytical capabilities. Future work should focus on refining prompting strategies and expanding the dataset to further assess LLM performance across diverse log types.


Related AI Insights

Lazarus Omolua
Lazarus Omoluahttps://richlyai.com/blog
My mission is to make sure that people in Africa are not left behind in the global AI revolution. RichlyAI exists to give everyone — students, founders, creators, and businesses — the tools to compete globally.

Subscribe

Popular

More like this
Related

How Business Ops Teams Boost Productivity with Codex

Discover how business operations teams use Codex to streamline documentation, enhance collaboration, and improve decision-making with AI-powered automation...

OpenAI Partners with Malta to Offer ChatGPT Plus Nationwide

OpenAI and Malta team up to provide free ChatGPT Plus access and AI training to all citizens, promoting digital literacy and responsible AI use.

Critical Linux Kernel Flaw Risks SSH Host Key Theft

A critical Linux kernel flaw risks stolen SSH host keys. Learn how to protect your systems and stay secure until patches are widely available.

Top External Hard Drives 2026: Expert Reviews & Buying Guide

Discover the best external hard drives of 2026 with expert reviews. Find top picks for speed, durability, and security to suit all storage needs.