LLM Goal Extraction in Requirements Engineering: Strategies & Limits

Evaluating LLM-Based Goal Extraction in Requirements Engineering: Prompting Strategies and Their Limitations

In a recent study published on arXiv, researchers have explored the potential of Large Language Models (LLMs) in automating the Goal-Oriented Requirements Engineering (GORE) process. As the field of Requirements Engineering (RE) often involves textual and repetitive tasks, LLMs have emerged as a promising tool for enhancing the efficiency of goal extraction from software documentation.

The study outlines a systematic approach that consists of three distinct phases: actor identification, high-level goal extraction, and low-level goal extraction. Each phase plays a crucial role in refining the automation process, ultimately leading to more accurate goal identification.

Implementation Strategy

To implement the proposed functionalities, the researchers devised a chain of LLMs, which are fed with engineered prompts. The focus of their experimentation was on various configurations of in-context learning, allowing them to analyze the similarities between input data and in-context examples. This methodical investigation aimed to assess the impact of different prompting strategies on the performance of goal extraction.

Key Findings

One of the significant findings from the research was the introduction of a generation-critic mechanism, functioning as a feedback loop between two LLMs. This innovative approach allowed for continuous improvement of the goal extraction process. While the overall accuracy achieved in low-level goal identification was 61%, the researchers caution that this method should primarily serve as an enhancement tool to accelerate manual extraction rather than fully replace human effort.

Performance Insights

The study further highlighted that the feedback-loop mechanism employing Zero-shot prompting demonstrated superior performance compared to the standalone Few-shot approach. An ablation study conducted by the researchers indicated that the absence of the feedback cycle resulted in a slight degradation of performance, underscoring the importance of this mechanism in the overall goal extraction process.

However, an intriguing observation was made regarding the combination of the feedback mechanism with Few-shot prompting. The results revealed no significant performance advantage, suggesting that the primary limiting factor may lie within the prompting strategy utilized for the ‘critic’ LLM. This finding opens up avenues for further exploration into how the refinement of both the quantity and quality of Shot examples can elevate the overall effectiveness of LLMs in this context.

Future Directions

Looking ahead, the researchers propose integrating advanced methodologies such as Retrieval-Augmented Generation (RAG) and Chain-of-Thought (CoT) prompting to enhance accuracy in goal extraction. These innovative techniques are expected to address some of the limitations identified in the current study and push the boundaries of what LLMs can achieve in Requirements Engineering.

Conclusion

As the landscape of software development continues to evolve, the role of artificial intelligence in automating and streamlining processes like Requirements Engineering becomes increasingly vital. This research not only underscores the potential of LLMs in goal extraction but also highlights the importance of refining prompting strategies to maximize their efficacy. Future studies will be essential in uncovering new methodologies that could significantly advance the field.

RichlyAI Blog AI Guide, Tutorials, Industrial Insights, & more!

Company

LLM Goal Extraction in Requirements Engineering: Strategies & Limits

Evaluating LLM-Based Goal Extraction in Requirements Engineering: Prompting Strategies and Their Limitations

Implementation Strategy

Key Findings

Performance Insights

Future Directions

Conclusion

Related AI Insights

Subscribe

More like thisRelated

About us

Company

The latest

Subscribe

RichlyAI Blog
AI Guide, Tutorials, Industrial Insights, & more!

More like this
Related