WALL: A Web Application for Automated Quality Assurance using Large Language Models
Abstract
WALL, a web application integrating SonarQube and LLMs, automates software issue detection, revision, and evaluation, demonstrating effectiveness in reducing human effort and improving code quality.
As software projects become increasingly complex, the volume and variety of issues in code files have grown substantially. Addressing this challenge requires efficient issue detection, resolution, and evaluation tools. This paper presents WALL, a web application that integrates SonarQube and large language models (LLMs) such as GPT-3.5 Turbo and GPT-4o to automate these tasks. WALL comprises three modules: an issue extraction tool, code issues reviser, and code comparison tool. Together, they enable a seamless pipeline for detecting software issues, generating automated code revisions, and evaluating the accuracy of revisions. Our experiments, conducted on 563 files with over 7,599 issues, demonstrate WALL's effectiveness in reducing human effort while maintaining high-quality revisions. Results show that employing a hybrid approach of cost-effective and advanced LLMs can significantly lower costs and improve revision rates. Future work aims to enhance WALL's capabilities by integrating open-source LLMs and eliminating human intervention, paving the way for fully automated code quality management.
Community
Memanto challenges the assumption that knowledge graphs are necessary for high-quality agent memory. Using a typed semantic schema, built-in conflict resolution, and Moorcheh's information-theoretic retrieval engine, we achieve 89.8% on LongMemEval and 87.1% on LoCoMo, SOTA among vector-only systems, with zero ingestion cost, single-query retrieval, and sub-90ms latency. The core finding: recall beats precision, and LLMs are better filters than pre-computed graph structures.
Try it: pip install memanto
Get this paper in your agent:
hf papers read 2509.09918 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper
