Gatelens: A reasoning-enhanced llm agent for automotive software release analytics
Journal article, 2026

Ensuring reliable data-driven decisions is crucial in domains where analytical accuracy directly impacts safety, compliance, or operational outcomes. Decision support in such domains relies on large tabular datasets, where manual analysis is slow, costly, and error-prone. While Large Language Models (LLMs) offer promising automation potential, they face challenges in analytical reasoning, structured data handling, and ambiguity resolution. This paper introduces GateLens, an LLM-based architecture for reliable analysis of complex tabular data. Its key innovation is the use of Relational Algebra (RA) as a formal intermediate representation between natural-language reasoning and executable code, addressing the reasoning-to-code gap that can arise in direct generation approaches. In our automotive instantiation, GateLens translates natural language queries into RA expressions and generates optimized Python code. Unlike traditional multi-agent or planning-based systems that can be slow, opaque, and costly to maintain, GateLens emphasizes speed, transparency, and reliability. We validate the architecture in automotive software release analytics, where experimental results show that GateLens outperforms the existing Chain-of-Thought (CoT) + Self-Consistency (SC) based system on real-world datasets, particularly in handling complex and ambiguous queries. Ablation studies confirm the essential role of the RA layer. Industrial deployment demonstrates over 80% reduction in analysis time while maintaining high accuracy across domain-specific tasks. GateLens operates effectively in zero-shot settings without requiring few-shot examples or agent orchestration. This work advances deployable LLM system design by identifying key architectural features—intermediate formal representations, execution efficiency, and low configuration overhead—crucial for domain-specific analytical applications where accuracy, traceability, and stakeholder trust are paramount.

Tabular question answering

Software release analytics

Test result interpretation

Interpretable reasoning

Large language models

Automotive software testing

Author

Arsham Gholamzadeh Khoee

Chalmers, Computer Science and Engineering (Chalmers), Functional Programming

Shuai Wang

Chalmers, Computer Science and Engineering (Chalmers), Functional Programming

Robert Feldt

Chalmers, Computer Science and Engineering (Chalmers), Software Engineering (Chalmers)

Dhasarathy Parthasarathy

Chalmers, Computer Science and Engineering (Chalmers), Functional Programming

Yinan Yu

Chalmers, Computer Science and Engineering (Chalmers), Functional Programming

Journal of Systems and Software

0164-1212 (ISSN)

Vol. 240 112961

Subject Categories (SSIF 2025)

Software Engineering

Computer Sciences

Computer Systems

DOI

10.1016/j.jss.2026.112961

More information

Latest update

6/2/2026 1