This paper presents the development and evaluation of an on-premises intelligent question-answering (QA) system designed to provide reliable and secure access to national land, infrastructure, and transport policy information. To address the limitatio...
This paper presents the development and evaluation of an on-premises intelligent question-answering (QA) system designed to provide reliable and secure access to national land, infrastructure, and transport policy information. To address the limitations of existing civil inquiry services-such as restricted call center operating hours, long wait times, and the rigidity of rule-based chatbots-the proposed system integrates a domain-specialized large language model (LLM) with Retrieval-Augmented Generation (RAG) technology. A customized version of Gemma-3-1B, named molit-gemma, was fine-tuned using official datasets from the Ministry of Land, Infrastructure and Transport (MOLIT), and combined with an OpenSearch-based retrieval pipeline to generate grounded, document-consistent responses. Experimental results demonstrate that the proposed system achieves superior performance compared to pretrained and commercial models. The molit-gemma + RAG configuration attained a BLEU of 0.6258 and an LLM-as-a-Judge overall score of 4.34 out of 5, confirming its strong domain suitability and factual accuracy. Deploying the system in an on-premises environment resolves critical security concerns associated with cloud-based LLMs, preventing the transmission of sensitive administrative data to external servers. Furthermore, the RAG architecture significantly mitigates hallucination risks by explicitly grounding responses in retrieved policy documents, thereby enhancing reliability and explainability-key requirements for public-sector AI services. This research not only demonstrates the feasibility of applying LLM-RAG hybrid architectures to government administrative services but also offers a scalable model for AI-driven policy communication. The proposed system provides a foundation for 24/7 citizen-centered policy consultation and contributes to the broader advancement of AI-based digital government. Future work will explore multimodal extensions, continuous quality improvement using Human-in-the-Loop pipelines, and cross-agency integration to expand the system's applicability across the public sector.