Multi-hop Question Answering (MHQA) necessitates that Large Language Models (LLMs) synthesize scattered information through iterative retrieval and reasoning. However, current iterative frameworks face structural bottlenecks: the ``context overflow'' ...
Multi-hop Question Answering (MHQA) necessitates that Large Language Models (LLMs) synthesize scattered information through iterative retrieval and reasoning. However, current iterative frameworks face structural bottlenecks: the ``context overflow'' arising from the accumulation of intermediate artifacts, and the ``error propagation'' where initial retrieval inaccuracies cascade into downstream reasoning failures. These challenges often destabilize the reasoning trajectory, leading to suboptimal final answers. To address these limitations, this study presents a lightweight, training-free adaptive pipeline designed to enhance the robustness of iterative MHQA. We first integrate the RECAP mechanism---inspired by cognitive chunking---to summarize reasoning history at each step, thereby stabilizing discourse coherence and mitigating context overflow. Building upon this stabilized baseline, we introduce a ``Similarity-Gated Query Refinement'' module. This component functions as a dynamic monitor, assessing retrieval quality via similarity score distributions and autonomously triggering query rectification only when high uncertainty is detected. This selective intervention strategy ensures corrective re-retrieval without incurring unnecessary computational overhead. Empirical evaluations on three standard benchmarks---HotpotQA, 2WikiMultihopQA, and MuSiQue---demonstrate that our adaptive pipeline consistently outperforms a robust RECAP-based baseline across diverse settings. Crucially, significant performance gains were observed not only in proprietary API-based models but also in offline environments using Llama-3.1-8B, substantiating the model-agnostic nature and generalizability of the proposed framework. While acknowledging limitations such as reliance on heuristic gating, this research underscores the efficacy of selective intervention based on lightweight uncertainty signals. By balancing computational efficiency with rigorous error correction, the proposed framework offers a practical foundation for designing robust, controllable agentic workflows in complex reasoning tasks.