1. NVIDIA. (2024). NVIDIA NeMo Guardrails Documentation. The documentation emphasizes defining explicit conversational flows and using validation to ensure agent outputs are consistent and adhere to predefined rules, which aligns with refining objectives.
2. Wei, J., Wang, X., Schuurmans, D., Bosma, M., Ichter, B., Xia, F., ... & Zhou, D. (2022). Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. arXiv preprint arXiv:2201.11903. This foundational paper demonstrates that breaking down problems into intermediate steps (subtasks) significantly improves the reliability and accuracy of LLM outputs.
3. Wang, L., Ma, W., Zhu, X., et al. (2023). A Survey on Large Language Model based Autonomous Agents. arXiv preprint arXiv:2308.11432. Section 3.2, "Planning," discusses task decomposition as a core capability for agents to handle complex tasks reliably by breaking them into simpler, executable sub-goals.