ORIGINAL RESEARCH article
Front. Artif. Intell.
Sec. Machine Learning and Artificial Intelligence
Eco-FinOps: A Causal-Agentic Framework for Energy-Efficient and Explainable Cloud Cost Optimization
- SP
Sreejindeth P
- PT
Padmavathy T
Vellore Institute of Technology, Vellore, India
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Abstract
Two converging crises—uncontrolled operational expenditure and significant environmental hazards—have been aggravated by the exponential scaling of cloud infrastructure. This has led to a greater focus on Financial Opera-tions (FinOps) efficiency. Current monitoring methods exhibit a major flaw: passive dashboards require constant human monitoring and generate alert fatigue, while deep learning models require heavy GPU computing power and act as opaque "black boxes." This lack of interpretability contradicts the sustainability goals these tools promise to support. This study pro-poses Eco-FinOps, a causal-agentic framework capable of optimizing cloud costs autonomously and energy-efficiently, designed strictly for commodity CPU environments without discrete GPU requirements. The system uses Polars-based telemetry ingestion, the PC algorithm for causal graph construction, and DoWhy for effect estimation to mathematically distinguish genuine workload fluctuations from inefficient resource usage. This study tar-gets the "zombie" resource pattern—provisioned instances utilizing merely 0.2-5% CPU while maintaining 60-90% memory consumption. Furthermore, a LangGraph-coordinated Retrieval-Augmented Generation (RAG) pipeline enables Small Language Models to generate auditable, human-verifiable remediation scripts. Evaluated on five fault-injection scenarios from the Al-ibaba Cluster Trace dataset, the framework achieves a Precision of 0.89, Recall of 0.86, and F1-score of 0.87. Causal verification successfully elim-inates 39% of false positive anomalies, raising Precision by 38 points over static threshold rules and outperforming neural autoencoder baselines by 15 F1 points.
Summary
Keywords
AIOps, anomaly detection, causal inference, cloud cost optimization, FinOps, Green cloud, LLM safety, RAG
Received
21 April 2026
Accepted
22 May 2026
Copyright
© 2026 P and T. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Padmavathy T
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.