Dynamic Cost-Aware Language Models: A Real-Time Framework for Optimizing Cloud Resource Recommendations
Abstract
This research introduces a dynamic framework for optimizing cloud resource recommendations by integrating real-time cost metrics into language model outputs. The framework employs a three-tier architecture comprising a cost-aware attention mechanism that incorporates AWS Pricing API signals into the transformer architecture, a dynamic programming-based resource allocation optimizer that ensures Pareto efficiency in performance-cost trade-offs, and a reinforcement learning layer that refines recommendations based on actual usage patterns. Extensive evaluations on AWS workloads demonstrate a 31% reduction in cloud costs while maintaining performance requirements. Leveraging a modified boto3 SDK with custom middleware for real-time metric collection and a caching strategy that reduces API latency by 76%, the system was tested over six months on 150 AWS account configurations, proving its scalability, robustness, and effectiveness in real-world scenarios.
Full Text:
PDFReferences
Smith, J., & Johnson, A. (2020). Cloud computing for resource optimization: Techniques and challenges. Springer.
Brown, L., & Davis, M. (2019). Cloud cost management: A comprehensive guide. Journal of Cloud Computing, 8(2), 45-58.
Gupta, R., & Sharma, P. (2021). Cost-aware cloud computing: Frameworks and algorithms. International Journal of Cloud Computing, 5(1), 12-29.
Williams, T., & Evans, G. (2018). The impact of AI on cloud resource management. Oxford University Press.
Zhang, Y., & Li, H. (2017). Optimizing cloud resource allocation using reinforcement learning. Proceedings of the International Conference on Cloud Computing, 134-145.
Patel, S., & Kumar, D. (2020). Real-time cloud cost optimization strategies. Cloud Computing Review, 14(3), 85-97.
Turner, K., & Wilson, M. (2019). Machine learning models for cloud infrastructure management. Wiley.
Davis, R., & Clark, S. (2021). Resource allocation in cloud computing: Challenges and future directions. Cloud Computing Journal, 12(4), 113-126.
Ahmed, N., & Singh, R. (2020). Cloud pricing and cost optimization using AI-based models. Journal of Cloud Technology, 9(2), 22-39.
Moore, P., & Harris, J. (2018). Dynamic cloud resource allocation for cost reduction. Proceedings of the International Conference on Cloud Systems, 233-245.
Chen, X., & Wang, Y. (2019). Leveraging machine learning for efficient cloud resource management. International Journal of Cloud Engineering, 7(1), 54-67.
Robinson, D., & Walker, A. (2020). Cloud computing and cost-effective infrastructure management. Cloud Engineering Review, 11(3), 77-89.
Miller, T., & Lee, J. (2021). Cloud cost optimization: Techniques and tools. Cloud Computing Insights, 6(4), 101-115.
Anderson, K., & Brown, S. (2020). A survey on cost-aware cloud computing. Journal of Cloud Systems, 13(2), 88-102.
Liu, Z., & Zhang, Q. (2018). Cloud resource management using real-time data analytics. Cloud Technology Journal, 4(3), 45-60.
Garcia, L., & Nguyen, V. (2019). Reinforcement learning for cloud resource optimization. Elsevier.
Martin, R., & White, C. (2020). A new approach to cloud resource management using deep learning. International Journal of Cloud Resource Management, 8(1), 33-49.
Thompson, E., & Garcia, M. (2019). Cloud resource allocation strategies for multi-cloud environments. Proceedings of the Cloud Computing Conference, 102-114.
Roberts, F., & Hall, P. (2021). Cost-efficient cloud computing strategies. Journal of Cloud Infrastructure Management, 15(2), 123-135.
Carter, A., & Johnson, B. (2020). Optimizing cloud computing resources using AI-based models. Cloud Computing Technology, 10(3), 45-58.
Refbacks
- There are currently no refbacks.
Copyright (c) 2025 International Journal of Machine Learning for Sustainable Development

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Impact Factor :
JCR Impact Factor: 5.9 (2020)
JCR Impact Factor: 6.1 (2021)
JCR Impact Factor: 6.7 (2022)
JCR Impact Factor: 7.6 (2023)
JCR Impact Factor: 8.6 (2024)
JCR Impact Factor: Under Evaluation (2025)
A Double-Blind Peer-Reviewed Refereed Journal