A Mathematically Modified Adam Algorithm for Improved Convergence in Deep Neural Networks: A Mathematically Modified Adam Algorithm

Mark   Laisin; Bright O Osu; Prisca Udodiri Duruojinkeya; Chigozie Chibuisi

A Mathematically Modified Adam Algorithm for Improved Convergence in Deep Neural Networks

A Mathematically Modified Adam Algorithm

Authors

Mark Laisin https://orcid.org/0009-0003-3331-1235
Bright O Osu https://orcid.org/0000-0003-2463-430X
Prisca Udodiri Duruojinkeya https://orcid.org/0000-0002-3174-7751
Chigozie Chibuisi https://orcid.org/0009-0006-4100-7817

Keywords:

Adaptive Optimization, Gradient Descent, Deep Neural Networks, Convergence Analysis, Momentum Thresholding, Non-Convex Optimization, Step Size Scheduling, Stochastic Optimization

Abstract

This study introduces the Adaptive Moment Gradient Thresholding (AMGT) algorithm, a modified version of the Adam optimizer, aimed at enhancing convergence stability in deep neural networks. By leveraging optimization theory and addressing the limitations of Adam, AMGT was designed to tackle non-convexity, constrained environments, and gradient-based learning instability. The algorithm incorporates a diminishing step size schedule and momentum thresholding to improve performance. Theoretical analysis demonstrated that AMGT achieved linear convergence under strong convexity with a rate of , global convergence under bounded gradient approximation errors, and convergence to stationary points in non-convex scenarios. Numerical experiments on convex quadratic functions validated the theoretical predictions, highlighting the algorithm’s sensitivity to spectral properties and resilience to learning rate variations. The results indicate that AMGT surpasses standard Adam in convergence behaviour and provides theoretical guarantees often lacking in adaptive optimizers. AMGT is particularly effective in high-dimensional, noisy, or resource-constrained settings due to its support for quantized and sparsified updates. By combining theoretical rigour with empirical robustness, AMGT emerges as a dependable option for training deep learning models across diverse optimization landscapes.

Downloads

Published

2025-12-29

Issue

Vol. 6 No. 2 (2025): December-2025

Section

Articles

License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

The OJEMSTE on provides immediate open access to its content on the principle that making research freely available to the public supports a greater global exchange of knowledge.

Open Access Statement:

The OJOMSTE permits any users to read, download, copy, distribute, print, search, or link to the full texts of the publications, crawl them for indexing, pass them as data to software, or use them for any other lawful purpose, without financial, legal, or technical barriers other than those inseparable from gaining access to the internet itself. The only constraint on reproduction and distribution, and the only role for copyright in this domain should be to give authors control over the integrity of their work and the right to be properly acknowledged and cited.