M-Learning: Heuristic Approach for Delayed Rewards in Reinforcement Learning

QR Code

M-Learning: Heuristic Approach for Delayed Rewards in Reinforcement Learning

The current design of reinforcement learning methods requires extensive computational resources. Algorithms such as Deep Q-Network (DQN) have obtained outstanding results in advancing the field. However, the need to tune thousands of parameters and run millions of training episodes remains a signifi...

Full description

Saved in:

Bibliographic Details
Main Authors:	Cesar Andrey Perdomo Charry, Marlon Sneider Mora Cortes, Oscar J. Perdomo
Format:	Article
Language:	English
Published:	MDPI AG 2025-06-01
Series:	Mathematics
Subjects:	reinforcement learning exploration–exploitation dilemma Q-learning frozen lake heuristic approach
Online Access:	https://www.mdpi.com/2227-7390/13/13/2108
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Adaptive sequential sampling for reliability estimation of binary functions
by: Miroslav Vořechovský
Published: (2022-08-01)

Complexification through gradual involvement and reward Providing in deep reinforcement learning
by: E. V. Rulko,
Published: (2024-12-01)

Strategic Decision-Making in SMEs: A Review of Heuristics and Machine Learning for Multi-Objective Optimization
by: Gines Molina-Abril, et al.
Published: (2025-07-01)

Optimizing travel time reliability with XAI: A Virginia interstate network case using machine learning and meta-heuristics
by: Navid Khorshidi, et al.
Published: (2025-09-01)

SBOA: A Novel Heuristic Optimization Algorithm
by: Qi Diao, et al.
Published: (2024-02-01)

XSQ-Learning: Adaptive Similarity Thresholds for Accelerated and Stable Q-Learning
by: Ansel Y. Rodríguez González, et al.
Published: (2025-06-01)

Adaptive Federated Learning With Reinforcement Learning-Based Client Selection for Heterogeneous Environments
by: Shamim Ahmed, et al.
Published: (2025-01-01)

Dual‐Transformer Deep Learning Framework for Seasonal Forecasting of Great Lakes Water Levels
by: Yi Chen, et al.
Published: (2025-06-01)

FADQN: A Heuristic Reinforcement Learning Mechanism for UAV Path Planning in Unknown Environment
by: Wei Sun, et al.
Published: (2025-01-01)

A Novel Heuristic Algorithm for Minimum Compliance Topology Optimization
by: Bogdan BOCHENEK, et al.
Published: (2016-11-01)

SASDL and RBATQ: Sparse Autoencoder With Swarm Based Deep Learning and Reinforcement Based Q-Learning for EEG Classification
by: Sunil Kumar Prabhakar, et al.
Published: (2022-01-01)

Analisis Usability Pada Aplikasi Allo Bank Menggunakan Heuristics Evaluation
by: Melida Ratna Utami, et al.
Published: (2022-12-01)

A Novel Hyper-Heuristic Algorithm for Bayesian Network Structure Learning Based on Feature Selection
by: Yinglong Dang, et al.
Published: (2025-07-01)

Correction: Forgetting phenomena in the Iowa Gambling Task: a new computational model among diverse participants
by: Tiancheng Yang, et al.
Published: (2025-07-01)

Forgetting phenomena in the Iowa Gambling Task: a new computational model among diverse participants
by: Tiancheng Yang, et al.
Published: (2025-06-01)

HEURISTIC METHOD OF THE TAKING THE NONPROGRAMMABLE DECISIONS IN NON-STANDARD SITUATION
by: A. V. Melehin, et al.
Published: (2016-07-01)

Artificial Bee Colony Algorithm Based on the Division Between Exploration and Exploitation and Its Application in Esophageal Cancer Prediction
by: WANG Yingcong, et al.
Published: (2025-07-01)

Distributed Heuristic Algorithm for Migration and Replication of Self-organized Services in Future Networks
by: Manar AL-jabr, et al.
Published: (2022-12-01)

Cooperative communication resources scheduling of satellite network using a mixed vector encoding heuristic algorithm
by: Jiaxuan Xie, et al.
Published: (2025-06-01)

Deep Reinforcement Learning-Based Two-Phase Hybrid Optimization for Scheduling Agile Earth Observation Satellites
by: Guanghui Zhou, et al.
Published: (2025-06-01)

Sim-to-Real Transfer of Deep Reinforcement Learning Agents for Online Coverage Path Planning
by: Arvi Jonnarth, et al.
Published: (2025-01-01)

A systematic review of machine learning-based remote sensing data analysis for geological and mined materials characterisation
by: Sureka Thiruchittampalam, et al.
Published: (2025-12-01)

Rockin’ the Subsurface: Learning Geophysics with ‘Electromagnetic Odyssey’
by: Katya Alvarez-Molina, et al.
Published: (2024-12-01)

An Overview Study of Deep Learning in Geophysics: Cross-Cutting Research to Advance Geoscience
by: Zhao Wenxue, et al.
Published: (2025-01-01)

Effective Solution of University Course Timetabling using Particle Swarm Optimizer based Hyper Heuristic approach
by: Zahid Iqbal, et al.
Published: (2021-12-01)

Optimizing Access Point Placement in Industrial IoT: A Deep Reinforcement Learning Approach With Q-Learning Verification and Signal Heatmap Visualization
by: Manash Mahanta, et al.
Published: (2025-01-01)

The effect of social network, funding and productive organizational energy on the capability of organizational ambidexterity in research institution
by: Novita Dyah, et al.
Published: (2016-06-01)

The Role of Exploration, Exploitation and Dynamic Capabilities: South Korean Entertainment Firms’ Proactive Response Strategies to the COVID-19 Pandemic
by: Ahn Yoonha, et al.
Published: (2025-06-01)

M-Race: A Racing Algorithm for the Tuning of Meta-Heuristics Based on Multiple Performance Objectives
by: Christoff Jordaan, et al.
Published: (2025-07-01)

Evaluating quantum-classical heuristics for traveling salesman problem
by: Mariia A. Makarova, et al.
Published: (2025-07-01)

Evaluasi Dan Redesain Antarmuka Website megapolitanpos.com Menggunakan Metode Usability Testing Dan Heuristic Evaluation
by: Jonathan Tanuwijaya, et al.
Published: (2025-07-01)

Assessing the impact of deforestation on sedimentation using the CAMF heuristic: Application in Manicaragua, Cuba
by: Grethell Castillo Reyes, et al.
Published: (2025-07-01)

Optimizing transient gas network control for challenging real-world instances using MIP-based heuristics
by: Hennings, Felix, et al.
Published: (2024-05-01)

Binary Chaotic White Shark Optimizer for the Unicost Set Covering Problem
by: Pablo Zúñiga-Valenzuela, et al.
Published: (2025-07-01)

Alzheimer’s disease multiclass detection through deep learning models and post-processing heuristics
by: Rani Ghassan Al Rahbani, et al.
Published: (2024-12-01)

Traj-Q-GPSR: A Trajectory-Informed and Q-Learning Enhanced GPSR Protocol for Mission-Oriented FANETs
by: Mingwei Wu, et al.
Published: (2025-07-01)

Machine Learning and data mining tools applied for databases of low number of records
by: Hubert Anysz
Published: (2022-01-01)

A Heuristic Approach to the Consecutive Ones Submatrix Problem
by: Rewayda Abo-Alsabeh, et al.
Published: (2023-02-01)

AI Agency in Fact-Checking: Role-Based Machine Heuristics and Publics’ Conspiratorial Orientation
by: Duo Lan, et al.
Published: (2025-05-01)

Guidelines for the handling of chilled foods, 1982
Published: (1982)