Search: DRL-based - ai.jp.net

Research Paper #Wireless Communication, Reinforcement Learning, UAV, RIS 🔬 ResearchAnalyzed: Jan 3, 2026 08:42

Throughput Optimization in UAV-Mounted RIS using DRL

Published:Dec 31, 2025 10:36

•

1 min read

•

ArXiv

Analysis

This paper addresses a practical problem in wireless communication: optimizing throughput in a UAV-mounted Reconfigurable Intelligent Surface (RIS) system, considering real-world impairments like UAV jitter and imperfect channel state information (CSI). The use of Deep Reinforcement Learning (DRL) is a key innovation, offering a model-free approach to solve a complex, stochastic, and non-convex optimization problem. The paper's significance lies in its potential to improve the performance of UAV-RIS systems in challenging environments, while also demonstrating the efficiency of DRL-based solutions compared to traditional optimization methods.

Key Takeaways

•Proposes a DRL-based solution for throughput optimization in UAV-mounted RIS systems.
•Addresses practical impairments like UAV jitter and imperfect CSI.
•Achieves higher throughput than conventional methods under severe jitter and low CSI quality.
•Offers significantly faster inference times compared to traditional optimization methods.

Reference

“The proposed DRL controllers achieve online inference times of 0.6 ms per decision versus roughly 370-550 ms for AO-WMMSE solvers.”

Permalink ArXiv

Research Paper #Robotics, Reinforcement Learning, Autonomous Navigation 🔬 ResearchAnalyzed: Jan 3, 2026 17:16

DRL for UGV Navigation in Crowded Environments

Published:Dec 30, 2025 15:17

•

1 min read

•

ArXiv

Analysis

This paper addresses the limitations of existing DRL-based UGV navigation methods by incorporating temporal context and adaptive multi-modal fusion. The use of temporal graph attention and hierarchical fusion is a novel approach to improve performance in crowded environments. The real-world implementation adds significant value.

Key Takeaways

•Proposes a DRL-based navigation framework (DRL-TH) for UGVs.
•Utilizes temporal graph attention (TG-GAT) to capture temporal context.
•Employs a graph hierarchical abstraction module (GHAM) for multi-modal fusion.
•Demonstrates superior performance compared to existing methods in simulations.
•Successfully implemented and tested on a real UGV.

Reference

“DRL-TH outperforms existing methods in various crowded environments. We also implemented DRL-TH control policy on a real UGV and showed that it performed well in real world scenarios.”

Permalink ArXiv

Research Paper #Vehicle Routing, Deep Reinforcement Learning, Optimization 🔬 ResearchAnalyzed: Jan 3, 2026 15:43

Deep RL for Fleet Size and Mix VRP

Published:Dec 30, 2025 14:26

•

1 min read

•

ArXiv

Analysis

This paper addresses the Fleet Size and Mix Vehicle Routing Problem (FSMVRP), a complex variant of the VRP, using deep reinforcement learning (DRL). The authors propose a novel policy network (FRIPN) that integrates fleet composition and routing decisions, aiming for near-optimal solutions quickly. The focus on computational efficiency and scalability, especially in large-scale and time-constrained scenarios, is a key contribution, making it relevant for real-world applications like vehicle rental and on-demand logistics. The use of specialized input embeddings for distinct decision objectives is also noteworthy.

Key Takeaways

•Proposes a DRL-based approach (FRIPN) for solving the FSMVRP.
•Focuses on computational efficiency and scalability.
•Integrates fleet composition and routing decisions.
•Uses specialized input embeddings for decision objectives.

Reference

“The method exhibits notable advantages in terms of computational efficiency and scalability, particularly in large-scale and time-constrained scenarios.”

Permalink ArXiv

Research #IoT, AI, Networking, URLLC, DRL, Bayesian Optimization 🔬 ResearchAnalyzed: Jan 4, 2026 06:49

Joint Link Adaptation and Device Scheduling Approach for URLLC Industrial IoT Network: A DRL-based Method with Bayesian Optimization

Published:Dec 29, 2025 14:32

•

1 min read

•

ArXiv

Analysis

The article proposes a DRL-based method with Bayesian optimization for joint link adaptation and device scheduling in URLLC industrial IoT networks. This suggests a focus on optimizing network performance for ultra-reliable low-latency communication, a critical requirement for industrial applications. The use of DRL (Deep Reinforcement Learning) indicates an attempt to address the complex and dynamic nature of these networks, while Bayesian optimization likely aims to improve the efficiency of the learning process. The source being ArXiv suggests this is a research paper, likely detailing the methodology, results, and potential advantages of the proposed approach.

Key Takeaways

•Focus on optimizing network performance for URLLC in industrial IoT.
•Utilizes DRL and Bayesian optimization for complex network management.
•Likely a research paper detailing a new approach.

Reference

“The article likely details the methodology, results, and potential advantages of the proposed approach.”

Permalink ArXiv

Paper #llm 🔬 ResearchAnalyzed: Jan 3, 2026 16:08

Splitwise: Adaptive Edge-Cloud LLM Inference with DRL

Published:Dec 29, 2025 08:57

•

1 min read

•

ArXiv

Analysis

This paper addresses the challenge of deploying large language models (LLMs) on edge devices, balancing latency, energy consumption, and accuracy. It proposes Splitwise, a novel framework using Lyapunov-assisted deep reinforcement learning (DRL) for dynamic partitioning of LLMs across edge and cloud resources. The approach is significant because it offers a more fine-grained and adaptive solution compared to static partitioning methods, especially in environments with fluctuating bandwidth. The use of Lyapunov optimization ensures queue stability and robustness, which is crucial for real-world deployments. The experimental results demonstrate substantial improvements in latency and energy efficiency.

Key Takeaways

•Proposes Splitwise, a DRL-based framework for adaptive LLM partitioning across edge and cloud.
•Employs Lyapunov optimization for queue stability and robustness.
•Achieves significant improvements in latency and energy efficiency compared to existing methods.
•Demonstrates performance on various hardware platforms and LLM sizes.

Reference

“Splitwise reduces end-to-end latency by 1.4x-2.8x and cuts energy consumption by up to 41% compared with existing partitioners.”

Permalink ArXiv

Throughput Optimization in UAV-Mounted RIS using DRL

Analysis

Key Takeaways

DRL for UGV Navigation in Crowded Environments

Analysis

Key Takeaways

Deep RL for Fleet Size and Mix VRP

Analysis

Key Takeaways

Joint Link Adaptation and Device Scheduling Approach for URLLC Industrial IoT Network: A DRL-based Method with Bayesian Optimization

Analysis

Key Takeaways

Splitwise: Adaptive Edge-Cloud LLM Inference with DRL

Analysis

Key Takeaways

📬 Get AI News Delivered

Browse by Category

Trending Topics

📬 Get AI News Delivered

Browse by Category

Trending Topics