Enhancing Multi-Cell Dynamic TDD with Multi-Agent Deep Reinforcement Learning

Sarala Patchala; Sai Prasanth Kanuparthy; Vullam Nagagopiraju

doi:10.14445/23488549/IJECE-V12I9P106

Enhancing Multi-Cell Dynamic TDD with Multi-Agent Deep Reinforcement Learning

International Journal of Electronics and Communication Engineering

Volume 12 Issue 9

Year of Publication : 2025

Authors : Sarala Patchala, Sai Prasanth Kanuparthy, Vullam Nagagopiraju, Vijaya Babu Burra, Banda Snv Ramana Murthy, Rohini Rajesh Swami Devnikar

10.14445/23488549/IJECE-V12I9P106

How to Cite?

Sarala Patchala, Sai Prasanth Kanuparthy, Vullam Nagagopiraju, Vijaya Babu Burra, Banda Snv Ramana Murthy, Rohini Rajesh Swami Devnikar, "Enhancing Multi-Cell Dynamic TDD with Multi-Agent Deep Reinforcement Learning," SSRG International Journal of Electronics and Communication Engineering, vol. 12, no. 9, pp. 72-83, 2025. Crossref, https://doi.org/10.14445/23488549/IJECE-V12I9P106

Abstract:

Dynamic Time Division Duplex (D-TDD) is an important feature in 5G and future 6G networks. It allows flexible allocation of Uplink (UL) and Downlink (DL) slots. This helps to manage traffic demands dynamically. However, two key challenges exist. First, the system determines the best TDD pattern to match user traffic. Second, cross-link interference occurs when different cells use different TDD configurations. This interference degrades network performance. The 3GPP standard does not provide an optimal method for TDD configuration. It does not solve cross-link interference issues. To address these gaps, we proposed a Multi-Agent Deep Reinforcement Learning (MADRL) approach. This approach models the TDD problem as a linear programming problem. Introduced the Multi-Agent Deep Reinforcement Learning-based 5G RAN TDD Pattern (MADRP) framework. This method is decentralized. Each cell has an independent agent that learns the best TDD configuration. The system reduces control latency and signaling overhead. The MADRP model monitors the buffer states of uplink and downlink data. It exchanges messages with neighboring cells to minimize cross-link interference. Each agent uses reinforcement learning to determine the best TDD allocation. The model adapts to traffic variations and prevents buffer overflows. It highlights the limitations of MADRP. Performance is degraded in high-interference environments. Future work will focus on implementing MADRP in real-world 5G systems. This aimed to integrate the model with OpenAirInterface (OAI) to demonstrate real-time adaptability. This will provide insights into practical deployment challenges. This research introduces a novel DRL-based TDD adaptation approach. It efficiently manages UL and DL allocation while minimizing cross-link interference. The method enhances performance in multi-cell 5G environments. It provides a scalable and effective alternative to static TDD configurations.

Keywords:

Deep, Multi-agent, Multi-cell, TDD, Reinforcement, Resource allocation.

References:

[1] Sher Ali et al., “New Trends and Advancement in Next Generation Mobile Wireless Communication (6G): A Survey,” Wireless Communications and Mobile Computing, vol. 2021, no. 1, pp. 1-14, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Hao Liu et al., “Receiving Buffer Adaptation for High-Speed Data Transfer,” IEEE Transactions on Computers, vol. 62, no. 11, pp. 2278-2291, 2012.
[CrossRef] [Google Scholar] [Publisher Link]
[3] Alexander A. Ganin et al., “Resilience in Intelligent Transportation Systems (ITS),” Transportation Research Part C: Emerging Technologies, vol. 100, pp. 318-329, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[4] Qingqing Wu, Xiaobo Zhou, and Robert Schober, “IRS-Assisted Wireless Powered Noma: Do We Really Need Different Phase Shifts in Dl and UL?,” IEEE Wireless Communications Letters, vol. 10, no. 7, pp. 1493-1497, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[5] Adel A. Ahmed et al., “Smart Traffic Shaping based on Distributed Reinforcement Learning for Multimedia Streaming Over 5G-Vanet Communication Technology,” Mathematics, vol. 11, no. 3, pp. 1-20, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Zhuanghua Shi et al., “Effects of Packet Loss and Latency on the Temporal Discrimination of Visual-Haptic Events,” IEEE Transactions on Haptics, vol. 3, no. 1, pp. 28-36, 2009.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Hyejin Kim, Jintae Kim, and Daesik Hong, “Dynamic TDD Systems for 5G and Beyond: A Survey of Cross-Link Interference Mitigation,” IEEE Communications Surveys & Tutorials, vol. 22, no. 4, pp. 2315-2348, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Raed Abduljabbar Aljiznawi et al., “Quality of Service (QoS) for 5G Networks,” International Journal of Future Computer and Communication, vol. 6, no. 1, pp. 27-30, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Alexei V. Nikitin et al., “Impulsive Interference in Communication Channels and its Mitigation by SPART and Other Nonlinear Filters,” EURASIP Journal on Advances in Signal Processing, vol. 2012, pp. 1-29, 2012.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Haijun Zhang et al., “Coexistence of Wi-Fi and Heterogeneous Small Cell Networks Sharing Unlicensed Spectrum,” IEEE Communications Magazine, vol. 53, no. 3, pp. 158-164, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Graner Joseph Gracius, “A Performance Benchmark and Analysis of 5G Non-Standalone and Standalone,” Master’s Thesis, National Yang Ming Chiao Tung University, pp. 1-24, 2021.
[Google Scholar]
[12] Ting Yang, Jiabao Sun, and Amin Mohajer, “Queue Stability and Dynamic Throughput Maximization in Multi-Agent Heterogeneous Wireless Networks,” Wireless Networks, vol. 30, pp. 3229-3255, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Md Mehedi Hasan, Sungoh Kwon, and Jee-Hyeon Na, “Adaptive Mobility Load Balancing Algorithm for LTE Small-Cell Networks,” IEEE Transactions on Wireless Communications, vol. 17, no. 4, pp. 2205-2217, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[14] João Rocha, Peter Roebeling, and María Ermitas Rial-Rivas, “Assessing the Impacts of Sustainable Agricultural Practices for Water Quality Improvements in the Vouga Catchment (Portugal) Using the Swat Model,” Science of the Total Environment, vol. 536, pp. 48-58, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Peng Bao et al., “A Statistical-Based Approach for Decentralized Demand Response Towards Primary Frequency Support: A Case Study of Large-Scale 5G Base Stations with Adaptive Droop Control,” IEEE Transactions on Smart Grid, vol. 16, no. 3, pp. 2208-2221, 2025.
[CrossRef] [Google Scholar] [Publisher Link]
[16] Hou-I. Liu et al., “Lightweight Deep Learning for Resource-Constrained Environments: A Survey,” ACM Computing Surveys, vol. 56, no. 10, pp. 1-42, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[17] “Physical Channels and Modulation, 3GPP TS 38.211,” ETSI, pp. 1-98, 2018.
[Google Scholar] [Publisher Link]
[18] Jeffrey G. Andrews et al., “What will 5G Be?,” IEEE Journal on Selected Areas in Communications, vol. 32, no. 6, pp. 1065-1082, 2014.
[CrossRef] [Google Scholar] [Publisher Link]
[19] Federico Boccardi et al., “Five Disruptive Technology Directions for 5G,” IEEE Communications Magazine, vol. 52, no. 2, pp. 74-80, 2014.
[CrossRef] [Google Scholar] [Publisher Link]
[20] Navya Kailasam et al., “Optimized Task Offloading in D2D-Assisted Cloud-Edge Networks Using Hybrid Deep Reinforcement Learning,” International Journal of Basic and Applied Sciences, vol. 14, no. 2, pp. 591-602, 2025.
[CrossRef] [Publisher Link]
[21] Mikko A. Uusitalo et al., “Hexa-X the European 6G Flagship Project.” 2021 Joint European Conference on Networks and Communications & 6G Summit (EuCNC/6G Summit), Porto, Portugal, pp. 580-585, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[22] Koki Yakushiji et al., “Short-Range Transportation using Unmanned Aerial Vehicles (UAVs) During Disasters in Japan,” Drones, vol. 4, no. 4, pp. 1-8, 2020.
[CrossRef] [Google Scholar] [Publisher Link]

IJECE MENUS

Call for Paper - Upcoming Issues

Enhancing Multi-Cell Dynamic TDD with Multi-Agent Deep Reinforcement Learning

How to Cite?

Abstract:

Keywords:

References: