Lia Suci Waliani♦ and Soo Young Shin°Digital Precoder-Combiner and Power Allocation Optimization in MU MIMO-NOMA: A Quantum Neural Network ApproachAbstract: This paper proposes a quantum neural network (QNN) to address joint optimization in wireless communication. By utilizing the advantages of quantum entanglement and superposition in quantum computing and machine learning, QNN can solve optimization problems with lower complexity than classical neural networks, due to its parallel processing capabilities. Specifically, this study applies QNN to jointly optimize digital precoder-combiner and power allocation in a multi-user multiple-input multiple-output non-orthogonal multiple access (MU MIMO-NOMA) system. The performance of the QNN is analyzed and presented. Keywords: digital precoding , digital combiner , quantum neural network , wireless optimization Ⅰ. IntroductionThe rapid advancement of wireless communication technology has led to a demand for higher data rates, particularly as we move towards the implementation and deployment of beyond 5G (B5G) and 6G networks[1-3]. This surge in demand is driven by the growing need for faster, more reliable, and more efficient communication capabilities to support an ever-increasing number of connected devices and data-intensive applications. Multipleinput multiple-output (MIMO) technology, which leverages the use of multiple antennas at both the transmitter and receiver ends, has emerged as a highly promising and effective solution to meet this demand[4,5]. By enabling simultaneous transmission and reception of multiple data streams, MIMO technology significantly enhances spectral efficiency and overall network performance, making it an essential component in the evolution of wireless communication systems. The concept of MIMO is often enhanced by the use of beamforming. Beamforming is a signal processing technique used in wireless communication systems to direct the transmission or reception of signals in specific directions[6]. In beamforming architectures, both analog and hybrid beamforming use phase shifters, which can cause quantization errors because of their limited resolution and non-linear properties. These errors can negatively impact the system’s overall performance[7]. Fully digital precoding, on the other hand, offers greater flexibility in beam steering by allowing baseband processors to directly generate phase and magnitude values for each channel, resulting in higher resolution and improved spectral efficiency[8]. Given the potential for a higher number of users compared to transmit antenna elements, integrating nonorthogonal multiple access (NOMA) with MIMO systems can further enhance spectral efficiency[9]. Achieving maximum spectral efficiency necessitates the joint optimization of transmit precoding, receiver combining, and NOMA power allocation. However, this optimization problem is non-convex and computationally intensive due to the high dimensionality of the variables involved. Conventional methods, such as mathematical derivation[10] and the iterative algorithm[11] often struggle to find optimal solutions efficiently. In order to address this issue, classical machine learning techniques can tackle non-convex optimization problems[12]. However, the complexity of classical machine learning algorithms tends to grow significantly as the amount of data being processed increases. This escalation in complexity often leads to longer computation times and greater demand for computational resources. This is primarily due to the serial processing nature of neurons in classical computers. To address this issue, quantum machine learning has emerged as a promising solution[13]. The advantages of quantum computing, such as parallel processing, lead to lower complexity and reduced processing times. By leveraging the principles of quantum computing and machine learning, quantum machine learning can potentially reduce computational complexity and improve processing speeds, offering a more efficient and scalable approach to solving complex problems. The inherent advantages of quantum computing to tackle complex optimization problems, provide a robust framework for enhancing the efficiency and effectiveness of these critical tasks[14,15]. To the best of the authors’ knowledge, the utilization of QNN for joint optimization for both digital precodercombiner and resource allocation is still limited. Accordingly, the contribution of this study proposes the use of a quantum neural network (QNN) for the joint optimization of digital precoder-combiner and resource allocation in multi-user multiple-input multiple-output nonorthogonal multiple access (MU MIMO-NOMA) systems. The structure of this paper is as follows. Section II presents the wireless system models for digital precodercombiner and NOMA power allocation. Section III describes the proposed quantum neural network (QNN) approach. Section IV discusses and analyzes the performance of the proposed QNN. Finally, the conclusions are drawn in Section V. Notations: Transpose and conjugate transpose are denoted by [TeX:] $$(\cdot)^T \text { and }(\cdot)^H,$$ respectively. [TeX:] $$(\cdot)^{-1}$$ indicates inversion. [TeX:] $$\otimes$$ indicates Kronecker product. Normal and circular normal distributions are presented as [TeX:] $$\mathscr{N}\left(\mu, \sigma^2\right),$$ and [TeX:] $$\mathscr{C} \mathscr{N}\left(\mu, \sigma^2\right),$$ respectively, where μ is the mean and [TeX:] $$\sigma^2$$ is the variance. [TeX:] $$|\cdot| \text { and }\|\cdot\|$$ are indicating the absolute and norm values, respectively. Ⅱ. Wireless System ModelThis study is concerned with a single base station (BS) serving multiple users (UEs) that are grouped in NOMA groups. Subsequently, this study assumes that each group consists of two users. The BS employs a downlink communication MU MIMO-NOMA system. The received signal [TeX:] $$\mathbf{y}_{k, m}$$ at the k-th UE and m-th group can be expressed as
(1)[TeX:] $$\begin{aligned} & \mathbf{y}_{k, m}=\underbrace{\sqrt{\mathbf{g}_{k, m}} \mathbf{c}_{k, m} \mathbf{H}_{k, m}^H \mathbf{v}_{k, m} \mathbf{x}}_{\text {desired signal }}+ \\ & \underbrace{\sum_{i=1, i \neq k}^K \sqrt{\mathbf{g}_{i, m}} \mathbf{c}_{i, m} \mathbf{H}_{k, m}^H \mathbf{v}_{i, m} \mathbf{x}}_{\text {interference signal }}+\underbrace{\mathbf{n}_{k, m}}_{\text {noise }}, \end{aligned}$$where [TeX:] $$\mathbf{H}_{k, m} \in \mathbb{C}^{S_{\mathrm{Tx}} \times N_{\mathrm{rx}}}, \mathbf{c}_{k, m}, \mathbf{v}_{k, m}, n_{k, m}$$ denote the channel matrix information, digital precoding, receiver combiner, transmitted data stream, additive white Gaussian noise (AWGN) denoted as [TeX:] $$\mathscr{C} \mathscr{N}\left(0, \sigma^2\right),$$ at NRx receive antenna of each user, respectively. In order to serve multiple UEs, the BS employs a uniform planar antenna (UPA) with dimensions [TeX:] $$S_{\mathrm{tx}}=M_{\mathrm{Tx}} \times N_{\mathrm{Tx}},$$ consisting of two-dimensional antenna elements. Conversely, each receiving UE utilizes a uniform linear antenna (ULA) with NRx one-dimensional array antenna elements. Thus, the downlink channel matrix for the k-th UE at m-th group can be expressed as
(2)[TeX:] $$\mathbf{H}_{k, m}=\sqrt{\frac{S_{\mathrm{Tx}} N_{\mathrm{Rx}}}{U_{\mathrm{path}}}} \sum_{u=1}^{U_{\mathrm{path}}} \psi_{k, u} \mathbf{a}_{k, r}\left(\theta_{k, u}^r\right) \mathbf{a}_{k, t}^H\left(\theta_{k, u}^t, \boldsymbol{\phi}_{k, u}^t\right),$$where [TeX:] $$\psi_{k, l} \sim \mathscr{C} \mathscr{N}(0,1), \mathbf{a}_{k, r}, \text { and } \mathbf{a}_{k, t}$$ denotes a complex gain of u-th path, ULA array response vector, and UPA array response vector, respectively. The array response vector of ULA antennas can be expressed as
(3)[TeX:] $$\mathbf{a}_{k, r}\left(\theta_{k, u}^r\right)=\frac{1}{\sqrt{N_{\mathrm{Tx}}}}\left[1, e^{j \omega \cos \left(\theta_l^r\right)}, \ldots, e^{j \omega\left(N_1\right) \cos \left(\theta_u^r\right)}\right]^{\mathrm{T}},$$where [TeX:] $$\omega=\frac{2 \pi d}{\lambda} \lambda,$$ the λ denotes the wavelength, d denotes the distance between two antennas which can be defined as [TeX:] $$d=\frac{\lambda}{2}, \text { and } \theta_{k, u}^r \in[0,2 \pi]$$ is the elevation angle of arrival (AoA). For a UPA the array response vector can be expressed as
(4)[TeX:] $$\begin{aligned} \mathbf{a}_{k, t}(\theta, \phi)_{k, u}^t= & \frac{1}{\sqrt{S_{\mathrm{Tx}}}} \\ & {\left[1, e^{j \omega \sin \left(\theta_u^t\right)\left[(m-1) \cos \left(\phi_u^t\right)+(n-1) \sin \left(\phi_u^t\right), \ldots,\right.}\right.} \\ & e^{\left.j \omega \sin \left(\theta_u^t\right)\left[(M-1) \cos \left(\phi_u^t\right)+(N-1) \sin \left(\phi_u^t\right)\right]\right]^{\mathrm{T}}}, \end{aligned}$$where [TeX:] $$\left(\theta_{k, l}^t, \phi_{k, l}^t\right) \in[0,2 \pi]$$ denotes the elevation and azimuth angles of departure (AoD). 2.1 Digital PrecoderAs illustrated in Fig. 1, the transmitter BS generates digital precoding vectors to serve multiple UEs. Digital precoding is employed to enhance signal quality by directing the signal to the intended UEs. Let [TeX:] $$\mathbf{v}_{k, m}$$ be the precoding vectors for the k-th UE at the m-th group. According to [16], the digital precoding can be expressed as
(5)[TeX:] $$\mathbf{v}_{k, m}=\sqrt{\mathbf{g}_{k, m}} \frac{\left(\mathbf{I}_K+\frac{1}{\sigma^2} \sum_{k=1}^K \xi_{k, m} \mathbf{H}_{k, m} \mathbf{H}_{k, m}^H\right)^{-1} \mathbf{H}_{k, m}}{\left.\| \mathbf{I}_K+\frac{1}{\sigma^2} \sum_{k=1}^K \xi_{k, m} \mathbf{H}_{k, m} \mathbf{H}_{k, m}^H\right)^{-1} \mathbf{H}_{k, m} \|},$$where [TeX:] $$\mathbf{I}_K$$ denotes the identity matrix with [TeX:] $$K \times K$$ dimension, [TeX:] $$\mathbf{H}_{K,m}$$ denotes the channel matrix for k-th UE at the m-th group, and [TeX:] $$\xi_{k, m}$$ denotes the optimization variable that can be obtained from QNN. 2.2 Digital CombinerOn the other hand, at the receiver, the received signal undergoes processing, where a digital combiner incorporating an RF chain is employed to integrate and process the incoming signals. Let [TeX:] $$\mu_{k, m}$$ be the optimization variable that can be obtained from the QNN, thus the optimal combiner [TeX:] $$\mathbf{c}_{k, m}$$ for the k-th user at the m-th group can be expressed as [17]
(6)[TeX:] $$\mathbf{c}_{k, m}=\mu_{k, m}\left(\left(\mathbf{H}_{k, m}^H \mathbf{H}_{k, m}+\gamma_{k, m}^{-1} I\right)^{-1} \mathbf{H}_{k, m}^H\right) .$$2.3 NOMA Power AllocationLet us assume channel gains of the users in a NOMA group are sorted in descending order as follows [TeX:] $$\left|\mathbf{H}_1\right|^2 \geq \cdots \geq\left|\mathbf{H}_K\right|^2.$$ Successive interference cancellation (SIC) is applied at the receivers for multi- user detection and decoding to mitigate inter-user interference. The received signal-to-interference- plus-noise ratio (SINR) can be calculated as follows
(7)[TeX:] $$\Upsilon_{k, m}=\frac{\mathbf{g}_{k, m}\left|\mathbf{c}_{k, m} \mathbf{H}_{k, m}^H \mathbf{v}_{k, m}\right|^2}{\sum_{i=k+1}^K \mathbf{g}_{i, m}\left|\mathbf{c}_{k, m} \mathbf{H}_{k, m}^H \mathbf{v}_{i, m}\right|^2+\sigma^2},$$where the [TeX:] $$\sigma^2$$ indicates the noise variance. The power allocation for the k-th user in the m-th group can be calculated as [TeX:] $$\mathbf{g}_{k, m}=\lambda_{k, m} G_{\mathrm{tot}},$$ where [TeX:] $$\lambda_{k, m} \in [0,1]$$ is the NOMA power coefficient and [TeX:] $$G_{\mathrm{tot}}$$ is the total power in the transmitting BS. The achievable rate for the k-th user at the m-th group can be expressed as follows
(8)[TeX:] $$R_{\mathrm{sum}}=\frac{B}{M} \sum_{m=1}^M \sum_{k=1}^K \log _2\left(1+\Upsilon_{k, m}\right),$$where B is the bandwidth and M denotes the number of grouping. 2.4 Problem FormulationThe objective function is to maximize the achievable sum rate given precoder, combiner, and power allocation in MU MIMO-NOMA which can be formulated as follows:
(9a)[TeX:] $$\max _{\mathbf{v}, \mathbf{c}, \mathbf{g}} R_{\mathrm{sum}}=\frac{B}{M} \sum_{m=1}^M \sum_{k=1}^K \log _2\left(1+\Upsilon_{k, m}\right)$$
Ⅲ. Quantum Neural Network for Precoder-Combiner and Power AllocationThis section presents the proposed QNN. The proposed QNN network consists of one layer employed to obtain optimization variables. The quantum circuit of QNN is designed and presented in Fig. 2. Herein, inputs of the network in classical values are mapped into quantum states as qubits through the encoding process. In this case, the number of qubits denoted as [TeX:] $$N_{\text{qubits}},$$ possesses an identical dimension with the network inputs, thus [TeX:] $$N_{\text {qubits}}=N_{\text {inputs}} .$$ As an initial step, each qubit is placed into a quantum superposition state using a Hadamard operator which is defined as pre-processing, expressed as
(10)[TeX:] $$\mathbb{U}_{\text {pre-processing }} \triangleq \bigotimes_{n=1}^{N_{\text {inputs }}} \mathbf{H}\left(\left|u_n\right\rangle\right).$$Subsequently, the pre-processed qubits are going through a feed-forward process. The feed-forward process begins with data encoding, where the classical input is mapped onto the quantum states of qubits. The unitary for input data in the y-th layer can be expressed as
(11)[TeX:] $$\mathbb{U}_{\mathrm{w}-\mathrm{en}}^{[y]}=\bigotimes_{n=1}^{N_{\mathrm{qubit}}} \mathbf{R X}\left(\tanh \left(w_n\right)\right),$$where [TeX:] $$\boldsymbol{\mathrm{R Y}}(\bullet)$$ denotes the quantum gates for rotating qubits around the Y−axis of the Bloch sphere1) within θ phase. Afterward, identical to data encoding, weight encoding is also mapped onto the quantum states. The unitary function of weight encoding in the y-th layer can be expressed as
(12)[TeX:] $$\mathbb{U}_{\mathrm{W}-\mathrm{en}}^{[y]}=\bigotimes_{n=1}^{N_{\text {qubit }}} \mathbf{R} \mathbf{X}\left(\tanh \left(w_n\right)\right),$$where [TeX:] $$\boldsymbol{\mathrm{R X}}(\bullet)$$ denotes the quantum gates for rotating qubits around the X−axis by θ phase. Subsequently, a feed-forward process of QNN denoted by [TeX:] $$\mathbb{U}_{\mathrm{QNN}},$$ can be expressed as
(13)[TeX:] $$\begin{aligned} \mathbb{U}_{\mathrm{QNN}} \triangleq & \bigotimes_{n=1}^{N_{\text {input }}} \mathbf{R X}\left(\tanh \left[w_n\right]\right) \\ & \left.\prod_{n=1}^{N_{\text {input }}} \mathbf{C Z}\left(\left|\mathbf{u}_n\right\rangle| | \mathbf{u}_{n-1}\right\rangle\right) \otimes \ldots \otimes \\ & \mathbf{C Z}\left(\left|\mathbf{u}_{N_{\text {input }}}\right\rangle\left|\left|\mathbf{u}_{N_{\text {input }}-1}\right\rangle\right) \otimes\right. \\ & \mathbf{R Y}\left(\tanh \left[\mathbf{u}_n\right]\right), \end{aligned}$$where [TeX:] $$w_n$$ denotes the trainable weights for the n-th qubit, [TeX:] $$\mathbf{u}_n$$ denotes the classical input for the n-th qubit, and [TeX:] $$\mathbf{CZ}(\cdot)$$ denotes an operation of connecting two qubits. 1) Bloch sphere is a geometrical representation of the quantum bits (qubits). Considering the loss value is obtained as a classical value, a quantum measurement is performed at the end of the total layers to obtain outputs in real values. The quantum measurement operation, denoted by [TeX:] $$\mathbf{M}_{\mathrm{QNN}}^{(y)},$$ can be expressed as
(14)[TeX:] $$\mathbf{M}_{\mathrm{QNN}}^{(y)}=\langle 0| \mathbb{U}_{\mathrm{QNN}}(w)^{\dagger} \mathbf{H} \mathbb{U}_{\mathrm{QNN}}(w)|0\rangle .$$In order to mitigate the inherent perturbations present within quantum measurement processes because of the noise, the measurement is conducted for [TeX:] $$y_{\text {shot }}$$ times. Therefore, the measurement outputs can be expressed as
(15)[TeX:] $$y=\frac{1}{y_{\text {shot }}} \sum_{y=1}^{y_{\text {shot }}} \mathbb{M}_{\mathrm{QNN}}^{(y)}.$$Based on the outputs, the loss of QNN can be calculated as follows
(16)[TeX:] $$\mathbf{L}_{\mathrm{QNN}}(w)=-\frac{1}{N_{\text {data }}} \sum_{n=1}^{N_{\text {data }}} \Xi(y)_n,$$where [TeX:] $$N_{\text {data }}$$ denotes number of training data in a batch, while [TeX:] $$\Xi(y)$$ denotes an objective output. Subsequently, a gradient of the QNN can be calculated using parameter-shift rules[18] as follows
(17)[TeX:] $$\nabla \mathbf{L}_{\mathrm{QNN}}(w)=\frac{\mathbf{L}_{\mathrm{QNN}}(w+\theta)-\mathbf{L}_{\mathrm{QNN}}(w-\theta)}{2 \sinh (\theta)},$$where [TeX:] $$\theta \in[0, \pi]$$ denotes a shifting parameter. Finally, the trainable weights can be updated as follows
where [TeX:] $$\alpha \in(0,1]$$ denotes the learning rate. 3.1 QNN for MU MIMO-NOMA OptimizationThe details for solving the joint optimization problem in the MU MIMO-NOMA scenario can be described as follows: 1. The inputs for the QNN are the channel matrices for each UE, as described in Eq. (2). The total number of elements in the channel matrix is equal to the total number of inputs in the QNN network. 2. The problem formulation has been presented in Section II, where the optimized variables become the output of the QNN. Specifically, the optimization variables [TeX:] $$\xi_{k, m}, \mu_{k, m}, \text { and } \lambda_{k, m}$$ denote the precoders, combiner, and power allocation, respectively. Ⅳ. Numerical ResultThis section analyses the performance of QNN for digital precoder-combiner optimization. The simulation parameters are given in Table 1. It is worth noting that quantum computers are still in the early stages of development, and the number of qubits available for use is currently limited. Therefore, this study considers the low number of the antenna and users. The quantum unitary UQNN was executed utilizing a “basic simulator” backend in IBM Qiskit[19]. Table 1. Parameters
4.1 LossThe training loss with respect to the number of training episodes was presented in Fig. 3. In this study, unsupervised learning was employed, which contrasted with supervised learning, unsupervised learning performed the optimization task without needing reference data. Therefore, the loss function can be expressed as [TeX:] $$\mathrm{LQNN}=-\frac{1}{N_{\text {data }}} \sum_{n=1}^{N_{\text {data }}} R_{\text {sum }} .$$ As can be seen, the loss function showed a decreasing trend starting from the first training episode. Furthermore, the loss is converged at 3-th training episode. 4.2 Achievable SumrateThe achievable rate with respect to the transmitted SNR in MU MIMO-NOMA is presented in Fig. 4 . As shown, the achievable sum rate for QNN optimization has improved compared to the unoptimized scheme. In the unoptimized scheme, all the optimization coefficients are set to 1, meaning there is no optimization for the precoder and combiner. Additionally, the power allocation coefficient is set to 0.2 for near users and 0.8 for far users. 4.3 Complexity AnalysisThe complexity of QNN was analyzed as follows. First, pre-processing was conducted as described in Eq. (10), which utilized the [TeX:] $$\mathbf{H}(\cdot)$$ operation, resulting in [TeX:] $$\mathscr{O}(1) .$$ Second, the feed-forward for QNN was described in Eq. (13). The weight encoding data utilized the [TeX:] $$\mathbf{RY}\left(\tanh \left[\mathbf{u}_n\right]\right)$$ operation, resulting in [TeX:] $$\mathscr{O}(1) .$$ The connection between each qubit utilized the [TeX:] $$\prod_{n=1}^{N_{\text {input }}^{\text {in }}}\mathbf{CZ}\left(\left|\mathbf{u}_n\right\rangle\left|\left|\mathbf{u}_{n-1}\right\rangle\right)\right.$$ operation, resulting in [TeX:] $$\mathscr{O}\left(N_{\text {input}}\right) .$$ The input encoding data utilized [TeX:] $$\mathbf{RX}\left(\tanh \left[w_n\right]\right)$$ operation, resulting in [TeX:] $$\mathscr{O}(1) .$$ Thus, the total complexity for [TeX:] $$\mathbb{U}_{\mathrm{QNN}} \in \mathscr{O}(1)+\mathscr{O}\left(N_{\text {input}}\right)+\mathscr{O}(1) \approx \mathscr{O}\left(N_{\text {input}}\right) .$$ Finally, the overall complexity of QNN [TeX:] $$\mathscr{O}(1)+\mathscr{O}\left(N_{\text {input}}\right) \approx \mathscr{O}\left(N_{\text {input}}\right) .$$ On the other hand, based on [14], the complexity of a classical neural network can be defined as [TeX:] $$\mathscr{O}\left(\left(N_{\text {input}}\right)^2\right) .$$ It can be concluded that the complexity of the QNN is lower than the classical neural network. Ⅴ. ConclusionsThis study proposed quantum neural network (QNN) optimization for joint optimization in MU MIMO-NOMA. Specifically, the QNN was applied to the joint optimization of precoders, combiners, and power allocation. The numerical results of MU MIMO-NOMA utilizing the proposed QNN have been presented. The results show that the proposed QNN outperformed the unoptimized scheme by achieving a higher achievable sum rate. The effects of employing different quantum circuit designs can be explored in the future BiographyLia Suci WalianiFeb. 2022 : B.Eng. degree, School of Electrical Engin- eering, Telkom University, Bandung, Indonesia. Aug. 2024 : M.Eng. degree, Dept. of IT Convergence, Kumoh National Institute of Technology, Gumi, South Korea. [Research Interests] Quantum neural networks, multiple-input multiple-output (MIMO), non-orthogonal multiple access (NOMA), and error correction coding. BiographySoo Young ShinFeb. 1999 : B.Engg. degree, School of Electrical and Electronic Engineering, Seoul National University. Feb. 2001 : M.Engg. degree, School of Electrical, Seoul National University. Feb. 2006 : Ph.D. degree, School of Electical Engineering and Computer Science, Seoul National University. July 2006-June 2007 : Post Doc. Researcher, School of Electrical Engineering, University of Washington, Seattle, USA. 2007-2010 : Senior Researcher, WiMAX Design Laboratory, Samsung Electronics, Suwon, South Korea. Sept. 2010-Current : Professor, School of Electronic Engineering, Kumoh National Institute of Technology. [Research Interest] 5G/6G wireless communica- tions and networks, signal processing, the Internet of Things, mixed reality, and drone applications. [ORCID:0000-0002-2526-2395] References
|