Julian AHRENS,Lia AHRENS,and Hans D.SCHOTTEN,2
(1.German Research Center for Artificial Intelligence,Kaiserslautern 67663,Germany;2.Technical University of Kaiserslautern,Kaiserslautern 67663,Germany)
Abstract:A machine learning method for predicting the evolution of a mobile communication channel based on a specific type of convolutional neural network is developed and evaluated in a simulated multipath transmission scenario.The simulation and channel estimation are de?signed to replicate real-world scenarios and common measurements supported by reference signals in modern cellular networks.The capability of the predictor meets the requirements that a deployment of the developed method in a radio resource scheduler of a base station pos?es.Possible applications of the method are discussed.
Keywords:channel estimation; channel prediction; convolutional neural network; machine learning;multipath transmission
Today’s mobile communication networks are driven by the demand of a steadily increasing number of sub?scribers for ever higher data rates.This demand has led to the introduction of support for technologies such as millimeter wave transmissions and massive multiple-input multiple-output (MIMO) into the current 5G standards.Apart from the introduction of these new technologies,the available spectrum has to be utilised in the most efficient manner possi?ble.This has already led to the move to orthogonal frequencydivision multiple access (OFDMA) and orthogonal frequencydivision multiplexing (OFDM) in the fourth-generation mobile broadband standards,which allow for fine grained control over the utilisation of the available radio resources across both the time and frequency domains.While OFDM combats frequencyselective fading by using long symbol times,OFDMA provides further benefits by allowing multiple users to schedule trans?mission on the subcarriers which are best for them at the time[1].OFDM also allows for different encodings to be used across the available spectrum,thereby giving the scheduler fine grained control over the trade-off transmission data rate vs.sig?nal robustness.
The dynamic allocation of radio resources and its scheduling are key to achieving efficient utilisation of the available spec?trum.Since base stations manage a large number of transmis?sions,each across a different channel depending on the posi?tion and environment of the individual user equipment (UE),they are natural candidates for hosting an optimisation through dynamic scheduling of the radio resources.To achieve effi?cient radio resource management,scheduling algorithms need to have information about the current and future states of the transmission channels.In particular,two things are required:On the one hand,a mechanism for the estimation of the trans?mission channels needs to be in place,i.e.,there has to be a measurement of the channel transfer function; on the other hand,the development of the transmission channels over time has to be predicted to allow for estimates of future channel quality.
In Long Term Evolution (LTE) systems,channel estimation can be implemented by observing the Cell-Specific Reference Signals (CRS).LTE release 10 (LTE Advanced) supplemented the CRS by the introduction of Channel State Information Ref?erence Signals (CSI-RS).5G New Radio (NR)does not provide CRS,instead relying exclusively on the flexibly configurable CSI-RS.In this paper,we use a simulation of a multipath prop?agation transmission channel based on the empirical evidence and the models devised in [2].The channel is estimated by transmitting and measuring a test signal containing a similar amount of information as the LTE CRS.In particular,very sim?ilar estimates can be derived from the observation of LTE CRS.
The present work focusses on the aspect of predicting the time-variant transmission channels.A convolutional neural network(CNN)operating on the time-frequency domain and us?ing multiple time resolutions is designed in order to achieve the necessary prediction performance.The proposed CNN is a two-dimensional variant of the WaveNet network architecture proposed in [3] and uses dilated kernels on the time axis to achieve the incorporation of multiple time resolutions.A fur?ther enhancement to the WaveNet architecture presented here consists in enabling simultaneous multi-step predictions,allow?ing for the instantaneous predictions of the channel develop?ment over a period of 5 ms (one half-frame) at a resolution of 500 μs(one slot)each.This is especially useful,since the allo?cation of resource blocks can be changed at the half-frame lev?el,necessitating the prediction over at least this time period.
The remainder of this work is structured as follows:Section 2 introduces the simulation from which the transfer functions of a fading channel scenario based on real-world observations are derived.Section 3 describes the employed channel estima?tion procedure.Section 4 describes the channel predictor that is the essential part of this work.Section 5 summarises the ob?tained results.In Section 6,we provide a discussion of possi?ble applications and an outlook on future research.Section 7 concludes the paper.
Setting the position and carrier frequency of the transmitter to(0,0)∈?2andfcarrier=900 MHz,respectively,the multipath transmission is simulated by generating 256 scatterers.Each scatterer starts at a randomly chosen initial position∈?2such that the power delay profile of the resulting multipath transmission matches the typical urban scenario de?scribed in [2],and moves at a random time-invariant velocity∈?2with~N(0,σ2),σ=10 m/s,forι=0,...,63,and=0 forι≥64.The receiver is supposed to move from an initial position=(400,0)∈?2near the trans?mitter at velocityv*=∈?2with |v*|=10 m/s,arctan2~U(-π,π) where exp(iarctan2(y,x))=(x+iy)/for(x,y)∈?2.The transmissions are assumed to be conducted periodically in blocks.The time for transmitting one block is assumed to beT=500 μs,which leads to a dis?crete time simulation with step size 500 μs.The simulation time amounts to 212=4 096 time steps in total.The bandwidth of transmission is set to 12.8 MHz.A time interval of length 20 μs at the beginning of each block is used for the transmis?sion of a test signal generated for the channel estimation.All values are computed and stored using International System of Units(SI)base units.
An example configuration of this simulation is shown in Fig.1.The large red and black dots represent the transmitter and receiver,respectively.The smaller dots represent the scatterers which are coloured according to the phase offsets ob?served on the corresponding transmission path (shown as lines)with red representing zero offset and cyan representing a phase offset of π.
At each simulation steptand for eachι,the path length re?flected by theι-th scatterand its derivative with respect to timeare com?puted.For each scattererι,we recorded the corresponding transmission delay time,the constant phase offset,and the Doppler frequencycaused by theι-th scatterer follow?ing the rules?2π,and,respectively,as well as the received signal amplitudecomputed using the free-space propaga?tion model[1].(Here,c0refers to the speed of light in vacuum.)
In a setting without line of sight,using linearisation of the phase offset with respect to the Doppler frequency,the timevariant channel impulse response evaluated at timet+τfor each simulation steptand smallτresulting from the multipath transmission simulated using the above parameters can be ap?proximated by

For any signal{Sτ}0≤τ ▲Figure 1.Example configuration of the designed simulation. This parametrisation is used in [4] and delivers a realistic approximation of real-world scenarios for numbers of summands greater than 100[5].In order to allow continuous time delays to be applied to discrete time signals,the impulse functionsin (1) and (2) are convolved with a windowed sinc(?)function scaled with a given bandwidth.Overall,the channel transmission including pulse shaping with bandwidth restricted to half the sample rate and additive noise is approximated by re?placing thein (1) and (2) by sin(π(?/2))/(π(?/2))1[-8,8]and adding independent and identically distributed Gaussian white noise~N(0,σ2) to the transmitted signal with powerσ2resulting in a signal-to-noise ratio of 12 dB. For both the channel estimation and prediction,we will work in the frequency domain.Apart from the obvious usefulness of frequency domain estimation and prediction for OFDM sys?tems,working in the frequency domain allows for a channel es?timation scheme of lower computational complexity compared to equalisers operating in the time domain and requiring matrix inversions.Moreover,the frequency domain mode of operation has some benefits on the predictor further detailed in Section 4.Throughout the remainder of this paper,for a discrete time complex-valued signal {Xτ}τ=0,...,N-1,let ?X={?Xf}f=0,...,N-1denote its(discrete)Fourier transform. The time-variant channel transfer functions ?h(?,t+τ) fort=0,...,4 095Tand 0 ≤τ In order to improve the quality of the preliminary estimatorwhich is noise corrupted,the correponding im?pulse responseis windowed by a step function of widthN/2 and then Fourier transformed,i.e.,the estimatorof the channel transfer function is given by where ?-1Y={?-1Yτ}τ=0,...,N-1refers to the inverse Fourier transform of the considered signal{Yf}f=0,...,N-1in the frequen?cy domain.The step of windowing the preliminarily estimated impulse responseis conducted due to the observa?tion of a long noisy tail showing up in the recorded,which,according to the simulation with maximum transmission delay time less thanN/2,should be eliminated; this,at the same time,yields a discrete approximation to convolving the estimat?ed channel transfer function with the kernel sin(π(?/2))/(π(?/2))so that down sampling with step size 2 (instead of the original step size 1) in the frequency domain delivers an error corrected version of the estimated channel transfer func?tion. When applied to a multipath transmission channel with addi?tive white noise such as the transmission channel simulated in Section 2,the above method of estimating the channel transfer function yields a reasonably accurate estimate. The initial resolution level of the frequency spectrum is set toN=29which results in an estimated channel transfer func?tionof lengthN/2=256 for each block beginning at simu?lation stept.Overall,the simulation is run 16 times indepen?dently,which results in 16 independent time series of the formwith. ▲Figure 2.Example plot of estimated transfer functions. An example realisation of transfer functions estimated dur?ing one simulation is shown in Fig.2.Thex-axis represents time,labelled by time stepst,and they-axis represents fre?quency,labelled by indices of subcarriersf.Brightness corre?sponds to amplitude with bright colors representing good signal reception.Colors correspond to phases with red representing phase 0 and cyan representing phase π.One can clearly see the thin dark areas reflecting the effect of frequency selective fading. In order to ensure that the proposed system could indeed be implemented on current cellular radio equipment,the method of estimation of the simulated channel is chosen in such a man?ner that the level of channel information obtained is very simi?lar to that commonly available from the reference signals in re?al-world systems. The time series of estimated channel transfer functions from Section 3 are used as labels for training and testing a carefully chosen convolutional neural network (CNN) that delivers oneor multi-step ahead predictions of the time-variant channel transfer functions resulting from the simulation in Section 2.Since additive noise is included in the simulation of the chan?nel,the trained neural network also contributes to the denois?ing of the channel transfer function along with the channel esti?mation scheme in Section 3. In general,CNNs are a specific architecture of feed-forward neural networks,where linear filters (convolution kernels) in?stead of traditional single weight parameters are used in a shift invariant manner for the transformation between adjacent lay?ers,making use of local temporal and spatial structure of the input signal within a local receptive field.The local receptive field can be enlarged without the need for increasing the num?ber of parameters by means of the dilation parameter.In the one-dimensional case,a CNN with dilation is known as WaveNet that is introduced in [3]for processing audio signals.For more details on CNNs,the readers are referred to[6].Com?pared to traditional fully connected neural networks and recur?rent neural networks such as long short-term memory units(LSTMs)[7],CNNs use fewer parameters and are less receptive to overfitting. The shift invariant nature of CNNs necessitates that the sig?nals processed by a CNN have some amount of homogeneity,as the layers of the CNN have no way of varying the processing performed by them between different regions of the input sig?nal.In our particular case,this means that the method in which predictions are performed for a certain consecutive group of subcarriers is exactly the same as that used for any other group of consecutive subcarriers.This is a reasonable ap?proach,as the manner in which the influence of the channel on the transmission develops over time is indeed very homoge?neous across the entire considered bandwidth.This assump?tion would not hold,if we were to work directly on the time do?main channel impulse response,as most of the power of this impulse response is contained within the first few microsec?onds,suggesting a different approach for processing this earlier part of the impulse response. In our setting,a two-dimensional convolutional neural net?work(CNN)with partial dilation is used for building the predic?tion model,which is described in the remainder of this section. CNNs are a special type of feed-forward neural networks made up of one or several convolutional layers.A feed-forward neural network is a function mapping an input vector to an out?put vector,making use of a set of parameters which are to be adapted through the training.In a multi-layer neural network,this function operates in the form of several such functions in succession,each transforming the corresponding input vector into an output vector.In our setting,for processing the time se?ries of channel transfer functionswith∈?256,we use two-dimensional convolutional layers where each input vector is indexed with three axes related to the real-or-imagi?nary part of the complex plane,the simulation time steps,and the frequency domain,and the transformation is conducted by convolving the input vector with a convolution kernel made up of free parameters to be adapted and adding a free parameter vector called bias to the result.For our purpose of multi-step prediction,we also consider the evolution of the time series over a long period of time,for which we use the so-called dila?tion parameter on the time axis defining the spacing between the free parameters in the convolution kernel.The introduc?tion of the dilation parameter enables us to extend the recep?tive field of the CNN in time without taking extra parameters for fitting. For delivering at mostm-step ahead predictions of the future channel transfer function,we use a 5-layer CNN beginning with 4 consecutive partially dilated convolutional layers along thet-axis with channel sizes 2,6,12,12,6,followed by one convolutional layer with 2moutput channels.In each partially dilated convolutional layer,the size of the free convolutional kernel is set to (4,5) for the time and frequency axis,respec?tively,and the dilation parameter is defined by 4 to the power of the corresponding layer number.The final layer is endowed with 1× 1 convolution kernels.Apart from the last layer,the hyperbolic tangent is used as activation function in each layer.In order to improve the back propagation of the gradient [8],a residual convolutional layer [9] with kernel size(1,1) is added to each partially dilated convolutional layer.The above layout is common in convolutional neural networks and is designed to best adapt to our task and the nature of the input signals.The layout of our CNN is summarised in Table 1.For illustration,a diagram of the dilated layers along thet-axis is presented in Fig.3. During the training,the free parameters in our CNN are ad?justed to the labelled training data by minimising the mean squared error (MSE) of prediction along the negative direction of the gradient of the error function with respect to the parame?ters,for which we use a refined version of stochastic gradient descent (SGD) called ADAM training algorithm [10].The gra?dient for each update is computed by means of the so-called backpropagation algorithm[8]based on the chain rule.tions at most.The MSEs are evaluated for training,validation,and test data (Table 2).The similarity of performance evaluat?ed on all three sub-datasets indicates no significant overfitting. As a baseline,we consider the trivial prediction where all fu?ture values of the time series are set to the latest observed val?ue; the MSE of such a prediction scheme provides a measure for the variation of the underlying time series over time (Table 2).Overall,the instantaneous long-term prediction with our CNN usingm=10 facilitated by employing the dilation param?eter in time delivers much more accurate results than the trivi?al prediction scheme. In Figs.4,5,6,and 7,the power density spectra in dB of an example channel transfer function evaluated at time stepst0andt0+Δtand the Δt-step ahead instantaneous prediction for timet0+Δt,Δt=1,2,5,10,output by the trained CNN withm=10,are plotted in blue,yellow,and green,respectively.Note in particular that most of the negative peaks of the future power density spectra are correctly detected by the predictor,which suggests the utility of our approach in handling situa? ▼Table 1.Layout of the 2D CNN for m-step ahead prediction ▲Figure 3.Structure of the dilated convolutional layers along the t-axis. In our setting,the 16 independent time series of channel transfer functionswith∈?256are each divid?ed into 8 segments which are to be fed into the CNN as input vectors and grouped as training,validation,and test parts with the proportion 6:1:1.The ADAM optimiser with learning rateγ=0.01 is run for 30 training epochs in total. The performance of our approach to delivering multi-step ahead prediction is measured in a setting withm=10 for train?ing the corresponding CNN to output 10-step ahead predic?tions with frequency selective fading in an OFDM transmission scheme(see Section 6 for more discussion). ▼Table 2.Mean squared errors(MSEs)for prediction length Δt from 1 to 10 ▲Figure 4.Power density spectrum in dB for Δt=1. ▲Figure 5.Power density spectrum in dB for Δt=2. ▲Figure 6.Power density spectrum in dB for Δt=5. ▲Figure 7.Power density spectrum in dB for Δt=10. As mentioned in the introduction,the method proposed in the preceding sections can be employed to provide an OFDMA/OFDM radio resource scheduler located in a base station with predictions necessary for an efficient scheduling of radio re?sources.There are two main aspects of the scheduler,which can benefit from this information: The predictions can be used to decide to which user a specif?ic radio resource element should be allocated by estimating the relative usefulness of assigning the element to a specific user compared to the utility another user may have of it.For in?stance,consider the case where two radio resource blocks RRB A and RRB B are assigned to users UE A and UE B,respec?tively.If the predictor predicts that during the next half-frame the part of the spectrum on which RRB A is transmitted will become faded for UE A,but a strong signal could be received by UE B,it would be advantageous to change the allocation and assign RRB B to UE A and RRB A to UE B. The other aspect is that the scheduler may control the choice of encoding used on each of the radio resource ele?ments.In particular,if a prediction reveals that a certain part of the spectrum will become faded for a particular user and a reallocation among the users as in the case discussed previ?ously is not applicable,the scheduler may initiate a change of the employed encoding,for instance from 64QAM1Quadrature amplitude modulationdown to 16QAM,thereby increasing the robustness of the signal and counteracting the decreasing signal-to-noise ratio.In ex?treme cases of frequency selective fading,transmissions on the corresponding frequencies could even be disabled com?pletely. In future research,we hope to expand on both of these topics by developing an adaptive coding scheme and a dynamic scheduler for the multi-user case based on the research per?formed in this article. In this paper,we simulated a multipath transmission scenar?io,implemented a channel estimation scheme,and designed a machine learning model for predicting the resulting channel transfer functions over multiple time steps.Our results show that the machine learning model is capable of capturing charac?teristics of the channel evolution and provides reasonable pre?dictions.We addressed possible applications of the method in real-world systems,which we plan to implement and evaluate in future research.

3 Channel Estimation



4 Channel Prediction


5 Results





6 Discussion
7 Conclusions