Efficient reservoir computing using field programmable gate array and electro-optic modulation

Prajnesh Kumar; Mingwei Jin; Ting Bu; Santosh Kumar; Yu-Ping Huang; Yu-Ping Huang

doi:10.1364/OSAC.417996

1. Introduction

Modern computers based on von Neumann architectures have been designed for digital information processing and generic computational tasks there upon. With more and more problems being identified as computationally complex and/or excessively time consuming, alternative computational paradigms are actively explored to solve these problems as they increasingly emerge [1–5]. To that end, a promising approach is to develop brain-inspired architectures for information processing, including a variety of neural networks [6–9]. Yet, those architectures are still implemented through computer simulations on Von Neumann computers, thus fundamentally subject to the latter’s limitations in speed, parallelism, etc.

Recently, there has been increasing interest on artificial neural networks using optics, leveraging its remarkable speed, multiplexing capability, and little heat deposition [10,11]. Usually, these optical neural networks are wholly trained with a known data set to optimize their connectivity and parameters through nonlinear layers [12,13]. However, such training is usually energy and time consuming, and its efficiency varies by the complexity of task, the size of the network, the nonlinearity and connectivity between the nodes. Also, it is conducted offline using digital computers, thus failing to account for the uncertainties and fluctuations that are inevitable in any optics systems, especially when the optical networks are complex and involve many free-space parts [14,15].

In an effort to address the above difficulties, the idea of reservoir computing (RC) was proposed and widely explored [16–20]. RC origins from liquid-state machine (LSM) [21] and echo state networks (ESN) [22]. Its realizations are generally composed of three parts: an input layer, a reservoir layer, and an output layer. The input signals are first fed into the input layer, then mapped to the reservoir layer, which contains a large number of interconnected nonlinear nodes and performs nonlinear transformation. Afterwards, response is readout with linear weighted sum of the reservoir states in the output layer. Unlike other kinds of recurrent neural networks that are notoriously hard to be trained, here only the output weights are needed to be optimized, making the training much simpler. It is this advantage and its success on lots of time dependent tasks, such as chaotic time series prediction and speech recognition [23–35], that draw much attention across different application areas.

The first RC system was implemented using analog electronics with a single nonlinear node and delayed feedback [17]. Since then, digital electronics are increasingly adopted [36–38]. Among them, field programmable gate array (FPGA) electronics can make the system compact, stable, low-cost and easily configurable with various commercial systems for real-time information processing. More recently, hybrid opto-electronic feedback systems with FPGA’s have reformulated the development of a coherent Ising machine for solving computationally hard optimization problems [39] and to build a large networks of identical nodes with arbitrary topology for cluster synchronization, chimera states [40] and laminar chaos [41].

Here, we add to the exciting progress made in the RC field and experimentally demonstrate a fully-packaged opto-electronic RC system consisting of a Mach-Zehnder electro-optic modulator (EOM) and a FPGA circuit. In our design, both the delay line and filters are implemented digitally within FPGA, which renders the whole system compact and immune to optical drifts and noise, especially compared to fiber-optical realizations [42–44]. Leveraging the filters inside the FPGA, we are able to achieve more dynamics and connections between reservoir nodes. To characterize the system, we run three benchmark tasks: the $10^{th}$ order Non-Linear Auto-Regressive Moving Average test (NARMA-10), the Santa Fe laser data prediction, and the isolate spoken digit recognition. All exhibit exceptionally high performance, which indicates the robustness and versatility of this RC system.

The remainder of the paper is organized as follows. Section 2. discusses the basics theory of opto-electronic RC system using delay feedback. In Section 3., we show the experimental setup which covers detailed block level FPGA implementations. Subsequently, Section 4. shows the experimental results to discuss three benchmark tests for evaluating the performance of RC system and finally, Section 5. concludes this paper.

2. Theory

Figure 1(a) shows the conventional RC model which consists of an input, a reservoir, and an output layer. The input layer consists of input vector u($t$) of length $K$ which are fed into the reservoir via fixed but random input weighted connection. These weights will scale the input differently for different reservoir nodes. The reservoir layer contains a large number of recurrent and randomly interconnected nonlinear nodes. Reservoir layer non-linearly projects the inputs to high-dimensional state space. The dynamics of the reservoir also exhibit a fading memory where the current reservoir state is influenced only by the recent past. The dynamics of states in the reservoir layer is given by

(1)$$x_i(t)=f_{NL}\Big(\sum_k w_{ik} x_k(t-\tau)+\sum_j M_{ij} u_j(t)\Big),$$

where $f_{NL}$ is the non-linear function, $x_i(t)$ is the reservoir state of $i^{th}$ node at time $t$, $w_{ik}$ is the nodes inter-connection matrix, $M_{ij}$ is the input weight matrix and $u_j(t)$ is the $j^{th}$ input.

Fig. 1. (a) Conventional reservoir computing (RC) model and (b) time-delay based RC model.

Download Full Size | PDF

At the output layer, the response is readout by linear weighted sum of node states which described as

(2)$$\hat{y}_j(t)=\sum_iW_{ij} x_i(t),$$

where $W_{ij}$ being the output weights. The output weights $W_{ij}$ are updated after training to make the outputs $\hat {y}_j(t)$ as close as possible to the target values $y_j(t)$ using optimization methods such as linear regression, ridge regression etc.

Above RC architecture requires very large number of interconnected physical nodes. In contrast, the time-delay based RC uses virtual nodes that are temporally spaced and has only one nonlinear node as shown in Fig. 1(b). The non-linearity is implemented electro-optically, using an EOM. The input is preprocessed before injecting into nonlinear node, this procedure is referred as masking. The preprocessed input signal is time-multiplexed and injected serially into reservoir. The input vector after preprocessing will resides in total delay time of $\tau$. The temporal spacing between N nodes is given by $\theta =\tau /N$ as shown in Fig. 1(b).

Figure 2 shows the schematic of our digital opto-electronic RC implementation of delay feedback system. The optical power at the output of EOM is given by a transfer function

(3)$$P(t) \sim \sin^2[\pi v(t)/(2V_\pi)+\phi],$$

where $v(t)$ is the RF signal applied to EOM, $V_\pi$ is the $\pi$- shift voltage and $\phi$ is the bias offset.

Fig. 2. Schematic representation of digital opto-electronic RC

Download Full Size | PDF

The photo-detector generates voltage in response to the applied laser power which is given by

(4)$$V_{det}(t) = \alpha \eta R_t P(t),$$

where $\alpha$ is the total insertion loss of the EOM, $\eta$ is the responsivity of the detector and $R_t$ is transimpedance gain of the amplifier. The RF input to EOM is

(5)$$v(t)=G[s(t) + \gamma u(t)],$$

where $G$ is the overall forward gain of the system, $\gamma$ is the input scaling factor and $u(t)$ is the input to the reservoir. $s(t)$ is the output of the filter after delay, which is convolution of the delayed detector output and filter impulse response function $h(t)$, as

(6)$$s(t)= h(t)*V_{det}(t-\tau), \\$$

with $\tau$ the total delay.

Taking all into account, the equation of the states reads

(7)$$x(t)=\beta h(t)*\sin^2 \left[x(t-\tau) + \gamma' u(t-\tau)+\phi \right] \\$$

with $x(t)=\pi G s(t)/2V_{\pi }$ , $\beta =\pi G \eta R_t\alpha |E_0|^2/2V_{\pi }$, and $\gamma '=G\gamma /2V_\pi$. It is then discretized as

(8)$$x[k] =\beta h[k]*\sin^2 \left(x[k-N] + \gamma' u[k-N]+\phi \right), \\$$

where $k=t/dt$ is discrete sample obtained with sampling period $dt$ and $\tau$ is integer multiple $N$ of $dt$. For digital filter, $h[k]$ has a finite length of $M$ with $M\leq N$. Thus, Equation (8) is in an explicit form of

(9)$$x[k]=\beta \sum_{j=0}^{M-1} h[j]\sin^2 \left(x[k-N-j] + \gamma' u[k-N-j]+\phi \right).$$

From Eq. (9), we can see that the states are coupled using filter coefficients. Digital filter gives the flexibility to implement different network topology [42].

3. Experimental setup

The block diagram of our experiment setup is depicted in Fig. 3(a). A fiber-coupled laser diode (JDS Uniphase CQF975/58) provides a continuous wave beam at $\sim$1550 nm, which is passed through a fiber polarization controller (FPC) before coupling to an EOM (Lucent Technologies, 2623NA). The EOM takes RF input signal from the arbitrary waveform generated using RF Digital to Analog converter (DAC) and modulates the laser beam intensity. The modulated intensity is converted to electrical signal using a photodetector (Thorlabs, DET10C2), whose output is digitized using an analog to digital converter (ADC) with 1 Msps sampling rate controlled by FPGA. A Personal computer (PC) provides the preprocessed data via Ethernet interface for this RC system. Figure 3(b) shows the FPGA blocks implemented on a Zedboard (Zynq-7000) development platform. The RC logic is implemented using Verilog programming language. The FPGA interfaces to ADC (Maxim Integrated MAX11198, 16bits) and RF DAC (Analog Devices D5541A,16bits) using Serial Peripheral Interface. The PC preprocess the input signal using mask and input scaling factor, which is then sent to a Zedboard via Ethernet interface. Processing System (PS) in Zedboard shares data with Direct Memory Access (DMA) to stream the input to the FPGA logic. The feedback signal from filter block and the streamed input are added and passed through programmable gain and offset block before converting to electrical signal using DAC. Here, the gain and offset are key parameters for tuning RC system performance. Similarly streaming data from ADC is fed into programmable delay block with delay limited to 1000 units, each delay representing 1 virtual node spacing. The delayed signal is then passed to finite digital bandpass filter which has 400 filter taps. The filtered data implements the Eq. (9) which has state information. PS will then transmit this data to PC via Ethernet interface. Calibration DAC (Maxim Integrated MAX5316, 16bits) is used to compensate for the bias drift of EOM.

Fig. 3. (a) Experimental setup for the present opto-electronic RC and (b) the block diagram of the hardware implementation using FPGA board, and ADC/DAC.

Download Full Size | PDF

4. Results

4.1 NARMA10

NARMA is one of the most widely used benchmarks in RC [45]. NARMA10 is a discrete time nonlinear task with $10^{th}$ order lag. Series $y_{k+1}$ is generated through a recursive formula

(10)$$y_{k+1} =0.3y_k+0.05y_k \sum_{i=0}^{9}y_{k-i} + 1.5u_k u_{k-9}+0.1.$$

The input $u_k$ is drawn from a uniform distribution in the interval $[0,0.5]$. Due to non-linearity and long time lag, NARMA10 poses a challenge for any computation system.

To characterize the performance of the RC, a normalized root mean squared error (NRMSE) between the target and predicted value is calculated as

(11)$$\textrm{NRMSE} =\sqrt{\frac{1}{m}\frac{\sum_{k=0}^{m}(\hat{y}_{k}-y_{k})^2}{\sigma^2(y_{k})}},$$

where $y_{k}$ is the target, $\hat {y}_k$ is the prediction, $m$ is the total number of samples in the target and $\sigma$ denotes the standard deviation of the target.

By sweeping the system parameters, the optimum operating point is identified at gain $G =0.58$, input scaling $\gamma =0.5$, bias $\phi =0.1\pi$, total of $N=400$ virtual nodes. The results of benchmark are tabulated in Table 1. Figure 4 plots the target vs prediction for 2000 samples. Also plotted is the residue $\textbf{R}$, defined as the difference between target and estimate normalized by the mean of target

(12)$$\textbf{R}[k]=m(y_{k}-\hat{y}_{k})/\sum_k y(k).$$

As seen, the present RC performs remarkably well over long term prediction. For 25000 samples with training size of 5200, the mean NRMSE is 0.148.

Fig. 4. NARMA10 benchmark results: (a) shows the amplitude of the target (red line) and the predicted signal (green line) vs index of 2000 samples and (b) shows the normalized residue of target and predicted signal.

Download Full Size | PDF

Table 1. NARMA10 benchmark results with 400 nodes.

View Table | View all tables in this article

4.2 Santa Fe laser data prediction

The data set A from the 1994 time series prediction competition organized by the Santa Fe Institute was an time series obtained from measuring a NH3 laser and it is good example of realistic data [46]. Santa Fe laser data prediction is one step series prediction and we use 4000 data points in our test case. The performance is measured using NMSE.

By sweeping the system parameters, the optimum operating point is identified at gain $G =0.58$, input scaling $\gamma =0.0029$, bias $\phi =0.1\pi$, total of $N=400$ virtual nodes for a data points of 4000. Results of the experiment is listed in Table 2. Figure 5 shows plots of the target and prediction, along with the Residue for the first 1000 samples.

Fig. 5. Santa Fe benchmark results: (a) shows the amplitude of the target (red line) and the predicted signal (green line) as a function of sample size and (b) shows the normalized residue of target and prediction.

Download Full Size | PDF

Table 2. Santa Fe laser data single step benchmark results.

View Table | View all tables in this article

4.3 Isolate spoken digit recognition

Speech recognition is a commonly used benchmark for testing the performance of neural networks. Acoustic feature are more pronounced in frequency domain compared to time domain. Hence the input audio data are first pre-processed by decomposing the time-domain information into frequency-time information. In our implementation we are using Lyons cochlear ear model to get the cochleagram which mimics filtering that occurs in nature [47].

The input audio data for this experiment is from AudioMNIST database [48]. This is a free and open database that contain 30,000 spoken digits(0-9) audio samples of 60 different speakers with a sampling rate of 48 kHz. To use these audio files with our RC, we first down-sample the audio to 12 kHz using the librosa package [49] and pad audio files with random length zero at beginning and end to make total sample size of $12,000$. Next we generate the cochleagram as shown in Fig. 6(b), which is calculated using “lyon 1.0.0" python package. In this test, we consider first 5 speakers from the AudioMNIST database to create 20 balanced subset, each containing 5 speaker $\times$ 10 digit $\times$ 2 utterance each, i.e. 100 audio sample. The training and testing dataset will contains 5 such subset with the first 4 used for training and the last one for testing. In-order to obtain unbiased result, 50 datasets are formed using random combinations of 5 subsets from 20 subsets and the word error rate (WER) is measured for each combinations.

Fig. 6. Graphical illustration of isolated spoken digit recognition task. (a) Uniformly distributed input mask with values in range [−1,1] and dimension $N$x$N_{ch}$ where $N=400$ Nodes and $N_{ch}=77$ (b) Cochleagram generated from audio file of dimension $N_{ch}$x$N_T$ corresponding to digit 9. (c) Resultant product of input mask and cochleagram is serialized and injected into reservoir. (d) Output layer weights of dimension $N_{d}$x$N$ where number of output classes is $N_{d}=10$ (e) The output from reservoir is serially captured and reshaped to $N$x$N_T$, this will give node matrix. (f) Product of output weight with reservoir state matrix give estimation matrix. Output class is predicted by getting the maximum argument corresponding to row-wise mean of estimation matrix.

Download Full Size | PDF

Figure 6 illustrate the flow for spoken digit recognition task. The input mask is a real valued matrix of dimension $N \times N_{ch}$ with uniform distribution in [−1,1], where $N$ is number of virtual nodes and $N_{ch}$ is the number of cochleagram channels. Figure 6(b) shows the cochleagram for digit 9 with dimension $N_{ch} \times N_T$ where $N_T$ is the index of new time representation that depends on decimation factor. As shown in Fig. 6(c), the resultant product of input mask and cochleagram is serially injected into reservoir by flattening the matrix. The output of the reservoir is de-serialized to get the reservoir state matrix of dimensions $N \times N_T$, as shown in Fig. 6(e). The estimation matrix is calculated by multiplying the output weight matrix with reservoir state matrix. The Output class is predicted by getting the maximum argument corresponding to row-wise mean of estimation matrix [50].

After tuning the system parameters and based on their error evaluations, we found the optimum operating point at gain $G =0.5$, input scaling $\gamma =300$, bias $\phi =0.35$, total of $N=400$ virtual nodes. Results of experiment are listed in Table 3.

Table 3. WER for Isolated spoken digit recognition task for training and testing phase.

View Table | View all tables in this article

5. Conclusions

We have experimentally demonstrated an opto-electronic RC system using EOM and FPGA. It takes advantage of electro-optic nonlinear transformation in EOM, and flexible signal generation and controllable timing, highly programmable signal filtering, and stable delay logic implementation in FPGA. The resultant system is a stable and fully functional reservoir computer supporting many nodes, high connectivity, and easily configurable for multifaceted tasks, while allowing online training. We have tested it with three benchmark tasks: NARMA-10, SantaFe laser prediction, and Isolate spoken digit recognition. It achieves 0.142 NRMSE for NARMA-10, 6.73$\times 10^{-3}$ NMSE for SataFe, and a WER $\sim$ 0.34% for the speech recognition. These results are compared with the state of the art in Table 4. As seen, the present RC system tops the prediction accuracy in the first two test, and is the only reported system to perform well in all three tasks. Moreover, it is able to precisely predict 25,000 steps in NARMA-10 series with impressive low NRMSE of 0.148. Such high performance in those various benchmark tests clearly demonstrate the advantages of our system for complex and versatile tasks.

Our system is currently working at a low speed, which is mostly limited by the sampling rate of ADC and settling time of DAC. A significant speedup is expected by choosing a faster ADC and DAC. Also, it is prospective to replace the existing bulk-optical EOM with photonic integrated chips, where sophisticated and tailored nonlinear transformations can be realized using nested optical circuits on a single chip [56,57]. Finally, an FPGA-based opto-electronic RC system of this or similar design are easily re-configurable to accommodate even more neurons, high connectivity, and arbitrary topology. Also, the performance and robustness of the present RC could be further improved by online training. Those upgrades will invite important applications in weather and financial forecasting, real-time information processing, and so on.

Table 4. Performance Metric Comparison of Various RC Systems. AWG:Arbitrary Wavefrom generator, DAQ: Data Acquisition.

View Table | View all tables in this article

Disclosures

The authors declare no conflicts of interest.

References

1. Q. Liu, L. Wang, A. G. Frutos, A. E. Condon, R. M. Corn, and L. M. Smith, “DNA computing on surfaces,” Nature 403(6766), 175–179 (2000). [CrossRef]

2. P. McMahon, A. Marandi, Y. Haribara, R. Hamerly, C. Langrock, S. Tamate, T. Inagaki, H. Takesue, S. Utsunomiya, K. Aihara, R. Byer, M. Fejer, H. Mabuchi, and Y. Yamamoto, “A fully programmable 100-spin coherent ising machine with all-to-all connections,” Science 354(6312), 614–617 (2016). [CrossRef]

3. R. A. Nawrocki, R. M. Voyles, and S. E. Shaheen, “A mini review of neuromorphic architectures and implementations,” IEEE Trans. Electron Devices 63(10), 3819–3829 (2016). [CrossRef]

4. F. Amato, A. López, E. M. Pe na-Méndez, P. Vaňhara, A. Hampl, and J. Havel, “Artificial neural networks in medical diagnosis,” J. Appl. Biomed. 11(2), 47–58 (2013). [CrossRef]

5. S. Kumar, H. Zhang, and Y.-P. Huang, “Large-scale ising emulation with four body interaction and all-to-all connections,” Commun. Phys. 3(1), 108 (2020). [CrossRef]

6. A. Krizhevsky, I. Sutskever, and G. Hinton, “Imagenet classification with deep convolutional neural networks,” Neural Information Processing Systems 25 (2012).

7. Y. LeCun, Y. Bengio, and G. Hinton, “Deep learning,” Nature 521(7553), 436–444 (2015). [CrossRef]

8. V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis, “Human-level control through deep reinforcement learning,” Nature 518(7540), 529–533 (2015). [CrossRef]

9. D. Silver, A. Huang, C. J. Maddison, A. Guez, L. Sifre, G. van den Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, I. Sutskever, T. Lillicrap, M. Leach, K. Kavukcuoglu, T. Graepel, and D. Hassabis, “Mastering the game of go with deep neural networks and tree search,” Nature 529(7587), 484–489 (2016). [CrossRef]

10. Y. Shen, N. C. Harris, S. Skirlo, M. Prabhu, T. Baehr-Jones, M. Hochberg, X. Sun, S. Zhao, H. Larochelle, D. Englund, and M. Soljacic, “Deep learning with coherent nanophotonic circuits,” Nat. Photonics 11(7), 441–446 (2017). [CrossRef]

11. J. Feldmann, N. Youngblood, C. D. Wright, H. Bhaskaran, and W. H. P. Pernice, “All-optical spiking neurosynaptic networks with self-learning capabilities,” Nature 569(7755), 208–214 (2019). [CrossRef]

12. X. Sui, Q. Wu, J. Liu, Q. Chen, and G. Gu, “A review of optical neural networks,” IEEE Access 8, 70773–70783 (2020). [CrossRef]

13. T. Bu, S. Kumar, H. Zhang, I. Huang, and Y.-P. Huang, “Single-pixel pattern recognition with coherent nonlinear optics,” Opt. Lett. 45(24), 6771–6774 (2020). [CrossRef]

14. Y. Zuo, B. Li, Y. Zhao, Y. Jiang, Y.-C. Chen, P. Chen, G.-B. Jo, J. Liu, and S. Du, “All-optical neural network with nonlinear activation functions,” Optica 6(9), 1132–1137 (2019). [CrossRef]

15. T. Zhou, L. Fang, T. Yan, J. Wu, Y. Li, J. Fan, H. Wu, X. Lin, and Q. Dai, “In situ optical backpropagation training of diffractive optical neural networks,” Photonics Res. 8(6), 940–953 (2020). [CrossRef]

16. D. Verstraeten, B. Schrauwen, M. D’Haene, and D. Stroobandt, “An experimental unification of reservoir computing methods,” Neural networks : official journal Int. Neural Netw. Soc. 20(3), 391–403 (2007). [CrossRef]

17. L. Appeltant, M. C. Soriano, G. Van der Sande, J. Danckaert, S. Massar, J. Dambre, B. Schrauwen, C. R. Mirasso, and I. Fischer, “Information processing using a single dynamical node as complex system,” Nat. Commun. 2(1), 468 (2011). [CrossRef]

18. P. Antonik, M. Haelterman, and S. Massar, “Brain-Inspired Photonic Signal Processor for Generating Periodic Patterns and Emulating Chaotic Systems,” Phys. Rev. Appl. 7(5), 054014 (2017). [CrossRef]

19. D. Brunner, B. Penkovsky, B. A. Marquez, M. Jacquot, I. Fischer, and L. Larger, “Tutorial: Photonic neural networks in delay systems,” J. Appl. Phys. 124(15), 152004 (2018). [CrossRef]

20. Y. K. Chembo, “Machine learning based on reservoir computing with time-delayed optoelectronic and photonic systems,” Chaos 30(1), 013111 (2020). [CrossRef]

21. W. Maass, T. Natschläger, and H. Markram, “Real-time computing without stable states: A new framework for neural computation based on perturbations,” Neural Comput. 14(11), 2531–2560 (2002). [CrossRef]

22. H. Jaeger and H. Haas, “Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication,” Science 304(5667), 78–80 (2004). [CrossRef]

23. K. Vandoorne, W. Dierckx, B. Schrauwen, D. Verstraeten, R. Baets, P. Bienstman, and J. Van Campenhout, “Toward optical signal processing using Photonic Reservoir Computing,” Opt. Express 16(15), 11182 (2008). [CrossRef]

24. K. Vandoorne, J. Dambre, D. Verstraeten, B. Schrauwen, and P. Bienstman, “Parallel Reservoir Computing Using Optical Amplifiers,” IEEE Trans. Neural Netw. 22(9), 1469–1481 (2011). [CrossRef]

25. F. Duport, B. Schneider, A. Smerieri, M. Haelterman, and S. Massar, “All-optical reservoir computing,” Opt. Express 20(20), 22783 (2012). [CrossRef]

26. D. Brunner, M. C. Soriano, C. R. Mirasso, and I. Fischer, “Parallel photonic information processing at gigabyte per second data rates using transient states,” Nat. Commun. 4(1), 1364 (2013). [CrossRef]

27. C. Mesaritakis, V. Papataxiarhis, and D. Syvridis, “Micro ring resonators as building blocks for an all-optical high-speed reservoir-computing bit-pattern-recognition system,” J. Opt. Soc. Am. B 30(11), 3048 (2013). [CrossRef]

28. A. Dejonckheere, F. Duport, A. Smerieri, L. Fang, J.-L. Oudar, M. Haelterman, and S. Massar, “All-optical reservoir computer based on saturation of absorption,” Opt. Express 22(9), 10868 (2014). [CrossRef]

29. T. Yamane, Y. Katayama, R. Nakane, G. Tanaka, and D. Nakano, “Wave-Based Reservoir Computing by Synchronization of Coupled Oscillators,” in Neural Information Processing, vol. 9491S. Arik, T. Huang, W. K. Lai, and Q. Liu, eds. (Springer International Publishing, Cham, 2015), pp. 198–205. Series Title: Lecture Notes in Computer Science.

30. C. Mesaritakis, A. Bogris, A. Kapsalis, and D. Syvridis, “High-speed all-optical pattern recognition of dispersive Fourier images through a photonic reservoir computing subsystem,” Opt. Lett. 40(14), 3416 (2015). [CrossRef]

31. L. Larger, A. Baylón-Fuentes, R. Martinenghi, V. S. Udaltsov, Y. K. Chembo, and M. Jacquot, “High-Speed Photonic Reservoir Computing Using a Time-Delay-Based Architecture: Million Words per Second Classification,” Phys. Rev. X 7(1), 011015 (2017). [CrossRef]

32. G. Tanaka, R. Nakane, T. Yamane, S. Takeda, D. Nakano, S. Nakagawa, and A. Hirose, “Waveform classification by memristive reservoir computing,” in Neural Information Processing, vol. 10637D. Liu, S. Xie, Y. Li, D. Zhao, and E.-S. M. El-Alfy, eds. (Springer International Publishing, Cham, 2017), pp. 457–465. Series Title: Lecture Notes in Computer Science.

33. G. Van der Sande, D. Brunner, and M. C. Soriano, “Advances in photonic reservoir computing,” Nanophotonics 6(3), 561–576 (2017). [CrossRef]

34. Y. Hou, G. Xia, W. Yang, D. Wang, E. Jayaprasath, Z. Jiang, C. Hu, and Z. Wu, “Prediction performance of reservoir computing system based on a semiconductor laser subject to double optical feedback and optical injection,” Opt. Express 26(8), 10211 (2018). [CrossRef]

35. A. Lugnan, A. Katumba, F. Laporte, M. Freiberger, S. Sackesyn, C. Ma, E. Gooskens, J. Dambre, and P. Bienstman, “Photonic neuromorphic information processing and reservoir computing,” APL Photonics 5(2), 020901 (2020). [CrossRef]

36. D. Canaday, A. Griffith, and D. J. Gauthier, “Rapid time series prediction with a hardware-based reservoir computer,” Chaos: An Interdiscip. J. Nonlinear Sci. 28(12), 123119 (2018). [CrossRef]

37. B. Penkovsky, L. Larger, and D. Brunner, “Efficient design of hardware-enabled reservoir computing in fpgas,” J. Appl. Phys. 124(16), 162101 (2018). [CrossRef]

38. I. Estébanez, I. Fischer, and M. C. Soriano, “Constructive role of noise for high-quality replication of chaotic attractor dynamics using a hardware-based reservoir computer,” Phys. Rev. Appl. 12(3), 034058 (2019). [CrossRef]

39. F. Böhm, G. Verschaffelt, and G. Van der Sande, “A poor man’s coherent ising machine based on opto-electronic feedback systems for solving optimization problems,” Nat. Commun. 10(1), 3538 (2019). [CrossRef]

40. J. D. Hart, D. C. Schmadel, T. E. Murphy, and R. Roy, “Experiments with arbitrary networks in time-multiplexed delay systems,” Chaos 27(12), 121103 (2017). [CrossRef]

41. J. D. Hart, R. Roy, D. Müller-Bender, A. Otto, and G. Radons, “Laminar chaos in experiments: Nonlinear systems with time-varying delays and noise,” Phys. Rev. Lett. 123(15), 154101 (2019). [CrossRef]

42. L. Larger, M. C. Soriano, D. Brunner, L. Appeltant, J. M. Gutierrez, L. Pesquera, C. R. Mirasso, and I. Fischer, “Photonic information processing beyond Turing: an optoelectronic implementation of reservoir computing,” Opt. Express 20(3), 3241 (2012). [CrossRef]

43. F. Duport, A. Smerieri, A. Akrout, M. Haelterman, and S. Massar, “Fully analogue photonic reservoir computer,” Sci. Rep. 6(1), 22381 (2016). [CrossRef]

44. X. Bao, Q. Zhao, and H. Yin, “Efficient optoelectronic reservoir computing with three-route input based on optical delay lines,” Appl. Opt. 58(15), 4111 (2019). [CrossRef]

45. H. Jaeger, “Adaptive nonlinear system identification with echo state networks,” in NIPS, (2002).

46. C. Chatfield and A. S. Weigend, “Time series prediction: Forecasting the future and understanding the past: Neil a. gershenfeld and andreas s. weigend, 1994, ’the future of time series’, in: A.s. weigend and n.a. gershenfeld, eds., (addison-wesley, reading, ma), 1-70,” Int. J. Forecast. 10(1), 161–163 (1994). [CrossRef]

47. R. Lyon, “A computational model of filtering, detection, and compression in the cochlea,” in ICASSP ’82. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 7 (1982), pp. 1282–1285.

48. S. Becker, M. Ackermann, S. Lapuschkin, K.-R. Müller, and W. Samek, “Interpreting and explaining deep neural networks for classification of audio signals,” CoRR abs/1807.03418 (2018).

49. B. McFee, V. Lostanlen, A. Metsai, M. McVicar, S. Balke, C. Thomé, C. Raffel, F. Zalkow, A. Malek, Dana, K. Lee, O. Nieto, J. Mason, D. Ellis, E. Battenberg, S. Seyfarth, R. Yamamoto, K. Choi, viktorandreevichmorozov, J. Moore, R. Bittner, S. Hidaka, Z. Wei, nullmightybofo, D. Here nú, F.-R. Stöter, P. Friesch, A. Weiss, M. Vollrath, and T. Kim, “librosa/librosa: 0.8.0,” (2020).

50. F. Abreu Araujo, M. Riou, J. Torrejon, S. Tsunegi, D. Querlioz, K. Yakushiji, A. Fukushima, H. Kubota, S. Yuasa, M. D. Stiles, and J. Grollier, “Role of non-linear data processing on speech recognition task in the framework of reservoir computing,” Sci. Rep. 10(1), 328 (2020). [CrossRef]

51. F. Duport, A. Smerieri, A. Akrout, M. Haelterman, and S. Massar, “Virtualization of a photonic reservoir computer,” J. Lightwave Technol. 34(9), 2085–2091 (2016). [CrossRef]

52. M. Hermans, P. Antonik, M. Haelterman, and S. Massar, “Embodiment of learning in electro-optical signal processors,” Phys. Rev. Lett. 117(12), 128301 (2016). [CrossRef]

53. M. C. Soriano, S. Ortín, D. Brunner, L. Larger, C. R. Mirasso, I. Fischer, and L. Pesquera, “Optoelectronic reservoir computing: tackling noise-induced performance degradation,” Opt. Express 21(1), 12 (2013). [CrossRef]

54. R. Martinenghi, S. Rybalko, M. Jacquot, Y. K. Chembo, and L. Larger, “Photonic Nonlinear Transient Computing with Multiple-Delay Wavelength Dynamics,” Phys. Rev. Lett. 108(24), 244101 (2012). [CrossRef]

55. Y. Paquot, F. Duport, A. Smerieri, J. Dambre, B. Schrauwen, M. Haelterman, and S. Massar, “Optoelectronic Reservoir Computing,” Sci. Rep. 2(1), 287 (2012). [CrossRef]

56. J.-Y. Chen, Z.-H. Ma, Y. M. Sua, Z. Li, C. Tang, and Y.-P. Huang, “Ultra-efficient frequency conversion in quasi-phase-matched lithium niobate microrings,” Optica 6(9), 1244 (2019). [CrossRef]

57. M. Jin, J.-Y. Chen, Y. M. Sua, and Y.-P. Huang, “High-extinction electro-optic modulation on lithium niobate thin film,” Opt. Lett. 44(5), 1265–1268 (2019). [CrossRef]

Samples	Mean	Sth	Min
$2000$	$0.142$	$6.47 \times 10^{- 3}$	$0.129$
$25000$	$0.148$	$5.90 \times 10^{- 3}$	$0.139$

Nodes#	Mean	Std	Min
$400$	$2.71 \times 10^{- 2}$	$6.20 \times 10^{- 4}$	$2.54 \times 10^{- 2}$
$950$	$6.73 \times 10^{- 3}$	$3.34 \times 10^{- 4}$	$5.66 \times 10^{- 3}$

	Training WER	Testing WER
Mean	$0.05 %$	$0.34 %$
Std	$0.13 %$	$0.51 %$

Reference	Opto-electronic RC system	Nodes#	Test case	Performance
[31]	Band-pass, optical phase dynamical	1113	Spoken digit recognition	WER= $0.04 %$
	variable,		Frequency Channel =86
	AWG @ 24 GS/s.		Training/Testing Size=475/25 digits,
	DAQ @ 80 GS/s		repeated 20 times.
[42]	Low-pass,	400	Santa Fe laser time-series	NMSE= $0.124 \pm 4 \times 10^{- 4}$
	voltage dynamic variable		One step prediction
		400	Spoken digit recognition	WER= $0.04 % \pm 0.017 %$
			Frequency Channel=86
			Training/Testing Size=475/25 digits,
			repeated 20 times
[43]	Band-pass, voltage dynamic variable,	47	NARMA10	NMSE= $0.230 \pm 0.023$
	fully analog system,		Training/Testing size=1000/1000.
	AWG @ 200 MS/s, 16 bits resolution,
	DAQ @ 200 MS/s, 12 bits resolution.
[51]	Band-pass, voltage dynamic variable,	50	NARMA10	NMSE= $0.181 \pm 0.013$
	Simultaneous task		Training/Testing size=3000/6000.
	AWG @ 200 MS/s, 16 bits resolution.
	DAQ @ 200 MS/s, 12 bits resolution.
[52]	Band-pass, voltage dynamic variable,	80	NARMA10	NRMSE= $0.185$
	optimization via backpropagation		Back propogation algorithm with
			20,000 iterations
[53]	Band-pass, voltage dynamic variable	400	Santa Fe laser time-series	NMSE= $0.02$
			Training/Testing size=3000/1000.
[54]	Low-pass,	150	Spoken digit recognition	WER= $0.6 % \pm 0.2 %$
	wavelength dynamical variable		Frequency Channel =86
			Training/Testing Size=475/25 digits,
			repeated 20 times.
[55]	Band-pass, voltage dynamic variable	50	NARMA10	NMSE= $0.168 \pm 0.015$
	DAQ @200 MS/s		Training/Testing size=1000/1000
		200	Spoken digit recognition	WER= $0.4 %$
			Frequency Channel=86
			Training/Testing Size=475/25 digits,
			repeated 5 times.
This paper	Band-pass, Voltage dynamic variable,	400	NARMA10	NRMSE= $0.142 \pm 6.5 \times 10^{- 3}$
	AWG @ 1 MS/s, 16 bits resolution		Training/Testing size=1000/1000
	DAQ @ 1 MS/s, 16 bits resolution.	950	Santa Fe laser time-series	NMSE= $6.73 \times 10^{- 3} \pm 3.3 \times 10^{- 4}$
			One step prediction 4000 samples
		400	Spoken digit recognition	WER= $0.34 % \pm 0.51 %$
			Frequency Channel=77
			Training/Testing Size=475/25 digits,
			repeated 20 times

Samples	Mean	Sth	Min
$2000$	$0.142$	$6.47 \times 10^{- 3}$	$0.129$
$25000$	$0.148$	$5.90 \times 10^{- 3}$	$0.139$

Efficient reservoir computing using field programmable gate array and electro-optic modulation

Abstract

1. Introduction

2. Theory

3. Experimental setup

4. Results

4.1 NARMA10

4.2 Santa Fe laser data prediction

4.3 Isolate spoken digit recognition

5. Conclusions

Disclosures

References

Cited By

Figures (6)

Tables (4)

Equations (12)

OSA Continuum