The present paper documents the research towards the development of an efficient algorithm to compute the result from a multiple-input-single-output Neural Network using floating-point arithmetic on FPGA. The proposed algorithm focus on optimizing pipeline delays by splitting the "Multiply and accumulate" algorithm into separate steps using partial products. It is a revisit of the classical algorithm for NN computation, able to overcome the main computation bottleneck in FPGA environment. The proposed algorithm can be implemented into an architecture that fully exploits the pipeline performance of the floating-point arithmetic blocks, thus allowing a very fast computation for the neural network. The performance of the proposed architecture is presented using as target a Cyclone II FPGA Device.
Laudani, A., Lozito G., M., RIGANTI FULGINEI, F., Salvini, A. (2014). An Efficient Architecture for Floating Point based MISO Neural Neworks on FPGA. In Proceedings of the 2014 UKSim-AMSS 16th International Conference on Computer Modelling and Simulation (pp.12-17). Cambridge : Davide AlDabas et al. [10.1109/UKSim.2014.15].
An Efficient Architecture for Floating Point based MISO Neural Neworks on FPGA
LAUDANI, ANTONINO;Lozito G. M;RIGANTI FULGINEI, Francesco;SALVINI, Alessandro
2014-01-01
Abstract
The present paper documents the research towards the development of an efficient algorithm to compute the result from a multiple-input-single-output Neural Network using floating-point arithmetic on FPGA. The proposed algorithm focus on optimizing pipeline delays by splitting the "Multiply and accumulate" algorithm into separate steps using partial products. It is a revisit of the classical algorithm for NN computation, able to overcome the main computation bottleneck in FPGA environment. The proposed algorithm can be implemented into an architecture that fully exploits the pipeline performance of the floating-point arithmetic blocks, thus allowing a very fast computation for the neural network. The performance of the proposed architecture is presented using as target a Cyclone II FPGA Device.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.