This paper surveys the improvement of artificial neural networks (ANN) performance through parallel programming using GPU and FPGA architectures. It discusses various training strategies, evaluates different approaches based on speed and performance, and highlights the advantages of GPU over CPU for tasks like pattern recognition and clustering. The document also examines historical developments in ANN and outlines configurations and methods for effective parallelization in neuron training and execution phases.