This document describes the design and implementation of a double precision floating point unit (FPU) for optimized speed. It discusses the need for high-speed arithmetic operations in applications such as digital signal processing. It presents the architecture of the proposed FPU, which includes blocks for floating point multiplication and addition. It also discusses the implementation of pipelined 64-bit floating point multiplication and addition units using techniques like carry lookahead addition and hybrid multiplication. Simulation results on a FPGA platform show that the proposed pipelined design achieves higher throughput than existing non-pipelined approaches.