This document summarizes genomic big data management, integration and mining. It discusses the exponential growth of biological data due to advances in sequencing technologies. Next generation sequencing techniques generate large amounts of short DNA reads. Several public databases contain heterogeneous biological data sources. Effective data management and integration methods are needed to analyze these large and complex datasets. Supervised machine learning can be used to extract knowledge and classify samples. Tools like CAMUR apply rule-based classification to problems like analyzing gene expression from cancer datasets. Future work involves advanced integration systems and new big data approaches for biological data.