This document discusses genomics workloads and the requirements for storage infrastructure to support them. It begins with an introduction to genomics and the growth of the field. It then examines the characteristics of genomic sequencing workloads, including the multi-step process and file-based nature. Key requirements for storage are outlined, such as high throughput, large ingestion of files, and support for POSIX and other access protocols. The document proposes a solution using a software-defined, clustered file system like IBM Spectrum Scale to provide scalable, high performance file storage as a building block of a composable infrastructure for genomics applications. It provides an example architecture and performance results for GATK-based analysis.