This document discusses using reinforcement learning for beam selection in wireless communication networks. It proposes a simulation environment called "RadioStrike" built in Unreal Engine to generate data and train reinforcement learning agents. The document provides background on machine learning for communications, beam selection techniques, and introduces some basic reinforcement learning concepts. It also outlines strategies for participants in the ITU ML5G challenge to approach the beam selection reinforcement learning problem, including providing sample code and simpler baseline problems to get started.