Casey Stella presents on using natural language processing (NLP) techniques like word2vec to analyze structured medical data. Clinical encounters can be viewed as "sentences" with vital signs, labs, procedures, diagnoses and drugs as "words". Word2vec is applied to a dataset of 197,340 clinical records to learn relationships between these clinical concepts without domain expertise. The results are explored using Spark and IPython Notebook on Hadoop.