The document presents 'Eureka', a system designed to efficiently discover training examples from dispersed data sources, particularly images, leveraging edge computing and an iterative discovery workflow. It emphasizes reducing the human labeling effort while improving domain experts' productivity by utilizing early discard methods and specialized hardware. Eureka significantly outperforms traditional brute-force methods, yielding a substantial reduction in the amount of data needing expert review.