big data big data technology warsaw summit hadoop machine learning apache flink kafka spark allegro apache kafka streaming stream processing postgres ing coreintel google bigquery 1200 node cluster hadoop cluster spark streaming criteo ucs director cisco aci (application centric infrastructure) flink-as-a-service sql apache cassandra sentione elasticsearch cluster deep water elasticsearch development unstructured data hpe controlpoint sentiment analysis clickstream backend events algorytm ml numpy theano - cpu vs gpu clean code naspers group avito analytics etsy bootstrapping scala statistics simplicity real-time processing mapreduce jupyter dc/os apache mesos kubernetes docker swarm fandom airflow cisco alterdata real-time big data analytics c-store dbms cisco ucs (unified computing system) intent adversarial examples apache nifi fraud detection realtime processing spark sql spark ml machine learning algorithms a/b testing hbase cassandra data science data sciencist teamwork work skills uber sas viya factorization machines recommendation system sparse data bpa bpa summit rpa automation business process automation robotics business process process automation robotic process automation cassandra clusters spotify entreprise adoption hadoop integration in bi ecosystem scaling solutions in enterprise data teams organization analytics workflow structured streaming snappy in-memory vespa recommendations targeting search nlp news private cloud google compute platform migration hybrid platforms privacy gdpr data pipelines data engineering data processing engine event sourcing security
See more