This presentation includes an intro to bioinformatics with an emphasis on human genome re-sequencing and how Hadoop and Neo4j can be used together to open striking possibilities.
This document provides an overview and objectives of a Python course for big data analytics. It discusses why Python is well-suited for big data tasks due to its libraries like PyDoop and SciPy. The course includes demonstrations of web scraping using Beautiful Soup, collecting tweets using APIs, and running word count on Hadoop using Pydoop. It also discusses how Python supports key aspects of data science like accessing, analyzing, and visualizing large datasets.
Slides from my talk on API Design Patterns at ScalaBay Meetup at Netflix on 09/09/2014.
http://www.meetup.com/Scala-Bay/events/195982742/
This document discusses API anti-patterns, which are commonly occurring solutions to problems that seem good on the surface but are not actually good solutions. It provides examples of anti-patterns related to request parameters, response codes, and organizational structure of APIs. The document advocates for RESTful design practices and using HTTP methods and status codes as intended to clearly represent operations.
As companies like Facebook and Google have introduced us to Graph Search and the Knowledge Graph, developers are learning the benefits of graph database architectures. Graph databases, like Neo4j, have increased in popularity by nearly 250% from last year - the highest among all other DBMS categories, according to db-engines.com. Join Kenny Bastani as we look at the benefits of using a graph database, explore various use cases and walkthrough creating a movie recommendation app on Neo4j 2.0.
The majority of NoSQL meetups in London are hosted on meetup.com and luckily for us meetup.com has an API that allows us to extract all the corresponding data - groups, events, venues, members and RSVPs.
In this talk Mark will show how we can use R to gain quick insights into the data using tools like dplyr and ggplot2. We'll also do some social network analysis of the attendees of London's meetup scene using igraph.
Finally we'll look at how we could bring together all these insights into a brand new Clojure front end for the meetup website.
Creating Open Data with Open Source (beta2)Sammy Fung
The document discusses creating open data using open source tools. It provides an overview of open data and Tim Berners-Lee's 5 star deployment scheme for open data. The author then describes using Python and the Scrapy framework to crawl websites and extract structured data to create open datasets. Specific examples discussed are the WeatherHK and TCTrack projects, which extract weather data from government websites. The author also proposes the hk0weather open source project to convert Hong Kong weather data into JSON format. The goal is to make more government data openly available in reusable, machine-readable formats.
This document summarizes an introductory webinar on building an enterprise knowledge graph from RDF data using TigerGraph. It introduces RDF and knowledge graphs, demonstrates loading DBpedia data into a TigerGraph graph database using a universal schema, and provides examples of queries to extract information from the graph such as related people, publishers by location, and related topics for a given predicate. The webinar encourages attendees to learn more about graph databases and TigerGraph through additional resources and future webinar episodes.
Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014Austin Ogilvie
The document outlines Greg Lamp's presentation at a Data Science MD Meetup in October 2014 about Applied Data Science with Yhat. The presentation covers the challenges of building analytical applications, a case study of a beer recommender system built in Python using beer review data, and a demonstration of deploying the model through Yhat's platform. It concludes with a question and answer section.
Building a Distributed Build System at Google ScaleAysylu Greenberg
It’s hard to imagine a modern developer workflow without a sufficiently advanced build system: Make, Gradle, Maven, Rake, and many others. In this talk, we’ll discuss the evolution of build systems that leads to distributed build systems, like Google's BuildRabbit. Then, we’ll dive into how we can build a scalable system that is fast and resilient, with examples from Google. We’ll conclude with the discussion of general challenges of migrating systems from one architecture to another.
JSON and Oracle Database: A Brave New WorldDaniel McGhan
A world of apps built in JavaScript, using JSON as their data exchange format, relying on APIs to get the job done - does Oracle Database have a place in this world? Can it offer UI developers what they need to get their job done as productively and successfully as possible? Absolutely! In this session, attendees will explore the new support for JSON in Oracle Database SQL and PL/SQL and learn how to help front-end developers build secure, high-performance applications.
Comprehensive Container Based Service Monitoring with Kubernetes and IstioFred Moyer
This document summarizes Fred Moyer's talk on comprehensive container-based service monitoring with Kubernetes and Istio. The talk covered Istio architecture and deployment, using the Istio sample bookinfo application, and monitoring the application with Istio metrics and Grafana dashboards. It also discussed Istio Mixer metrics adapters, math and statistics concepts like histograms and quantiles, and monitoring concepts like service level objectives, indicators, and agreements. The talk provided exercises for attendees to deploy sample applications and create custom metrics adapters.
The document describes the Neo4j graph database and platform vision. It discusses key components like index-free adjacency, ACID transactions, clustering, and hardware optimizations. It outlines use cases for graph analytics, transactions, AI, and data integration. It also covers drivers, APIs, visualization, and administration tools. Finally, it previews upcoming innovations in Neo4j 3.4 like geospatial support, native string indexes, and rolling upgrades.
Creando microservicios con Java, Microprofile y TomEE - Baranquilla JUGCésar Hernández
En esta sesión los asistentes presenciaron la base teórica y práctica para la creación de micro servicios con Java, JakartaEE, MicroProfile utilizando TomEE como servidor de aplicaciones.
Drill can query JSON data stored in various data sources like HDFS, HBase, and Hive. It allows running SQL queries over JSON data without requiring a fixed schema. The document describes how Drill enables ad-hoc querying of JSON-formatted Yelp business review data using SQL, providing insights faster than traditional approaches.
Developing in R - the contextual Multi-Armed Bandit editionRobin van Emden
The document discusses R package development. It covers that R is dominant in statistics research and is an interpreted language. It also supports multiple programming paradigms like imperative, functional and object oriented programming. It discusses different class systems in R like S3, S4 and the newer R6 class. It emphasizes that R6 class provides a better approach. The document also highlights the importance of skills like semantic development skills, syntactic development skills and domain knowledge for R development.
HEPData is a repository for data from high energy physics (HEP) experiments dating back to the 1950s. It provides physicists with access to the underlying data and tables from published papers. The new HEPData system offers simplified submission processes, standard data formats, versioning, and assigning DOIs to help data providers share their work. It also improves access and search capabilities for data consumers through features like publication-driven and data-driven searching, semantic publishing, data conversion tools, and access through analysis environments like ROOT and Mathematica.
The document describes Krist Wongsuphasawat's background and work in data visualization. It notes that he has a PhD in Computer Science from the University of Maryland, where he studied information visualization. He currently works as a data visualization scientist at Twitter, where he builds internal tools to analyze log data and monitor changes over time. Some of his projects include Scribe Radar, which allows users to search through and visualize client event data in order to find patterns and monitor effects of product changes. The document provides details on his approaches for dealing with large log datasets and visualizing user activity sequences.
Data Science Amsterdam - Massively Parallel Processing with Procedural LanguagesIan Huston
The goal of in-database analytics is to bring the calculations to the data, reducing transport costs and I/O bottlenecks. With Procedural Languages such as PL/Python and PL/R data parallel queries can be run across terabytes of data using not only pure SQL but also familiar Python and R packages. The Pivotal Data Science team have used this technique to create fraud behaviour models for each individual user in a large corporate network, to understand interception rates at customs checkpoints by accelerating natural language processing of package descriptions and to reduce customer churn by building a sentiment model using customer call centre records.
http://www.meetup.com/Data-Science-Amsterdam/events/178974942/
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Codemotion
Once you start working with Big Data systems, you discover a whole bunch of problems you won’t find in monolithic systems. Monitoring all of the components becomes a big data problem itself. In the talk we’ll mention all of the aspects that you should take in consideration when monitoring a distributed system using tools like: Web Services,Spark,Cassandra,MongoDB,AWS. Not only the tools, what should you monitor about the actual data that flows in the system? We’ll cover the simplest solution with your day to day open source tools, the surprising thing, that it comes not from an Ops Guy.
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Demi Ben-Ari
Once you start working with distributed Big Data systems, you start discovering a whole bunch of problems you won’t find in monolithic systems.
All of a sudden to monitor all of the components becomes a big data problem itself.
In the talk we’ll mention all of the aspects that you should take in consideration when monitoring a distributed system once you’re using tools like:
Web Services, Apache Spark, Cassandra, MongoDB, Amazon Web Services.
Not only the tools, what should you monitor about the actual data that flows in the system?
And we’ll cover the simplest solution with your day to day open source tools, the surprising thing, that it comes not from an Ops Guy.
Fully Tested: From Design to MVP In 3 WeeksSmartBear
In this presentation Daniel Giordano, Product Marketing Manager at SmartBear, will cover how to speed up your development with a design first mind set, virtualizing services and dependencies to enhance collaboration between developers & testers, & end-to-End testing strategies for an immature product.
The document discusses SYSTAP and their graph database product Blazegraph. It provides an overview of SYSTAP and Blazegraph, highlighting that Blazegraph can scale to handle large graph datasets with billions or trillions of edges through various deployment options including embedded, high availability, scale-out, and GPU acceleration configurations. The document also discusses how Blazegraph is being used by organizations for applications like knowledge graphs, genomics, and defense/intelligence.
Massively Parallel Processing with Procedural Python by Ronert Obst PyData Be...PyData
The Python data ecosystem has grown beyond the confines of single machines to embrace scalability. Here we describe one of our approaches to scaling, which is already being used in production systems. The goal of in-database analytics is to bring the calculations to the data, reducing transport costs and I/O bottlenecks. Using PL/Python we can run parallel queries across terabytes of data using not only pure SQL but also familiar PyData packages such as scikit-learn and nltk. This approach can also be used with PL/R to make use of a wide variety of R packages. We look at examples on Postgres compatible systems such as the Greenplum Database and on Hadoop through Pivotal HAWQ. We will also introduce MADlib, Pivotal’s open source library for scalable in-database machine learning, which uses Python to glue SQL queries to low level C++ functions and is also usable through the PyMADlib package.
Spark is a powerhouse for large datasets, but when it comes to smaller data workloads, its overhead can sometimes slow things down. What if you could achieve high performance and efficiency without the need for Spark?
At S&P Global Commodity Insights, having a complete view of global energy and commodities markets enables customers to make data-driven decisions with confidence and create long-term, sustainable value. 🌍
Explore delta-rs + CDC and how these open-source innovations power lightweight, high-performance data applications beyond Spark! 🚀
Ad
More Related Content
Similar to Hadoop and Neo4j: A Winning Combination for Bioinformatics (20)
Creating Open Data with Open Source (beta2)Sammy Fung
The document discusses creating open data using open source tools. It provides an overview of open data and Tim Berners-Lee's 5 star deployment scheme for open data. The author then describes using Python and the Scrapy framework to crawl websites and extract structured data to create open datasets. Specific examples discussed are the WeatherHK and TCTrack projects, which extract weather data from government websites. The author also proposes the hk0weather open source project to convert Hong Kong weather data into JSON format. The goal is to make more government data openly available in reusable, machine-readable formats.
This document summarizes an introductory webinar on building an enterprise knowledge graph from RDF data using TigerGraph. It introduces RDF and knowledge graphs, demonstrates loading DBpedia data into a TigerGraph graph database using a universal schema, and provides examples of queries to extract information from the graph such as related people, publishers by location, and related topics for a given predicate. The webinar encourages attendees to learn more about graph databases and TigerGraph through additional resources and future webinar episodes.
Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014Austin Ogilvie
The document outlines Greg Lamp's presentation at a Data Science MD Meetup in October 2014 about Applied Data Science with Yhat. The presentation covers the challenges of building analytical applications, a case study of a beer recommender system built in Python using beer review data, and a demonstration of deploying the model through Yhat's platform. It concludes with a question and answer section.
Building a Distributed Build System at Google ScaleAysylu Greenberg
It’s hard to imagine a modern developer workflow without a sufficiently advanced build system: Make, Gradle, Maven, Rake, and many others. In this talk, we’ll discuss the evolution of build systems that leads to distributed build systems, like Google's BuildRabbit. Then, we’ll dive into how we can build a scalable system that is fast and resilient, with examples from Google. We’ll conclude with the discussion of general challenges of migrating systems from one architecture to another.
JSON and Oracle Database: A Brave New WorldDaniel McGhan
A world of apps built in JavaScript, using JSON as their data exchange format, relying on APIs to get the job done - does Oracle Database have a place in this world? Can it offer UI developers what they need to get their job done as productively and successfully as possible? Absolutely! In this session, attendees will explore the new support for JSON in Oracle Database SQL and PL/SQL and learn how to help front-end developers build secure, high-performance applications.
Comprehensive Container Based Service Monitoring with Kubernetes and IstioFred Moyer
This document summarizes Fred Moyer's talk on comprehensive container-based service monitoring with Kubernetes and Istio. The talk covered Istio architecture and deployment, using the Istio sample bookinfo application, and monitoring the application with Istio metrics and Grafana dashboards. It also discussed Istio Mixer metrics adapters, math and statistics concepts like histograms and quantiles, and monitoring concepts like service level objectives, indicators, and agreements. The talk provided exercises for attendees to deploy sample applications and create custom metrics adapters.
The document describes the Neo4j graph database and platform vision. It discusses key components like index-free adjacency, ACID transactions, clustering, and hardware optimizations. It outlines use cases for graph analytics, transactions, AI, and data integration. It also covers drivers, APIs, visualization, and administration tools. Finally, it previews upcoming innovations in Neo4j 3.4 like geospatial support, native string indexes, and rolling upgrades.
Creando microservicios con Java, Microprofile y TomEE - Baranquilla JUGCésar Hernández
En esta sesión los asistentes presenciaron la base teórica y práctica para la creación de micro servicios con Java, JakartaEE, MicroProfile utilizando TomEE como servidor de aplicaciones.
Drill can query JSON data stored in various data sources like HDFS, HBase, and Hive. It allows running SQL queries over JSON data without requiring a fixed schema. The document describes how Drill enables ad-hoc querying of JSON-formatted Yelp business review data using SQL, providing insights faster than traditional approaches.
Developing in R - the contextual Multi-Armed Bandit editionRobin van Emden
The document discusses R package development. It covers that R is dominant in statistics research and is an interpreted language. It also supports multiple programming paradigms like imperative, functional and object oriented programming. It discusses different class systems in R like S3, S4 and the newer R6 class. It emphasizes that R6 class provides a better approach. The document also highlights the importance of skills like semantic development skills, syntactic development skills and domain knowledge for R development.
HEPData is a repository for data from high energy physics (HEP) experiments dating back to the 1950s. It provides physicists with access to the underlying data and tables from published papers. The new HEPData system offers simplified submission processes, standard data formats, versioning, and assigning DOIs to help data providers share their work. It also improves access and search capabilities for data consumers through features like publication-driven and data-driven searching, semantic publishing, data conversion tools, and access through analysis environments like ROOT and Mathematica.
The document describes Krist Wongsuphasawat's background and work in data visualization. It notes that he has a PhD in Computer Science from the University of Maryland, where he studied information visualization. He currently works as a data visualization scientist at Twitter, where he builds internal tools to analyze log data and monitor changes over time. Some of his projects include Scribe Radar, which allows users to search through and visualize client event data in order to find patterns and monitor effects of product changes. The document provides details on his approaches for dealing with large log datasets and visualizing user activity sequences.
Data Science Amsterdam - Massively Parallel Processing with Procedural LanguagesIan Huston
The goal of in-database analytics is to bring the calculations to the data, reducing transport costs and I/O bottlenecks. With Procedural Languages such as PL/Python and PL/R data parallel queries can be run across terabytes of data using not only pure SQL but also familiar Python and R packages. The Pivotal Data Science team have used this technique to create fraud behaviour models for each individual user in a large corporate network, to understand interception rates at customs checkpoints by accelerating natural language processing of package descriptions and to reduce customer churn by building a sentiment model using customer call centre records.
http://www.meetup.com/Data-Science-Amsterdam/events/178974942/
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Codemotion
Once you start working with Big Data systems, you discover a whole bunch of problems you won’t find in monolithic systems. Monitoring all of the components becomes a big data problem itself. In the talk we’ll mention all of the aspects that you should take in consideration when monitoring a distributed system using tools like: Web Services,Spark,Cassandra,MongoDB,AWS. Not only the tools, what should you monitor about the actual data that flows in the system? We’ll cover the simplest solution with your day to day open source tools, the surprising thing, that it comes not from an Ops Guy.
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Demi Ben-Ari
Once you start working with distributed Big Data systems, you start discovering a whole bunch of problems you won’t find in monolithic systems.
All of a sudden to monitor all of the components becomes a big data problem itself.
In the talk we’ll mention all of the aspects that you should take in consideration when monitoring a distributed system once you’re using tools like:
Web Services, Apache Spark, Cassandra, MongoDB, Amazon Web Services.
Not only the tools, what should you monitor about the actual data that flows in the system?
And we’ll cover the simplest solution with your day to day open source tools, the surprising thing, that it comes not from an Ops Guy.
Fully Tested: From Design to MVP In 3 WeeksSmartBear
In this presentation Daniel Giordano, Product Marketing Manager at SmartBear, will cover how to speed up your development with a design first mind set, virtualizing services and dependencies to enhance collaboration between developers & testers, & end-to-End testing strategies for an immature product.
The document discusses SYSTAP and their graph database product Blazegraph. It provides an overview of SYSTAP and Blazegraph, highlighting that Blazegraph can scale to handle large graph datasets with billions or trillions of edges through various deployment options including embedded, high availability, scale-out, and GPU acceleration configurations. The document also discusses how Blazegraph is being used by organizations for applications like knowledge graphs, genomics, and defense/intelligence.
Massively Parallel Processing with Procedural Python by Ronert Obst PyData Be...PyData
The Python data ecosystem has grown beyond the confines of single machines to embrace scalability. Here we describe one of our approaches to scaling, which is already being used in production systems. The goal of in-database analytics is to bring the calculations to the data, reducing transport costs and I/O bottlenecks. Using PL/Python we can run parallel queries across terabytes of data using not only pure SQL but also familiar PyData packages such as scikit-learn and nltk. This approach can also be used with PL/R to make use of a wide variety of R packages. We look at examples on Postgres compatible systems such as the Greenplum Database and on Hadoop through Pivotal HAWQ. We will also introduce MADlib, Pivotal’s open source library for scalable in-database machine learning, which uses Python to glue SQL queries to low level C++ functions and is also usable through the PyMADlib package.
Spark is a powerhouse for large datasets, but when it comes to smaller data workloads, its overhead can sometimes slow things down. What if you could achieve high performance and efficiency without the need for Spark?
At S&P Global Commodity Insights, having a complete view of global energy and commodities markets enables customers to make data-driven decisions with confidence and create long-term, sustainable value. 🌍
Explore delta-rs + CDC and how these open-source innovations power lightweight, high-performance data applications beyond Spark! 🚀
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...Noah Loul
Artificial intelligence is changing how businesses operate. Companies are using AI agents to automate tasks, reduce time spent on repetitive work, and focus more on high-value activities. Noah Loul, an AI strategist and entrepreneur, has helped dozens of companies streamline their operations using smart automation. He believes AI agents aren't just tools—they're workers that take on repeatable tasks so your human team can focus on what matters. If you want to reduce time waste and increase output, AI agents are the next move.
Dev Dives: Automate and orchestrate your processes with UiPath MaestroUiPathCommunity
This session is designed to equip developers with the skills needed to build mission-critical, end-to-end processes that seamlessly orchestrate agents, people, and robots.
📕 Here's what you can expect:
- Modeling: Build end-to-end processes using BPMN.
- Implementing: Integrate agentic tasks, RPA, APIs, and advanced decisioning into processes.
- Operating: Control process instances with rewind, replay, pause, and stop functions.
- Monitoring: Use dashboards and embedded analytics for real-time insights into process instances.
This webinar is a must-attend for developers looking to enhance their agentic automation skills and orchestrate robust, mission-critical processes.
👨🏫 Speaker:
Andrei Vintila, Principal Product Manager @UiPath
This session streamed live on April 29, 2025, 16:00 CET.
Check out all our upcoming Dev Dives sessions at https://community.uipath.com/dev-dives-automation-developer-2025/.
This is the keynote of the Into the Box conference, highlighting the release of the BoxLang JVM language, its key enhancements, and its vision for the future.
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...TrustArc
Most consumers believe they’re making informed decisions about their personal data—adjusting privacy settings, blocking trackers, and opting out where they can. However, our new research reveals that while awareness is high, taking meaningful action is still lacking. On the corporate side, many organizations report strong policies for managing third-party data and consumer consent yet fall short when it comes to consistency, accountability and transparency.
This session will explore the research findings from TrustArc’s Privacy Pulse Survey, examining consumer attitudes toward personal data collection and practical suggestions for corporate practices around purchasing third-party data.
Attendees will learn:
- Consumer awareness around data brokers and what consumers are doing to limit data collection
- How businesses assess third-party vendors and their consent management operations
- Where business preparedness needs improvement
- What these trends mean for the future of privacy governance and public trust
This discussion is essential for privacy, risk, and compliance professionals who want to ground their strategies in current data and prepare for what’s next in the privacy landscape.
Book industry standards are evolving rapidly. In the first part of this session, we’ll share an overview of key developments from 2024 and the early months of 2025. Then, BookNet’s resident standards expert, Tom Richardson, and CEO, Lauren Stewart, have a forward-looking conversation about what’s next.
Link to recording, presentation slides, and accompanying resource: https://bnctechforum.ca/sessions/standardsgoals-for-2025-standards-certification-roundup/
Presented by BookNet Canada on May 6, 2025 with support from the Department of Canadian Heritage.
Social Media App Development Company-EmizenTechSteve Jonas
EmizenTech is a trusted Social Media App Development Company with 11+ years of experience in building engaging and feature-rich social platforms. Our team of skilled developers delivers custom social media apps tailored to your business goals and user expectations. We integrate real-time chat, video sharing, content feeds, notifications, and robust security features to ensure seamless user experiences. Whether you're creating a new platform or enhancing an existing one, we offer scalable solutions that support high performance and future growth. EmizenTech empowers businesses to connect users globally, boost engagement, and stay competitive in the digital social landscape.
Andrew Marnell: Transforming Business Strategy Through Data-Driven InsightsAndrew Marnell
With expertise in data architecture, performance tracking, and revenue forecasting, Andrew Marnell plays a vital role in aligning business strategies with data insights. Andrew Marnell’s ability to lead cross-functional teams ensures businesses achieve sustainable growth and operational excellence.
Train Smarter, Not Harder – Let 3D Animation Lead the Way!
Discover how 3D animation makes inductions more engaging, effective, and cost-efficient.
Check out the slides to see how you can transform your safety training process!
Slide 1: Why 3D animation changes the game
Slide 2: Site-specific induction isn’t optional—it’s essential
Slide 3: Visitors are most at risk. Keep them safe
Slide 4: Videos beat text—especially when safety is on the line
Slide 5: TechEHS makes safety engaging and consistent
Slide 6: Better retention, lower costs, safer sites
Slide 7: Ready to elevate your induction process?
Can an animated video make a difference to your site's safety? Let's talk.
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfAbi john
Analyze the growth of meme coins from mere online jokes to potential assets in the digital economy. Explore the community, culture, and utility as they elevate themselves to a new era in cryptocurrency.
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Aqusag Technologies
In late April 2025, a significant portion of Europe, particularly Spain, Portugal, and parts of southern France, experienced widespread, rolling power outages that continue to affect millions of residents, businesses, and infrastructure systems.
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...Alan Dix
Talk at the final event of Data Fusion Dynamics: A Collaborative UK-Saudi Initiative in Cybersecurity and Artificial Intelligence funded by the British Council UK-Saudi Challenge Fund 2024, Cardiff Metropolitan University, 29th April 2025
https://alandix.com/academic/talks/CMet2025-AI-Changes-Everything/
Is AI just another technology, or does it fundamentally change the way we live and think?
Every technology has a direct impact with micro-ethical consequences, some good, some bad. However more profound are the ways in which some technologies reshape the very fabric of society with macro-ethical impacts. The invention of the stirrup revolutionised mounted combat, but as a side effect gave rise to the feudal system, which still shapes politics today. The internal combustion engine offers personal freedom and creates pollution, but has also transformed the nature of urban planning and international trade. When we look at AI the micro-ethical issues, such as bias, are most obvious, but the macro-ethical challenges may be greater.
At a micro-ethical level AI has the potential to deepen social, ethnic and gender bias, issues I have warned about since the early 1990s! It is also being used increasingly on the battlefield. However, it also offers amazing opportunities in health and educations, as the recent Nobel prizes for the developers of AlphaFold illustrate. More radically, the need to encode ethics acts as a mirror to surface essential ethical problems and conflicts.
At the macro-ethical level, by the early 2000s digital technology had already begun to undermine sovereignty (e.g. gambling), market economics (through network effects and emergent monopolies), and the very meaning of money. Modern AI is the child of big data, big computation and ultimately big business, intensifying the inherent tendency of digital technology to concentrate power. AI is already unravelling the fundamentals of the social, political and economic world around us, but this is a world that needs radical reimagining to overcome the global environmental and human challenges that confront us. Our challenge is whether to let the threads fall as they may, or to use them to weave a better future.
Quantum Computing Quick Research Guide by Arthur MorganArthur Morgan
This is a Quick Research Guide (QRG).
QRGs include the following:
- A brief, high-level overview of the QRG topic.
- A milestone timeline for the QRG topic.
- Links to various free online resource materials to provide a deeper dive into the QRG topic.
- Conclusion and a recommendation for at least two books available in the SJPL system on the QRG topic.
QRGs planned for the series:
- Artificial Intelligence QRG
- Quantum Computing QRG
- Big Data Analytics QRG
- Spacecraft Guidance, Navigation & Control QRG (coming 2026)
- UK Home Computing & The Birth of ARM QRG (coming 2027)
Any questions or comments?
- Please contact Arthur Morgan at [email protected].
100% human made.
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungenpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-nomad-web-best-practices-und-verwaltung-von-multiuser-umgebungen/
HCL Nomad Web wird als die nächste Generation des HCL Notes-Clients gefeiert und bietet zahlreiche Vorteile, wie die Beseitigung des Bedarfs an Paketierung, Verteilung und Installation. Nomad Web-Client-Updates werden “automatisch” im Hintergrund installiert, was den administrativen Aufwand im Vergleich zu traditionellen HCL Notes-Clients erheblich reduziert. Allerdings stellt die Fehlerbehebung in Nomad Web im Vergleich zum Notes-Client einzigartige Herausforderungen dar.
Begleiten Sie Christoph und Marc, während sie demonstrieren, wie der Fehlerbehebungsprozess in HCL Nomad Web vereinfacht werden kann, um eine reibungslose und effiziente Benutzererfahrung zu gewährleisten.
In diesem Webinar werden wir effektive Strategien zur Diagnose und Lösung häufiger Probleme in HCL Nomad Web untersuchen, einschließlich
- Zugriff auf die Konsole
- Auffinden und Interpretieren von Protokolldateien
- Zugriff auf den Datenordner im Cache des Browsers (unter Verwendung von OPFS)
- Verständnis der Unterschiede zwischen Einzel- und Mehrbenutzerszenarien
- Nutzung der Client Clocking-Funktion
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager APIUiPathCommunity
Join this UiPath Community Berlin meetup to explore the Orchestrator API, Swagger interface, and the Test Manager API. Learn how to leverage these tools to streamline automation, enhance testing, and integrate more efficiently with UiPath. Perfect for developers, testers, and automation enthusiasts!
📕 Agenda
Welcome & Introductions
Orchestrator API Overview
Exploring the Swagger Interface
Test Manager API Highlights
Streamlining Automation & Testing with APIs (Demo)
Q&A and Open Discussion
Perfect for developers, testers, and automation enthusiasts!
👉 Join our UiPath Community Berlin chapter: https://community.uipath.com/berlin/
This session streamed live on April 29, 2025, 18:00 CET.
Check out all our upcoming UiPath Community sessions at https://community.uipath.com/events/.
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager APIUiPathCommunity
Ad
Hadoop and Neo4j: A Winning Combination for Bioinformatics
1. {GraphConnect NYC}
Hadoop and Graph Databases
(Neo4j): Winning Combination for
Bioinformatics
Jonathan Freeman
@freethejazz
{Open Software Integrators} { www.osintegrators.com} {@osintegrators}
2. Hadoop + Neo4j = Bioanalytics Win
Open Software Integrators
●
Jonathan Freeman
@freethejazz
Founded January 2008 by Andrew C. Oliver
○ Durham, NC
Revenue and staff has at least doubled every year since
2009.
●
New office (2012) in Chicago, IL
○ We're hiring associate to senior level as well as UI Developers
(JQuery, Javascript, HTML, CSS)
○ Up to 50% travel (probably less), salary + bonus, 401k, health,
etc etc
○ Preferred: Java, Tomcat, JBoss, Hibernate, Spring, RDBMS,
JQuery
○ Nice to have: Hadoop, Neo4j, MongoDB, Ruby a/o at least one
Cloud platform
{Open Software Integrators} { www.osintegrators.com} {@osintegrators}
3. Hadoop + Neo4j = Bioinformatics Win
Questions to answer
●
●
●
●
uhh, bioinformatics?
What is Hadoop? Why is it a good fit?
And Neo4j? Why the combination?
I want this now! How do I do it?!?!
{Open Software Integrators} { www.osintegrators.com} {@osintegrators}
Jonathan Freeman
@freethejazz
5. Hadoop + Neo4j = Bioinformatics Win
Jonathan Freeman
@freethejazz
“
dynamic
information processing
system
{Open Software Integrators} { www.osintegrators.com} {@osintegrators}
6. Hadoop + Neo4j = Bioinformatics Win
Jonathan Freeman
@freethejazz
Life
http://www.labtimes.org/labtimes/issues/lt2011/lt07/lt_2011_07_26_29.pdf
{Open Software Integrators} { www.osintegrators.com} {@osintegrators}
7. Hadoop + Neo4j = Bioinformatics Win
Jonathan Freeman
@freethejazz
● Storing/Retrieving Biological Data
● Organizing Biological Data
● Analyzing Biological Data
{Open Software Integrators} { www.osintegrators.com} {@osintegrators}
8. Hadoop + Neo4j = Bioinformatics Win
Jonathan Freeman
@freethejazz
Biological Data
● amino acid sequences
● nucleotide sequences
● protein structures
{Open Software Integrators} { www.osintegrators.com} {@osintegrators}
9. Hadoop + Neo4j = Bioinformatics Win
Jonathan Freeman
@freethejazz
●
●
●
●
●
Genetic sequence analysis
Tracing biological evolution
Analysis of gene expression
Studying mutations in cancer
Predicting protein structure and
function
● Molecular Interaction
{Open Software Integrators} { www.osintegrators.com} {@osintegrators}
10. Hadoop + Neo4j = Bioinformatics Win
Jonathan Freeman
@freethejazz
●
●
●
●
●
Genetic sequence analysis
Tracing biological evolution
Analysis of gene expression
Studying mutations in cancer
Predicting protein structure and
function
● Molecular Interaction
{Open Software Integrators} { www.osintegrators.com} {@osintegrators}
11. Hadoop + Neo4j = Bioinformatics Win
Jonathan Freeman
@freethejazz
Full Human Genome Sequencing Then
13 Years
$2,700,000,000
{Open Software Integrators} { www.osintegrators.com} {@osintegrators}
12. Hadoop + Neo4j = Bioinformatics Win
Jonathan Freeman
@freethejazz
Full Human Genome Sequencing Then
1 Day
{Open Software Integrators} { www.osintegrators.com} {@osintegrators}
$5,000
14. Hadoop + Neo4j = Bioinformatics Win
Jonathan Freeman
@freethejazz
So what are we
waiting for?
{Open Software Integrators} { www.osintegrators.com} {@osintegrators}
25. Hadoop + Neo4j = Bioinformatics Win
Jonathan Freeman
@freethejazz
Infrastructure for distributed computing
HDFS
MapReduce
A distributed file system.
An implementation of a
programming model for
processing very large data sets.
{Open Software Integrators} { www.osintegrators.com} {@osintegrators}
29. Hadoop + Neo4j = Bioinformatics Win
Jonathan Freeman
@freethejazz
Infrastructure for distributed computing
HDFS
MapReduce
A distributed file system.
An implementation of a
programming model for
processing very large data sets.
{Open Software Integrators} { www.osintegrators.com} {@osintegrators}