It has always been a tedious task to choose the right configuration for MongoDB on AWS EC2
It is always challenging and takes a lots of time to make your system Production Ready.
Here is a quick guide on how to setup MongoDB on AWS EC2.
IaaS: Virtual Machines provides an overview of virtual machines on the Microsoft Azure platform. It discusses that virtual machines can be used to deploy various application types like web applications, services, and websites. It also describes the different sizes and configurations of virtual machines available on Azure, including their CPU, memory, disk space allocations. The document outlines key considerations for using virtual machines like storage, availability sets, service level agreements and licensing.
Running Solr at Memory Speed with Alluxio - Timothy Potter, LucidworksLucidworks
- Alluxio is a virtual distributed storage system that brings data closer to compute by caching frequently accessed data in memory, allowing applications to access data at memory speed.
- Solr can be configured to use Alluxio as its storage layer by mounting the Solr root directory in Alluxio and configuring Solr to use Alluxio as its storage system.
- This allows indexing and querying of data to occur directly from memory cached data in Alluxio, improving performance over accessing data from remote storage systems like S3, HDFS, etc.
Cloud init and cloud provisioning [openstack summit vancouver]Joshua Harlow
Evil Superuser's HOWTO: Launching instances to do your bidding.
You click 'run' on the OpenStack dashboard, or launch a new instance via the api. Some provisioning magic happens and soon you've got a server created especially for you. Did you ever wonder what magic happens to a standard image on boot? Have you wanted to launch instances and have them into your infrastructure with no manual interaction? Cloud-init is software that runs in most linux instances. It can take your input and do your bidding. Learn what things cloud-init magically does for you and how you can make it do more. Also, take advantage of the after-talk to pester cloud-init developers on what is missing or throw rotten fruits in their direction.
This document discusses Azure Backup (Recovery Services) and provides an overview of its key concepts and usage scenarios. Azure Backup allows backing up of on-premises servers and virtual machines to Azure storage. It uses storage vaults mapped to Azure Blob storage to store backup recovery points. The Azure Backup Agent installs on machines to perform backups and restores and manage the backup schedule. Site Recovery allows disaster recovery between on-premises and Azure environments, or between two on-premises sites. References and contacts are provided for further information.
Cloud Init is a package that handles early initialization of cloud instances. It retrieves instance metadata like the server name and IP from the cloud provider. It then runs scripts and modules to configure the server, such as adding users, installing packages, and configuring services. It works across many Linux distributions and retrieves data through metadata services, config drives, or ISO images depending on the cloud platform. Administrators can write their own modules to customize the initialization and troubleshoot using logs and files stored in /var/lib/cloud.
MS Cloud Day - Building web applications with Azure storageSpiffy
This document provides an overview and agenda for a Microsoft Cloud Day session on building web applications with Azure Storage. The session will cover Blob, Table, and Queue storage capabilities in Azure, including how to create storage accounts, upload and retrieve blobs, create and query tables, and use queues for communication between services. Attendees will learn best practices for scalability when using Azure Storage.
This document discusses lessons learned from building the Wakoopa software. It outlines how the company scaled its infrastructure from one server to support 200 million users. Key strategies included splitting the server into staging, production and database roles, using background job queues, optimizing database queries, archiving old hourly usage data, and plans to implement MySQL partitioning and other technologies to improve performance.
Using schedulers like Marathon and Aurora help to get your applications scheduled and executing on Mesos. In many cases it makes sense to build a framework and integrate directly. This talk will breakdown what is involved in building a framework, how to-do this with examples and why you would want to-do this. Frameworks are not only for generally available software applications (like Kafka, HDFS, Spark ,etc) but can also be used for custom internal R&D built software applications too.
Overview of Windows Azure Virtual Machines - the IaaS offering in the Windows Azure platform. The presentation covers the compute, storage and network features of Virtual Machines. It also describes how best to deploy Windows Azure cloud services and VMs.
- Just Eat is a leading digital marketplace for takeaway food delivery founded in 2001 operating in 13 markets globally. It has processed up to 2,500 orders per minute at peak times.
- Just Eat migrated to AWS 5 years ago and runs hundreds of EC2 instances at peak dinner times using scheduled scaling, CloudFormation, and other AWS services.
- AWS Lambda was introduced in 2014 and Just Eat started using it for micro tasks like resetting delivery times, publishing SNS messages, and provisioning instance access to reduce infrastructure costs and management compared to running EC2 fleets.
Setup Kubernetes Cluster On AWS Using KOPSSivaprakash
This document provides steps to set up a Kubernetes cluster on AWS using KOPS. It describes installing required tools like kubectl and KOPS, creating an SSH key and Route53 host on AWS, generating a Kubernetes cluster across multiple Availability Zones, and exposing sample applications. Finally, it covers deleting the cluster.
Originally delivered as Lightning Talk at Lucene Eurocon 2011 in Barcelona, this quick presentation shows how to use Sematext's SPM service to monitor Solr, OS, JVM, and more.
This document discusses Terraform, an open source tool for building, changing, and versioning infrastructure safely and efficiently. It provides declarative configuration files to manage networks, virtual machines, containers, and other infrastructure resources. The document introduces Terraform and how it works, provides examples of Terraform code and its output, and offers best practices for using Terraform including separating infrastructure code from application code, using modules, and managing state. Terraform allows infrastructure to be treated as code, provides a faster development cycle than other tools like CloudFormation, and helps promote a devOps culture.
Docker is all the rage these days. While one doesn't hear much about Solr on Docker, we're here to tell you not only that it can be done, but also share how it's done.
We'll quickly go over the basic Docker ideas - containers are lighter than VMs, they solve "but it worked on my laptop" issues - so we can dive into the specifics of running Solr on Docker.
We'll do a live demo showing you how to run Solr master - slave as well as SolrCloud using containers, how to manage CPU assignments, constraint memory and use Docker data volumes when running Solr in containers. We will also show you how to create your own containers with custom configurations.
Finally, we'll address one of the core Solr questions - which deployment type should I use? We will demonstrate performance differences between the following deployment types:
- Single Solr instance running on a bare metal machine
- Multiple Solr instances running on a single bare metal machine
- Solr running in containers
- Solr running on virtual machine
- Solr running on virtual machine using unikernel
For each deployment type we'll address how it impacts performance, operational flexibility and all other key pros and cons you ought to keep in mind.
This document summarizes Erlang processes and scheduling in Erlang. It discusses the process control block, stack and heap, message passing, the scheduler, workload balancing across schedulers, and garbage collection. The scheduler prioritizes ports, maximum, high and normal/low priority queues. It will consume all reductions for a process or pause it waiting for a message. The garbage collector uses copying collection for process heaps and reference counting for shared binaries.
A meticulous presentation on Authorization, Encryption & Authentication of the security features in MySQL 8.0 by Vignesh Prabhu, Database reliability engineer, Mydbops.
AWS re:Invent re:Cap 행사에서 발표된 강연 자료입니다. 아마존 웹서비스의 양승도 솔루션스 아키텍트가 발표한 내용입니다.
내용 요약: re:Invent에서 새로 발표된 Amazon Aurora는 My-SQL과 호환되는 관계형 데이터베이스 엔진으로, 상용 고급 데이터베이스의 속도 및 가용성과 오픈소스 데이터베이스의 간편함 및 비용효율성을 모두 갖춘 서비스입니다. 이 세션에서는 MySQL과 Oracle, Microsoft SQL Server, PostgreSQL에 이어 Amazon RDS로 구동되는 다섯 번째 데이터베이스 엔진 Amazon Aurora의 특징과 이점에 대해 소개하도록 하겠습니다.
This document provides an introduction and overview of Apache Mesos. It begins by describing Mesos' origins at companies like Google and how it enables fine-grained resource sharing in data centers. It then discusses concepts like schedulers, executors, frameworks and how Mesos allows building distributed applications and data center infrastructure. The document also covers Mesos concepts such as resources, attributes, roles, constraints and how tools like Marathon interact with Mesos. Finally, it provides examples of Mesos frameworks for technologies like Kafka and Cassandra.
Apache CloudStack (ACS) is an open source cloud computing platform that can manage virtual machines hosted on various hypervisors including VMware vSphere, XenServer, KVM, and Open Virtualization. ACS integrates with vSphere to provide dynamic scaling of CPU and RAM for virtual machines, hot plug additional NICs, CPU/RAM overcommit, distributed resource scheduling, full clone support, VM and volume snapshots, storage migration, and zone-wide block storage. The integration does not currently support features like storage distributed resource scheduling, Cisco 1000v virtual switches, or private VLANs.
DataStax: Backup and Restore in Cassandra and OpsCenterDataStax Academy
Cassandra and OpsCenter has a range of backup and restore topics. I will start with a basic overview of Cassandra backup/restore, walking through the operational steps to provide the understanding required to perform an on disk backup and restore. Expanding on this overview, I'll cover the limitations (including schema requirements) and their impact on the restore process. Further, I'll discuss commit log archiving and point in time restore operations. After covering the underlying operations, I'll wrap up with a discussion of how OpsCenter automates this process and leverages S3.
The document discusses integrating Docker, Mesos, Spark, Marathon, and Chronos into a unified big data platform. Docker provides containerization capabilities, while Mesos is a distributed resource manager that supports running Docker containers. Spark can run natively on Mesos by running Spark as a Docker container within Mesos. Marathon and Chronos help manage long-running services and cron jobs on Mesos. The author will demonstrate how to put these technologies together into an integrated system and address running Spark on Mesos in Docker containers.
This is my talk from the July LVL.UP KL meeting (formerly WebCamp KL) held on August 6th at Mindvalley, Bangsar.
The talk covers a basic introduction to scalability, 5 things to consider/think about and 5 things you can do build at scale.
WebCampKL Group is here - https://www.facebook.com/groups/webcamp/
The video of this talk is available here: http://youtu.be/Djs-8lGpz_U (also added as the 19th slide).
AWS Summit London 2014 - JUST EAT - High Availability and Rapid Changedaniel-richardson
Nearly 1,000 takeaways ordered a minute from hungry consumers, with near real time confirmation from restaurants and delivery of their food just 45 minutes later is a hard technical challenge.
AWS allows the many small engineering teams at JUST EAT to take responsibility to meet that challenge, as they build and operate a platform that delivers a takeaway experience for consumers to love.
Learn how we migrated our e-commerce platform to AWS and organise both our platform and teams around the the twin goals of rapid change and high availability. During the session we break things live in production, and watch how the JUST EAT platform is designed around AWS to recover quickly and automatically.
AWS CloudFormation template with single & redundant systemNaoya Hashimoto
* Use CloudFormation to create Stacks composed of VPC, Internet Gateway, Route Table, ELB, EC2 Instance, EBS Volumes
* Single pattern with EC2 Instances WEB server and DB server with the same AZ
* Redundant pattern with EC2 Instances WEB server and DB server with multi-AZ
This webinar discusses RISO Inc.'s experience migrating their on-premise data center to the AWS cloud with assistance from Apps Associates. [1] Apps Associates designed and implemented the new infrastructure on AWS, migrating applications like Oracle ERP and SQL servers. [2] This provided benefits like a 35% reduction in backup costs, 50% fewer IT vendors, and the ability to relocate offices without interrupting operations. [3] The webinar explores considerations for cloud migrations and the hybrid cloud model.
Overview of Windows Azure Virtual Machines - the IaaS offering in the Windows Azure platform. The presentation covers the compute, storage and network features of Virtual Machines. It also describes how best to deploy Windows Azure cloud services and VMs.
- Just Eat is a leading digital marketplace for takeaway food delivery founded in 2001 operating in 13 markets globally. It has processed up to 2,500 orders per minute at peak times.
- Just Eat migrated to AWS 5 years ago and runs hundreds of EC2 instances at peak dinner times using scheduled scaling, CloudFormation, and other AWS services.
- AWS Lambda was introduced in 2014 and Just Eat started using it for micro tasks like resetting delivery times, publishing SNS messages, and provisioning instance access to reduce infrastructure costs and management compared to running EC2 fleets.
Setup Kubernetes Cluster On AWS Using KOPSSivaprakash
This document provides steps to set up a Kubernetes cluster on AWS using KOPS. It describes installing required tools like kubectl and KOPS, creating an SSH key and Route53 host on AWS, generating a Kubernetes cluster across multiple Availability Zones, and exposing sample applications. Finally, it covers deleting the cluster.
Originally delivered as Lightning Talk at Lucene Eurocon 2011 in Barcelona, this quick presentation shows how to use Sematext's SPM service to monitor Solr, OS, JVM, and more.
This document discusses Terraform, an open source tool for building, changing, and versioning infrastructure safely and efficiently. It provides declarative configuration files to manage networks, virtual machines, containers, and other infrastructure resources. The document introduces Terraform and how it works, provides examples of Terraform code and its output, and offers best practices for using Terraform including separating infrastructure code from application code, using modules, and managing state. Terraform allows infrastructure to be treated as code, provides a faster development cycle than other tools like CloudFormation, and helps promote a devOps culture.
Docker is all the rage these days. While one doesn't hear much about Solr on Docker, we're here to tell you not only that it can be done, but also share how it's done.
We'll quickly go over the basic Docker ideas - containers are lighter than VMs, they solve "but it worked on my laptop" issues - so we can dive into the specifics of running Solr on Docker.
We'll do a live demo showing you how to run Solr master - slave as well as SolrCloud using containers, how to manage CPU assignments, constraint memory and use Docker data volumes when running Solr in containers. We will also show you how to create your own containers with custom configurations.
Finally, we'll address one of the core Solr questions - which deployment type should I use? We will demonstrate performance differences between the following deployment types:
- Single Solr instance running on a bare metal machine
- Multiple Solr instances running on a single bare metal machine
- Solr running in containers
- Solr running on virtual machine
- Solr running on virtual machine using unikernel
For each deployment type we'll address how it impacts performance, operational flexibility and all other key pros and cons you ought to keep in mind.
This document summarizes Erlang processes and scheduling in Erlang. It discusses the process control block, stack and heap, message passing, the scheduler, workload balancing across schedulers, and garbage collection. The scheduler prioritizes ports, maximum, high and normal/low priority queues. It will consume all reductions for a process or pause it waiting for a message. The garbage collector uses copying collection for process heaps and reference counting for shared binaries.
A meticulous presentation on Authorization, Encryption & Authentication of the security features in MySQL 8.0 by Vignesh Prabhu, Database reliability engineer, Mydbops.
AWS re:Invent re:Cap 행사에서 발표된 강연 자료입니다. 아마존 웹서비스의 양승도 솔루션스 아키텍트가 발표한 내용입니다.
내용 요약: re:Invent에서 새로 발표된 Amazon Aurora는 My-SQL과 호환되는 관계형 데이터베이스 엔진으로, 상용 고급 데이터베이스의 속도 및 가용성과 오픈소스 데이터베이스의 간편함 및 비용효율성을 모두 갖춘 서비스입니다. 이 세션에서는 MySQL과 Oracle, Microsoft SQL Server, PostgreSQL에 이어 Amazon RDS로 구동되는 다섯 번째 데이터베이스 엔진 Amazon Aurora의 특징과 이점에 대해 소개하도록 하겠습니다.
This document provides an introduction and overview of Apache Mesos. It begins by describing Mesos' origins at companies like Google and how it enables fine-grained resource sharing in data centers. It then discusses concepts like schedulers, executors, frameworks and how Mesos allows building distributed applications and data center infrastructure. The document also covers Mesos concepts such as resources, attributes, roles, constraints and how tools like Marathon interact with Mesos. Finally, it provides examples of Mesos frameworks for technologies like Kafka and Cassandra.
Apache CloudStack (ACS) is an open source cloud computing platform that can manage virtual machines hosted on various hypervisors including VMware vSphere, XenServer, KVM, and Open Virtualization. ACS integrates with vSphere to provide dynamic scaling of CPU and RAM for virtual machines, hot plug additional NICs, CPU/RAM overcommit, distributed resource scheduling, full clone support, VM and volume snapshots, storage migration, and zone-wide block storage. The integration does not currently support features like storage distributed resource scheduling, Cisco 1000v virtual switches, or private VLANs.
DataStax: Backup and Restore in Cassandra and OpsCenterDataStax Academy
Cassandra and OpsCenter has a range of backup and restore topics. I will start with a basic overview of Cassandra backup/restore, walking through the operational steps to provide the understanding required to perform an on disk backup and restore. Expanding on this overview, I'll cover the limitations (including schema requirements) and their impact on the restore process. Further, I'll discuss commit log archiving and point in time restore operations. After covering the underlying operations, I'll wrap up with a discussion of how OpsCenter automates this process and leverages S3.
The document discusses integrating Docker, Mesos, Spark, Marathon, and Chronos into a unified big data platform. Docker provides containerization capabilities, while Mesos is a distributed resource manager that supports running Docker containers. Spark can run natively on Mesos by running Spark as a Docker container within Mesos. Marathon and Chronos help manage long-running services and cron jobs on Mesos. The author will demonstrate how to put these technologies together into an integrated system and address running Spark on Mesos in Docker containers.
This is my talk from the July LVL.UP KL meeting (formerly WebCamp KL) held on August 6th at Mindvalley, Bangsar.
The talk covers a basic introduction to scalability, 5 things to consider/think about and 5 things you can do build at scale.
WebCampKL Group is here - https://www.facebook.com/groups/webcamp/
The video of this talk is available here: http://youtu.be/Djs-8lGpz_U (also added as the 19th slide).
AWS Summit London 2014 - JUST EAT - High Availability and Rapid Changedaniel-richardson
Nearly 1,000 takeaways ordered a minute from hungry consumers, with near real time confirmation from restaurants and delivery of their food just 45 minutes later is a hard technical challenge.
AWS allows the many small engineering teams at JUST EAT to take responsibility to meet that challenge, as they build and operate a platform that delivers a takeaway experience for consumers to love.
Learn how we migrated our e-commerce platform to AWS and organise both our platform and teams around the the twin goals of rapid change and high availability. During the session we break things live in production, and watch how the JUST EAT platform is designed around AWS to recover quickly and automatically.
AWS CloudFormation template with single & redundant systemNaoya Hashimoto
* Use CloudFormation to create Stacks composed of VPC, Internet Gateway, Route Table, ELB, EC2 Instance, EBS Volumes
* Single pattern with EC2 Instances WEB server and DB server with the same AZ
* Redundant pattern with EC2 Instances WEB server and DB server with multi-AZ
This webinar discusses RISO Inc.'s experience migrating their on-premise data center to the AWS cloud with assistance from Apps Associates. [1] Apps Associates designed and implemented the new infrastructure on AWS, migrating applications like Oracle ERP and SQL servers. [2] This provided benefits like a 35% reduction in backup costs, 50% fewer IT vendors, and the ability to relocate offices without interrupting operations. [3] The webinar explores considerations for cloud migrations and the hybrid cloud model.
AWS provides a comprehensive set of global cloud computing services including compute, storage, databases, analytics, networking, mobile, developer tools, management tools, IoT, security and enterprise applications. Some key services highlighted include EC2 for virtual servers, S3 for object storage, RDS for managed relational databases, DynamoDB for NoSQL database services, EBS for block storage volumes, VPC for virtual networking, IAM for access management, CloudFront for content delivery and Route 53 for DNS services. AWS operates across multiple geographic regions and availability zones for reliability and high availability.
MongoDB supports replication for failover and redundancy. In this session we will introduce the basic concepts around replica sets, which provide automated failover and recovery of nodes. We'll cover how to set up, configure, and initiate a replica set; methods for using replication to scale reads; and proper architecture for durability.
This document provides instructions for setting up a MongoDB replica set across multiple virtual machines. It describes installing MongoDB on each VM, creating directories to store data, configuring the yum repository, and installing MongoDB packages. It then explains how to initialize and configure a local 3-node replica set, add members, and check the replica set status. Finally, it briefly discusses connecting to primary and secondary members, performing CRUD operations, and setting up MongoDB Management Service (MMS) for monitoring and backups.
- Replica sets in MongoDB allow for replication across multiple servers, with one server acting as the primary and able to accept writes, and other secondary servers replicating the primary.
- If the primary fails, the replica set will automatically elect a new primary from the secondary servers and continue operating without interruption.
- The replica set configuration specifies the members, their roles, and settings like heartbeat frequency to monitor member health and elect a primary if needed.
Getting started with replica set in MongoDBKishor Parkhe
The document provides instructions for setting up and administering replica sets and sharded clusters in MongoDB. It describes initializing and configuring replica sets, adding members, and handling failures. It also explains the components of sharded clusters, requirements for sharding, and steps for enabling and administering sharding, including adding shards, sharding data, and commands for viewing sharding status.
Back to Basics: Build Something Big With MongoDB MongoDB
1. Replica sets allow for high availability and redundancy by creating copies of data across multiple nodes. The replica set lifestyle involves creation, initialization, handling failures and failovers, and recovery from failures.
2. When developing with replica sets, developers must consider consistency models such as strong consistency, delayed consistency, and write concerns to determine how and when data is written and acknowledged. Tagging and read preferences also allow control over where data is read from and written to.
3. Sharding provides horizontal scalability by partitioning data across multiple machines or replica sets. The data is split into chunks based on a user-defined shard key and distributed across shards. A config server stores metadata about chunk mappings and locations,
In this session we will cover wide area replica sets and using tags for backup. Attendees should be well versed in basic replication and familiar with concepts in the morning's basic replication talk. No beginner topics will be covered in this session
Database replication involves keeping identical copies of data on different servers to provide redundancy and minimize downtime. Replication is recommended for databases in production from the start. A MongoDB replica set consists of a primary server that handles client requests and secondary servers that copy the primary's data. Replica sets can include up to 50 members with 7 voting members and use an oplog to replicate operations from the primary to secondaries. For elections and writes to succeed, a majority of voting members must be reachable.
This document discusses MongoDB replication using replica sets. It describes how to configure and administer replica sets, which allow for asynchronous master-slave replication and automatic failover between members. Replica sets maintain multiple copies of data across multiple servers, provide redundancy and high availability, and can elect a new primary if one fails. The document outlines different replication topologies and member types in a replica set, and how replica sets integrate with sharded clusters in MongoDB.
Netflix is the world's largest online movie rental service with over 10 million subscribers. Their mission is to connect people with movies they'll love. They are moving to the cloud with AWS to leverage its massive scale and mature application services for better availability, operational simplicity, and encoding of content for streaming across multiple devices. Netflix uses many AWS services like EC2, S3, SQS, and EMR and is addressing challenges of reliability, persistence, security and data consistency. Engineers are encouraged to join Netflix.
Advanced data migration techniques for Amazon RDSTom Laszewski
Migrating on premise data from Oracle and MySQL Databases to AWS Oracle and MySQL RDS. These techniques will work for AWS EC2 as well. Scripts included in the slides.
MongoDb scalability and high availability with Replica-SetVivek Parihar
One of the much awaited features in MongoDB 1.6 is replica sets, MongoDB replication solution providing automatic failover and recovery.
MongoDB High Availabiltity with Replica Sets
This talk will cover -
• What is Replica Set?
• Replication Process
• Advantaged of Replica Set vs master/slave
• How to set up replica set on production Demo
This video is tutorial for setting up the MongoDb replica-set ion production environment. In this i took 3 instances which have already mongo installed and running. This tutorial consists-:
1.Setup the each instance of replica set
2.modify the mongodb.conf to include replica set information
3.configure the servers to include in replica set
4.then cross checking if we kill one primary then secondary becomes primary or not.
Setting up mongodb sharded cluster in 30 minutesSudheer Kondla
The document describes how to configure and deploy a MongoDB sharded cluster with 6 virtual machines in 30 minutes. It provides step-by-step instructions on installing MongoDB, setting up the config servers, adding shards, and enabling sharding for databases and collections. Key aspects include designating MongoDB instances as config servers, starting mongos processes connected to the config servers, adding shards by hostname and port, and enabling sharding on specific databases and collections with shard keys.
This document provides information about MongoDB replication and sharding. It discusses what replication is, how to set up replication on Windows including starting primary and secondary servers and verifying replication. It also discusses best practices for replication including always using replica sets, using replica sets to offload reads from primary, and using an odd number of replicas. The document also discusses how to set up MongoDB replication on Linux in a step-by-step process and how to check the replication status. It provides commands for adding and removing MongoDB instances from a replica set and making a primary secondary. Finally, it discusses what sharding is in MongoDB, the concept of sharding keys, and provides a high-level overview of implementing sharding in MongoDB including using
CoreOS, or How I Learned to Stop Worrying and Love SystemdRichard Lister
Ric Lister presents patterns for running Docker in production on CoreOS, including a simple homogeneous operations cluster where sidekick units announce services in etcd and a reverse proxy discovers them, an etcd and workers pattern for low-traffic sites behind a load balancer, and an immutable servers pattern without etcd for high-traffic microservices with strict change control. He also discusses logging to ship container output off hosts, various monitoring options, alternative operating systems like RancherOS and Atomic, and scheduler options like Kubernetes, Mesos, and Deis.
This document provides instructions for setting up a small 3 node Hadoop cluster on Amazon EC2. It covers configuring EC2 instances, installing Java and Hadoop, configuring the Hadoop nodes and services, and running a sample MapReduce job to validate the cluster. The goal is to provide a simple tutorial for getting started with Hadoop on EC2 for learning purposes.
This document provides instructions for setting up an Apache Hadoop cluster on Macintosh OSX. It describes installing and configuring Java, Hadoop, Hive, and MySQL on a "namenode" machine and multiple "datanode" machines. Key steps include installing software via Homebrew, configuring host files and SSH keys for passwordless login, creating configuration files for core Hadoop components and copying them to all datanodes, and installing scripts to help manage the cluster. The goal is to have a basic functioning Hadoop cluster on Mac OSX for testing and proof of concept purposes.
This document provides guidance on deploying and upgrading a MongoDB sharded cluster. It discusses the components of a sharded cluster including config servers, shards, and mongos processes. It recommends a production deployment have at least 3 config servers, 3 nodes per shard replica set, and multiple mongos instances. The document outlines steps for deploying each component, including initializing replica sets and adding shards. It also provides a checklist for upgrading between minor and major versions, such as changes to configuration options, deprecated operations, and connectivity changes.
This document provides an overview and instructions for deploying, upgrading, and troubleshooting a MongoDB sharded cluster. It describes the components of a sharded cluster including shards, config servers, and mongos processes. It provides recommendations for initial deployment including using replica sets for shards and config servers, DNS names instead of IPs, and proper user authorization. The document also outlines best practices for upgrading between minor and major versions, including stopping the balancer, upgrading processes in rolling fashion, and handling incompatible changes when downgrading major versions.
This document provides guidance on deploying and upgrading a MongoDB sharded cluster. It discusses the components of a sharded cluster including config servers, shards, and mongos processes. It recommends a production deployment has at least 3 config servers, 3 nodes per shard replica set, and multiple mongos instances. The document outlines steps for deploying each component, including initializing replica sets and adding shards. It also provides a checklist for upgrading between minor and major versions, such as changes to configuration options, deprecated operations, and connectivity changes.
This document provides an overview and introduction to MongoDB including:
- MongoDB installation using yum or binary, roles in MongoDB, and how to create users
- Basic MongoDB commands like use, show dbs, and dropDatabase
- Tracing slow queries using the database profiler and viewing results in the system.profile collection
- Important monitoring commands like db.serverStatus(), currentOp(), db.stats(), and replica set commands like rs.status() and rs.printReplicationInfo()
The document provides instructions for installing and configuring the Eyeball XMPP Server, which is a scalable instant messaging server that supports client-to-client, client-to-server, and server-to-server communication. It describes installing prerequisites like databases and ODBC drivers, configuring the edge and state server components, creating database schemas and users, enabling TLS encryption, and configuring licensing and server-to-server communication.
MongoDB is a non-relational database that supports document-based queries, indexing of all fields, master-slave replication for high availability, automatic sharding of data across multiple servers, and MapReduce for flexible aggregation. It uses dynamic schemas and embeds documents which can store binary data. Queries in MongoDB support ad-hoc queries on documents using standard operators and indexes can be applied on any field.
Oracle 11g Installation With ASM and Data Guard SetupArun Sharma
In this article we will look at Oracle 11g installation with ASM storage and also setup physical standby on ASM.
We will be following below steps for our configuration:
Setup Primary Server
Setup Standby Server
Full article link is here: https://www.support.dbagenesis.com/post/oracle-11g-installation-with-asm-and-data-guard-setup
This document provides information about installing and configuring Linux, Apache web server, PostgreSQL database, and Apache Tomcat on a Linux system. It discusses installing Ubuntu using VirtualBox, creating users and groups, setting file permissions, important Linux files and directories. It also covers configuring Apache server and Tomcat, installing and configuring PostgreSQL, and some self-study questions about the Linux boot process, run levels, finding the kernel version and learning about NIS, NFS, and RPM package management.
The document provides information on MongoDB replication and sharding. Replication allows for redundancy and increased data availability by synchronizing data across multiple database servers. A replica set consists of a primary node that receives writes and secondary nodes that replicate the primary. Sharding partitions data across multiple machines or shards to improve scalability and allow for larger data sets and higher throughput. Sharded clusters have shards that store data, config servers that store metadata, and query routers that direct operations to shards.
The document discusses installing and configuring MySQL on Linux. It provides steps to install MySQL using RPM files, set passwords for security, test the installation, and configure applications to connect to the database. It also covers basic and advanced MySQL commands like CREATE TABLE, SELECT, JOIN, and more.
This document provides instructions for installing Hadoop on a small cluster of 4 virtual machines for testing purposes. It describes downloading and extracting Hadoop, configuring environment variables and SSH keys, editing configuration files, and checking the Hadoop status page to confirm the installation was successful.
This document provides an overview of how to set up and manage a MongoDB sharded cluster. It describes the key components of a sharded cluster including shards, config servers, and mongos query routers. It then provides step-by-step instructions for deploying, upgrading, and troubleshooting a sharded cluster. The document explains how to configure shards, config servers, and mongos processes. It also outlines best practices for upgrading between minor and major versions of MongoDB.
5/13/13 presentation to Austin DevOps Meetup Group, describing our system for deploying 15 websites and supporting services in multiple languages to bare redhat 6 VMs. All system-wide software is installed using RPMs, and all application software is installed using GIT or Tarball.
Introduction to automation in the cloud, why it's needed, what are the tools or ways of working, the processes, the best practises with some examples and takeaways.
Nagios Conference 2014 - Mike Weber - Expanding NRDS Capabilities on Linux Sy...Nagios
Mike Weber's presentation on Expanding NRDS Capabilities on Linux Systems.
The presentation was given during the Nagios World Conference North America held Oct 13th - Oct 16th, 2014 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/conference
Configure & send push notification on i os deviceShepHertz
To help with the configuration of push notifications in iOS apps, we have come up with a comprehensive tutorial that covers some of the difficult steps such as configuring your own service and creating p12 files among other necessary tasks.
Complete steps to Integrate Push Notification for Your Cocos2dx App with Push...ShepHertz
Integrating push notification for ios and android in cocos2dx is always a big challenge for developers. So, this blog will walk you through step-by-step integration process by taking the examples from the sample.
A Complete Tutorial to Develop Real-Time Fighting Game with Cocos2dxShepHertz
A Comprehensive tutorial to develop real time fighting game in Cocos2dx. To make thing easier you can check out this developed real-time fighting game demo in cocos2dx who help developers bootstrap easily. This demo’s some of key steps involved in doing this and the entire source code can be downloaded or viewed from our git repo.
Travel and aviation marketing automation and omnichannel travelShepHertz
Accelerate Digitization in Travel and Aviation by providing omni-channel customer experience using Predictive Analytics, Marketing Automation, Api Management.
Gaming marketing automation and multiplayer game developmentShepHertz
Develop rich & engaging games with AppWarp multiplayer engine and boost user acquisition, engagement, retention & conversion with App42 Marketing Automation.
Banking Services Marketing Automation and Omni-channel BankingShepHertz
Accelerate Digitization in banking by providing omni-channel customer experience using Predictive Analytics, Marketing Automation, Api Management with ShepHertz.
ShepHertz is a leading Cloud and On-Premise Ecosystem provider for App and Game development.
It's flagship product App42 comprises of App42 Game Mechanics APIs and Marketing Automation, AppWarp – Multiplayer Gaming Engine (Cloud + On-Premise), and App42 Gaming PaaS.
ShepHertz's endeavor is to provide Game App developers with easy and extremely cost-effective tools and services they need to succeed in today’s multi-device world while reducing their development time, time to Market, cost, effort and infra management disturbances through the productivity and efficiency of the platform itself.
Push Notification with Unity in iOS using App42 BackendShepHertz
Integrating App42 Push Notification service with Unity3D iOS App. With Unity you can create rich, interactive entertainment or multimedia experience. This presentation will help you learn how to use Unity Push Notification feature with App42 Backend as a Service APIs. Check out our website http://www.shephertz.com
App42 Student Lab - Android Game Dev Series V 0.1ShepHertz
ShepHertz has started App42 Student Lab to provide umpteen learning and practical simulation to budding developers.
The Android Game Dev Series is the first workshop that is being conducted in association with Entrepreneurship Cell, IIT Kharagpur. The Series consists of three Webinars and a month-long Hackathon for college students to learn and bring their ideas to reality.
In this first Webinar V0.1 we have covered: basic steps in developing an App or a Game, configuring Java, Android SDK Setup, developing a sample Tic-Tac-Toe Game on Android, and integrating a basic Leaderboard using App42.
ShepHertz - A Complete Cloud Ecosystem for your AppsShepHertz
The document discusses App42, a platform that provides APIs, SDKs, and services to help reduce development time and costs for app developers. It offers tools for user acquisition, engagement, analytics, and multiplayer gaming. App42 provides backend functionality, analytics dashboards, A/B testing, push notifications, and pricing plans for infrastructure usage.
Andrew Marnell: Transforming Business Strategy Through Data-Driven InsightsAndrew Marnell
With expertise in data architecture, performance tracking, and revenue forecasting, Andrew Marnell plays a vital role in aligning business strategies with data insights. Andrew Marnell’s ability to lead cross-functional teams ensures businesses achieve sustainable growth and operational excellence.
This is the keynote of the Into the Box conference, highlighting the release of the BoxLang JVM language, its key enhancements, and its vision for the future.
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfSoftware Company
Explore the benefits and features of advanced logistics management software for businesses in Riyadh. This guide delves into the latest technologies, from real-time tracking and route optimization to warehouse management and inventory control, helping businesses streamline their logistics operations and reduce costs. Learn how implementing the right software solution can enhance efficiency, improve customer satisfaction, and provide a competitive edge in the growing logistics sector of Riyadh.
Role of Data Annotation Services in AI-Powered ManufacturingAndrew Leo
From predictive maintenance to robotic automation, AI is driving the future of manufacturing. But without high-quality annotated data, even the smartest models fall short.
Discover how data annotation services are powering accuracy, safety, and efficiency in AI-driven manufacturing systems.
Precision in data labeling = Precision on the production floor.
Book industry standards are evolving rapidly. In the first part of this session, we’ll share an overview of key developments from 2024 and the early months of 2025. Then, BookNet’s resident standards expert, Tom Richardson, and CEO, Lauren Stewart, have a forward-looking conversation about what’s next.
Link to recording, presentation slides, and accompanying resource: https://bnctechforum.ca/sessions/standardsgoals-for-2025-standards-certification-roundup/
Presented by BookNet Canada on May 6, 2025 with support from the Department of Canadian Heritage.
How Can I use the AI Hype in my Business Context?Daniel Lehner
𝙄𝙨 𝘼𝙄 𝙟𝙪𝙨𝙩 𝙝𝙮𝙥𝙚? 𝙊𝙧 𝙞𝙨 𝙞𝙩 𝙩𝙝𝙚 𝙜𝙖𝙢𝙚 𝙘𝙝𝙖𝙣𝙜𝙚𝙧 𝙮𝙤𝙪𝙧 𝙗𝙪𝙨𝙞𝙣𝙚𝙨𝙨 𝙣𝙚𝙚𝙙𝙨?
Everyone’s talking about AI but is anyone really using it to create real value?
Most companies want to leverage AI. Few know 𝗵𝗼𝘄.
✅ What exactly should you ask to find real AI opportunities?
✅ Which AI techniques actually fit your business?
✅ Is your data even ready for AI?
If you’re not sure, you’re not alone. This is a condensed version of the slides I presented at a Linkedin webinar for Tecnovy on 28.04.2025.
Linux Support for SMARC: How Toradex Empowers Embedded DevelopersToradex
Toradex brings robust Linux support to SMARC (Smart Mobility Architecture), ensuring high performance and long-term reliability for embedded applications. Here’s how:
• Optimized Torizon OS & Yocto Support – Toradex provides Torizon OS, a Debian-based easy-to-use platform, and Yocto BSPs for customized Linux images on SMARC modules.
• Seamless Integration with i.MX 8M Plus and i.MX 95 – Toradex SMARC solutions leverage NXP’s i.MX 8 M Plus and i.MX 95 SoCs, delivering power efficiency and AI-ready performance.
• Secure and Reliable – With Secure Boot, over-the-air (OTA) updates, and LTS kernel support, Toradex ensures industrial-grade security and longevity.
• Containerized Workflows for AI & IoT – Support for Docker, ROS, and real-time Linux enables scalable AI, ML, and IoT applications.
• Strong Ecosystem & Developer Support – Toradex offers comprehensive documentation, developer tools, and dedicated support, accelerating time-to-market.
With Toradex’s Linux support for SMARC, developers get a scalable, secure, and high-performance solution for industrial, medical, and AI-driven applications.
Do you have a specific project or application in mind where you're considering SMARC? We can help with Free Compatibility Check and help you with quick time-to-market
For more information: https://www.toradex.com/computer-on-modules/smarc-arm-family
Generative Artificial Intelligence (GenAI) in BusinessDr. Tathagat Varma
My talk for the Indian School of Business (ISB) Emerging Leaders Program Cohort 9. In this talk, I discussed key issues around adoption of GenAI in business - benefits, opportunities and limitations. I also discussed how my research on Theory of Cognitive Chasms helps address some of these issues
Big Data Analytics Quick Research Guide by Arthur MorganArthur Morgan
This is a Quick Research Guide (QRG).
QRGs include the following:
- A brief, high-level overview of the QRG topic.
- A milestone timeline for the QRG topic.
- Links to various free online resource materials to provide a deeper dive into the QRG topic.
- Conclusion and a recommendation for at least two books available in the SJPL system on the QRG topic.
QRGs planned for the series:
- Artificial Intelligence QRG
- Quantum Computing QRG
- Big Data Analytics QRG
- Spacecraft Guidance, Navigation & Control QRG (coming 2026)
- UK Home Computing & The Birth of ARM QRG (coming 2027)
Any questions or comments?
- Please contact Arthur Morgan at [email protected].
100% human made.
Quantum Computing Quick Research Guide by Arthur MorganArthur Morgan
This is a Quick Research Guide (QRG).
QRGs include the following:
- A brief, high-level overview of the QRG topic.
- A milestone timeline for the QRG topic.
- Links to various free online resource materials to provide a deeper dive into the QRG topic.
- Conclusion and a recommendation for at least two books available in the SJPL system on the QRG topic.
QRGs planned for the series:
- Artificial Intelligence QRG
- Quantum Computing QRG
- Big Data Analytics QRG
- Spacecraft Guidance, Navigation & Control QRG (coming 2026)
- UK Home Computing & The Birth of ARM QRG (coming 2027)
Any questions or comments?
- Please contact Arthur Morgan at [email protected].
100% human made.
2. Background
It has always been a tedious task to choose the right configuration for
MongoDB on AWS EC2
Choosing the right configuration in this environment is always challenging
and it takes a lots of time to make your system Production Ready.
3. What does it take?
All it needs is two machines that will be used as PRIMARY (Master)
and SECONDARY (Slave) node and one ARBITER machine for the
replica set.
However, it might get changed based on your application
requirement and you can opt for higher number of nodes based on
your need
ARBITER is only required in case of even number replica set. If you
want to maintain replica set with one PRIMARY and two SECONDARY,
ARBITER is not required
4. Hardware Requirement
Two 64 bit EC2 instances of medium/large or higher configuration
based on your App requirement for PRIMARY and SECONDARY node
(There is a data storage limitation of using 32 bit machine and can only
support upto 2.5 GB storage)
A small 32 bit EC2 machine for MongoDB ARBITER
It is recommended –
• to have machines in different availability zone to make it High
available in-case of a shutdown of one availability zone
• to use Ext4 EBS volume to support I/O suspend and write-cache
flushing for multi-disk consistent snapshots
5. Installation Steps
Create and Launch an EC2 instance of required configuration as
stated above for PRIMARY, SECONDARY and ARBITER nodes
Create an EBS volume of required size to be used for MongoDB storage
for both nodes
Connect to EC2 instances on PRIMARY and SECONDARY node via SSH
Make an Ext4 file system on both nodes via sudo mkfs -t ext4
/dev/<Created EBS Volume>
Create directory /data/db or any other of your own choice and mount
it to attached volume using sudo mount -a /dev/< Created EBS
Volume > /data/db
6. Contd..
Edit your /etc/fstab to enumerate it on start up of instance using
sudo echo ‘/dev/sdf /data/db auto noatime,noexec,nodiratime 0 0’
>> /etc/fstab
Download and Install MongoDB on all instances
Start the PRIMARY node with following command in MongoDB
directory using mongod --rest --replSet myHASet (where myHASet is
the name of Replica set; you can choose any name of your choice)
Go to Mongo terminal in MongoDB directory.
Initialize the set using command rs.initiate() on mongo terminal
Check the status of Replica set after initialization using rs.status()
command.
7. Contd..
If initialization is success you will see OK in the output something like
this
{
"set" : "sample",
"myState" : 1,
"members" : [
{
"name" : "<PRIMARY_HOSTNAME>:27017",
"self" : true
}
],
"ok" : 1
}
You can also check the status on
http://<PRIMARY_NODE>:27017/_replSet
Your Primary node is ready to use now. You can insert/update document
on this node
8. Contd..
Now start the SECONDARY node with same command as on primary
mongod --rest --replSet myHASet
Tell the PRIMARY node to add SECONDARY node in replica set. Go to
mongo console on PRIMARY node and add this using
rs.add(“<SECONDARY_HOSTNAME>”);
If addition is successful you will see the response
Once your SECONDARY node is attached to replica set you can check
the status on http://<PRIMARY_NODE>:27017/_replSet
Now start the ARBITER node using mongod --rest --replSet myset -oplogSize 8
9. Contd..
Add the ARBITER node in replica set using command rs.add( {
_id:2, host:”<ARBITER_HOSTNAME>”, arbiterOnly:true } )
Once ARBITER is added successfully, you are done with the
configuration and your replica set is ready to use.
Got o http://<PRIMARY_NODE>:27017/_replSet and you should
be able to see the status of each node.
To test the replica, take down the primary node, and see if
SECONDARY is able to pick up and will become PRIMARY node.
You can fire the command db.isMaster() to check the status if
SECONDARY node has turned up as Master node.
10. Connecting Replica
• After you have setup the replica set successfully, you can connect with it
using JAVA driver from your client application
• You can use the following code snippet for making connection to replica set
List addrs = new ArrayList();
addrs.add( new ServerAddress(“<PRIMARY_HOST>",”<MONGO_PORT>" ) );
addrs.add( new ServerAddress(“<SECONDARY_HOST>",“<MONGO_PORT>"));
Mongo m = new Mongo(addrs);
DB db = m.getDB(“<NAME_OF_DB>");
• MongoDB driver is smart enough to connect to PRIMARY node only, in-case if
PRIMARY node is down, it will automatically switch to another node for
communication
11. Conclusion
Here is an honest attempt to guide you to setup MongoDB on AWS
EC2. Though this is an open forum and you all are open to post your
comments if I have missed anything
Also, if you don’t want to get into setting up the infrastructure and
administration for MongoDB, you can directly use our App42 NoSQL
Cloud Storage Service.
This service can be accessed using our REST API or using native
platform SDKs available in different languages like iOS, Android,
J2ME, JAVA, PHP, Ruby, Windows Phone and C#
12. Links for Reference :
http://www.shephertz.com
http://api.shephertz.com
http://appwarp.shephertz.com
http://app42paas.shephertz.com/
Follow us on:
Contact: [email protected]
Skype: ShepHertz