This document provides an overview of Apache Cassandra, including: - Cassandra is an open source distributed database designed to handle large amounts of data across commodity servers. - It was originally created at Facebook and is influenced by Amazon Dynamo and Google Bigtable. - Cassandra uses a peer-to-peer distributed architecture with no single point of failure and supports replication across multiple data centers. - It uses a column-oriented data model with tunable consistency levels and supports the Cassandra Query Language (CQL) which is similar to SQL. - Major companies that use Cassandra include Facebook, Netflix, Twitter, IBM and more for its scalability, availability and flexibility.