Cassandra is a extremely scalable NoSQL data source. Cassandra is architected to handle real-time big data workloads throughout numerous data locations with no individual level of disaster, providing companies with highly database efficiency and constant accessibility and it is free, open-source, allocated storage space system for handling considerable amounts of arranged data. It varies from conventional relational data source management systems in some significant ways. It is designed to range to a very huge size across many investment servers.

Cassandra was designed for fixing the problem of mailbox search at Facebook or myspace. It brings together Amazon Dynamo’s completely allocated style with Search engines Bigtable’s column-oriented information style. Facebook or myspace open-sourced Cassandra in 2008 and it became an Apache Incubator venture. In beginning 2010, Cassandra became a top-level Apache venture. These days there are thousands of Cassandra deployments in development, such as at organizations such as Blockbuster online, Tweets, Rackspace, and cisco.

Advantages of Cassandra :

  • Flexible and Scalable
  • Trustworthy
  • Long-lasting
  • Statistics Without ET
  • Performance
  • Features of Cassandra :
  • Decentralized
  • Supports duplication and multiple information middle replication
  • Scalability
  • Fault-tolerant
  • Tunable consistency
  • MapReduce support
  • Query language

Cassandra can consist of with Hadoop to provide only one solution for both research and real-time needs. Hadoop MapReduce provides the ability to run huge methodical problems against terabytes of details and it provides caching on each of its nodes. Managed with Cassandra’s scalability functions, you can progressively add nodes to the categories to keep as much of your details kept in storage space as you need. Thus, there is no need for a personal caching aspect.