RKCP Information Technology




Cassandra
Home » Courses   » Cassandra

Cassandra

This training helps the participants to learn Big Data Introduction, NoSQL DB Introduction, Cassandra Data Model, Cassandra Modelling & Architecture, Cassandra API, CQSHL, Cassandra Administration, Cassandra Analytics & Search Clusters

Overview

Apache Cassandra is a distributed database of second generation originally open-sourced by Facebook. Its write-optimized, shared-nothing architecture results in an excellent performance and scalability.

Cassandra moves away from the master-slave model to using a peer-to-peer model. This means there is no single master rather all the nodes are potentially masters. This helps to make the writes and reads extremely scalable and even allow nodes to function in case of partition tolerance.


Objective

Apache Cassandra is an open-source project and a distributed NoSQL database of second-generation. For higher availability and scalability of the database this is the best choice. Cassandra supports replication across multiple data-centers. It offers tunable consistency to make the write and read processes highly scalable.

This Apache Cassandra training provides an overview of the following:

  • Fundamentals of Big data and NoSQL database
  • Cassandra and the features provided by it.
  • Architecture of Cassandra and its data model
  • Installing, configuring, and monitoring Cassandra
  • Hadoop ecosystem of products around Cassandra


Prerequisites

  • Knowledge of any SQL database is preferred
  • Knowledge of Java is preferred (Not mandatory for Developers)

Syllabus

Expand All
  • 1. What is Big Data?
    • Landscape of Technology
    • Big Data Relevance
    • Distributed Systems and its Challenges
  • 2. Why to use NoSQL Database
    • Relational DB vs. NoSQL
    • Types of NoSQL Database
    • Landscape of NoSQL
    • CAP Theorem and Eventual Consistency
    • Key Characteristics of NoSQL Database
    • ACID vs BASE
  • 3. Fundamentals of Cassandra
    • Its Distributed and Decentralized
    • Scalability is Elastic
    • Its availability is High and it has Fault Tolerance
    • Consistency is Tunable
    • Its Row-Oriented
    • Its Schema-Free
    • Its Performance is High
  • 4. Data Model of Cassandra
    • The Relational Data Model of Cassandra
    • Simple Introduction
    • Clusters
    • Keyspaces
    • Hands-on Session
  • 5. Installation and setting up of Cassandra
    • Single Node Setup
    • Multi-Node Cluster Setup
    • Key Configurations needed for Cassandra
    • CLI and Hands-On with Cassandra
  • 6. Modeling of Cassandra
    • Cassandra (Column Family NoSQL DB)
    • Key Concepts - Key Space - Column Family - Column Family Options - Wide Rows, Skinny Row - Column Sorting - Super Columns - Counter Column Family - Composite Keys and Columns - Time To Live -
    • Secondary Index in Cassandra
    • Differences between Custom Index and Secondary Index
    • Differences between Relational Modeling and Cassandra Modeling
    • Key Points to note while modeling of a Cassandra Database
    • Patterns and Anit-Patterns in Cassandra Modeling