1 / 8

An introduction to Apache Cassandra

A introduction to Apache Cassandra, what is it and how does it work ? How can it be used with Hadoopand how does it perorm ?

semtechs
Download Presentation

An introduction to Apache Cassandra

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Apache Cassandra • What is it ? • How does it work ? • Hadoop • Tools • Architecture www.semtech-solutions.co.nz info@semtech-solutions.co.nz

  2. Cassandra – What is it ? • Distributed database management system • Designed for big data • Scalable • Fault tolerant • No single point of failure • Has an SQL like query language • NoSQL www.semtech-solutions.co.nz info@semtech-solutions.co.nz

  3. Cassandra – How does it work ? • Organises data into tables • Uses Cassandra Query Language ( CQL )‏ • Does not allow sub queries or joins • Supports Hadoop Map Reduce • Uses asynchronous masterless replication • Gives low latency • Allows indexing • Allows batch analysis via Hadoop www.semtech-solutions.co.nz info@semtech-solutions.co.nz

  4. Cassandra – Hadoop How does Cassandra integrate with Hadoop • Support for Map Reduce • Integration with • Apache Pig • Apache Hive • Can also act as a back end for Solr ! www.semtech-solutions.co.nz info@semtech-solutions.co.nz

  5. Cassandra – Tools • User Interface ( GUI ) • Cassandra GUI • Toad for cloud db's • Administration • OpsCentre • Cassandra Cluster Admin • Other • Client libraries • Java, Python, .Net, Perl etc www.semtech-solutions.co.nz info@semtech-solutions.co.nz

  6. Cassandra – Architecture • A peer to peer cluster • No single point of failure • Tunable consistency • Is performance or accuracy more important ? • Query by key or key range • Row oriented data storage • Rows can hold up to 2 billion columns www.semtech-solutions.co.nz info@semtech-solutions.co.nz

  7. Cassandra – Architecture www.semtech-solutions.co.nz info@semtech-solutions.co.nz

  8. Contact Us • Feel free to contact us at • www.semtech-solutions.co.nz • info@semtech-solutions.co.nz • We offer IT project consultancy • We are happy to hear about your problems • You can just pay for those hours that you need • To solve your problems

More Related