0% found this document useful (0 votes)

92 views7 pages

Cassandra and Data Handling

The document discusses various concepts related to Apache Cassandra such as data modeling, compaction strategies, wide rows, real-time ingestion using Spark and Kafka, analytics using Spark, performance tuning and monitoring metrics, and data loading tools like sstableloader and COPY command. It provides answers to questions on Cassandra features like secondary indexes, consistency, replication factor, configuration files, APIs, and best practices for modeling Cassandra data.

Uploaded by

Lynch George

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

92 views7 pages

Cassandra and Data Handling

Uploaded by

Lynch George

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 7

The type of __________ strategy Cassandra performs on your data is configurable and

can significantly affect read performance.

compaction

In which of the following scenarios can we use 'Wide rows'?

All the options

What is key that dictates how the rows are ordered on reads?
Shapes

Which among the following is undesirable in a relational data model, but not in
Cassandra?
Denormalization

Point out the correct statement :

All of the mentioned

Cassandra searches the __________ to determine the approximate location on disk of

the index entry.
All of the mentioned

Cassandra searches the __________ to determine the approximate location on disk of

the index entry.
partition summary

Sqoop works via JDBC connection

True

___________________ is the tool that work with cassandra to make data transfer from
RDBMS systems possible.
Sqoop

Cassandra has support to which among the following RDBMS systems

All the options

Cassandra support data transfer from RDBMS systems in and out of Cassandra
True

________ is used to ingest data into Cassandra in Real-Time while working with
Spark
SparkStreaming

In real time processing the data under consideration is Data-in-motion

True

Real-time Analytics Use cases include

All the options

Which among the following is possible in cassandra-Spark combination and not

delivered as a feature of cassandra alone
All the options

Does Cassandra support JOIN operations?

Yes,It support JOINS while in conjuction with Spark.

____________ is used to connect Spark and Cassandra

Spark connector
Which of the hadoop Components enables you to run analytics on your Cassandra data?
All the options

What is the purpose of using Thrift in Cassandra?

facilitate access to the DB

What is SuperColumn in Cassandra?

unique element

Select the best advantage of Analytics with cassandra?

Fraud detection

What is the use of Source Command in Cassandra?

execute a file

ColumnFamily refers to a structure having infinite number of rows.

True

What is the default log level used by Log4J in cassandra

INFO

What is the optimum number of concurrent_reads per processor core ?

Key Cassandra metrics that are important in Performance monitoring

All the options

What is the biggest perrformance gain in cassandra write operations?

commit log in a separate disk

Which is the least verbose logging level in cassandra

trace

In cassandra consistency is achieved through consistency tuning mechanisms

True

Which among the following is not a performance measurement tool

None

Use secondary index if you want to query a column which is not a primary key/not
part of composite key
True

Which among the following is a high-level goals of your Data-Model?

minimizing the number of data-duplication -- wrong

_________ is a Cassandra feature that optimizes the cluster consistency process

Hinted handoff

When do you have to avoid using secondary indexes

can use it on any columns without effecting performance -- wrong

Using sstableloader we can load

both pre-exiting sstables and external data

What can also be attributed as wide-row in apache Cassandra

Compound Key
What is Replication Factor in Cassandra?
number of data copies existing

Command used to imports data from CSV file into an existing table
COPY FROM

Which directory contain Cassandra configuration files

conf

Which among the following is undesirable in a relational data model, but not in
Cassandra?
Denormalization

Which among the following can be used as COPY option

All the options

While loading external data into a cluster

both the options -- wrong

State whether the statement is true or false : Cassandra runs on RedHat

True

COPY command can be used to read data

All the options

Cassandra has API support for which of the following

All the options

sstableloader uses __________ protocol to learn the topology of the cluster.

gossip

Using sstableloader data loading into a live, active cluster is not allowed.
False

Partition index is list of partition keys and the start position of rows in the
data file (on disk).
True

What is the default Partitioner in apache Cassandra cluster

Murmur3Partitioner

What kind of files can be imported or exported using the COPY command
csv

Is there any relation between the directory that hold sstables and the name of the
keyspace of the sstable
Both name has to be same

Tool that streams sstables to a live cluster

sstableloader

What is the need of a partition key?

decompression -- wrong

Real-time data ingestion in Cassandra can be done using

both Spark and Kafka
It is wise to use secondary indexes on the columns you want to be querying on has
few unique values
True

Hi Shiva,

Thanks.. I completed yesterday only and sent you dumps .

Thanks,
Hiren Kalavadia
Mob: +91 75670 70987
TCS – Comcast Relationship

From: Gande, Shiva (Contractor)

Sent: Monday, October 8, 2018 4:35 AM
To: Kalavadia, Hiren (Contractor)
Subject: Cassandra Data Modeling - I got 18 out of 25 some may be wrong,,,

Point out the correct statement

- All the optins

Cassandra searches the __________ to determine the approximate location on disk of

the index entry.
partition summary

Which among the following is undesirable in a relational data model, but not in
Cassandra?
DeNormalization

What is key that dictates how the rows are ordered on reads?
Comparator

In which of the following scenarios can we use 'Wide rows'

All the options

Cassandra searches the __________ to determine the approximate location on disk of

the index entry
partition record -- Wrong
partition search -- Wrong

The type of __________ strategy Cassandra performs on your data is configurable and
can significantly affect read performance
compaction

Cassandra has support to which among the following RDBMS system

All the options

Cassandra support data transfer from RDBMS systems in and out of Cassandra
True
What can also be attributed as wide-row in apache Cassandra
Clustering Key

Real-time data ingestion in Cassandra can be done using

both Spark and Kafka

Source Command in Cassandra is used to?

execute a file

In real time processing the data under consideration is Data-in-motion

True

Which among the following is possible in cassandra-Spark combination and not

delivered as a feature of cassandra alone
All the options

Cassandra supports joins while working in conjunction with spark

True

is used to connect Spark and Cassandra

Spark Combiner -- Wrong

Does Cassandra support JOIN operations?

Yes it supports JOINS while in conjunction

is used to ingest data into Cassandra in Real-Time while working with Spark

SparkStreaming

Real-time Analytics Use cases include

All the options

Select the best advantage of Analytics with cassandra

Fraud detection

Which of the hadoop Components enables you to run analytics on your Cassandra data
All the options

ColumnFamily refers to a structure having infinite number of rows

True

What is SuperColumn in Cassandra

column keys

What is the purpose of using Thrift in Cassandra

access to DB

Which is the least verbose logging level in cassandra

error

What is the default log level used by Log4J in cassandra

INFO
What is the biggest perrformance gain in cassandra write operations
commit log in separate desk

Which among the following is not a performance measurement tool

iostat

In cassandra consistency is achieved through consistency tuning mechanisms

True

Key Cassandra metrics that are important in Performance monitoring

All the options

What is the optimum number of concurrent_reads per processor core

Use secondary index if you want to query a column which is not a primary key/not
part of composite key
True

Which among the following is true

All the options

What is the best method to store row data in a sorted order

use a primary key

What is the need of a partition key

identify the partition

Which among the following is true about Thrift API

used to read and write to DB

When do you have to avoid using secondary indexes

less account of unique values

sstableloader uses __________ protocol to learn the topology of the cluste

all of the mentioned

_ is a Cassandra feature that optimizes the cluster consistency process

hindeted handoff

What is Replication Factor in Cassandra

number of data copies existing

Which among the following is a high-level goals of your data-model

date-duplicaiton minimize

Using sstableloader external data cannot be loaded into the cluster

false

Which among the ffollowing is true about COPY command

all the above

Cassandra supports which of the below API's to retrieve and manipulate data
Thrift API

COPY command can be used to read data

All the options
You can enable or disable hinted handoff in the cassandra.yaml file
true

JMX stands for

Java management extension

Which directory contain Cassandra configuration files

conf

While loading external data into a cluster

both the options

Which of the following is used to load the data in batch

all the options

Using sstableloader we can load

external data

Command used to imports data from CSV file into an existing table
COPY FROM

Thanks,
Shiva Gande
Cell : +1 610 998 5523
Desk : +1 856 792 2288
TCS – Comcast Relationship

Class 3 Cassandra
No ratings yet
Class 3 Cassandra
64 pages
Rajashekar Neelarapu 556762 BA635 Disaster Recovery Professor Fred Rose
No ratings yet
Rajashekar Neelarapu 556762 BA635 Disaster Recovery Professor Fred Rose
6 pages
Quick Reference Guide: NVDA For Windows Keyboard Commands
No ratings yet
Quick Reference Guide: NVDA For Windows Keyboard Commands
1 page
Log
No ratings yet
Log
128 pages
Zhao 13
No ratings yet
Zhao 13
136 pages
120 Javascript - Events
No ratings yet
120 Javascript - Events
41 pages
lec17
No ratings yet
lec17
21 pages
Unit 3
No ratings yet
Unit 3
19 pages
Cassandra-Presentation-BSB-23.9.2021 - Copy
No ratings yet
Cassandra-Presentation-BSB-23.9.2021 - Copy
50 pages
102 Ozq 7 X G2 G 7 DC PJos Qpsag Ma 4 W YXMQp
No ratings yet
102 Ozq 7 X G2 G 7 DC PJos Qpsag Ma 4 W YXMQp
22 pages
Efficient Patch Management with Darcs: Definitive Reference for Developers and Engineers
From Everand
Efficient Patch Management with Darcs: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Acucapture: Operating Manual
No ratings yet
Acucapture: Operating Manual
31 pages
Cassandra Essentials: Definitive Reference for Developers and Engineers
From Everand
Cassandra Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Algorithms For The Girlies (FREE Sample)
No ratings yet
Algorithms For The Girlies (FREE Sample)
22 pages
Wide-Column Stores: Big Data Management Phil Bartie
No ratings yet
Wide-Column Stores: Big Data Management Phil Bartie
46 pages
Cassandra 3.x High Availability - Second Edition
From Everand
Cassandra 3.x High Availability - Second Edition
Robbie Strickland
No ratings yet
Final Paper
No ratings yet
Final Paper
29 pages
JanusGraph Essentials: Definitive Reference for Developers and Engineers
From Everand
JanusGraph Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Cassandra Interview QA Full
No ratings yet
Cassandra Interview QA Full
2 pages
Cassandra
No ratings yet
Cassandra
25 pages
Cassandra Data Modeling Best Practices
No ratings yet
Cassandra Data Modeling Best Practices
57 pages
Chmod Sheet
No ratings yet
Chmod Sheet
8 pages
Programming Languages Vocabulary and Definitions
No ratings yet
Programming Languages Vocabulary and Definitions
9 pages
LAB 5 - Semaphore Implementation
No ratings yet
LAB 5 - Semaphore Implementation
3 pages
Solution Cs09 Week 06 Assignment 06
No ratings yet
Solution Cs09 Week 06 Assignment 06
3 pages
Cassandra Brass Tacks
No ratings yet
Cassandra Brass Tacks
6 pages
Cassandra
No ratings yet
Cassandra
31 pages
Cassandra FAQ
No ratings yet
Cassandra FAQ
2 pages
Cassandra Brass Tacks Q&A
No ratings yet
Cassandra Brass Tacks Q&A
4 pages
Cassandra Data Model
No ratings yet
Cassandra Data Model
17 pages
Here Are the Answers to the C Programming Basics
No ratings yet
Here Are the Answers to the C Programming Basics
2 pages
PR 5 - No SQL
No ratings yet
PR 5 - No SQL
9 pages
AWSCLFReviewQuestions-TheCoreDatabaseServices1
No ratings yet
AWSCLFReviewQuestions-TheCoreDatabaseServices1
3 pages
Cassandra Data Modeling
No ratings yet
Cassandra Data Modeling
3 pages
Terrform Guide
No ratings yet
Terrform Guide
38 pages
Cassandra: A Distributed Database With No Single Point of Failure
100% (1)
Cassandra: A Distributed Database With No Single Point of Failure
9 pages
000-553 IBM Netezza Certification
100% (2)
000-553 IBM Netezza Certification
44 pages
SmartTAP Proxy Server - Record PSTN Calls - O365
No ratings yet
SmartTAP Proxy Server - Record PSTN Calls - O365
6 pages
InstLog04262020 191959
No ratings yet
InstLog04262020 191959
7 pages
CS403 IMP Short Notes
100% (1)
CS403 IMP Short Notes
88 pages
What Is An ASPX File?: How To Open, Edit, and Convert ASPX Files
No ratings yet
What Is An ASPX File?: How To Open, Edit, and Convert ASPX Files
3 pages
Lenovo Storage s3200 Ds
No ratings yet
Lenovo Storage s3200 Ds
4 pages
Comprehensive Guide to SAS Programming: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to SAS Programming: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Principles of MapReduce Systems: Definitive Reference for Developers and Engineers
From Everand
Principles of MapReduce Systems: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
CSE4011 Virtualization ETH 1 AC41
No ratings yet
CSE4011 Virtualization ETH 1 AC41
6 pages
Amazon RDS Architecture and Administration: Definitive Reference for Developers and Engineers
From Everand
Amazon RDS Architecture and Administration: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Data Mining
No ratings yet
Data Mining
1 page
D-PDD-DY-23-Demo
No ratings yet
D-PDD-DY-23-Demo
5 pages
Cassandra
No ratings yet
Cassandra
10 pages
TUF-2000 Series Communication Protocols PDF
No ratings yet
TUF-2000 Series Communication Protocols PDF
18 pages
Cara Unprotek Sheet Exel
No ratings yet
Cara Unprotek Sheet Exel
4 pages
Data Visualization
No ratings yet
Data Visualization
1 page
Introduction To Cassandra
No ratings yet
Introduction To Cassandra
47 pages
Segmentation Analytics with SAS Viya: An Approach to Clustering and Visualization
From Everand
Segmentation Analytics with SAS Viya: An Approach to Clustering and Visualization
Randall S. Collica
No ratings yet
BlockChain PotentusNexus
No ratings yet
BlockChain PotentusNexus
2 pages
Cassandra
No ratings yet
Cassandra
5 pages
Casandra Brass Tacks
No ratings yet
Casandra Brass Tacks
2 pages
Efficient Parallel Computing with Dask: Definitive Reference for Developers and Engineers
From Everand
Efficient Parallel Computing with Dask: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Intro to Data Science_week 10_LAQ's
No ratings yet
Intro to Data Science_week 10_LAQ's
4 pages
Cassandra PPT Final
No ratings yet
Cassandra PPT Final
23 pages
Compare Mongodb and Cassandra
No ratings yet
Compare Mongodb and Cassandra
6 pages
EE120 Information Technology 1
No ratings yet
EE120 Information Technology 1
10 pages
ITC Infotalk FAQs
No ratings yet
ITC Infotalk FAQs
18 pages
How To Create a Custom Bapi
No ratings yet
How To Create a Custom Bapi
20 pages
Learn Cassandra
100% (2)
Learn Cassandra
37 pages
An Overview of Apache Cassandra: Cassandra Essentials Tutorial Series
No ratings yet
An Overview of Apache Cassandra: Cassandra Essentials Tutorial Series
20 pages
Cassandra As Used by Facebook
100% (1)
Cassandra As Used by Facebook
12 pages
Comprehensive Guide to Dash Applications: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Dash Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Cassandra Certification Study Guide DataStax
13% (8)
Cassandra Certification Study Guide DataStax
20 pages
Redshift Essentials: Definitive Reference for Developers and Engineers
From Everand
Redshift Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Dzone Refcard 153 Apache Cassandra 2020
No ratings yet
Dzone Refcard 153 Apache Cassandra 2020
11 pages
Cassandra_Complete_Notes
No ratings yet
Cassandra_Complete_Notes
5 pages
Mastering Amazon Redshift: Scalable Cloud Data Warehousing
From Everand
Mastering Amazon Redshift: Scalable Cloud Data Warehousing
Robert Johnson
No ratings yet
Analyzing Historical Stock - Revenue Data and Building A Dashboard - Jupyter Notebook
No ratings yet
Analyzing Historical Stock - Revenue Data and Building A Dashboard - Jupyter Notebook
9 pages
Cassandra: Wa'el Belkasim Arash Akhlaghi Badrinath Jayakumar
No ratings yet
Cassandra: Wa'el Belkasim Arash Akhlaghi Badrinath Jayakumar
37 pages
SAP interface programming with RFC and VBA: Edit SAP data with MS Access
From Everand
SAP interface programming with RFC and VBA: Edit SAP data with MS Access
Karl Josef Hensel
No ratings yet
Apache Cassandra: by Chethan Gowda
No ratings yet
Apache Cassandra: by Chethan Gowda
12 pages
Bootstrap
No ratings yet
Bootstrap
1 page
Cassandra Interview Questions Answers
No ratings yet
Cassandra Interview Questions Answers
10 pages
Chapter 7
No ratings yet
Chapter 7
48 pages
Mastering DuckDB: High-Performance Analytics Made Easy
From Everand
Mastering DuckDB: High-Performance Analytics Made Easy
Robert Johnson
No ratings yet
AME B DeveloperGuide
No ratings yet
AME B DeveloperGuide
132 pages
AWS Essentials FP
No ratings yet
AWS Essentials FP
2 pages
Cassandra Installation Review
No ratings yet
Cassandra Installation Review
6 pages
End-to-End Data Science with SAS: A Hands-On Programming Guide
From Everand
End-to-End Data Science with SAS: A Hands-On Programming Guide
James Gearheart
No ratings yet
9720115-003 Emulator Users Guide v1.1.0 PDF
No ratings yet
9720115-003 Emulator Users Guide v1.1.0 PDF
68 pages
Data Mining Methods Basics
No ratings yet
Data Mining Methods Basics
2 pages
IBM Exam 000-612 Questions
No ratings yet
IBM Exam 000-612 Questions
4 pages
Cassandra
No ratings yet
Cassandra
7 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Mastering Apache Cassandra - Second Edition
From Everand
Mastering Apache Cassandra - Second Edition
Nishant Neeraj
No ratings yet
Learn Cassandra in 24 Hours
From Everand
Learn Cassandra in 24 Hours
Alex Nordeen
No ratings yet
Python-Unit Test-2
No ratings yet
Python-Unit Test-2
6 pages
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet