Cassandra benchmarking measures the performance of different data modeling with different set of nodes and clients. With comparison of different data modeling, this allows us to find out the optimized database schema design for Cassandra.
cd /temp // Install in temp folder
wget http://downloads.datastax.com/datastax-ddc/datastax-ddc-3.9.0-bin.tar.gz
tar zxvf datastax-ddc-3.9.0-bin.tar.gz
cd /temp // Install in temp folder
wget http://download.nus.edu.sg/mirror/apache/maven/maven-3/3.3.9/binaries/apache-maven-3.3.9-bin.tar.gz
tar xzvf apache-maven-3.3.9-bin.tar.gz
export PATH=/temp/apache-maven-3.3.9/bin:$PATH
cd /temp/datastax-ddc-3.9.0/conf
vim cassandra.yaml
Edit the settings in 'cassandra.yaml' file:
- seeds: Add the IP addresses of the three nodes.
- listen_address: Add in the IP address of the current node in use.
Save the file and restart the cassandra server.
Before running the scripts, make sure that the project is in the home folder. Change directory to the project folder to prepare for benchmarking.
cd Team3-Cassandra
The benchmark.sh script requires 2 arguments that represents the type of dataset (D8 or D40) and number of clients.
a) To bulkload all D8 datasets into the database with 1 node, run bash bulkload.sh 8 1.
b) To bulkload all D8 datasets into the database with 3 nodes, run bash bulkload.sh 8 3.
c) To bulkload all D40 datasets into the database with 1 node, run bash bulkload.sh 40 1.
d) To bulkload all D40 datasets into the database with 3 nodes, run bash bulkload.sh 40 3.
The benchmark.sh script requires 2 arguments that represents the type of dataset (D8 or D40) and number of clients.
a) To benchmark D8 datasets with 10 clients, run bash benchmark.sh 8 10.
b) To benchmark D8 datasets with 20 clients, run bash benchmark.sh 8 20.
c) To benchmark D8 datasets with 40 clients, run bash benchmark.sh 8 40.
d) To benchmark D40 datasets with 10 clients, run bash benchmark.sh 40 10.
e) To benchmark D40 datasets with 20 clients, run bash benchmark.sh 40 20.
f) To benchmark D40 datasets with 40 clients, run bash benchmark.sh 40 40.
ps -ax | grep cassandra //Look for the pid in the output “XXXX pts/0 Sl 0:19 java”
kill XXXX
https://maven.apache.org/install.html
http://www.mkyong.com/maven/install-maven-on-mac-osx/

