Overview

Note: The Workload Services Framework is a benchmarking framework and is not intended to be used for the deployment of workloads in production environments. It is recommended that users consider any adjustments which may be necessary for the deployment of these workloads in a production environment including those necessary for implementing software best practices for workload scalability and security.

Introduction¶

This workload runs Cassandra without Kubernetes and Docker/Containerd. It deploys Cassandra server and clients directly on bare metal or cloud VM instances.

Quick start¶

How to run.
It is better to have no less than 8 CPU cores, 16GB RAM on node. And no less than 4GB for each Cassandra instance.
Step 1: Clone code from https://github.com/intel/workload-services-framework
Step 2: Switch to code root path and run command "mkdir build".
Step 3: Go to path "../script/terraform" and edit file terraform-congig.static|aws|gcp|...|.tf to set your node information.
Step 4: Go to path "../build" and Run below command to configure the ctest.
"cmake -DBACKEND=terraform -DPLATFORM=EMR -DTERRAFORM_SUT=static -DREGISTRY="your registry IP:port" -DRELEASE=":Cassandra" -DBENCHMARK="Cassandra" ..". According to your test bed, 'DPLATFORM' can be changed to EMR. 'DCUMULUS_SUT' can be changed to aws or azure.
Step 5: Switch to path ../build/Workload/Cassandra and run "make" command.
Step 6: Run below command to list all test cases. "./ctest.sh -N"
Step 7: Run test case by below command. "./ctest.sh -R test_cassandran" "_1n" means running on single nodes. "_2n" means running on two nodes, server and client separated.
Run for performance data on EMR bare mental Below steps and configure is for Intel(R) Xeon(R) EMR on bare mental. Pre-requiement for resource:
a). 2 nodes(run test case "standalone_2n_pkm")
b). Enable SCN-2
c). 8 nvme disk mounted to /mnt/disk1/, /mnt/disk2/.../mnt/disk8/ on server node. The nvme disk top performace should be not lower than INTEL SSDPF21Q016TB*.
d). Disk space required. Each Cassandra instance will generated about 250GB~300GB data on disk. So each mvme disk free space should be no less than 300GB.

Steps:

Step 2: Switch to code root path and run command "mkdir build".
Step 3: Go to path "../script/terraform" and edit file terraform-congig.static.tf to set your node information.
Step 4: Go to path "../build" and Run below command to configure the ctest.
"cmake -DBACKEND=terraform -DPLATFORM=EMR -DTERRAFORM_SUT=static -DREGISTRY="your registry IP:port" -DRELEASE=":Cassandra" -DBENCHMARK="Cassandra" ..".
Step 5: Go to path "../build/Workload/Cassandra" and and run "make" command.
Step 6: Copy ../workload/Cassandra/test-config/bare-mental/test-config-emr.yaml to current directory.
Step 7: Run below command to list all test cases. "./ctest.sh -N"
Step 8: Run test case by below command.
"./ctest.sh --testcase test_static_cassandra_standalone_2n_pkm --config=test-config-emr.yaml -V"

Current Support Machine Instance¶

Bare mental.
EMR, its configure file is test-configure/bare-mental/test-config-erm.yaml
SPR, its configure file is test-configure/bare-mental/test-config-spr.yaml
ICX, its configure file is test-configure/bare-mental/test-config-icx.yaml
cloud.
This code can be run on aws, gcp, azure and ali cloud.

Known issues and Limitation¶

Not supprot run Cassandra with Java OpenJdk14 on ARM platform.
Host OS limitation: Only support running on Ubuntu24.04 and Centos currently.

Customize Test Configurations¶

Refer to ctest.md to customize test parameters.

Parameters for workload configure: * CLIENT_DURATION - The during time of cassandra client running. In order to warm up, it's better to set this value more than 10 minutes. * CLIENT_THREADS - Thread number in one client instance to run concurrently. * CLIENT_INSERT - Operation ratio for write. Deault value is 20. * CLIENT_SIMPLE - Operation ratio for read. Default is 80. * CLIENT_POP_MAX - Cassandra DB entries. * INSTANCE_NUM - The nummer of Cassandra server and client instance to run concurrently. * CASSANDRA_NUMACTL_VCORES_ENABLE - For standalone settings. Cassandra server instance will pinned to vcpus(half phsical vcpus, half virtual vcpus). Default is 'false' * JVM_HEAP_SIZE - JVM configure for '-Xms' and '-Xmx'. Here set min (-Xms) and max (-Xmx) heap sizes to the same value to avoid stop-the-world GC pauses during resize. If set value larger than free memory size, it will be adjust to '80% * free memeory size' * JDK_VERSION - Two optios, 'JDK11' and 'JDK14'. JDK11 means use openjdk11. JDK14 means use openjdk14. Default value is 'JDK11'. * CASSANDRA_DISK_MOUNT - Boolean variable(true|false). Default value is 'false'. If set to 'true', it will use '/mnt/disk[0...INSTANCE_NUM]' for cassandre instance. The purpose is to set each cassandra server instance has its independent disk. * CASSANDRA_CONCURENT_READS - Setting for concurrent_reads in cassandra.yaml * CASSANDRA_CONCURENT_WRITES - Setting for concurrent_writes in cassandra.yaml * DEPLOY_MODE - Cassandra server deploy mode, Currently only support standalone mode.

Test Case¶

cassandra-stress user is used for the test case where it interleaves user provided queries with configurable ratio and distribution.

There are 4 test cases available: * test_*_cassandra_gated 1. This testcase is gated. * test_*_cassandra_Standalone_2n_pkm 1. This testcase is deploy cassandra as standalone mode, not cluste. And for Post-Si performance analysis. Default trace collection from log "Begin performance testing" and end at "End performance testing".

Docker Image¶

As deploying Cassandra server and clients directly on bare mental or cloud VM instance, no docker image related.

KPI¶

Run the kpi.sh script to parse the KPIs from the output logs.

**test_cassandra_user**
---------------------------------------------
    Partition rate(pk/s) : 193587
    Row rate(row/s)      : 2033872
    Latency mean(ms)     : 1.3
    Latency median(ms)   : 1.1
    Latency 95th percentile(ms)   : 2.3
    Latency 99th percentile(ms)   : 2.3
    Latency 99.9th percentile(ms) : 16.3
    Latency max(ms)      : 229.2
    Total operation time : 00:10:00
---------------------------------------------
*Finally Op rate(op/s): 193587

Note:

op/s    Number of operations per second performed during the run.
pk/s    Number of partition operations per second performed during the run.
row/s   Number of row operations per second performed during the run.
mean    Average latency in milliseconds for each operation during that run.
med     Median latency in milliseconds for each operation during that run.
.99     99% of the time the latency was less than the number displayed in the column.

Index Info¶

Name: Cassandra
Category: DataServices
Platform: SPR, ICX, EMR, SRF
Keywords:
Permission: