|
JVM Instances |
jvm_Ctr_1(1), jvm_Backend_1(16), jvm_TxInjector_1(16)
|
OS Image Description |
os_1
|
Tuning |
- cpupower -c all frequency-set -g performance
- tuned-adm profile throughput-performance
- ulimit -n 1024000
- echo 960000 > /proc/sys/kernel/sched_rt_runtime_us
- echo 800000000 > /proc/sys/kernel/sched_latency_ns
- echo 40000 > /proc/sys/kernel/sched_migration_cost_ns
- echo 410000000 > /proc/sys/kernel/sched_min_granularity_ns
- echo 2000000 > /proc/sys/kernel/sched_wakeup_granularity_ns
- echo 9000 > /proc/sys/kernel/sched_nr_migrate
- echo 10000 > /proc/sys/vm/dirty_expire_centisecs
- echo 1500 > /proc/sys/vm/dirty_writeback_centisecs
- echo 40 > /proc/sys/vm/dirty_ratio
- echo 10 > /proc/sys/vm/dirty_background_ratio
- echo 10 > /proc/sys/vm/swappiness
- echo 0 > /proc/sys/kernel/numa_balancing
- echo 0 > /proc/sys/vm/numa_stat
- echo always > /sys/kernel/mm/transparent_hugepage/enabled
- echo always > /sys/kernel/mm/transparent_hugepage/defrag
|
Notes |
None
|
Parts of Benchmark |
Controller
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms3g -Xmx3g -Xmn2g -XX:+UseParallelGC -XX:ParallelGCThreads=1 -XX:CICompilerCount=2
|
Tuning |
Used numactl to interleave memory on all CPUs
|
Notes |
None
|
Parts of Benchmark |
Backend
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms31g -Xmx31g -Xmn29g -XX:AllocatePrefetchInstr=2 -XX:+UseParallelGC -XX:ParallelGCThreads=16 -XX:LargePageSizeInBytes=2m -XX:-UseAdaptiveSizePolicy -XX:+AlwaysPreTouch -XX:+UseLargePages -XX:SurvivorRatio=8 -XX:TargetSurvivorRatio=95 -XX:MaxTenuringThreshold=15 -XX:InlineSmallCode=11k -XX:MaxGCPauseMillis=100 -XX:LoopUnrollLimit=200 -XX:+UseTransparentHugePages -XX:TLABAllocationWeight=2 -XX:ThreadStackSize=140 -XX:CompileThresholdScaling=120 -XX:CICompilerCount=4 -XX:AutoBoxCacheMax=32 -XX:OnStackReplacePercentage=100 -XX:TLABSize=1m -XX:MinTLABSize=1m -XX:-ResizeTLAB -XX:TLABWasteTargetPercent=1 -XX:TLABWasteIncrement=1 -XX:YoungPLABSize=1m -XX:OldPLABSize=1m
|
Tuning |
Used numactl to affinitize each Backend JVM to 6 Core / 12 Threads - Group1: --physcpubind=0-5,96-101 --localalloc
- Group2: --physcpubind=6-11,102-107 --localalloc
- Group3: --physcpubind=12-17,108-113 --localalloc
- Group4: --physcpubind=18-23,114-119 --localalloc
- Group5: --physcpubind=24-29,120-125 --localalloc
- Group6: --physcpubind=30-35,126-131 --localalloc
- Group7: --physcpubind=36-41,132-137 --localalloc
- Group8: --physcpubind=42-47,138-143 --localalloc
- Group9: --physcpubind=48-53,144-149 --localalloc
- Group10: --physcpubind=54-59,150-155 --localalloc
- Group11: --physcpubind=60-65,156-161 --localalloc
- Group12: --physcpubind=66-71,162-167 --localalloc
- Group13: --physcpubind=72-77,168-173 --localalloc
- Group14: --physcpubind=78-83,174-179 --localalloc
- Group15: --physcpubind=84-89,180-185 --localalloc
- Group16: --physcpubind=90-95,186-191 --localalloc
|
Notes |
None
|
Parts of Benchmark |
TxInjector
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms3g -Xmx3g -Xmn2g -XX:+UseParallelGC -XX:ParallelGCThreads=1 -XX:CICompilerCount=2
|
Tuning |
Used numactl to affinitize each Transaction Injector JVM to 6 Core/12 Threads - Group1: --physcpubind=0-5,96-101 --localalloc
- Group2: --physcpubind=6-11,102-107 --localalloc
- Group3: --physcpubind=12-17,108-113 --localalloc
- Group4: --physcpubind=18-23,114-119 --localalloc
- Group5: --physcpubind=24-29,120-125 --localalloc
- Group6: --physcpubind=30-35,126-131 --localalloc
- Group7: --physcpubind=36-41,132-137 --localalloc
- Group8: --physcpubind=42-47,138-143 --localalloc
- Group9: --physcpubind=48-53,144-149 --localalloc
- Group10: --physcpubind=54-59,150-155 --localalloc
- Group11: --physcpubind=60-65,156-161 --localalloc
- Group12: --physcpubind=66-71,162-167 --localalloc
- Group13: --physcpubind=72-77,168-173 --localalloc
- Group14: --physcpubind=78-83,174-179 --localalloc
- Group15: --physcpubind=84-89,180-185 --localalloc
- Group16: --physcpubind=90-95,186-191 --localalloc
|
Notes |
None
|
|