High-volume sampling: algorithms’ comparison

Goal: To determine the most efficient way to perform high-volume sampling.

Set-up: The sampler's goal is to give permission to sample use a given sampling rate; (approximately) each 1000th request should result in actual sampling. There are several samplers:
  1. AtomicSampler: uses AtomicInteger to keep track of sampling attempts; 
  2. RandomSampler: uses an instance-wide Random instance to determine if a sample should be taken. 
  3. RandomSampler2: uses a local Random instance to determine if a sample should be taken, which is initialized each time. 
  4. XorShiftSampler: uses a XOR-Shift algorithm to produce psedo-random numbers (http://www.javamex.com/tutorials/random_numbers/xorshift.shtml
Before actual measurement each sampler is called 50K times to warm it up and give a hot-spot compiler a chance to compile the samplers' bytecode into a native high-performance code.

Then 10 runs are performed, using a variety of scenarios. 10M sampling attempts are performed during each run. Time to complete each run is recorded and then the average is calculated.

The scenarios:
  1. Single-threaded sampling. All runs are done using one thread and one instance of sampler. 
  2. Multi-threaded sampling (10 threads, 1 sampler per thread). 
  3. Multi-threaded sampling (10 threads sharing 1 sampler instance). 
The average times (in ms) on i5 Windows box are:




1 thread

10 threads,
1 sampler
per thread

10 threads
sharing 1 sampler instance

AtomicSampler

125

676

7664
RandomSampler
143
588
7039
RandomSampler2
622
6758
6519
XorShiftRandomSampler
211
813
791

Conclusion: in multithreaded environment sharing Random variable produce worse results than recreating a new Random each time (even if the latter means creating and gc-ing random instances).

Source code is accessible at https://github.com/mykyta-protsenko/samplerPerformance.git
The project was compiled and in JDK 1.6 compatibility mode.

Comments

Post a Comment

Popular posts from this blog

Spring Framework and Code Obscurity

Multi-threading skills - what do you mean by that?