Yardstick Apache Spark Benchmarks

Yardstick Apache Spark is a set of Apache Spark benchmarks written on top of Yardstick framework.

Yardstick Framework

Visit Yardstick Repository for detailed information on how to run Yardstick benchmarks and how to generate graphs.

The documentation below describes configuration parameters in addition to standard Yardstick parameters.

Installation

Create a local clone of Yardstick Apache Spark repository
Import Yardstick Apache Spark POM file into your project
Run mvn package command

Provided Benchmarks

The following benchmarks are provided:

SparkSqlQueryBenchmark - benchmark sql query operations.
SparkQueryDSLBenchmark - benchmarks query dsl operations.

Writing Apache Spark Benchmarks

All benchmarks extend SparkAbstractBenchmark class. A new benchmark should also extend this abstract class and implement test method. This is the method that is actually benchmarked.

Running Apache Spark Benchmarks

Before running Apache Spark benchmarks, run mvn package command. This command will compile the project and also will unpack scripts from yardstick-resources.zip file to bin directory.

Properties And Command Line Arguments

Note that this section only describes configuration parameters specific to Apache Spark benchmarks, and not for Yardstick framework. To run Apache Spark benchmarks and generate graphs, you will need to run them using Yardstick framework scripts in bin folder.

Refer to Yardstick Documentation for common Yardstick properties and command line arguments for running Yardstick scripts.

The following benchmark properties can be defined in the benchmark configuration:

-b or --backups - Set storage level MEMORY_ONLY_2 (replicate each partition on two cluster nodes). By default MEMORY_ONLY.

License

Yardstick Apache Spark is available under Apache 2.0 Open Source license.

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
config		config
sbin		sbin
src/main/java/org/yardstickframework/spark		src/main/java/org/yardstickframework/spark
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Yardstick Apache Spark Benchmarks

Yardstick Framework

Installation

Provided Benchmarks

Writing Apache Spark Benchmarks

Running Apache Spark Benchmarks

Properties And Command Line Arguments

License

About

Releases

Packages

Languages

License

yardstick-benchmarks/yardstick-spark

Folders and files

Latest commit

History

Repository files navigation

Yardstick Apache Spark Benchmarks

Yardstick Framework

Installation

Provided Benchmarks

Writing Apache Spark Benchmarks

Running Apache Spark Benchmarks

Properties And Command Line Arguments

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages