WBDB2013.cn 2013 : Third Workshop on Big Data Benchmarking
Call For Papers
Call for Papers
Third Workshop on Big Data Benchmarking
July 16-17, 2013 Xi'an, China
The Third Workshop on Big Data Benchmarking (3rd WBDB) will be held on July
16-17, 2013 in Xi'an, China, hosted by the Shanxi HPC Center - following upon
the first and second workshops held in May 2012 in San Jose, CA and in December
2012 in Pune, India, respectively.
The objective of the WBDB workshops is to make progress towards developing
industry standard benchmarks for evaluating hardware and software systems for
big data applications.
The BigData Top100 List
The BigData Top100 List concept emerged from discussions at previous WBDB
workshops and related meetings. The list would rank big data systems according
to their performance on selected big data analytics workloads, enabling
comparisons among different big data solutions.
A successful benchmark would be simple to implement and execute; cost
effective, so that the benefits of executing the benchmark justify its expense;
timely, with benchmark versions keeping pace with rapid changes in the
marketplace; and verifiable so that results of the benchmark can be validated
via independent means.
Themes for WBDB2013.cn
The 3rd WBDB will emphasize two benchmark proposals that are currently being
considered: one based on a Deep Analytics Pipeline for event processing and a
second based on extending the TPC-DS benchmark with semistructured and
unstructured data and new queries targeted at those data, called BigBench.
The priority is to address the following issues in the context of these
* Data generation: Models and procedures for generating synthetic data with
* Workload: Representative big data business problems and corresponding
specific implementations for each step and/or query in the workload.
* Benchmark execution: Rules and regulations for running the benchmark;
data scale factors; benchmark versioning; benchmark metrics.
* Metrics for efficiency: Measuring the efficiency of the solution, e.g. based
on costs of acquisition, ownership, energy and/or other factors.
Papers on early implementations of the Deep Analytics Pipeline or BigBench, or
describing lessons learned in benchmarking big data applications are solicited.
Discussions of enhancements to these benchmarks are also encouraged, for
example, including more data genres (e.g. graphs) in the workload; considering
a range of machine learning and other algorithms, etc. Papers proposing other
benchmarking alternatives will also be considered.
Meeting Registration / Attendance
* The workshop will include invited talks, regular presentations, “lightning”
talks, and discussion sessions.
* A modest registration fee will be charged to cover workshop expenses.
* Attendance will be capped to ensure effective discussions and active
* In selecting papers, preference will be given to papers that directly
address the themes of the workshop and to diversity of representation across
organizations and institutions.
* Extended versions of selected papers will be published in the Springer
Verlag Lecture Notes in Computer Science series.
Papers should be formated using the Springer LNCS style. Workshop papers should
have 4 to 8 pages. For the post workshop proceedings 16 pages will be accepted.
* Paper submission deadline: May 1st, 2013
* Notification of acceptance: May 15th, 2013
* Workshop: July 16-17, 2013
* Full length proceedings submission: August 16, 2013
Further information can be found on the workshop website:
Submission should be done via the CMT conference management service:
Chaitan Baru - CLDS, UCSD
Milind Bhandarkar - Greenplum
Eyal Gutkind - Mellanox
Jian Li - IBM
Tong Liu - Mellanox (Local arrangements)
Raghunath Nambiar - Cisco
Meikel Poess - Oracle
Tilmann Rabl - msrg.org, U Toronto
Xiaohui Zhou - Xi’an University (Local host)