posted by user: rabl || 2218 views || tracked by 2 users: [display] 2013 : Third Workshop on Big Data Benchmarking


When Jul 16, 2013 - Jul 17, 2013
Where Xi'an, China
Submission Deadline May 1, 2013
Notification Due May 15, 2013
Final Version Due Aug 16, 2013
Categories    big data   benchmarking   bigdata top100

Call For Papers

Call for Papers

Third Workshop on Big Data Benchmarking

July 16-17, 2013 Xi'an, China

The Third Workshop on Big Data Benchmarking (3rd WBDB) will be held on July
16-17, 2013 in Xi'an, China, hosted by the Shanxi HPC Center - following upon
the first and second workshops held in May 2012 in San Jose, CA and in December
2012 in Pune, India, respectively.

The objective of the WBDB workshops is to make progress towards developing
industry standard benchmarks for evaluating hardware and software systems for
big data applications.

The BigData Top100 List
The BigData Top100 List concept emerged from discussions at previous WBDB
workshops and related meetings. The list would rank big data systems according
to their performance on selected big data analytics workloads, enabling
comparisons among different big data solutions.

A successful benchmark would be simple to implement and execute; cost
effective, so that the benefits of executing the benchmark justify its expense;
timely, with benchmark versions keeping pace with rapid changes in the
marketplace; and verifiable so that results of the benchmark can be validated
via independent means.

Themes for
The 3rd WBDB will emphasize two benchmark proposals that are currently being
considered: one based on a Deep Analytics Pipeline for event processing and a
second based on extending the TPC-DS benchmark with semistructured and
unstructured data and new queries targeted at those data, called BigBench.
The priority is to address the following issues in the context of these
benchmark proposals:
* Data generation: Models and procedures for generating synthetic data with
requisite properties.
* Workload: Representative big data business problems and corresponding
specific implementations for each step and/or query in the workload.
* Benchmark execution: Rules and regulations for running the benchmark;
data scale factors; benchmark versioning; benchmark metrics.
* Metrics for efficiency: Measuring the efficiency of the solution, e.g. based
on costs of acquisition, ownership, energy and/or other factors.

Papers on early implementations of the Deep Analytics Pipeline or BigBench, or
describing lessons learned in benchmarking big data applications are solicited.
Discussions of enhancements to these benchmarks are also encouraged, for
example, including more data genres (e.g. graphs) in the workload; considering
a range of machine learning and other algorithms, etc. Papers proposing other
benchmarking alternatives will also be considered.

Meeting Registration / Attendance
* The workshop will include invited talks, regular presentations, “lightning”
talks, and discussion sessions.
* A modest registration fee will be charged to cover workshop expenses.
* Attendance will be capped to ensure effective discussions and active
* In selecting papers, preference will be given to papers that directly
address the themes of the workshop and to diversity of representation across
organizations and institutions.
* Extended versions of selected papers will be published in the Springer
Verlag Lecture Notes in Computer Science series.

Paper format
Papers should be formated using the Springer LNCS style. Workshop papers should
have 4 to 8 pages. For the post workshop proceedings 16 pages will be accepted.

Important Dates:
* Paper submission deadline: May 1st, 2013
* Notification of acceptance: May 15th, 2013
* Workshop: July 16-17, 2013
* Full length proceedings submission: August 16, 2013

Further information can be found on the workshop website:

Submission should be done via the CMT conference management service:

Organizing Committee
Chaitan Baru - CLDS, UCSD
Milind Bhandarkar - Greenplum
Eyal Gutkind - Mellanox
Jian Li - IBM
Tong Liu - Mellanox (Local arrangements)
Raghunath Nambiar - Cisco
Meikel Poess - Oracle
Tilmann Rabl -, U Toronto
Xiaohui Zhou - Xi’an University (Local host)

Related Resources

DATA 2024   13th International Conference on Data Science, Technology and Applications
ACM-Ei/Scopus-CCISS 2024   2024 International Conference on Computing, Information Science and System (CCISS 2024)
CDMA 2024   8th International Conference on Data Science and Machine Learning
SoCAV 2024   2024 International Symposium on Connected and Autonomous Vehicles (SoCAV 2024)
ACM ICCBDC 2024   ACM--2024 8th International Conference on Cloud and Big Data Computing (ICCBDC 2024)
ACM-Ei/Scopus-DMNLP 2024   2024 International Conference on Data Mining and Natural Language Processing (DMNLP 2024)
AdNLP 2024   5th International conference on Advanced Natural Language Processing
BDML 2024   5th International Conference on Big Data and Machine Learning
ICCBDC 2024   ACM--2024 8th International Conference on Cloud and Big Data Computing (ICCBDC 2024)
CYBI 2024   11th International Conference on Cybernetics & Informatics