View on GitHub

The PBBS Benchmarks

New version of pbbs benchmarks

RemoveDuplicates (DDUP)

Given a sequence of elements which can be hashed and compared for equality, remove all duplicates from the sequence. The result must contain exactly one of each of the elements in the input and can be in any order.

Default Input Distributions

The test distributions are the following:

For the large inputs n = 100 million, and for the small n = 10 million.

Input and Output File Formats

The input and output data need to be in the sequence file format, both with the same element type.

The output file can be in any order.