Sideways Information Passing for Joins using Bloom Filters by gropaul · Pull Request #27 · gropaul/duckdb

gropaul · 2025-10-20T21:16:34Z

Sideways Information Passing for Joins using Bloom Filters

Hi, dear DuckDB team. In this PR, I would like to add a Bloom Filter Pushdown for the left side of the join.

The main idea is that if we detect that a join is selective, we build a bloom filter during join hash table population and then push it down to the probe side during scan. It reuses much of the existing infrastructure for min/max join table filters.

A Bloom filter is a space-efficient probabilistic data structure that can quickly test whether an element is definitely not in a set or possibly in a set, allowing false positives but never false negatives. Based on the hash of the key, we get an offset to a slot and then set 4 bits in this slot, determined by the hash.

There are still some things unfinished that I am not sure how to approach, like serialization and planning. Looking forward to the feedback on these points!

Advantages

1. Faster Probing

Probing the Bloom Filter is 2-4x times faster than probing the hash table. This has several reasons: (1) The bloom filter requires 12 bits per inserted key, while the hash table allocates 64-bit slots per key, so that the bloom filter will be 5x smaller than the hash table. (2) Probing the bloom filter is done with < 20 instructions and completely branchless, unlike the linear probing code, which needs to follow the probing chain.

2. Low False-Positive Rate

The false positive rate of the bloom filter is ~2%, which is much lower than the range-based filter pushdown that is in duckdb right now. This means that many tuples will be filtered out during the table scan. This leads to fewer tuples needing to be decompressed, which is especially helpful for IMDB on the M4, where FSST decompress is a big part of the whole execution.

3. More Optimal Join Plans

By pushing down bloom filters, we can get more optimal join plans at runtime, which gives us some of the benefits from the Robus Predicate Transfer. This is done by decomposing joins into Lookup and Expand, where the Bloom Filter is the pushed-down lookup, and the actual join then is the expand phase. In a pipeline with two joins, where the first join explodes the data size and the second join filters it back down, pushing the Bloom filter to the table scan ensures only tuples that survive both joins are fed into the expensive first join, avoiding unnecessary intermediate explosion.

Limitations & Todo's

1. Serialization issues and `TableFilter::ToExpression`

TableFilters in DuckDB support being serialized and transformed into an expression. With simple predicates, this is trivial, as one only needs to serialize the expression type and the constant. Serializing the BloomFilter Table Filter would imply that the BloomFilter itself needs to be serialized, which can get quite big. I am not sure how to approach this. Also, the Bloom Filter cannot be turned into an expression. The current hack is to turn it into a constant TRUE. This needs of course, fixing.

2. Bloom Filter Build & Probe Overhead

Building a Bloom filter adds approximately 20% overhead to the join's build phase. Since Bloom filters only provide benefits for selective joins, we must decide when this overhead is justified. Also, probing the bloom filter without it beeing selective is also overhead.
The ideal case would be the following: During the Probe pipeline, we measure the selectivity of the join, and if it is below a certain threshold, we build the bloom filter and push it down. For this to happen, we need to pause the pipeline to create a Task to build the BF. Pausing pipelines is, to my knowledge, only possible for SINK and SOURCE, the probe isa streaming operator. Also, the table scan does not allow updates to the table filters, while the pipeline runs without any changes.
Therefore, I settled for a less perfect but easier to implement way: During the Planning, I check whether a build side is selective by looking whether there is a filter. If there is, I build the bloom filter and push it down during the build of the join. When the probe pipeline starts, I check the selectivity of the Bloom Filter for the first 20 vectors. If the selectivity is higher than 25%, I disable the bloom filter.
This means that in the worst case, we only get the overhead of building the Bloom Filter and then not using it. The picture below illustrates this. It shows the speed-up between duckdb main and this PR for a single join query. The join is between the same table A, which contains primary keys, but the build side is filtered by the selectivity shown on the X-axis. This means that for a selectivity of 1%, the build side will be 1% the size of the probe side, while for a selectivity of 100% (i.e., no filter), the build and probe sides will have the same size. We can also see that we get higher speedups for larger probe sides as the bloom filter can reside on lower cache levels.

3. Compatibility with Compressed Materialization of Join Keys

Compressed Materialization sandwiches join operators and temporarily compress columns to materialize less data. While this is, of course, an awesome optimization, it hinders bloom filter pushdown as the compression would also need to be pushed down into the table filters. Currently, DuckDB does not support filter pushdown with compressed materialization, which leaves room for further optimizations.

4. Parallelism

The Bloom Filter Push Down reduces the need to probe the hashtable, which scales well with multiple cores (read-only), but requires building the bloom filter, which does not scale as well as probing because we need atomics to write thread-safely. This means that there are higher speed-ups for lower core counts (See benchmarking)

5. Not all the benefits of RPT

While this PR supports unidirectional, sideways information passing, Robust Predicate Transfer can also push filter information from the probe side of a join to the build side. Benchmarking this PR against the RPT PR showed that RPT's speedups are >2 times higher.

6. Single Key Bloom Filter

Right now, the Bloom Filter only supports single-key Joins, as the TableFilter provides only an API for a single Vector, not for a DataChunk.

7. Interoperability with Min/Max Filters

Right now, I disabled Min/Max filters and made them optimal filters if there is a bloom filter. This is because, if there is a bloom filter, the Min/Max filters will remain active even if they return IS_ALWAYS_TRUE, which adds overhead of about 3% for both TPC-H and IMDB. Please let me know what your preferred behavior is.

Benchmarking

The following benchmark shows the speedup of this PR vs the current main. All Benchmarks were done using eight threads.

Instance / Device	CPU Architecture	Processor Name
r7a.xlarge	x86_64	AMD EPYC 9R14 (Genoa)
c8g.xlarge	ARM64	AWS Graviton4
MacBook Pro (M4)	ARM64	Apple M4

Benchmark	AWS Graviton4	AMD EPYC 9R14 (x86)	M4 (Arm)
IMDB Speedup	14.47%	13.23%	17.31%
TPC-H SF10 Speedup	3.27%	7.34%	8.35%

# Conflicts: # src/execution/join_hashtable.cpp # src/include/duckdb/execution/join_hashtable.hpp

…of sel is assumed to be flat

…incoming and flat sel

…electivity

# Conflicts: # src/execution/operator/join/physical_hash_join.cpp

…a bf

gropaul added 30 commits May 28, 2025 11:45

first trial with bf

4b4c9e4

bugfix with uvf

4863c44

Merge branch 'main' into join/bf

8d50bb9

# Conflicts: # src/execution/join_hashtable.cpp # src/include/duckdb/execution/join_hashtable.hpp

reintegrated bf to 1.4 hash table

3232093

started moving bf to table filter

31272f1

first version of bf filter hold together by ducktape

b4d6584

removed probe keys function for sectorized bf

058a91d

We can run all tpch without crashing but the result is wrong because …

151e798

…of sel is assumed to be flat

solved flat sel by flattening keys if necessary and then merging the …

a16f7b7

…incoming and flat sel

TMP Commit

be2eed6

bug fixes: correct number of join conditions, FiltersNullValues

4c36f30

added table filter to string

8fa0e8d

added single value API

7d38ea0

disabled bf for small tables

08df408

bug fix: hashing constant vector in bf results in a constant vector

ae544b9

only add bloom filter if there are no other dynamic filters

ad43331

disabled compressed mater for join keys

32a9138

not just in release

3a8f3c6

deciding on whether to use ht on join build side cardinality and bf s…

b5ece8d

…electivity

removed unused elements from ht

70a7e64

fixed seg fault: to small capacity

4f53bc4

made bf usage less selective

d2d4a3f

formatting

d09c22e

added wip serializer

b5fc782

added wip serializer and some cleanup

79f2148

make range filters optional when using bf

78d7eac

reference instead of copy, better planning, optional range filters

c118c70

Bloom filter now uses atomics instead of mutex

616563b

Merge branch 'main' into join/bf

ebcfeca

made bf use more restrictive

3a5bbd2

gropaul added 22 commits October 17, 2025 13:46

made bf use more restrictive

1b573c1

Merge remote-tracking branch 'origin/join/bf' into join/bf

dde4214

# Conflicts: # src/execution/operator/join/physical_hash_join.cpp

cleaning up and moving implementations to cpp

388885b

finished move of implementation to cpp

8696657

moved consts to be able to compile in debug

c9b7d02

fixed faulty hashes init

1191c27

updated how bf is used to only continue on low selectivity

7dba34f

better naming in initialize

62d8347

correct type for constant and formatting

eb4ba60

added inlining for x86

0692d8a

changes for x86

736a6ed

first draft

47514bc

another version to test

c600e19

block based insert

f73b3b6

reanabled min/max filter even if there is bf

290e01e

all defined behaviour now

ca93cc1

reverted 8bit mask to 64bit mask

0fa2b44

renamed filter classes

2d87d49

made greater_equals/less_equals filters optional filters if there is …

3e8e516

…a bf

formatting and cleaning

43018a5

put mask into a funcion

3d00370

Merge branch 'main' into join/bf-x86-v1

3f72273

gropaul changed the title ~~Join/bf x86 v1~~ Bloom Filter based Join Table Filter pushdown Oct 24, 2025

gropaul changed the title ~~Bloom Filter based Join Table Filter pushdown~~ Bloom Filter Join Pushdown Oct 24, 2025

gropaul changed the title ~~Bloom Filter Join Pushdown~~ Sideways Information Passing for Joins using Bloom Filters Oct 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Sideways Information Passing for Joins using Bloom Filters#27

Sideways Information Passing for Joins using Bloom Filters#27
gropaul wants to merge 52 commits intomainfrom
join/bf-x86-v1

gropaul commented Oct 20, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

gropaul commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Sideways Information Passing for Joins using Bloom Filters

Advantages

1. Faster Probing

2. Low False-Positive Rate

3. More Optimal Join Plans

Limitations & Todo's

1. Serialization issues and TableFilter::ToExpression

2. Bloom Filter Build & Probe Overhead

3. Compatibility with Compressed Materialization of Join Keys

4. Parallelism

5. Not all the benefits of RPT

6. Single Key Bloom Filter

7. Interoperability with Min/Max Filters

Benchmarking

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

gropaul commented Oct 20, 2025 •

edited

Loading

1. Serialization issues and `TableFilter::ToExpression`