1 Billion Row Challenge in Rust

This is an attempt at the one billion row challenge but in rust.

All the following attempts were ran on my personal laptop, a Dell XPS 15 9510, in AC mode and with the following specs:

OS: Arch Linux x86_64, kernel: 6.6.32-1-lts
CPU: Intel Core i7-11800H
RAM: 32 GB

Attempts

The following are the different iterations of the parsing function.

Naive Baseline (time: ~106 secs)

This approach is to parse the file using the naive approach of reading each line of the file through a BufReader using the built in lines method and splitting each line at ';' then parsing each half individually without any custom parsing. (This is clearly very slow, just counting the lines using a similar approach takes ~54 secs)

const BUFFER_SIZE: usize = 3 * 1024;

fn parse_lines<R: Read>(reader: BufReader<R>) -> Vec<StationAggregate> {
    let mut results = HashMap::new();

    for line in reader.lines() {
        let line = line.unwrap();

        let (name, temp) = line
            .split_once(';')
            .map(|(name, temp)| (name.to_owned(), temp.parse().unwrap()))
            .unwrap();

        let entry = results.entry(name).or_insert(StationAggregateTmp {
            min: temp,
            max: temp,
            total: 0f64,
            count: 0,
        });

        if temp < entry.min {
            entry.min = temp;
        } else if temp > entry.max {
            entry.max = temp;
        }
        entry.total += temp;
        entry.count += 1;
    }

    let mut results = results
        .into_iter()
        .map(|(name, aggregate)| StationAggregate {
            name,
            min: aggregate.min,
            max: aggregate.max,
            mean: aggregate.total / aggregate.count as f64,
        })
        .collect::<Vec<_>>();

    results.sort_by(|lhs, rhs| lhs.name.cmp(&rhs.name));
    results
}

Increasing the buffer size to 1MB (time: ~104 secs)

My initial thought is that since reading is taking so long, whether increasing the buffer size dramatically will decrease the time spend on IO. There was some improvement of the overall runtime (~2 secs) but it's not as significant as I thought it would be.

const BUFFER_SIZE: usize = 1024 * 1024;

Custom parsing of station name (time: ~88 secs)

The idea here is to minimize the overhead caused by ut8 string validation and unnecessary copying of the station name. So instead of using the convenient lines method on BufReader which produces an iterator over '\n' delimited lines, I am using the read_until '\n' to avoid the bytes to ut8 string conversion overhead. I also avoided copying the station name on each HashMap access and only doing the copy when encountering the station name for the first time.

fn parse_lines<R: Read>(mut reader: BufReader<R>) -> Vec<StationAggregate> {
    let mut results = HashMap::new();

    let mut buf = Vec::with_capacity(MAX_LINE_LEN);

    loop {
        let bytes_read = reader
            .read_until(b'\n', &mut buf)
            .expect("failed to read from BufReader");

        if bytes_read == 0 {
            break;
        }

        let mut split_idx = 0;
        loop {
            if buf[split_idx] == b';' {
                break;
            }
            split_idx += 1;
        }

        let name = &buf[..split_idx];
        let temp = String::from_utf8_lossy(&buf[split_idx + 1..bytes_read - 1])
            .parse()
            .expect("failed to parse temp");

        if !results.contains_key(name.as_ref()) {
            results.insert(
                name.to_vec(),
                StationAggregateTmp {
                    min: temp,
                    max: temp,
                    total: 0f64,
                    count: 0,
                },
            );
        }

        let entry = results.get_mut(name).unwrap();

        if temp < entry.min {
            entry.min = temp;
        } else if temp > entry.max {
            entry.max = temp;
        }
        entry.total += temp;
        entry.count += 1;

        buf.clear();
    }

    let mut results = results
        .into_iter()
        .map(|(name, aggregate)| StationAggregate {
            name: unsafe { String::from_utf8_unchecked(name) },
            min: aggregate.min,
            max: aggregate.max,
            mean: aggregate.total / aggregate.count as f64,
        })
        .collect::<Vec<_>>();

    results.sort_by(|lhs, rhs| lhs.name.cmp(&rhs.name));
    results
}

Custom parsing (very bad) of floats (time: ~79 secs)

Since we know that the maximum and minimum values of the temperature for all entries, we can create a custom parsing function for it. Here I have created a very bad implementation of it but it still managed to sqeeze out a bit of extra performance.

#[inline]
fn parse_float_limited(bytes: &[u8]) -> f64 {
    let is_negative = bytes[0] == b'-';
    let bytes = if is_negative { &bytes[1..] } else { bytes };

    let mut period_idx = 0;
    loop {
        if bytes[period_idx] == b'.' {
            break;
        }
        period_idx += 1;
    }

    let mut i = 0;
    let mut result = 0;
    let mut base = 10u64.pow((period_idx - 1) as u32);
    while base >= 1 {
        result += (bytes[i] - b'0') as u64 * base;
        base /= 10;
        i += 1;
    }

    let decimal = (bytes[period_idx + 1] - b'0') as f64 / 10.0;

    if is_negative {
        -(result as f64) - decimal
    } else {
        result as f64 + decimal
    }
}

pub fn parse_lines<R: Read>(mut reader: BufReader<R>) -> Vec<StationAggregate> {
    <unchanged>
        let temp = parse_float_limited(&buf[split_idx + 1..bytes_read - 1]);
    <unchanged>
}

Avoid hashing the station name twice (time: ~68 secs)

In the flamegraph I noticed that a lot of time was spent accessing the HashMap, so I tried using the Entry API which allows me to only hash the station name once for every line.

impl Default for StationAggregateTmp {
    fn default() -> Self {
        Self {
            min: f64::MAX,
            max: f64::MIN,
            total: 0f64,
            count: 0,
        }
    }
}

pub fn parse_lines<R: Read>(mut reader: BufReader<R>) -> Vec<StationAggregate> {
    let mut results = HashMap::new();

    let mut buf = Vec::with_capacity(MAX_LINE_LEN);

    loop {
        let bytes_read = reader
            .read_until(b'\n', &mut buf)
            .expect("failed to read from BufReader");

        if bytes_read == 0 {
            break;
        }

        let mut split_idx = 0;
        loop {
            if buf[split_idx] == b';' {
                break;
            }
            split_idx += 1;
        }

        let name = &buf[..split_idx];
        let temp = parse_float_limited(&buf[split_idx + 1..bytes_read - 1]);

        let entry = results
            .entry(name.to_vec())
            .or_insert_with(StationAggregateTmp::default);

        if temp < entry.min {
            entry.min = temp;
        } else if temp > entry.max {
            entry.max = temp;
        }
        entry.total += temp;
        entry.count += 1;

        buf.clear();
    }

    let mut results = results
        .into_iter()
        .map(|(name, aggregate)| StationAggregate {
            name: unsafe { String::from_utf8_unchecked(name) },
            min: aggregate.min,
            max: aggregate.max,
            mean: aggregate.total / aggregate.count as f64,
        })
        .collect::<Vec<_>>();

    results.sort_by(|lhs, rhs| lhs.name.cmp(&rhs.name));
    results
}

Parallelization (time: ~12 secs)

I parallelized both the file IO and parsing process using as many OS threads as the cores on my laptop (16). The major challenge here is that the width of each line is not equal, so I can't simply split the work cleanly based on the byte offset in the file.

My current approach is to split the work roughly based on the byte offset (formula: / ) then record the start and ending offsets of the lines that are cutoff during the split and process them later.

pub fn process(file_path: PathBuf) -> Vec<StationAggregate> {
    let mut file = File::open(&file_path).unwrap();

    let file_size = file.metadata().unwrap().size();
    let available_threads = thread::available_parallelism().unwrap().get();
    let section_size = file_size / available_threads as u64;

    let remaining_intervals = Arc::new(Mutex::new(vec![(0, 0); available_threads - 1]));

    let mut handles = Vec::with_capacity(available_threads);

    for i in 0..available_threads as u64 {
        let mut current = i * section_size;
        let end = current + section_size;
        let mut file = File::open(&file_path).unwrap();
        file.seek(std::io::SeekFrom::Start(current as u64)).unwrap();

        let remaining_intervals_shared = Arc::clone(&remaining_intervals);
        let mut reader = BufReader::with_capacity(BUFFER_SIZE, file);

        let handle = thread::spawn(move || {
            let mut results = HashMap::with_capacity(HASHMAP_SIZE);

            let mut buf = Vec::with_capacity(MAX_LINE_LEN);

            // Find end of last entry from previous chunk
            if i > 0 {
                let leftover_bytes = reader
                    .read_until(b'\n', &mut buf)
                    .expect("failed to read from BufReader");

                remaining_intervals_shared.lock().unwrap()[(i - 1) as usize].1 =
                    current as usize + leftover_bytes;

                current += leftover_bytes as u64;

                buf.clear();
            }

            loop {
                let bytes_read = reader
                    .read_until(b'\n', &mut buf)
                    .expect("failed to read from BufReader");

                current += bytes_read as u64;

                if bytes_read == 0 {
                    break;
                }

                // Find start of last entry for current chunk
                if current >= end {
                    // Don't check for last chunk
                    if (i as usize) < available_threads - 1 {
                        remaining_intervals_shared.lock().unwrap()[i as usize].0 =
                            current as usize - bytes_read;
                    }
                    break;
                }

                let (name, temp) = parse_entry(&buf, bytes_read);

                let entry = results
                    .entry(name.to_vec())
                    .or_insert_with(StationAggregateTmp::default);

                if temp < entry.min {
                    entry.min = temp;
                }
                if temp > entry.max {
                    entry.max = temp;
                }
                entry.total += temp;
                entry.count += 1;

                buf.clear();
            }

            results
        });

        handles.push(handle);
    }

    // Merge results from all threads
    let mut results = handles
        .into_iter()
        .map(|handle| handle.join().unwrap())
        .fold(HashMap::with_capacity(HASHMAP_SIZE), |mut acc, mut map| {
            for (k, v) in map.drain() {
                let entry = acc.entry(k).or_insert_with(StationAggregateTmp::default);

                if v.min < entry.min {
                    entry.min = v.min;
                }
                if v.max > entry.max {
                    entry.max = v.max;
                }
                entry.total += v.total;
                entry.count += v.count;
            }
            acc
        });

    let mut buf = [0u8; MAX_LINE_LEN];
    // Read remainders
    for (start, end) in remaining_intervals.lock().unwrap().iter() {
        file.seek(std::io::SeekFrom::Start(*start as u64)).unwrap();
        let bytes_read = file.read(&mut buf).unwrap();
        debug_assert!(bytes_read >= end - start);

        let buf = &buf[..end - start];
        let (name, temp) = parse_entry(buf, end - start);

        let entry = results
            .entry(name.to_vec())
            .or_insert_with(StationAggregateTmp::default);

        if temp < entry.min {
            entry.min = temp;
        }
        if temp > entry.max {
            entry.max = temp;
        }
        entry.total += temp;
        entry.count += 1;

        #[cfg(debug_assertions)]
        line_count.fetch_add(1, Ordering::Relaxed);
    }

    let mut results = results
        .into_iter()
        .map(|(name, aggregate)| StationAggregate {
            name: unsafe { String::from_utf8_unchecked(name) },
            min: aggregate.min,
            max: aggregate.max,
            mean: aggregate.total / aggregate.count as f64,
        })
        .collect::<Vec<_>>();

    results.sort_by(|lhs, rhs| lhs.name.cmp(&rhs.name));
    results
}

fn parse_entry(buf: &[u8], bytes_read: usize) -> (&[u8], f64) {
    let mut split_idx = 0;
    loop {
        if buf[split_idx] == b';' {
            break;
        }
        split_idx += 1;
    }

    let name = &buf[..split_idx];
    let temp = custom_parse_float(&buf[split_idx + 1..bytes_read - 1]);
    (name, temp)
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
benches		benches
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
profile.sh		profile.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

1 Billion Row Challenge in Rust

Attempts

Naive Baseline (time: ~106 secs)

Increasing the buffer size to 1MB (time: ~104 secs)

Custom parsing of station name (time: ~88 secs)

Custom parsing (very bad) of floats (time: ~79 secs)

Avoid hashing the station name twice (time: ~68 secs)

Parallelization (time: ~12 secs)

About

Uh oh!

Uh oh!

Languages

sunkit02/onebrc

Folders and files

Latest commit

History

Repository files navigation

1 Billion Row Challenge in Rust

Attempts

Naive Baseline (time: ~106 secs)

Increasing the buffer size to 1MB (time: ~104 secs)

Custom parsing of station name (time: ~88 secs)

Custom parsing (very bad) of floats (time: ~79 secs)

Avoid hashing the station name twice (time: ~68 secs)

Parallelization (time: ~12 secs)

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages