I write about bioinformatics: pairwise alignment, minimizers, HPC, data structures (text-indexing, perfect hashing), and some devops. You might also be interested in my surveys and lablogs. Be warned that some posts are quick/unfinished notes.
Popular
- December 2024 Static search trees: 40x faster than binary search
- January 2024 One Billion Row Challenge
Recent
- March 2026 NordVPN refund dark patterns
- March 2026 Route Planning using Customizable Contraction Hierarchies
- February 2026 Wheeler graphs
- February 2026 DEFLATE, gzip, zlib, libz, et al.
- January 2026 QuadRank: Engineering a High Throughput Rank
- January 2026 Recent results on hash tables
- January 2026 Releasing Rust SIMD binaries to GitHub, BioConda, and PyPI
- January 2026 Trying to understand DDR memory
- January 2026 Quotes from "The Evolution of Mathematical Software"
Thesis chapters
- April 2025 Thesis: Optimal Throughput Bioinformatics
- April 2025 A History of Pairwise Alignment
- April 2025 Low Density Minimizers
- April 2025 Beyond Global Alignment
- April 2025 High Throughput Bioinformatics
Papers
- January 2026 QuadRank: Engineering a High Throughput Rank
- August 2025 QuickHeap: the fastest priority queue
- February 2025 PtrHash: Minimal Perfect Hashing at RAM Throughput
- July 2024 SimdMinimizers: Computing random minimizers, fast
- June 2024 A near-tight lower bound on minimizer density
- March 2024 A*PA2: Up to 19x faster exact global alignment
Featured
- February 2026 Wheeler graphs
- February 2026 DEFLATE, gzip, zlib, libz, et al.
- January 2026 Releasing Rust SIMD binaries to GitHub, BioConda, and PyPI
- January 2026 Trying to understand DDR memory
- December 2025 Asymptotic elevators
- December 2025 Overview of static data structures
- November 2025 Distributing Rust SIMD Binaries
- August 2025 Three log scientist
- March 2025 SimdSketch: a fast bucket sketch
- February 2025 Thoughts on Consensus MPHF and tiny pointers
- June 2024 CPU performance
- September 2023 PtrHash: Notes on adapting PTHash in Rust
- August 2023 Bitpacking and approximate string matching
- November 2022 String algorithm visualizations
- September 2022 Revised Oxford Bioinformatics latex template
- April 2022 Benchmark attention points
- May 2021 28000x speedup with Numba.CUDA