Ragnar Groot Koerkamp on CuriousCoding

[WIP] Minimizers and more

Tue, 05 Nov 2024 00:00:00 +0100

Table of Contents

1 Introduction
- 1.1 Overview
- 1.2 Previous reviews
2 Theory of sampling schemes
- 2.1 Questions
- 2.2 Types of schemes
- 2.3 Parameter regimes
- 2.4 Different perspectives
- 2.5 UHS vs minimizer scheme
- 2.6 (Asymptotic) bounds
- 2.7 Lower bounds
3 Minimizer schemes
- 3.1 Orders
- 3.2 UHS-based and search-based schemes
- 3.3 Pure schemes
- 3.4 Other variants
  - Selection schemes
  - Canonical minimizers
4 Open questions

1 Introduction

sadf

Lots of DNA data
Most algorithms deal with k-mers.
k-mers overlap, and hence considering all of them is redundant.
Thus: sample a subset of the kmers.
Must be ’locally consistent’ and deterministic to be useful.
Enter random minimizers.
Parameter $w$: guarantee that at least one k-mer is sampled out of every window of $w$ k-mers.
Density $d$: (expected) overall fraction of sampled k-mers.
Obviously, $d\geq 1/w$
For random mini, $d=2/(w+1)$.
Lower density => fewer k-mers, smaller indices, faster algorithms.
Question: How small density can we get for given $k$ and $w$?

1.1 Overview

Figure 1: An overview of the papers this post discusses, showing authors and categories of each paper.

Comments on GreedyMini

Mon, 04 Nov 2024 00:00:00 +0100

Table of Contents

Overview
Detailed comments
Comments on “Expected density of random minimizers”

These are some (biased) comments on “Generating Low-Density Minimizers” (Golan et al. 2024), which introduces the GreedyMini minimizer scheme.

At the bottom, there are also some comment on Golan and Shur (2024).

Overview

The GreedyMinimizer is very cute and simple idea, and works as follows:

Comments on 'When Less is More' minimizer review

Tue, 15 Oct 2024 00:00:00 +0200

Table of Contents

The importance of ordering
Asymptotically optimal minimizers

These are some (biased) comments on “When Less Is More: Sketching with Minimizers in Genomics” (Ndiaye et al. 2024).

The importance of ordering

the interest lies in constructing a minimizer with a density within a constant factor, i.e., $O(1/w)$ for any $k$. With lexicographic ordering, minimizers can achieve such density, but with large $k$ values ($\geq \log_{|Σ|}(w)-c$ for a constant $c$), which might not be desirable (Zheng, Kingsford, and Marçais 2020). However, random ordering can result in a lower density than that of the lexicographic ordering. Thus, random ordering (implemented with pseudo-random hash functions) is usually used in practice.

A lemma on suffix array searching

Sat, 05 Oct 2024 00:00:00 +0200

Table of Contents

1 Suffix arrays
2 Searching methods
3 Analysing the faster search

We’ll prove that using the “faster” binary search algorithm (see 2.2) that tracks the LCP with the left and right boundary of the remaining search interval has amortized runtime

\[ O\Big(\lg_2(n) + |P| + |P| \cdot \lg_2(Occ(P))\Big), \] when $P$ is a randomly sampled fixed-length pattern from the text and $Occ(P)$ counts the number of occurrences of $P$ in the text.

FM-index implementations

Wed, 02 Oct 2024 00:00:00 +0200

Here I’ll briefly list some FM-index and related implementations around the web. Implementations seem relatively inconsistent, mostly because the FM-index is more of a ‘wrapper’ type around a given Burrows-Wheeler-transform and an occurrences list. Both can be implemented in various ways. In particular occurrences should be stored using a wavelet tree for optimal compressing.

The nucleic-acid repo contains a completely unoptimised version.
The Rust-bio crate contains a generic FM-index. It stores a sampled occurrences array, so that space is relatively small but lookups take $O(k)$ time for sampling factor $k$.
SDSL contains a wavelet tree and compressed suffix array implementation based on it, that provides the same functionality as an FM-index.
There is the Quad Wavelet Tree (QWT) Rust crate (Ceregini, Kurpicz, and Venturini 2023). This uses a 4-ary tree instead of the usual binary wavelet tree, and improves latency by around a factor 2 over SDSL wavelet trees.
Dominik Kempa has the Faster-Minuter index (Gog et al. 2019) that contains an improved wavelet tree as well.
GEM-Cutter contain a GPU implementation of the FM-index (Chacon et al. 2015).
There is also RopeBWT3 (Li 2024), which is basically a run-length compressed BWT with a B+ tree on top for fast queries.

[WIP] Progress on fast suffix array searching

Tue, 01 Oct 2024 00:00:00 +0000

Here’s a lablog.

Background

Compare with suffix arrays with a twist: https://www.cai.sk/ojs/index.php/cai/article/view/2019_3_555
Compare with https://github.com/mranisz/sa, which is based on Compact and hash based variants of the suffix array
- https://journals.pan.pl/dlibra/publication/121376/edition/105762/content

Here’s a bike

A figure of a bike.

Binary searching

Eytzinger

Btrees

Multithreading

Practical minimizers

Thu, 12 Sep 2024 00:00:00 +0200

Table of Contents

1 Sampling schemes
- 1.1 Definitions
- 1.2 Miniception
- 1.3 Mod-minimizer
- 1.4 Forward scheme lower bound
- 1.5 Open syncmer minimizer
- 1.6 Open-closed minimizer
- 1.7 New: General mod-minimizer
- 1.8 Variant: Open-closed minimizer using offsets
2 Selection schemes
- 2.1 Definition
- 2.2 Bd-anchors
- 2.3 New: Smallest unique substring anchors
- 2.4 New: Anti lexicographic sorting
3 More sampling schemes
- 3.1 Anti-lex sus-anchors
- 3.2 Threshold anchors
- 3.3 The $t$-gap disappears for large alphabets
4 Computing the density of forward schemes
- 4.1 WIP: Anti lexicographic sus-anchor density
5 Open questions
6 Ideas

This post introduces some new practical sampling schemes. It builds on:

Calling Rust from Python

Tue, 10 Sep 2024 00:00:00 +0200

Table of Contents

1 Steps
- 1.1 Using kwargs
2 TODOs

Using PyO3 and maturin, it’s very easy to call Rust code from Python. I’m mostly following the guide at pyo3.rs, but leaving out some thing related to python environments.

1 Steps

Install maturin. I use the Arch package but you can also do a pip install in the environment below.
Make sure you have a lib target, and add cdylib as a crate-type.

AI reading list

Mon, 09 Sep 2024 00:00:00 +0200

[WIP] Faster binary search

Sun, 08 Sep 2024 00:00:00 +0200

Table of Contents

1 High level ideas
- 1.1 Resources
- 1.2 Code
2 To measure
3 TODO Memory efficiency
- 3.1 B-tree

1 High level ideas

Prefix table: for each 20-bit prefix, store the corresponding range of the array.
Interpolation: Make one or more interpolation steps. Could store max resulting error.
- Drawback: can cause an unpredictable number of resulting iterations.
Batching: process multiple (8-32) queries at the same time, hiding memory latency
Query bucketing: given >>1M of queries, partition them into 1M buckets and answer bucket by bucket.
Eytzinger layout
B-tree layout
prefetching (either next Eytzinger iteration, or in the batch)

1.1 Resources

Algorithmica: https://en.algorithmica.org/hpc/data-structures/
Khuong and Morin (2017)
https://www.cai.sk/ojs/index.php/cai/article/view/2019_3_555

1.2 Code

github:RagnarGrootKoerkamp/suffix-array-searching
- Some initial binary search and Btree variants.
github:RagnarGrootKoerkamp/cpu-benchmarks
- low-level CPU benchmarks to get upper bounds on potential performance

2 To measure

Max random access cacheline throughput (1 and many threads)
- Also variants for fetching 2/3/4 consecutive cachelines.

TODO 3 Memory efficiency

Suppose our task is to find an integer $q$ in a sorted list $A$ of length $n$. One option is to use binary search, but using a B-tree or the Eytzinger layout turns out faster when $n$ is large. See the excellent paper Khuong and Morin (2017) for background and detailed comparisons.

PACE 24

Thu, 05 Sep 2024 00:00:00 +0200

Table of Contents

1 General observations
2 Heuristic track
3 Parameterized track
4 Exact track

In this post I will collect some high level ideas and approaches used to solve the PACE 2024 challenge. Very briefly, the goal is to write fast solvers for NP-hard problems. The problem for the 2024 edition is one-side crossing minimization: Given is a bipartite graph $(A, B)$ that is drawn in standard way with the nodes of both $A$ and $B$ on a line, where the order of the nodes of $A$ is fixed. The goal is to find a permutation of $B$ that minimizes the number of edge crossings when all edges are drawn as straight lines.

[WIP] Feynman problems

Mon, 12 Aug 2024 00:00:00 +0200

Table of Contents

1 Space dust

1 Space dust

What is the total mass of space dust hitting the earth during the Perseids meteor shower?

References

Title

Fri, 09 Aug 2024 00:00:00 +0200

Table of Contents

1 First part
- 1.1 subsection
2 Second part

1 First part

hello

list 1
more list

asdf

first
second

col	col
val	val

1.1 subsection

centered

col 1

tab	more
2	1

col 2

tab	more
2	1

2 Second part

Computing random minimizers, fast

Fri, 12 Jul 2024 00:00:00 +0200

Table of Contents

1 Introduction
- 1.1 Results
2 Random minimizers
3 Algorithms
4 Analysing what we have so far
5 Rolling our own hash
6 SIMD sliding window
- 6.1 Results
  - Human genome results
7 TODO Cleanup, Testing, Super-k-mers, and canonical k-mers

1 Introduction

In this post, we will develop a fast implementation of random minimizers.

A near-tight lower bound on minimizer density

Tue, 25 Jun 2024 00:00:00 +0200

Table of Contents

Succinct background
- Definitions
- Lower bounds
A new lower bound
Discussion
Post scriptum
Acknowledgement

The results of this post are now available in a pre-print: DOI, PDF:

Kille, Bryce, Ragnar Groot Koerkamp, Drake McAdams, Alan Liu, and Todd Treangen. 2024. “A near-Tight Lower Bound on the Density of Forward Sampling Schemes.” Biorxiv. https://doi.org/10.1101/2024.09.06.611668.

In this post I will prove a new lower bound on the density of any minimizer or forward sampling scheme: \[ d(f) \geq \frac{\lceil\frac{w+k}{w}\rceil}{w+k} = \frac{\lceil\frac{\ell+1}{w}\rceil}{\ell+1}. \]

[WIP] High throughput searching - Part 1

Sun, 16 Jun 2024 00:00:00 +0200

Table of Contents

Hardware
Details of caches and memory
Latency, bandwidth, and throughput
Measuring latency
TODO Memory bandwidth
TODO High throughput random access
NOTES
TODO

This (planned) series of posts has the aim to write a high performance search algorithm for suffix arrays. We will start with a classic binary search implementation and make incremental improvements to it. But that is planned for Part 3.

Tools for suffix array searching

Fri, 14 Jun 2024 00:00:00 +0200

Table of Contents

1 Sapling
2 PLA-Index
3 LISA: learned index

Let’s summarize some tools for efficiently searching suffix arrays.

1 Sapling

Sapling (Kirsche, Das, and Schatz 2020) works as follows:

Choose a parameter $p$ store for each of the $2^p$ $p$-bit prefixes the corresponding position in the suffix array.
When querying, first find the bucket for the query prefix. Then do a linear interpolation inside the bucket.
Search the area $[-E, +E]$ around the interpolated position, where $E$ is a bound on the error of the linear approximation. In practice $E$ is only a $95\%$-confidence bound, and if the true value is not in the range, a linear search with steps of size $E$ is done.

The paper also introduces a neural network approach to approximating buckets, but this takes over a day to learn and is slower to query in practice.

Crates for suffix array construction

Thu, 13 Jun 2024 00:00:00 +0200

Popular C libraries are:

Both have a ..64 variant that supports input strings longer than 2GB.

Rust wrappers:

divsufsort: rust reimplementation, does not support large inputs.
cdivsufsort: c-wrapper, does not support large inputs
livdivsufsort-rs: c-wrapper, does support large inputs
sais: unrelated to the original library; does not implement a linear time algorithm anyway
libsais-rs: Daniel Liu’s fork-of-fork of the original, but not on crates.io. Supports multithreading using OpenMP and wraps both the original and 64bit version.
simple-saca: Daniel Liu’s bounded-context suffix array construction that is faster than divsufsort and libsais, but does not return a true fully sorted suffix array.

References

Thoughts on POASTA

Tue, 28 May 2024 00:00:00 +0200

Table of Contents

Summary
Background
Review comments

Here are some thoughts on POASTA (van Dijk et al. 2024), a recent affine-cost sequence-to-DAG (POA) aligner inspired by WFA and using A*.

Summary

Take a query and a directed acyclic graph (DAG).
Align the query to the full DAG. It’s like global alignment for graphs.
- In fact I think the graph doesn’t actually have to be acyclic, as long as it has a start and end. (When there is a cycle, the maximum remaining path length is simply $\infty$.)
Do greedy extension of matches, similar to WFA and A*PA.
- Note that this is not as strong as full diagonal transition as done by WFA and gWFA (graph WFA for unit costs only), which only consider farthest reaching states.
In fact, this is the first implementation of affine-cost WFA!
It also uses A* with the classic gap-cost heuristic extended to graphs.
- For each point in the graph the minimal and maximal remaining distance is computed, and if the remaining query length is outside this range, the difference to get into the range is a lowerbound on number of indels.
Greedy extension is applied (although this is inherent when using WFA).
Suboptimal states in superbubbles are pruned using additional logic.

Background

Daniel: why is nobody doing exact banded alignment, i.e., simple band doubling, for exact DP-based alignment. We are still not convinced that A*/WFA is faster than DP, especially when divergence is not super low ($<1\%$).

Review comments

Fig 1 confuses me: (partly Daniel)

A*PA2: Up to 19x faster exact global alignment

Sat, 23 Mar 2024 00:00:00 +0100

Table of Contents

Abstract
1 Introduction
- 1.1 Contributions
- 1.2 Previous work
  - 1.2.1 Needleman-Wunsch
  - 1.2.2 Graph algorithms
  - 1.2.3 Computational volumes
  - 1.2.4 Parallelism
  - 1.2.5 Tools
2 Preliminaries
3 Methods
- 3.1 Band-doubling
- 3.2 Blocks
- 3.3 Memory
- 3.4 SIMD
- 3.5 SIMD-friendly sequence profile
- 3.6 Traceback
- 3.7 A*
  - 3.7.1 Bulk-contours update
  - 3.7.2 Pre-pruning
- 3.8 Determining the rows to compute
  - 3.8.1 Sparse heuristic invocation
- 3.9 Incremental doubling
4 Results
- 4.1 Setup
- 4.2 Comparison with other aligners
- 4.3 Effects of methods
5 Discussion
Acknowledgements
Conflict of interest
6 Appendix
- 6.1 Bitpacking
- 6.2 Comparison with other aligners
- 6.3 Effects of methods

\begin{equation*} \newcommand{\g}{g^*} \newcommand{\h}{h^*} \newcommand{\f}{f^*} \newcommand{\cgap}{c_{\textrm{gap}}} \newcommand{\xor}{\ \mathrm{xor}\ } \newcommand{\and}{\ \mathrm{and}\ } \newcommand{\st}[2]{\langle #1, #2\rangle} \newcommand{\matches}{\mathcal M} \end{equation*}

Review of refined minimizes

Fri, 26 Jan 2024 00:00:00 +0100

Table of Contents

Summary
Main issues
1. Introduction
2. Methods
- 2.3 heuristic
3. Results
Discussion
Code

These are my review-like notes on refined minimizers, introduced in Pan and Reinert (2024).

Summary

The paper introduces refined minimizers, a new scheme for sampling canonical minimizers that is less biased than the usual scheme.

Instead of taking the minimum of the minimizer of the forward and reverse strand, the minimizer of the strand with the higher GT density is chosen.
The less bias towards small minimizers causes a more equal distribution of frequency of selected kmers.

Main issues

The methods contain a number of mistakes in the math and proofs.
The limit to $|s|$ needs to be made much more precise. In fact it is a $k\to\infty$ limit (rather than a $w\to\infty$ limit), which seems not as useful in practice.
A comparison to NtHash2 should be made, for both kmer frequency distribution and speed.
The provided code (github:xp3i4/mini_benchmark) segfaults and is undocumented.

1. Introduction

the minimizer concept is a data structure: to me, minimizers by themselves are not a data structure.
$w>k$: not needed. $w\geq 1$ is sufficient.
In many places, \citep citations like (Pan and Reinert 2024) would have been more appropriate then \citet ones like Pan and Reinert (2024).
of a predefined ordering scheme: the minimum of/over some set with respect to some ordering scheme.
nitpicky imprecision: $X$ is the set of positions of kmers, not simply the set of kmer strings themselves. (Or I suppose $X$ could be a list of kmers.) (Otherwise we have $|X| \leq 4^k$ and $|S|\to\infty$ so that $\rho\to 0$.)
a k-mer $X = x$ => Why not just $x$? The notation is confusing.
$n(x)/|S|$ is not really an average (there is only one string $S$); rather it’s a density.
The definition of $V$ is not clear to me. What is random? What is counted?
3. Its density converges => For $w\to \infty$ or $k\to\infty$ or both?
CMP (branch conditions) can be one of the slowest instructions on modern hardware. Branch misses in an inner loop for minimizer computation can severely affect performance.
Simple operations and L1 accesses can be pipelined and latency can be hidden, making them take 2-4x time less in practice. This makes branch-misses up to 4 times as bad, relatively.
Are lexicographic minimizers used much in practice?

2. Methods

There are a number of mistaken in the math here here and some unclarities that could use fixing.

Mod-minimizers and other minimizers

Thu, 18 Jan 2024 00:00:00 +0100

\[ \newcommand{\d}{\mathrm{d}} \newcommand{\L}{\mathcal{L}} \]

This post introduces some background for minimizers and some experiments for a new minimizer variant. That new variant is now called the mod-minimizer and published at WABI24 (Groot Koerkamp and Pibiri 2024). This also includes a review of existing methods, including pseudocode for most of the methods covered below.

Intro to Rust

Tue, 16 Jan 2024 00:00:00 +0100

These are notes for a quick introduction to Rust.

Overview

Statically typed & Compiled language.
Great developer experience:
- cargo build system
- rust-analyzer LSP

Rust features

Basics

C++	Rust
std::size_t	usize
std::pointerdiff_t	isize
int	i32
unsigned int	u32
long long	i64
unsigned long long	u64

string	String
string_view	&str
byte	u8
char	char

vector<T>	Vec<T>
array<int, 4>	[u32; 4]
int[]	&[u32]

T	T
const T&	&T
T&	&mut T
T*	unsafe { T* }
unique_ptr<T>	Box<T>

optional<T>	Option<T>
variant<T, E>	Result<T, E>

C++	Rust
for(int i = 0; i < n; ++i) {}	for i in 0..n {}
while(true) {}	loop {}
while(f()) {}	while f() {}
do { } while (f());	loop { if !f() { break; } }
switch x { case 1:; }	match x { 1 => {} }

C++	Rust
cout << “text” << endl;	println!(“text”);
cout << 1+1 << endl;	println!("{}", 1+1);
cout << n << endl;	println!("{n}");

Basic syntax

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56


// trailing return type
fn square(x: u32) -> u32 {
 // return on last line can be omitted
 x*x
}

// mutable reference
fn increment(x: &mut u32) {
 *x += 1;
}

fn main(){
 // Introduce variables with let.
 // Types are automatically inferred.
 let a = 1;
 // b is mutable.
 let mut b = 1;
 b += 1;
 // c, d, and e are usize:
 let c: usize = 1;
 let d = 1usize;
 let e = usize::MAX;

 // No parentheses needed.
 if a > b {
 // This shouldn't happen.
 panic!();
 }

 // 0..n is a `Range`.
 // Ranges are `IntoIterator` and converted into an iterator, which is looped over.
 for i in 0..n {
 // Print i to a line on stderr.
 eprintln!("{i}");
 }

 while b < 1000 {
 increment(b);
 }

 loop {
 b = square(b);
 if b > 1000000000 {
 break;
 }
 }

 // Pattern matching
 match 5 {
 0 => panic!(),
 1 => todo!(),
 2..3 => eprintln!("small"),
 x if x%2==0 => eprintln!("even {x}"),
 _ => eprintln!("odd");
 }
}

Expressions everywhere!

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16


let a = { 1 + 1 };
let b = if a > 10 { a } else { 10 };
let c = loop {
 break 3;
};
let d = {
 let mut x = 1;
 while x < 1000 {
 x *= 2;
 }
 x
};
let a = match Some(5) {
 None => 0,
 Some(x) => 2*x,
};

Closures

1
2
3
4


let double = |x| 2*x;
let a = double(1);
let multiply = |x: usize, y: usize| -> usize { x * y };
let b = multiply(2, 3);

Pattern matching

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18


let a: Option<i32> = Some(1);
match a {
 Some(0) => eprintln!("I am 0"),
 Some(x) if x % 2 == 0 => eprintln!("I am even"),
 Some(x) => eprintln!("I am {x}"),
 None => eprintln!("I am none"),
}

if let Some(x) = a {
 eprintln!("a = Some({x})");
}

let Some(x) = a else {
 return;
};
eprintln!("{x}");

let x = a.unwrap();

References

Ownership

Containers

1
2
3
4
5


// Create an array
let a: [usize; 10] = [1; 10];
// Create a vec
let v: Vec<usize> = vec![1usize; 10];
assert_eq!(&a, &v, "Slices are not equal!");

Traits

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28


trait MyTrait {
 fn my_fn(&self);
}

impl MyTrait for usize {
 fn my_fn(&self) {
 eprintln!("I am a usize!");
 }
}

impl MyTrait for i32 {
 fn my_fn(&self) {
 eprintln!("I am a i32!");
 }
}

fn f(t: impl MyTrait) {
 t.my_fn();
}

fn main() {
 let a = 1; // i32 by default
 a.my_fn();
 let b = 1usize;
 b.my_fn();
 f(a);
 f(b);
}

Iterators

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21


for i in 0..10 {
 eprintln!("i={}", i);
}

let v = (0..10).collect_vec();

for x in &v {
 eprintln!("x={}", x);
}

for (i, x) in v.iter().enumerate() {
 eprintln!("{i:>2} => {x}");
}

for x in v.iter().filter(|x| **x % 2 == 0) {
 eprintln!("x={}", x);
}

for x in v.iter().map(|x| x * x) {
 eprintln!("square: {}", x);
}

Common libraries

See blessed.rs for a list of commonly used and recommended libraries.

Notes on bidirectional anchors

Mon, 15 Jan 2024 00:00:00 +0100

Table of Contents

Paper overview
Remarks on the paper
Thoughts

\[ \newcommand{\A}{\mathcal{A}_\ell} \newcommand{\T}{\mathcal{T}_\ell} \]

These are some notes on Bidirectional String Anchors (Loukides, Pissis, and Sweering 2023), also called bd-anchors.

Resources:

Loukides and Pissis (2021): preceding conference paper with subset of content.
Loukides, Pissis, and Sweering (2023): The paper discussed here.
Ayad, Loukides, and Pissis (2023): follow-up/second paper containing
- a faster average-case $O(n)$ construction algorithm;
- a more memory efficient construction algorithms for the index.
https://github.com/solonas13/bd-anchors: code for first paper
https://github.com/lorrainea/BDA-index: code for follow-up paper

The remainder of this post is split into an overview of the paper, Remarks on the paper, and further Thoughts.

Notes on SsHash

Mon, 15 Jan 2024 00:00:00 +0100

Table of Contents

Paper summary
Remarks
Ideas

\[\newcommand{\S}{\mathcal{S}}\]

Paper summary

Intro

SsHash (Pibiri 2022) is a datastructure for indexing kmers. Given a set of kmers $\S$, it supports two operations:

$Lookup(g)$: return the unique id $i\in [|\S|]$ of the kmer $g$.
$Access(i)$: return the kmer corresponding to id $i$.

It also supports streaming queries, looking up all kmers from a longer string consecutively, by expoiting the overlap between them.

One Billion Row Challenge

Wed, 03 Jan 2024 00:00:00 +0100

Table of Contents

External links
The problem
Initial solution: 105s
First flamegraph
Bytes instead of strings: 72s
Manual parsing: 61s
Inline hash keys: 50s
Faster hash function: 41s
A new flame graph
Perf it is
Something simple: allocating the right size: 41s
memchr for scanning: 47s
memchr crate: 29s
get_unchecked: 28s
Manual SIMD: 29s
Profiling
Revisiting the key function: 23s
PtrHash perfect hash function: 17s
Larger masks: 15s
Reduce pattern matching: 14s
Memory map: 12s
Parallelization: 2.0s
Branchless parsing: 1.7s
Purging all branches: 1.67s
Some more attempts
Faster perfect hashing: 1.55s
Bug time: Back up to 1.71s
Temperatures less than 100: 1.62s
Computing min as a max: 1.50
Intermezzo: Hyperthreading: 1.34s
Not parsing negative numbers: 1.48s
More efficient parsing: 1.44s
Fixing undefined behaviour: back to 1.56s
Lazily subtracting b'0': 1.52s
Min/max without parsing: 1.55s
Parsing using a single multiplication: doesn’t work
Parsing using a single multiplication does work after all! 1.48s
A side note: ASCII
Skip parsing using PDEP: 1.42s
- Improved
- A further note
Branchy min/max: 1.37s
No counting: 1.34s
Arbitrary long city names: 1.34
4 entries in parallel: 1.23s
Mmap per thread
Reordering some operations: 1.19s
Reordering more: 1.11s
Even more ILP: 1.05
Compliance 1, OK I’ll count: 1.06
TODO
Postscript

Since everybody is doing it, I’m also going to take a stab at the One Billion Row Challenge.

Perfect NtHash for Robust Minimizers

Sun, 31 Dec 2023 00:00:00 +0100

Table of Contents

NtHash
Minimizers
- Robust minimizers
Is NtHash injective on kmers?
- Searching for a collision
- Proving perfection
Alternatives
SmHasher results
TODO benchmark NtHash, NtHash2, FxHash

NtHash

NtHash (Mohamadi et al. 2016) is a rolling hash suitable for hashing any kind of text, but made for DNA originally. For a string of length $k$ it is a $64$ bit value computed as:

\begin{equation} h(x) = \bigoplus_{i=0}^{k-1} rot^i(h(x_i)) \end{equation}

A*PA talk @ CWI

Wed, 27 Dec 2023 00:00:00 +0100

I recently gave a talk about A*PA at CWI. Sadly the recording doesn’t show the blackboard, but either way, find it here.

Notes on implementing Longest Common Repeat (LCR)

Wed, 06 Dec 2023 00:00:00 +0100

Table of Contents

Notes
Discussion / TODOs
- Evals

These are my running notes on implementing an algorithm for Longest Common Repeat using minimizers.

Notes

Coloured Tree Problem

See Lemma 3 at here

Generic sparse suffix array

For random strings and $b \leq n / \log n$, direct radix sort on $2log n + log log n$-bit prefixes is sufficient for $O(n)$ runtime. In fact, since computer word size $w\geq \log n$, we only need at most $2$ rounds of radix sort! (See simple-saca.)

ALPACA/PANGAIA winter workshop notes

Mon, 20 Nov 2023 00:00:00 +0100

Table of Contents

Monday
Tuesday
- Variant types
Wednesday

These are notes of discussions at the ALPACA/PANGAIA conference in November 2023.

Monday

I had interesting discussions with Giulio, Paul, and Lucas Robidou.

Fimpera: bloom filter for kmers

Idea: instead of storing $k$mers in bloom filter, store all constituent $s$mers ($s<k$). This allows single-memory-lookup membership queries when streaming $k$mers.

Notes on writing course

Tue, 14 Nov 2023 00:00:00 +0100

Some notes from the writing course I’m taking.

Lecture 1, 14 November

Resources

Searching phrases/alternatives in quotes in Google Scholar can tell which one is more frequently used.

[WIP] PTRhash: Improving the PTHash Minimal Perfect Hash Function

Mon, 23 Oct 2023 00:00:00 +0200

Table of Contents

Abstract
Introduction
Background
- PtHash
- Phobic
PtrHash
Results

Abstract

Motivation: Given a set $S$ of $n$ objects, a minimal perfect hash function (MPHF) is a collision-free bijective map $f$ from the elements of $S$ to $\{0, \dots, n-1\}$. These functions have uses in databases, search engines, and are used in bioinformatics indexing tools such as Pufferfish (using BBHash), SSHash, and Piscem (both using PTHash). This work presents an MPHF that prioritizes query throughput and can be constructed efficiently for billions or more elements using $2$ to $4$ bits of memory per key.

BAPCtools instruction

Tue, 17 Oct 2023 00:00:00 +0200

Steps:

Clone https://github.com/RagnarGrootKoerkamp/BAPCtools

Make an alias to the executable:

1

ln -s ~/git/BAPCtools/bin/tools.py ~/bin/bt

Create a new problem:

1
2
3


cd ~/problems
bt new_problem my_problem_name
cd ~/problems/my_problem_name

You now have the following:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22


.
├── data
│   ├── sample
│   │   └── 1.in # Sample testcase input
│   │   └── 1.ans # Sample testcase output
│   └── secret
├── generators # for later
│   └── ...
├── input_validators # for later
│   └── ...
├── output_validators # for later
│   └── ...
├── problem_statement
│   ├── figure.tex.template
│   ├── problem.en.tex # The problem statement
├── problem.yaml
└── submissions
 ├── accepted
 │   └── name.cpp # A submission
 ├── run_time_error
 ├── time_limit_exceeded
 └── wrong_answer

Edit the statement problem.en.tex
Make some samples by hand in data/samples/*.{in,ans}
Write a solution in submissions/accepted/<yourname>.{py,cpp,java}.
Use bt run or bt run submissions/accepted/<submission> to test your submission.

PTRHash: Notes on adapting PTHash in Rust

Thu, 21 Sep 2023 00:00:00 +0200

Table of Contents

Questions and remarks on PTHash paper
Ideas for improvement
Implementation log
PtrHash, part 2
- Phobic
  - TODO for PTRhash

\[ %\newcommand{\mm}{\,\%\,} \newcommand{\mm}{\bmod} \newcommand{\lxor}{\oplus} \newcommand{\K}{\mathcal K} \]

BBHash: some ideas

Mon, 04 Sep 2023 00:00:00 +0200

Table of Contents

Possible speedup?

BBHash Limasset et al. (2017) uses multiple layers to create a minimal perfect hashing functions (MPFH), that hashes some input set into $[n]$.

(See also my note on PTHash (Pibiri and Trani 2021).)

Simply said, it maps the $n$ elements into $[\gamma \cdot n]$ using hashing function $h_0$. The $k_0$ elements that have collisions are mapped into $[\gamma \cdot k_0]$ using $h_1$. Then, the $k_1$ elements with collisions are mapped into $[\gamma \cdot k_1]$, and so on.

BitPAl bitpacking algorithm

Sun, 03 Sep 2023 00:00:00 +0200

Table of Contents

Problem
Input
Example
Discussion
Found the bug
Outlook

The supplement (download) of the Loving, Hernandez, and Benson (2014) paper introduces a $15$ operation version of Myers (1999) bitpacking algorithm, which uses $16$ operations when modified for edit distance.

I tried implementing it, but it seems to have a bug that I will describe below. The fix is here.

Problem

To recap, this algorithm solves the unit-cost edit distance problem by using bitpacking to compute a $1\times w$ at a time. As input, it takes

[WIP] Bitpacking and string searching

Fri, 11 Aug 2023 00:00:00 +0200

Table of Contents

Intro
Review papers
DP methods
Automata methods
- Hamming distance
- Edit distance
Suffix array methods
- Hamming distance
  - Galil and Giancarlo (1986)
  - Grossi and Luccio (1989)
- Edit distance
Other
- Hyyrö (2008a)
- Hyyrö, Narisawa, and Inenaga (2010)
TODO
- TODO

\[ \newcommand{\ceil}[1]{\lceil#1\rceil} \newcommand{\floor}[1]{\lfloor#1\rfloor} \]

Shortest paths, bucket queues, and A* on the edit graph

Sat, 29 Jul 2023 00:00:00 +0200

Table of Contents

Shortest path algorithms ..
- .. in general
- .. for circuit design
Bucket queues
Shortest path algorithms by Hadlock
- Grid graphs
- Strings
Spouge’s computational volumes

This note summarizes some papers I was reading while investigating the history of A* for pairwise alignment, and related to that the first usage of a bucket queue. Schrijver (2012) provides a nice overview of general shortest path methods.

Shortest path algorithms ..

.. in general

Moore (1959) was already presented in 1957. I did not find a PDF of this paper but Schrijver (2012) summarizes it well: For unit-cost graphs it presents an $O(m)$ BFS algorithm, and for general weighted graphs an $O(mn)$ algorithm.

Research proposal: subquadratic string graph construction

Mon, 10 Jul 2023 00:00:00 +0200

Table of Contents

Introduction
Research plan

This is a research proposal for a 5 month internship at CWI during autumn/winter 2023-2024.

Introduction

An important problem in bioinformatics is genome assembly: DNA sequencing machines read substrings of a full DNA genome, and these pieces must be assembled together to recover the entire genome.

Loukides, Pissis, Thankachan, Zuba :: Suffix-Prefix Queries on a Dictionary

Fri, 07 Jul 2023 00:00:00 +0200

Table of Contents

Comments
A small rant on $τ$-micro-macro trees
Ideas for simplification
Closing thoughts

\[\newcommand{\dol}{\$}\]

These are some comments and new ideas on the paper by Loukides, Pissis, Thankachan, and Zuba (2023).

DSB 2023

Mon, 20 Mar 2023 00:00:00 +0100

These are notes for DSB 2023. They’re not very structured though. I usually find methods more interesting than results.

Day 1, Tuesday

Practical data structures for longest common extensions, Alexander Herlez

LCE: longest common extension: given $i$, $j$, the max $k$ s.t. $A[i, i+k) = A[j, j+k)$.
alg:
- compare first k
- if same: sample a subset and use black-box datastructure.
- similar idea to minhash/mash kmer selection methods, same(?) as syncmers
string synchronizing sets (SSS):
- rolling hash. sample a position when it has minimal hash in it’s window
https://github.com/herlaz/alx
useful for constant time LCE when extensions are length 1000 or longer. Not faster for shorter LCEs.
note: WFA uses this a lot and it’s actually the bottleneck, but really only for short extensions.

Pan-genome de Bruijn graph using the bidirectional FM-index, Lore Depuydt

Goal: graph of pangenome with correspondences to reads
Implicit graph by Beller and Ohlebusch: build on top of FM-index of concatenation
New in Nexus:
- Bidirectional FM-index for bidirectional traversal
  - down/upstream neighbourhoods can be visualized efficiently using traversal
- Search schemes for lossless approximate pattern matching (separate talk)
  - query -> candidate matches -> graph paths -> read paths
  - Lossless aligner; reports many more occurrences than PuffAligner, although this does make it slower for >0 edit distance
- checkpoints k-mers inside long nodes
  - this way, to find the start of a long (compressed) node goes down significantly.
Memory usage: Bidirectional FM-index is >80%.
- Bidir FM is linear in total size of pangenome.
Future: r index, which uses less memory.

Indexing large metagenomics projects with abundances, Pierre Peterlongo

Indexing read sets with abundance in <100GB with fast querying with low ram.
Uses counting bloom filters
fimpera decreases the overestimation, allowing less memory usage.
Once k-mers are sorted, all processes are sequential! 1000x speedup
instead of querying k-mers, store all slightly smaller s-mers and query all of them for much better false positive rates

µ-PBWT: Enabling the Storage and Use of UK Biobank Data on a Commodity Laptop, Davide Cozzi

r-index equivalent of PBWT, using run-length encoding
10-100x less memory than PBWT.
https://github.com/dlcgold/muPBWT

Genome-on-Diet: Taming Large-Scale Genomic Analyses via Sparsified Genomics, Mohammed Alser

Building an index on spaced kmers/patterns
Only some positions are sampled. i.e.: sample every other basepair and build kmers on those.
Index is built on the fly; not preprocessed
to 2x faster and 2x less memory than minimap2.
to 50x faster and 700x less memory than other tools.
summary: subsample 1 in m bits and run minimap on top of that.
similar to spaced seeds, but additionally subsamples the number of kmers.
Faster and better accuracy than spaced seeds.

Spectrum preserving tilings enable sparse and modular reference indexing, Giulio Ermanno Pibiri

Spectrum preserving tiling: Given a set of reads and their DBG, we can for each read store location information per unitig in this larger graph
Use SSHash to store all unitigs.
Sample a subset of unitigs to store location information for. Non-sampled unitigs can still be recovered by querying adjacent unitig locations instead.
Main contribution: reducing space usage of the reverse index: mapping kmers to locations.
https://github.com/COMBINE-lab/pufferfish2

Towards a lower-memory chunked graph data structure inspired by Minecraft, Fawaz Dabbaghie

Chunk big graphs as in minecraft chunks.
First approach: split human genome graph in 1000 parts using BFS’s from random positions
Already big speedup!
TODO: simple idea and super effective. Maybe play around with this at some point

Optimal Worst-Case Design of Gapped k-mer Masks, Sven Rahmann

Gapped kmers are better in the worst case than normal kmers:
- If you can make $x$ substitutions in a length $n$ strings, gapped kmers need a higher $x$ to mutate (break) all of them.
Second optimization goal: count number of recovered positions, instead of number of kmers.
#########_#########_######### (27,29)-mer: in a $n=100$ window, lack of such matches implies at least 4 errors. Normal 27-mers imply at least 3 errors.
boundary effects (changing $n$) are not super strong.
TODO: read old papers and see if this could be used for A*PA
- How about inexact spaced matches?

Locality-Preserving Hashing of k-mers, Yoshihiro Shibuya

Mapping of kmers to $[0, 4^n)$ such that kmers with same minimizer are close.
Split mapping based on whether the minimizer is left-maximal and/or right-maximal inside its super-k-mer.
https://github.com/jermp/lphash
Less than 1.44 bits/element for large k (which is the generic lower bound).

Space-efficient k-mer counting using an implicit sequence representation, Miika Leinonen

Map kmers into a hashtable storing at each index:
- count of kmers mapping here
- the last character of the kmer
- the index of the preceding kmer
Memory usage: 25%-50% of normal hash table
saves more memory for larger k.

VeChat: correcting errors in long reads using variation graphs, Alexander Schönhuth

Error correction using graph cleaning
Clean using repeated steps of:
- remove edges with low coverage
- clean edges with low confidence: the relative weight this edge has with respect to total weight of edges going out of predecessor or going into successor
- clean isolated nodes and leaf edges
Up to 10x less remaining errors than other tools.

Fixing homopolymer errors in HiFi reads using dictionary compression, Diego Diaz-Dominguez

Encode sequence recursively using grammer based
Grammar compression is good on its own
TODO: read paper

Orthanq: orthogonal evidence based haplotype quantification, Hamdiye Uzuner

Variant calling pipeline
https://github.com/orthanq

Day 2, Wednesday

Random Wheeler graphs, Riccardo Maso

missed it :(

Doctoral plan

Mon, 12 Dec 2022 00:00:00 +0100

Research Proposal: Near-linear exact pairwise alignment

Abstract

Pairwise alignment and edit distance specifically is a problem that was first stated around 1968 (Needleman and Wunsch 1970; Vintsyuk 1968). It involves finding the minimal number of edits (substitutions, insertions, deletions) to transform one string/sequence into another. For sequences of length $n$, the original algorithm takes $O(n^2)$ quadratic time (Sellers 1974). In 1983, this was improved to $O(ns)$ for sequences with low edit distance $s$ using Band-Doubling. At the same time, a further improvement to $O(n+s^2)$ expected runtime was presented using the diagonal-transition method (Ukkonen 1983, 1985; Myers 1986).

One Year Of Rust

Thu, 17 Nov 2022 00:00:00 +0100

Table of Contents

Thoughts and remarks
- Good
- Bad
My programming language journey
- Lego mindstorms
- LabVIEW
- C++
- Python
- Rust

These are some notes on my opinions on Rust after one year of using it.

Thoughts and remarks

These pros and cons are mostly relative to C++, the language I used for the past ~10 years.

Good

Sum types!
- Option and enum are so much nicer than optional and in particular variant.
Build system
- Never used 3rd party code/libs in personal C++ projects.
- Use random crates all the time now!
Rust Analyzer
- C++ had YouCompleteMe in Vim, but Rust Analyzer worked out of the box and coding is so much nicer now.
No more need for !=nullptr
- If you want to use an optional, you are forced to check it’s existence.
- with if let Some(value) = optional, you can do the check and unpacking in one go, saving bugs.
C++ move semantics are an afterthought – don’t forget to move() when needed! In Rust, this comes naturally.
Lifetimes save bugs. It’s tedious, but as long as you’re doing nothing crazy you just keep applying suggestions until it works.
Turn your linux app into a webapp in 1 simple step!
I don’t actually believe Rust is hard to learn. Maybe if you’re used to C, but coming from C++11, Rust simply better corresponds to my mental model of code than C++ does. Sure it takes time, but what is the last time you learned a new language? After 1 year, I think I know/understand a larger percentage of Rust than I knew C++ after 10 years.
This is the right time to start! Lots of new cool features recently!
- let-chaing: if let Some(val) = val && val > 0 { .. }
- let-else: let Some(val) = val else { return; };
- GATs
ranges done right. (Well, at least more sane than C++-20/23 ranges)
Traits: tell the compiler that every type with .begin() and .end() is indeed a container.
No header/implementation split, and no header mess. Sane modules that map to directories!
Single source for all documentation!
WASM: taking your SDL2 based rendering and porting it to HTML Canvas in a day is very satisfying!
Expressions everywhere!
- let x = if var {1} else {0};,
- let x = { let a = 1; a };,
- let x = loop { break 10; };.
Simple and consistent struct initialization: let x = X { a: 1, b: 2 };

Bad

No equivalent of template-templates.
Using global/static convenience variables in simple programs is pain.
yes, fighting the borrow checker can be annoying, especially if you don’t know what you’re doing. Just remember: don’t try to make a struct that contains references into itself.
Using generic types has some rough edges
Haven’t used it yet for standalone files. In competitive programming/Project Euler, it’s too much overhead to create a new project for each problem.
Casting between usize, u32, and i32 all the time when using graphics libraries gets boring fast.

My programming language journey

Lego mindstorms

Age 11-13

LabVIEW

Age 13-15

C++

Age 15-26
Oh my god, it’s so nice to just type what you want instead of dragging/dropping boxes and wires
Started at C++11, solving Project Euler problems mostly
Only learned about references after a few years of using C++.
Never really used pointers in my own projects.
I couldn’t tell you how to declare and initialize a native array.
Also, I still can’t write new
Big fanboy; watched ~half the CppCon videos after each edition.
Always excited for the next edition.

Python

Age 21-
BAPCtools, a 5kloc python project

Rust

Age 26-
Started summer 2021 with a small hobby project
Now used in AstarPA, a 14kloc pairwise alignment project
Read all the blogs on r/rust.

The complexity and performance of WFA and band doubling

Thu, 17 Nov 2022 00:00:00 +0100

Table of Contents

Complexity analysis
Implementation efficiency
- Band doubling for affine scores was never implemented
WFA vs band doubling for affine costs
Conclusion
- Future work

This note explores the complexity and performance of band doubling (Edlib) and WFA under varying cost models.

Edlib (Šošić and Šikić 2017) uses band doubling and runs in $O(ns)$ time, for sequence length $n$ and edit distance $s$ between the two sequences.

String algorithm visualizations

Tue, 08 Nov 2022 00:00:00 +0100

Select the algorithm to visualize
Click the buttons, or click the canvas and use the indicated keys

Suffix-array construction is explained here and BWT is explained here.

Source code is on GitHub.

Algorithm
String
Query

Delay (s)

Thoughts on linear programming

Fri, 04 Nov 2022 00:00:00 +0100

Table of Contents

Linear programming
Assumptions
Idea for an algorithm

This note contains some ideas about linear programming and most-orthogonal faces. They’re mostly on an intuitive level and not very formal.

Postscriptum: The ideas here don’t work.

Linear programming

Maximize $\t\x$ subject to $A\x \leq \b$.

$\x$ is a vector of $n$ variables $x_i$.
$A$ is a $m\times n$ matrix: there are $m$ constraints $A_j \x \leq b_j$.

Assumptions

We make the following assumptions:

Local Doubling

Wed, 19 Oct 2022 00:00:00 +0200

Table of Contents

Notation
Needleman-Wunsch: where it all begins
Dijkstra/BFS: visiting fewer states
Band doubling: Dijkstra, but more efficient
GapCost: A first heuristic
Computational volumes: an even smaller search
Cheating: an oracle gave us $g^*$
A*: Better heuristics
Broken idea: A* and computational volumes
Local doubling
- Without heuristic
- With heuristic
Diagonal Transition
A* with Diagonal Transition and pruning: doing less work
Goal: Diagonal Transition + pruning + local doubling
Pruning: Improving A* heuristics on the go
Cheating more: an oracle gave us the optimal path
TODO: aspriation windows

\begin{equation*} \newcommand{\st}[2]{\langle #1,#2\rangle} \newcommand{\g}{g^*} \newcommand{\fm}{f_{max}} \newcommand{\gap}{\operatorname{Gap}} \end{equation*}

BWT and FM-index

Tue, 18 Oct 2022 00:00:00 +0200

Table of Contents

Burrows-Wheeler Transformation (BWT)
Bi-directional BWT

These are some notes about the Burrows-Wheeler Transform (BWT), FM-index, and variants.

See my post on the linear time suffix array construction algorithm for notation and terminology.

At the bottom you can find a visualization.
This page has an interactive demo.

Source code for visualizations is this GitHub repo.

Burrows-Wheeler Transformation (BWT)

The BWT of a string $S$ is generated as follows:

A Combinatorial Identity

Sun, 16 Oct 2022 00:00:00 +0200

Some notes regarding the identity

\begin{equation} \sum_{k=0}^n \binom{2k}k \binom{2n-2k}{n-k} = 4^n \end{equation}

Gould has two derivations:
- The first, from Jensens equality, (18) in (Jensen 1902; Shijie 1303).
- A second via the Chu-Vandermonde convolution:
  
  \begin{equation} \sum_{k=0}^n \binom{x}k \binom{y}{n-k} = \binom{x+y}n \end{equation}
  
  using $x=y=-\frac 12$ and using the $-\frac 12$-transform:
  
  \begin{equation} \binom{-1/2}{n} = (-1)^n\binom{2n}{n}\frac 1 {2^{2n}} \end{equation}
Duarte and de Oliveira (2012) has a combinatorial proof.

References

Duarte, Rui, and António Guedes de Oliveira. 2012. “New Developments of an Old Identity.” https://doi.org/10.48550/ARXIV.1203.5424.

Jensen, J. L. W. V. 1902. “Sur Une Identité D’abel et Sur D’autres Formules Analogues.” Acta Mathematica 26 (0): 307–18. https://doi.org/10.1007/bf02415499.

Shijie, Zhu. 1303. Jade Mirror of the Four Unknowns.

Tensor embedding preserves Hamming distance

Fri, 14 Oct 2022 00:00:00 +0200

Table of Contents

Definitions
Proof of Lemma 1
TODO Proof of Lemma 2

This is a proof that Tensor Embedding (Joudaki, Rätsch, and Kahles 2020) with $ℓ^2$-norm preserves the Hamming distance.

This is in collaboration with Amir Joudaki.

\begin{equation*} \newcommand{\I}{\mathcal I} \newcommand{\EE}{\mathbb E} \newcommand{\var}{\operatorname{Var}} \end{equation*}

Definitions

Notation

The alphabet is $\Sigma$, of size $|\Sigma| = \sigma$.
The set of indices is $\I := \{(i_1, \dots, i_t) \in [n]^t: i_1 < \dots < i_t\}$.
Given a string $a_1\dots a_n = a\in \Sigma^n$, we define the $I$-index as $a_I = (a_{i_1}, \dots, a_{i_t})$.
We write $[ X ]$ for the indicator variable of event $X$, which is $1$ when $X$ holds and $0$ otherwise.

Definition 1: Tensor embedding

Given $a\in \Sigma^n$, the tensor embedding $T_a$ is the $\sigma^t$ tensor given by $T_a[s] = \sum_{I\in \I} [A_I = s]$ for each $s\in \Sigma^t$.

The normalized tensor embedding distance $d_{te}$ between two sequences $a$ and $b$ is defined as

Linear-time suffix array construction

Thu, 13 Oct 2022 00:00:00 +0200

Table of Contents

Notation
Small and Large suffixes
Building the suffix array from a smaller one
Visualization

These are some notes about linear time suffix array (SA) construction algorithms (SACA’s).

At the bottom you can find a visualization.
This page has an interactive demo.

History of suffix array construction algorithms:

1990 first algorithm: Manber and Myers (1993)
2002 small/large suffixes, explained below: Ko and Aluru (2005)
2009 recursion only on LMS suffixes: Nong, Zhang, and Chan (2009)

These slides from Stanford are a nice reference for the last algorithm.

Competitive Programming Lecture

Wed, 28 Sep 2022 00:00:00 +0200

Table of Contents

Contest strategies
Pairwise Alignment using A*
Exercises

Contest strategies

Preparation

Thinking costs energy!
Sleep enough; early to bed the 2 nights before.
No practising on contest day (and the day before); it just takes energy.

During the contest

Eat! At the very least take a break halfway with the entire team and eat some snacks.
Make sure to read all the problems before the end of the contest. In the beginning, split the problems to find the simple ones, but towards the end, find a problem you think you can solve (because of the scoreboard or because you like it), and work on it as a team.

Coding

Ideally, use C++. Otherwise, Python can be used too.
- For big-integer problems, prefer Python.
Use a TCR (e.g. https://github.com/TimonKnigge/TCR): a 25 page document containing algorithms. Ideally, implement all of them yourself so you know how they work. Otherwise download one.
Make a template, and add it to your TCR. One person should type this in the first minutes of the contest and copy it to A.cpp, B.cpp, … .
When you think you solved a problem:
- Decide exactly how the code will look. Maybe write pseudocode on paper.
- For hard problems: verify your solution with a teammate.
- Once the keyboard is free, start typing it out. If needed, ask one teammate to look while you code.
- Typical distribution:
  - 1 person typing
  - 1 person solving a new problem
  - 1 person helping the other 2: spotting typos or working on problems.

Pairwise Alignment using A*

Some resources you can use:

Reducing A* memory usage using fronts

Mon, 26 Sep 2022 00:00:00 +0200

Table of Contents

Motivation
Parititioning A* memory by fronts
- Non-consistent heuristics
- Front indexing
Tracing back the path

Here is an idea to reduce the memory usage of A* by only storing one front at a time, similar to what Edlib and WFA do. Note that for now this will not work, but I’m putting this online anyway.

Motivation

In our implementation of A*PA, we use a hashmap to store the value of $g$ of all visited (explored/expanded) states by A*. This can take up a lot of memory and simply reading/writing $g$ in the hashmap can take over half the total execution time.

Speeding up A*: computational volumes and path-pruning

Fri, 23 Sep 2022 00:00:00 +0200

Table of Contents

Motivation
Summary
Why is A* slow?
Computational volumes
Dealing with pruning
- Thoughts on more aggressive pruning
Algorithm summary
Challenges
Results
What about band-doubling?
- Maybe doubling can work after all?
TODOs
Extensions

This post build on top of our recent preprint Groot Koerkamp and Ivanov (2024) and gives an overview of some of my new ideas to significantly speed up exact global pairwise alignment. It’s recommended you understand the seed heuristic and match pruning before reading this post.

Revised Oxford Bioinformatics latex template

Thu, 22 Sep 2022 12:13:00 +0200

I made an improved version of the Oxford Bioinformatics latex template. See the Github repository.

Linear memory WFA?

Wed, 17 Aug 2022 00:00:00 +0200

Table of Contents

Motivation
Path traceback: two strategies
Observations
- What information is needed for path tracing
A pragmatic solution
Another interpretation
Affine costs
Conclusion

Figure 1: Only the red substitutions and blue indel need to be stored to trace the entire path.

In this post I’ll discuss an idea to run WFA using less memory, while still allowing us to trace back the optimal path from the target state back to the start of the search.

Transforming match bonus into cost

Tue, 16 Aug 2022 00:00:00 +0200

Table of Contents

Tricks with match bonus or how to fool Dijkstra’s limitations
Conclusion

Tricks with match bonus or how to fool Dijkstra’s limitations

The reader is assumed to have basic knowledge about pairwise alignment and graph theory.

Paper styleguide

Sat, 06 Aug 2022 00:00:00 +0200

Table of Contents

Notation
Naming and style

This is a growing list of notation and style decisions Pesho and I made during the writing of our paper, written down so that we don’t have to spend time on it again next time.

Notation

Math

Modulo: $a\bmod m$ for remainder, $a\equiv b\pmod m$ for equivalence.

Alphabet

$\Sigma$, $|\Sigma| = 4$

Sequences

$A = \overline{a_0\dots a_{n-1}} \in \Sigma^*$, $|A| = n$
$B = \overline{b_0\dots b_{m-1}} \in \Sigma^*$, $|B| = m$
Edit distance $\mathrm{ed}(A, B)$
$A_{<i} = \overline{a_0\dots a_{i-1}}$
$A_{\geq i} = \overline{a_i\dots a_{n-1}}$
$A_{i\dots i’} = \overline{a_i\dots a_{i’-1}}$

Edit graph

State $\langle i, j\rangle$
Graph $G(V, E)$ where $V = \{\langle i,j\rangle | 0\leq i\leq n, 0\leq j\leq m\}$
Root state $v_s = \langle 0,0\rangle$
Target state $v_t = \langle n,m\rangle$
Distance $d(u, v)$
Path $\pi$
Shortest path $\pi^*$
Cost of path $cost(\pi)$, $cost(\pi^*) = d(v_s, v_t) = \mathrm{ed}(A, B)$.

Naming and style

Vertex, not node

Diamond optimisation for diagonal transition

Mon, 01 Aug 2022 00:00:00 +0200

Table of Contents

Diamond transition or how technicalities can break concepts
- But let’s take a closer look
- Conclusion

Diamond transition or how technicalities can break concepts

We assume the reader has some basic knowledge about pairwise alignment and in particular the WFA algorithm.

In this post we dive into a potential 2x speedup of WFA — one that turns out not to work.

Let’s take a look at one of the most important and efficient algorithms for pairwise alignment — WFA (Marco-Sola et al. 2020). It already looks good, and is pretty efficient. In Table 1, which copies the style of Figure 1 in Eizenga and Paten (2022), rows are wavefronts, and columns are diagonals. Light-blue states are stored in memory. Green shows the current state being computed, and dark-blue shows the cells the green cell depends on.

Bidirectional A*

Thu, 28 Jul 2022 17:59:00 +0200

These are some links and papers on bidirectional A* variants. Nothing insightful at the moment.

small lecture: introduces $h_f(u) = \frac 12 (\pi_f(u) - \pi_r)$. Not found a paper yet.
An Improved Bidirectional Heuristic Search Algorithm (Champeaux 1977): introduces a bidirectional variant
Bidirectional Heuristic Search Again (Champeaux 1983): fixes a bug in the above paper
Efficient modified bidirectional A* algorithm for optimal route-finding: Didn’t read closely yet.
A new bidirectional algorithm for shortest paths (Pijls 2008): Actually a new methods. Seems to cite useful papers.
There 2 papers that cite this one may also be interesting.

The BiWFA meeting condition

Mon, 11 Jul 2022 00:00:00 +0200

cross references: BiWFA GitHub issue

It seems that getting the meeting/overlap condition of BiWFA (Marco-Sola et al. (2023), Algorithm 1 and Lemma 2.1) correct is tricky.

Let $p := \max(x, o+e)$ be the maximal cost of any edge in the edit graph. As in the BiWFA paper, let $s_f$ and $s_r$ be the distances of the forward and reverse fronts computed so far.

We prove the following lemma:

Lemma Once BiWFA has expanded the forward and reverse fronts up to $s_f$ and $s_r$ and has found some path of cost $s \leq s_f + s_r$, expanding the fronts until $s’_f + s’_r \geq s+p+o$ is guaranteed to find a shortest path.

A* variants

Sun, 12 Jun 2022 12:04:00 +0200

These are some quick notes listing papers related to A* itself and variants. In particular, here I’m interested in papers that update $h$ during the A* search, as a background for pruning.

Specifically, our version of pruning increases $h$ during a single A* search, and in fact the heuristic becomes in-admissible after pruning.

Changing $h$

The original A* paper has a proof of optimality. Later papers consider this also with heuristics that change their value over time.

IGGSY 22 Slides

Sun, 12 Jun 2022 12:04:00 +0200

These are the slides Pesho Ivanov and I presented at IGGSY 2022 on Astarix and A*PA.

Drive: here

Pdf: here

Benchmark attention points

Thu, 28 Apr 2022 23:33:00 +0200

Benchmarking is harder than you think, even when taking into account this rule.

This post lists some lessons I learned while attempting to run benchmarks for A* pairwise aligner. I was doing this on a laptop, which likely has different characteristics from CPUs in a typical server rack. All the programs I run are single threaded.

Hardware

Do not run while charging the laptop: Charging makes the battery hot and causes throttling. Run either on battery power or with a completely full battery to prevent this.
Disable hyperthreading: Completely disable hyperthreading in the BIOS. Multiple programs running on the same core may fight for resources.

CPU settings

Pin CPU frequency: CPUs, especially laptops, have turboboost, (thermal) throttling, and powersave features. Make sure to pin the CPU core frequency low enough that it can be sustained for long times without throttling.
In my case, the performance governor can fix the CPU frequency. The base frequency of my CPU is 2.6GHz, so that’s where I pinned it.

Motivation

Thu, 28 Apr 2022 23:22:00 +0200

It’s not the need for faster software that motivates; it’s the mathematical discovery that needs sharing.

Variations on the WFA recursion

Sun, 17 Apr 2022 03:14:00 +0200

Table of Contents

Gap open
Gap close
Symmetric alternatives
Another symmetry
Conclusions

cross references: BiWFA GitHub issue

In this post I will explore some variations of the recursion used by WFA/BiWFA for the affine version of the diagonal transition algorithm. In particular, we will go over a gap-close variant, and look into some more symmetric formulations.

Gap open

WFA (Marco-Sola et al. 2020) introduces the affine cost variant of the classic diagonal transition method. Let us call it a gap-open variant, because the gap-open cost $o$ is payed when opening the gap, that is, when jumping from the $M$ layer to the $I$ or $D$ layer.

A survey of exact global pairwise alignment

Fri, 01 Apr 2022 00:00:00 +0200

Note: This is a living document, and will likely remain so for a while. Feel free to suggest missing papers or make a pull request.

AStarix

Fri, 12 Nov 2021 13:05:00 +0100

Papers

AStarix is a method for aligning sequences (reads) to graphs:

Input

A reference sequence or graph
Alignment costs $(\Delta_{match}, \Delta_{subst}, \Delta_{del}, \Delta_{ins})$ for a match, substitution, insertion and deletion
Sequence(s) to align

Output

An optimal alignment of each input sequence

The input is a reference graph (automaton really) $G_r = (V_r, E_r)$ with edges $E_r \subseteq V_r\times V_r\times \Sigma$ that indicate the transitions between states.

Neighbour joining

Fri, 12 Nov 2021 11:57:00 +0100

Neighbour joining (NJ, paper) is a phylogeny reconstruction method. It differs from UPGMA in the way it computes the distances between clusters.

This algorithm first assumes that the phylogeny is a star graph. Then it finds the pair of vertices that when merged and split out gives the minimal total edge length $S_{ij}$ of the new almost-star graph. (See eq. (4) and figure 2a and 2b in the paper.) \[ S_{i,j} = \frac1{2(n-2)} \sum_{k\not\in \{i,j\}}(d(i, k)+d(j,k)) + \frac 12 d(i,j)+\frac 1{n-2} \sum_{k<l,\, k, l\not\in\{i,j\}}d(k,l). \] After subtracting the sum of all pairwise distances (which is a constant) and multiplying by $2(n-2)$, we obtain the familiar \[ Q(i, j) = (n-2) d(i, j) - \sum_{k=1}^n d(i, k) - \sum_{k=1}^n d(j, k). \] Thus, we merge the two vertices that minimize $Q$. The distance from the merging of vertices $i$ and $j$ to each other vertex $k$ is $d_{(i-j)k} = (d_{i,k} + d_{j,k})/2$.

UPGMA

Thu, 28 Oct 2021 11:56:00 +0200

Unweighted pair group method with arithmetic mean (UPGMA) is a phylogeny reconstruction method.

Input: Matrix of pairwise distances
Output: Phylogeny
Algorithm: Repeatedly merge the nearest two clusters. The distance between clusters is the average of all pairwise distances between them. When merging two clusters, the distances of the new cluster are the weighted averages of distances from the two clusters being merged.
Complexity: $O(n^3)$ naive, $O(n^2 \ln n)$ using heap.

RTFE

Fri, 22 Oct 2021 15:16:00 +0200

Read The F*ing Error

When you complain about an error without reading it first.
When you assume you understand the problem halfway through reading the error, and only after more debugging you realize you failed to read properly.

1st law of Procrastination

Fri, 22 Oct 2021 11:46:00 +0200

Important deadlines require important procrastination.

Data should be reviewed

Fri, 22 Oct 2021 11:41:00 +0200

Experiments and their analysis should be reproducible, and all data/figures in a paper should be reviewable. Pipelines (e.g. snakemake files) to generated them should be attached to the paper.

I’ve asked for automated scripts to reproduce test data on 3+ github repositories now, and got a satisfactory answer zero times:

WFA: https://github.com/smarco/WFA/issues/26

Link to a datadump on the block-aligner repository. Good to have actual data, but exactly how this data was created is unclear to me.

Hugo and ox-hugo

Thu, 14 Oct 2021 00:00:00 +0200

Here’s the customary how I made this site using X post.

This site is built using Hugo and ox-hugo.

The source is written in Org mode, which is converted to markdown by ox-hugo. To get started yourself, check out the initial commit of the source repository and build from there.

Some notes:

I’m using the Hugo-coder theme.
Since the conversion from Org to markdown is done using an Emacs plugin, the emacs folder contains a simple init.el to import ox-hugo and a function to export all *.org files in the repository apart from those inside the emacs folder itself.
The makefile contains the build-content rule to call the conversion, and build-site to invoke Hugo. Just running make will do both of these and serve the site locally.

Hello, World!

Wed, 13 Oct 2021 00:00:00 +0200

1

print("Hello, World!")

1

std::cout << "Hello, World!" << std::endl;

Spaced k-mer and assembler methods

Wed, 14 Jul 2021 00:00:00 +0200

Table of Contents

Spaced $k$-mers
Minimap
SPAdes
MUMmer4
BLASR
Bowtie 2
Patternhunter
Spaced seeds improve $k$-mer-based metagenomic classification
LoMeX
Meeting notes

Concepts:

Mapping Map a sequence onto a reference genome/dataset
Assembly Build a genome from a set of reads
- de novo (implied): without using a reference genome
- Otherwise just called mapping

Typical complicating factors:

read errors
non-uniform coverage
insert size variation
chimeric reads (?)
bireads
non-uniform read coverage (as in metagenomics, i.e. multi cell assembly)

Spaced $k$-mers

Also called

Ideas for assembling [long] reads

Fri, 09 Jul 2021 00:00:00 +0200

\[ \newcommand{\vp}{\varphi} \newcommand{\A}{\mathcal A} \newcommand{\O}{\mathcal O} \newcommand{\N}{\mathbb N} \newcommand{\Z}{\mathbb Z} \newcommand{\ed}{\mathrm{ed}} \newcommand{\mh}{\mathrm{mh}} \newcommand{\hash}{\mathrm{hash}} \]

Here is an idea for an algorithm to assemble long reads.

Go over all sequences and sketch their windows using the Hamming distance preserving sketch method described here. This method may need some tweaking to also work with an indel rate of around 10%.
Let’s say we find a pair of matching windows between reads $A$ and $B$ starting at positions $i$ and $j$. This indicates that $A$ and $B$ may be related with an offset of $j-i$.

Hamming Similarity Search

Thu, 08 Jul 2021 00:00:00 +0200

Table of Contents

Background
Introduction
Hamming Similarity Search
Phylogeny reconstruction
- Running the algorithm
Assembly

\[ \newcommand{\vp}{\varphi} \newcommand{\A}{\mathcal A} \newcommand{\O}{\mathcal O} \newcommand{\N}{\mathbb N} \newcommand{\ed}{\mathrm{ed}} \newcommand{\mh}{\mathrm{mh}} \newcommand{\hash}{\mathrm{hash}} \]

Background

Quickly finding similar pieces of DNA within large datasets is at the core of computational biology. This has many applications:

Alignment: Given two pieces of related DNA, align them to find where mutations (i.e. substitutions, insertions, or deletions) occur.

Detached fullscreen in Sway

Fri, 02 Jul 2021 00:00:00 +0200

Xrefs: PR for Sway | AUR package sway-inhibit-fullscreen-git

Once upon a time, Chromium had a bug where using $mod+f in i3 to fullscreen the Chromium window changed the window to occupy the entire screen, but didn’t actually make Chromium enter full screen mode. According to some, those¹ were² the³ good⁴ days⁵^,⁶. Watching 4 YouTube streams in parallel was still possibly, back in those days:

Without patches, the best we can do nowadays⁷ is the following

Open source contributions

Fri, 02 Jul 2021 00:00:00 +0200

Table of Contents

My aur packages
Some issues I reported/fixed

My aur packages

List on aur.archlinux.org

bapctools-git: BAPCtools is used for developing ICPC style programming contest problems.
feh-preload-next-image-git: Branch of Feh that loads the next image to speed up browsing images in a remote directory.
i3-focus-last-git: Window switcher for i3/sway.
python-pyexiftool-nocheck: the original python-pyexiftool is outdated, orphaned, and still depends on python2.
sway-inhibit-fullscreen-git: Sway branch that adds the inhibit_fullscreen toggle command. Bind this to e.g. $mod+Shift+f to disconnect the Sway full screen status from the application full screen status. Used to e.g. watch YouTube videos in Chromium in full screen mode, but in a window that is only a quarter of the screen.

Some issues I reported/fixed

vimium/issues/3557 after searching and hitting Enter, vimium freezes and needs a mouse click to unfreeze. n and N do not work.
- fix here: add a missing return statement after many hours of debugging
vimium/issues/3844 Bug: Last characters dropped when using custom search engine
- duplicate of issues/3567
- PR pull/3846
sway/pull/6286 Add anything option to Grimshot to allow selecting either a window, output, or manual area.
mako/issues/358 Feature Request: Dismiss all notifications with given category
slurp/issues/87 Moving the mouse to a different output does not always grey out rectangles
slurp/issues/86 Clicking outside all rectangles should cancel
sway/issues/6299 Unexpected next_on_output behaviour with workspace_auto_back_and_forth
- follow-up PRs: 6332: cleanup, 6334: more cleanup, 6335: .clang-format
Signal-Desktop/issues/5307 ctrl-2 broken under Wayland

Powersearch with Vimium

Fri, 02 Jul 2021 00:00:00 +0200

Related posts: Dark mode with Vimium

Vimium (Github, Chromium extension) is not only a great way to navigate webpages; it’s also a great help to quickly search many webpages.

I am using it many times a day to search for just the documentation I need. Some of the search engines I have configured:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21


# Documentation
archwiki: https://wiki.archlinux.org/index.php?search=%s ArchWiki
aur: https://aur.archlinux.org/packages/?K=%s AUR
cpp: https://en.cppreference.com/mwiki/index.php?search=%s CppReference
github: https://github.com/search?q=%s GitHub
hoogle: https://www.haskell.org/hoogle/?hoogle=%s Hoogle
oeis: https://oeis.org/search?q=%s OEIS
python: https://docs.python.org/3.7/search.html?q=%s Python
wiki: https://en.wikipedia.org/w/index.php?title=Special:Search&search=%s Wikipedia

# Translation
nlen: https://translate.google.nl/#nl/en/%s Dutch -> English
ennl: https://translate.google.nl/#en/nl/%s English -> Dutch
deen: https://translate.google.nl/#de/en/%s German -> English
ende: https://translate.google.nl/#en/de/%s English -> German

# Other
imdb: https://www.imdb.com/find?q=%s IMDB
# `pe 123` to jump straight to problem 123.
pe: https://projecteuler.net/problem=%s Project Euler
kattis: https://open.kattis.com/search?q=%s Kattis

Any of these can now be activated by typing their shorthand in the omnibar (which is usually activated by pressing o or O). Vimium even has live preview support for Wikipedia and Google search, as well as some others.

Wayland utilities

Fri, 02 Jul 2021 00:00:00 +0200

This post goes over some useful utilities I have been using on my Wayland system.

Screen brightness: `light`

Light is a nice tool to manage screen and keyboard brightness.

Install light
Add your user to the video group: usermod -aG video <user>

I really like the light -T flag, which multiplies the current brightness by some value. This way you can have fine grained control both for very low and very high brightness values. To prevent yourself from decreasing the brightness all the way to 0, you can run e.g. light -N 0.2 to set a minimium screen brightness of 0.2. This value will be stored in your config directory under ~/.config/light/.

Browsing in the dark with Vimium and Dark Reader

Thu, 01 Jul 2021 00:00:00 +0200

Table of Contents

Chromium theme
Dark Reader
Vimium

Let’s quickly go over some settings you can change for a better dark mode experience in Chromium.

Chromium theme

First of all, you can make Chromium itself use a dark theme. This will ensure both a dark tab bar and nice dark settings pages. As explained here, you’ll need to change the following:

Run chromium with the flags

Window switching in Sway

Thu, 01 Jul 2021 00:00:00 +0200

Sway has many commands for switching the active workspace and focused window. However, I find that most of my window switching comes down to a few simple commands that focus a specific application, or open it first when it has no open windows yet. E.g.:

$mod+s: open and/or focus slack
$mod+i: open and/or focus signal
$mod+m: open and/or focus emacs
$mod+c: open and/or focus chromium

In addition to this, some apps like emacs have a separate $mod+Shift+m command that always opens a new window/instance.

Clean your homedir with XDG Base Dir

Wed, 30 Jun 2021 00:00:00 +0200

Xrefs: XDG specification | ArchWiki | Reddit post

In case you are, like me, tired of applications polluting your homedir with config and data files, the XDB Base Directory Specification (ArchWiki) has your back.

You probably saw the ~/.config directory already, and in fact, many programs can be told to use this directory instead of polluting your homedir. The ArchWiki page has a list of many applications and which environment variables need to be set to change the location of their configuration.

Emacs Doom

Wed, 30 Jun 2021 00:00:00 +0200

Table of Contents

Configuration
- init.el
- config.el
Running as server and client
Wayland
Useful commands
Emacs as mail client

Install Doom Emacs as explained in the readme.

Alongside it, you’ll want to install ripgrep and fd for better search integration, and possibly ttf-font-awesome for better icons.

Configuration

Instead of the default ~/emacs.d/ and ~/doom.d/ config directories, you can also use ~/.config/emacs/ and ~/.config/doom/.

init.el

My init.el is mostly default, and enables the languages I regularly use, with LSP support where possible:

Environment variables done once

Wed, 30 Jun 2021 00:00:00 +0200

Xrefs: GitHub issue

One problem I had with my Sway setup is that setting environment variables in my config.fish (the Fish equivalent to .bashrc or .zshrc) is not always sufficient.

In particular, I need my environment variables to be available in at least the following places:

my Fish shell,
applications launched from Sway (e.g. using keybindings),
applications launched as a systemd service (e.g. the Emacs server daemon).

Setting variables in the shell profile has the problem that they are not picked up by systemd services. Another option seems to be ~/.pam_environment, but this is deprecated.

28000x speedup with Numba.CUDA

Mon, 24 May 2021 00:00:00 +0200

Xrefs: r/CUDA, Numba discourse

X1 Extreme Gen 3 - Migrating to Wayland

Sun, 16 May 2021 00:00:00 +0200

I got a new laptop, so this felt like the right time to migrate to Wayland.

Delta

what	before	after
hardware
laptop	Asus UX501V	Lenovo X1 Extreme Gen 3
CPU	i7-6700HQ	i7-10750H
GPU	GTX 960M	GTX 1650
RAM	16GB	64GB
OS
bootloader	Grub	EFISTUB
OS	Windows + Arch dualboot	Windows + Arch dualboot
networking	netctl	systemd-networkd
dns/dhcp	dhcpcd	systemd-resolved
wifi	wpa_supplicant	iwd
Wayland
display/login manager	-	-
display server	X	Wayland
window manager	i3	Sway
bar	i3blocks	waybar
backlight	xbacklight	light
night mode	redshift	gammastep
clipboard	-	wl-clipboard, clipman
program launcher	rofi	rofi [wayland]
password finder	rofi-pass	rofi-pass-git
key remapping	setxkbmap, xcape, xmodmap	interception-tools
Tools
terminal emulator	urxvt	foot
shell	zsh	fish
shell highlighting	zsh-syntax-highlight	-
environment variables	.zshrc	environment.d
text editor	vim	emacs doom
aur helper	packer	yay
directory usage	du	dust
password manager	pass	pass
search tool	ag [silver searcher]	ag + ripgrep + fd
file browser	terminal	terminal + ranger
calculator	qalc	qalc, rofi-calc
notification deamon	dunst	mako
image viewer	feh	feh + sxiv
music	-	spotifyd + spotify-tui + waybar custom/media
screenshot	teiler+scrot	grimshot (from sway) + custom upload wrapper
wallpaper	feh	sway
messenger	slack, signal-desktop	slack, signal-desktop [wayland]
system monitor	htop	htop
remote shell	ssh	ssh
browser	chromium	chromium
pdf viewer	zathura	zathura
vpn	-	openconnect

see also the i3 -> sway migration guide.

SE Endurance: Early game

Mon, 26 Apr 2021 00:00:00 +0200

Xrefs: Reddit

This is the start of a series of posts on our (philae, winston) play through Factorio with the Space Exploration mod.

After lots of struggling, we recently finished our first SE world after 624 in-game hours. Since this was also our first/second Factorio world, the start was very inefficient and we learned a lot of things along the way. In this new map, which we call Endurance (after the interplanetary spaceship in Interstellar), we will apply what we learned, and share it with the world :)

Hashcode 2021 Finals

Sat, 24 Apr 2021 00:00:00 +0200

Xrefs: Problem | Scoreboard
Team: cat /dev/random | grep "to be or not to be"
Who: Jan-Willem Buurlage, Ragnar Groot Koerkamp, Timon Knigge, Abe Wits
Score: 274253375
Rank: 19 of 38

Not good.
Not bad.
Definitely ugly.

Linkerrijtje (aka top half).

I would have liked to write that I’m happy with the result, but to be fair–I’m not. Just the fact that I can’t sleep and feel the need to write this in the middle of the night surely is indication of this. Plenty of things could have gone better, and there are so many things we (I) could have done differently to actually break that elusive 300M barrier (and potentially get that top 10 place we were hoping for), that I’ll definitely be sad for a little while.

Hashcode 2021: A lucky ride

Mon, 01 Mar 2021 00:00:00 +0100

Xrefs: Problem | Scoreboard | Codeforces announcement, this blog | Hacker News
Team: cat /dev/random | grep "to be or not to be"
Who: Jan-Willem Buurlage, Ragnar Groot Koerkamp, Timon Knigge, Abe Wits
Score: 10282641
Rank: 16

Since we did quite well, here is a write-up of our participation in Hashcode 2021.

Prep

All four of us had previously participated in Hashcode, but this was the first time in the current composition. Since we estimated our chances of getting through to the finals to be more than nothing, we decided to practice some previous Hashcode problems. Not all test sessions were equally successful, but we did manage to get a good division of work: while I start by immediately writing the IO Input and Output classes, and the Output::score() function, the others always start with reading the statement, analysing the testcases, and writing at least one greedy solution. This already is a big step up from previous years/teams, where usually everybody would write the IO themselves.