* Lect 17 & 18 - GWAS -> Generalized Least Squares - Sequence of GLS - Optimization: Code motion -> reduction of complexity - Optimization: BLAS2 to BLAS3 -> efficiency - I/O: double buffering - https://arxiv.org/pdf/1210.7325v1.pdf - Parallelism: multithreaded blas + explicit threading - https://arxiv.org/pdf/1210.7683v1.pdf