9.3
Harmonic series
(Naturtonreihe)
Fundamental frequency $$f$$
$$f, 2f, 3f, 4f, 5f, \ldots$$
9.2
Octave equivalence
$$\ldots \overset{\wedge}{=} A_4 \overset{\wedge}{=} A_5 \overset{\wedge}{=} A_6 \overset{\wedge}{=} \ldots$$
$$\ldots \overset{\wedge}{=} 440 Hz \overset{\wedge}{=} 880 Hz \overset{\wedge}{=} 1760 Hz \overset{\wedge}{=} \ldots$$
9.4
Tones and semitones
$$f + 2f + 3f + 4f + 5f + \ldots$$
$$f + \ldots + 1.25f + \ldots + 1.5f + \ldots + 1.75f + \ldots + 2f$$
12 semitones: A, A#, B, C, C#, D, D#, E, F, F#, G, G#
9.5
Tuning
Equal temperament (12-TET)
$$f_i = 440\ Hz \cdot 2^{\frac{i}{12}}, i \in \mathbb{Z}$$
Tone |
$$A_4$$ | $${A\sharp}_4$$ | $$B_4$$ | $$C_4$$ | $${C\sharp}_4$$ |
$$D_4$$ | $${D\sharp}_4$$ | $$E_4$$ | $$F_4$$ | $${F\sharp}_4$$ |
$$G_4$$ | $${G\sharp}_4$$ | $$A_5$$ |
Hertz |
440 = 440 |
466.16 | 493.88 | 523.25 |
554.37 ≈ 550 |
587.33 | 622.25 |
659.26 ≈ 660 |
698.46 | 739.99 | 783.99 | 830.61 |
880 = 880 |
10.7
Roughness
Given two complex tones
$$\color{orange}{T_1 = \{(a_1, f_1), (a_2, f_2), (a_3, f_3), \dots \}},$$
$$\color{lime}{T_2 = \{(a_4, f_4), (a_5, f_5), (a_6, f_6), \dots \}}$$
$$Roughness(T_1, T_2) = \frac{\sum\limits_{\color{orange}{(a_i, f_i) \in T_1}} \sum\limits_{\color{lime}{(a_j, f_j) \in T_2}} \color{orange}{a_i} \cdot \color{lime}{a_j} \cdot Roughness(\color{orange}{f_i}, \color{lime}{f_j})}{\sum\limits_{\color{orange}{(a_i, f_i) \in T_1}} \sum\limits_{\color{lime}{(a_j, f_j) \in T_2}} \color{orange}{a_i} \cdot \color{lime}{a_j}} \color{gray}{\in [0, 1]}$$
12.1
“Techniques for
automatic dissonance suppression
in harmonic mixing”
Master thesis by Vittorio Maffei (2014-2015)
12.2
12.3
Preprocessing
Tracks converted to mono, 44,100 Hz
Tempo changed to 120 bpm
8 second samples = 16 beats
12.4
Short time Fourier transform (STFT)
Blackman window
4096 window size, 256 hop size
4096 bins, 5000 Hz max frequency
→ 20 strongest partials extracted
12.5
Residual extraction
Split signal into sinusoids and residuals
12.6
Temporal averaging
Averaged to 16th notes
1379 windows → 64 windows
12.7
Optimal pitch-shift
12.8
Dissonance suppression
12.9
Partials suppression