Yeah I think parent poster confused a detail or two.
20hz-20khz is the human range, more or less. For most adults it's more like 15khz.
The 44.1khz sampling rate of CD audio was chosen because it can represent frequencies up to 22.05hz (44.1khz / 2) which safely covers the frequency range of human hearing with a bit of headroom for good measure.
The problem is that you have to implement an anti-aliasing filter that goes from 0dB at 20kHz to -96dB at 22kHz. This isn't a very practical filter to build, and will typically add all kinds of ripple to the top end.
The reason for 96kHz or even 192kHz sampling is that your filter can have a much gentler roll-off.
That's why they chose 44.1khz for redbook audio, right? Takes you up to 22khz, but most people can only hear up to 15khz or less, so you've got some headroom to play with if things aren't perfect between 15-22khz where our hearing isn't super sensitive anyway.
That's not really true. 2kHz is a pretty big range to build a filter in (and with digital you can trivially sample at 192khz FFT and delete the high frequency data)
And the studio can master the recording from a high sample rate recording using an enormous FIR filter, because latency and computing power are mostly irrelevant.
High sample rate equipment just allows you to listen to studio intermediate data without having to have it downsampled first.
But there are problems. For example, if there is any non-linearity in the equipment this can easily cause inaudible tones in the ultrasonic range to intermodulate with each other to produce tones that are audible.
But surely at 22khz you can only oscillate between 2 values if sampling at 44khz, whereas 2khz gets 20 values in the same 180 degrees of the wave? So doesn’t this mean you lose fidelity at higher frequencies?
A sine wave is a sine wave; it can't contain any "additional information" that would gain anything from getting encoded by more data points.
If it would, it wouldn't be a sine wave anymore, and you could decompose it into its component of phase and amplitude shifted sine waves, which will occupy all kinds of places in the frequency domain, many of them higher than your initial 22 kHz.
That's in fact what we mean when we say "a 22 kHz signal": An ensemble of all of these overlapping waves represented as a single signal. If you're sampling that with an ADC, you're not only sampling the the 22 kHz wave, but you're also sampling it – and that, but only that, can in fact be described with a single bit per ADC readout.
Looking at it from an information theoretical point of view, the Kolgomorov complexity of "a sine wave of 22 kHz and amplitude x" is pretty minimal – just from that sentence, you can perfectly recreate that signal with no sampling whatsoever, and one ADC readout tells you its amplitude and phase. It can carry a bandwidth of 0 Hz.
The magical thing about Nyquist is that this isn't a problem. Two values is enough to define a pure sine wave at 20khz, and all you need is the ability to represent sine waves at every frequency (per Fourier analysis).
This reaches the edge of my knowledge, but I think this is basically because the discrete values are 'slewed' (aka integrated aka low-pass filtered) into a continuous signal by the DAC.
No. Having the other 20 in-between values doesn't help improve the "accuracy" of the 2 khz wave. An x Khz wave is always an x Khz wave. That is to say the x Khz part of the signal is by definition the part of the signal which looks like sin([freq][time]2pi). Your soundwave is a sum of these Eg
a_i sin(2pitf)
where each a_i is the amount of a given frequency present. If every wave sample you have goes [1,-1,1,-1,1,-1...] then 100% of your wave is accounted for as a sin wave at the sampling rate. There is zero additional frequencies present with unknown amounts. If your wave samples went like [0,1,0,-1,0,1,0,-1...] the only frequency of sound present in the signal is the one at half the sampling rate. The whole wave is accounted for.
On a more physical level, its ok if your sample only sees +1/-1 every other sample, because when that electrical signal turns into pushing a magnet the magnet has to accelerate smoothly in accordance to Newtons laws. In other words, flipping the voltage between +-1 as a 40khz square wave becomes a perfect 20khz sine wave when it reaches the speaker.
20hz-20khz is the human range, more or less. For most adults it's more like 15khz.
The 44.1khz sampling rate of CD audio was chosen because it can represent frequencies up to 22.05hz (44.1khz / 2) which safely covers the frequency range of human hearing with a bit of headroom for good measure.