It’s not that hard to grasp I don’t think. If you understand graphs of soundwaves, it’s literally just the wave scratched into the plastic. The movement of the needle dictates the movement of the speaker membrane which results in the same movement in your eardrum. Which is what you percieve as sound.
What I don’t get, personally, is how this one scratched-in groove wave can contain a bassline, a melody and a singing voice and they all can be differentiated coming out of the speaker.
How speakers work in general is just black magic to me, actually.
An easier way to understand it, without knowing the math, is to know how it’s made. You play audio into a very similar device and it’s needle scratches the grooves. When you then have a needle pick up the grooves it’s moving the exact same way the needle was forced to move by the original.
It’s similar to how a speaker and a microphone are basically the same device. If you take a speaker and plug it into a microphone input, it still works (though they’re tuned differently so it’s not as good). A microphone has a crystal vibrate, which creates an electric signal. If you play that electric signal into a crystal it vibrates and creates the same sound.
There’s no math or anything being done for this to work. It’s purely mechanical. It’s just a copy of what the needle did when sound was played into it, so another needle running through it recreates the same sound. You can use math to represent it, but none is being done by the device (other than just the laws of physics).
Basically any wave can be created by adding together individual frequencies, and with some fancy math it’s possible to go the other way with a Fourier transform and get how loud every frequency is (like is displayed in a spectrogram).
I think the real black magic is in how our ears and brains can decode the mess of information coming in and identify meaningful patterns.
A nice related bookmark I have saved. There’s also interactive pert with a slider where you can draw a wave and then add more and more sinusoids to get closer and closer. And of course the square wave, with infinite sinusoids (theoretically). https://www.jezzamon.com/fourier/
Speakers do the simplest thing possible and literally just vibrate. A recording being played literally just recreates a recorded vibration. It’s a tiny choreography that your ears are incredibly sensitive for.
All the fancy stuff happens in our brains, after our ears has split up the sound around us into different ranges of frequencies (you can think of the hairs in the inner ears as tuning forks). We learn to recognize which frequencies goes together, and then we learn how the frequencies from multiple sources can overlap, and we learn what it all means
The real crazy part is how something as simple as sound can carry so much information and how reliably our brains can tell it all apart and make sense of it
But all sounds are vibration. If you capture the vibration, you capture all of the sound. The “different sounds” are all a single pattern of vibration; it’s the brain and inner ear that decodes the vibration into separate sounds. And hence it can also be difficult to do, depending on what the sounds are.
All the sounds get mixed together as they approach you (as they compress the same air), by the time it gets to your ear it can be represented by one complex wave.
If you can build up intuition around Fourier decomposition I think it gets much easier to understand.
Multiple things going on at the same frequency are indistinguishable (up to a phase). Lots of stuff going on at different frequency can be separated. Light also has frequency (color) and volume (intensity)—it may be more intuitive to conceptualize in this way.
Ironically, I work a lot with Fourier Transform. Still feels like magic. I even taught it! I’m trying to develop more intuition about it (vs hard knowledge)
Yeah, waves add. Which, well they add from the center which looks weird and bumpy. What’s more amazing is how good our ears are at picking out differences (it’s like 100x more sensitive to differences than other senses) so it can tell what all those individual waves would be so we can still hear the guitar vs drums vs bass vs vocals when it’s all one wave combined.
It’s not that hard to grasp if you read up a bit. You are probably born early 1900’s and have never heard of stereophonic recordings. But fear not!! What you are seeing is left + right channel (mono). The left - right channel is encoded vertically. So your left channel is mono + vertical divided by 2, and the right is (mono - vertical) divided by 2.
It’s not that hard to grasp I don’t think. If you understand graphs of soundwaves, it’s literally just the wave scratched into the plastic. The movement of the needle dictates the movement of the speaker membrane which results in the same movement in your eardrum. Which is what you percieve as sound.
What I don’t get, personally, is how this one scratched-in groove wave can contain a bassline, a melody and a singing voice and they all can be differentiated coming out of the speaker.
How speakers work in general is just black magic to me, actually.
Exactly. It makes sense to me conceptually, but it still blows my mind
An easier way to understand it, without knowing the math, is to know how it’s made. You play audio into a very similar device and it’s needle scratches the grooves. When you then have a needle pick up the grooves it’s moving the exact same way the needle was forced to move by the original.
It’s similar to how a speaker and a microphone are basically the same device. If you take a speaker and plug it into a microphone input, it still works (though they’re tuned differently so it’s not as good). A microphone has a crystal vibrate, which creates an electric signal. If you play that electric signal into a crystal it vibrates and creates the same sound.
There’s no math or anything being done for this to work. It’s purely mechanical. It’s just a copy of what the needle did when sound was played into it, so another needle running through it recreates the same sound. You can use math to represent it, but none is being done by the device (other than just the laws of physics).
So there’s this thing called a Fourier series…
Basically any wave can be created by adding together individual frequencies, and with some fancy math it’s possible to go the other way with a Fourier transform and get how loud every frequency is (like is displayed in a spectrogram).
I think the real black magic is in how our ears and brains can decode the mess of information coming in and identify meaningful patterns.
A nice related bookmark I have saved. There’s also interactive pert with a slider where you can draw a wave and then add more and more sinusoids to get closer and closer. And of course the square wave, with infinite sinusoids (theoretically).
https://www.jezzamon.com/fourier/
This reminds me, Bartosz Ciechanowski has an explainer about the whole sound thing.
That’s because it doesn’t, your brain does
Speakers do the simplest thing possible and literally just vibrate. A recording being played literally just recreates a recorded vibration. It’s a tiny choreography that your ears are incredibly sensitive for.
All the fancy stuff happens in our brains, after our ears has split up the sound around us into different ranges of frequencies (you can think of the hairs in the inner ears as tuning forks). We learn to recognize which frequencies goes together, and then we learn how the frequencies from multiple sources can overlap, and we learn what it all means
The real crazy part is how something as simple as sound can carry so much information and how reliably our brains can tell it all apart and make sense of it
This.
Well, the big ones anyway. The little ones use crystals.
Nobody knows what magnets are. The best science can tell us is that you shouldn’t drop them into a glass of water.
Why?
The US president told the navy that putting magnets in water could disable them, he also told Fox News twice that no one knows how magentas work.
https://www.snopes.com/fact-check/trump-nobody-knows-what-magnet-is-quote/
Here’s a video on the topic.
I was going to say, “I’m sure technology connections has done a video on this.”
This is from a video about headphones. His layman’s explanation at the timestamp is probably the best I’ve heard it told:
https://youtu.be/4-rgrSFrwrs?t=647
Even simpler to visualize: Its the movement of the membrane of the speaker/microphone turned into a physical line.
That explains just a tiny part. There are so many different sounds at the same volume and frequency
But all sounds are vibration. If you capture the vibration, you capture all of the sound. The “different sounds” are all a single pattern of vibration; it’s the brain and inner ear that decodes the vibration into separate sounds. And hence it can also be difficult to do, depending on what the sounds are.
All the sounds get mixed together as they approach you (as they compress the same air), by the time it gets to your ear it can be represented by one complex wave.
Like when you flatten all the layers in a graphics project.
No, because flattening the layers is a lossy operation but this is reversible.
Hmm good point!
This is a really interesting analogy. I perceive more than I see.
deleted by creator
If you can build up intuition around Fourier decomposition I think it gets much easier to understand.
Multiple things going on at the same frequency are indistinguishable (up to a phase). Lots of stuff going on at different frequency can be separated. Light also has frequency (color) and volume (intensity)—it may be more intuitive to conceptualize in this way.
Ironically, I work a lot with Fourier Transform. Still feels like magic. I even taught it! I’m trying to develop more intuition about it (vs hard knowledge)
I forget most of them, but I remember there being several concepts in calculus that straight up felt like magic once I finally understood them.
Yeah, waves add. Which, well they add from the center which looks weird and bumpy. What’s more amazing is how good our ears are at picking out differences (it’s like 100x more sensitive to differences than other senses) so it can tell what all those individual waves would be so we can still hear the guitar vs drums vs bass vs vocals when it’s all one wave combined.
Magic!
It’s not that hard to grasp if you read up a bit. You are probably born early 1900’s and have never heard of stereophonic recordings. But fear not!! What you are seeing is left + right channel (mono). The left - right channel is encoded vertically. So your left channel is mono + vertical divided by 2, and the right is (mono - vertical) divided by 2.