Now, however much we would like to believe that every problem and its solution is black and white, life really doesn’t work that way. And when we are talking about something that is invisible in itself (I mean sound, in case you were wondering), the shade chart between black and white becomes even more diffused. When something is invisible, the confusion when referring to it becomes acute. You might be sitting in the front rows of the venue at a Zakir Hussain concert and feeling a little uncomfortable with the overall volume whenever Ustadji is smacking the skin of the tabla daya with his powerful strokes, but the person sitting right beside you might feel that the high intensity volume is exactly what makes the concert exciting. Or, you might be unfortunate to find yourself behind the last barricade at some concert, some 120 feet from the stage, and feeling you can’t hear the vocals clearly. But just beside you an elderly gentleman feels this is absolutely the perfect volume, very close to how he listens to his radio at home. Now consider a small venue, which has 500 audience. If you ask each of them about their preference of how it should sound, you will get data that will be pretty much useless in coming to a logical conclusion about how to run the audio of the show. So then, how do we deal with this conundrum? How do we come up with a stable solution when the problem itself is a moving target? If the audience were asked to come up with a solution amongst themselves before the show starts, I am sure it would be something akin to our honourable ministers trying to decide on a matter in the Parliament.
But that is exactly the responsibility the sound engineer (or whoever is handling the sound) has to take for a live performance — deciding on behalf of a room of 500 people or a field of 50,000 how it should sound so everyone who paid for the tickets goes home happy. Needless to say, more often than not, he is criticised by a certain percentage of the audience after every show because remember, there is no common target of how it should sound that the whole audience will agree on.
About the clarity of the vocal, or any other instrument for that matter, you could argue why the volume can’t be increased and made more audible like you do on your earphones. You are partly right, it is just a matter of pushing the fader (the volume control for the vocal or other instruments) and making it clearer/louder. But if it was that easy, don’t you think this problem wouldn’t be so common and the audio person in charge would have done it, instead of inviting displeased comments from the audience? How much volume you can push for the clarity of vocals (or any mic-ed instrument) depends on a lot of factors, like how adequate the sound system is for that venue, how they have been placed and angled with respect to the mic and how loud the voice itself is. Imagine running a 1 tonne AC on your residential meter. It is going to run fine even on the hottest of summers as long as the room is adequately sized. But if you try to cool a room double the size of what this 1 tonne can cover, it will struggle. Conversely, if you add another AC for the other room on top of this 1 tonne one, you run the risk of frying your electric supply. You need to pay for a meter of higher load capacity. But we wouldn’t go and blame the AC manufacturer or the person responsible for installing the AC in our home in this scenario. The AC here is analogical to the sound system, the electric supply is analogical to the strength of the voice being sung into the mic and the room size is analogical to the size of the venue.
Let’s talk about the other facet of the problem — the overall volume distribution. Things might get a bit technical here onwards, so it might be more applicable for people who are involved in the organising of events where sound reinforcement is necessary. The general rule of thumb when talking about volume distribution is to have a sound system that can produce a non-distorted and full-bandwidth level of 110db SPL with a variance of + or - 3db between the front row and the last row of audience. Read that a couple of times if needed, it could be a little difficult to grasp but I will try and simplify it. What this means is, if you are standing in the middle of a venue of 50 feet by 30 feet, and you play a song to check, the sound system should be able to play it fine at a volume of 110db SPL in the middle of the venue. If you move to the first row with the song volume unchanged, the volume from the speakers should not be louder than 113db and at the last row not less than 107db. You can then be pretty confident that during the performance, the audience at the front won’t be complaining it’s too loud and the ones at the back complaining it’s too soft while people in the middle cannot understand what the other 2/3rd are whining about. So it is a kind of minimising the huge difference between the shades of black and white. Equality is never possible in audio transmission, it is always about choosing the best compromise.
Also, I mentioned the terms non-distorted and full-bandwidth, which means we cannot use speakers that can attain the 110db number but cannot reproduce sound with those qualities. An example of such a speaker is the image below.
A typical horn-only speaker that can easily reach high levels of SPL but severely lack in low distortion and full bandwidth though it's good enough just for transmission of speech |
A line array speaker system by JBL. Notice the 8 identical speaker boxes that make up the array. Pic credit: desch-audio.de |
More common speaker that you see around known as Point Source Speaker |