Audio Asylum Thread Printer

In Reply to: RE: Yep, Faulty Logic posted by theaudiohobby on April 26, 2008 at 23:24:40

There are serious methodological reasons for considering single blind tests inferior to double blind tests, and the Clever Hans problem is a perfect example. Single blind tests are particularly unconvincing to others who are not personally familiar with all of the players. That is not to say they are worthless, but they are not likely to be convincing to third parties, as is needed for an art or science to progress. But faulty logic is possible with double blind tests as well. As commonly conducted by audio hobbyists, double blind tests work reliably only when they conclude that something was heard. As generally conducted by these hobbyists they do not have sufficient statistical power to conclude that nothing was heard.

Experimental Science needs the support of Mathematics, particularly mathematical theories of causality that justify the use of statistics. When conducting a sequence of experiments one needs to start with a model of what is possible and what is likely. One then conducts the experiments, applies statistical methods and refines the model.

A simple causal model suffices in the case of the successful amateur ABX listening test. One makes the highly plausible assumption that the source of randomness is unknown and uninfluenced by the test subject, affecting the subject only through the physical mechanisms of hearing. Then when the statistics are analyzed one concludes that the subject heard the stimulus. (If the test set up were poorly designed, for example if it displayed the random number on a screen, then this conclusion would be invalid. Note also that even if the test were done perfectly it would fail to convince a person who didn't buy into the underlying model. For example, an audio skeptic who believed in ESP (!) might conclude that a successful test subject used his psychic powers and not his ears.)

Now consider a slightly more complex model. Suppose that a sound is near the threshold and a subject has the ability to detect its presence or absence 5% better than chance. In other words, if the sound is present the subject will say he heard it 55% of the time and the subject will say he didn't hear it 45% of the time, while if the sound is not present, the subject will say he heard it 45% of the time and say he didn't hear it 55% of the time. Can the subject hear the sound? I would say yes, albeit just barely.

What will happen if a traditional 16 sample ABX trial is run? It will almost certainly fail to show that the subject heard the sound. However, it would be an abuse of statistics to conclude that the subject did not hear the sound.

When experiments are run which have marginal results, it is not uncommon for people with different underlying causal models to reach opposing conclusions. In many cases, Science progresses only after scientists with outdated models die and are replaced with by a newer generation.

Tony Lauck

"Perception, inference and authority are the valid sources of knowledge" - P.R. Sarkar

JUDEA PEARL - CAUSALITY (Open in New Window)

Propeller Head Plaza

RE: Yep, Faulty Logic

Post a Message!