When reading much of the academic literature on virtual space, I often feel confused, as if I was missing some fundamental fact (or assumption). I think I've finally found out why.
In Mark B. N. Hansen's
New Philosophy for New Media, the author refers to vision as "
the most disembodied register of aesthetic experience" (p. 12), and cites touch and hearing as examples of "
the more embodied registers". The author offers no explanation nor proof of any sort, scientific or otherwise, to back up this claim. It is treated as a given, as self-evident physical fact.
But for me, vision is neither more nor less of an embodied process than audio. It is, like audio, one of the five physical senses. It doesn't help that Hansen refers to them as "registers" rather than senses (on the next page he uses "register" with a different meaning - it is this kind of elastic use of vocabulary that makes much new media theory so difficult to follow), but as far as I can see he simply means
senses as in the five
senses. Since they are
physical senses, how could one be more or less embodied than another?
I have yet to read the entire book, but it does appear to be an attempt to show that vision is actually an embodied process. He says this agrees with the interests of contemporary artists, whose work is aimed at "dismantling the supposed purity of vision and exposing its dirty, embodied underside". (It is interesting that he considers bodies to be dirty).
When I read this, I realised that a lot of new media theory adopts this assumption that vision is less embodied than audio. Since I don't share this assumption, and had never heard it explicitly stated, I didn't know it was happening, which explains why I find so much of the theory confusing.
I understand that the assumption is there now, but I still don't understand why.
Is it a question of framing? That the framed image (and this might explain why so much new media theory relates it to cinema - a relation I've always found tenuous) is viewed differently from 'realworld' vision. In other words, when viewing a picture or a video, do theorists discard the physical visual information surrounding the frame, seeing the framed image as existing separately from its physical context (ie, the room it's in)?
If so, my audio and performance background would explain why I don't share this attitude, since I think audio is always perceived within the sonic environment. While concert halls aim (but fail) to isolate the audio experience, most contemporary music either simply accepts it (rock music in a pub) or actively embraces it (ambient music).