Here is a complexity theory perspective. Be warned that it may differ wildly from someone whose primary focus is logic.
I think the appropriate definition of a "visual proof" would boil down to giving an appropriate definition of what a verifier does with such a proof. Proof systems in complexity theory are measured by (a) how much the prover and verifier "interact", (b) the allowed lengths of potential proofs, and (c) the power of the verifier.
In the framework you are proposing, the prover simply gives the verifier a proof and walks away. So (a) is already determined. Also, the visual proof is presumably written on a small sheet of paper, so (b) is essentially determined (let's say the visual proof can be encoded in length that is at most a fixed polynomial in the length of the claim).
That leaves (c), which is where proof complexity gets interesting. It turns out that verifiers can be surprisingly weak and still verify the proof of any statement which has short proofs (where "short" is "fixed polynomial"). For example, if you require that proofs be written on a two-dimensional grid, then for every theorem with short proofs, there are proofs of the theorem which can be verified by a two-dimensional finite automaton, see
J. Hartmanis, D. Ranjan R. Chang, and P. Rohatyi. On IP = PSPACE and theorems with narrow proofs. EATCS-Bulletin, 41:166–174, 1990.
A verifier could be a "streaming" algorithm: it could be randomized, go over the proof in just one pass, and use a tiny amount of workspace relative to the length of the proof, see
Richard J. Lipton: Efficient Checking of Computations. STACS 1990: 207-215
The famous PCP theorem (of Arora et al.) tells us that the verifier could even be a "spot checker" which is randomized and only probes the proof at a constant number of points (which depend on the random coin tosses).
All of these are effectively different ways of characterizing the class NP: the polynomial time verifier in the definition of NP can be replaced with verifiers of the above kind.
So I believe that a good characterization of "visual proof" would turn out to give yet another way in which a simple verifier can check the proof of a theorem. However it is natural to think that maybe not all theorems with short proofs should have short visual proofs, so perhaps it is too ambitious to think that all of NP should have "visual proofs". Hence your definition problem will be a delicate combination of figuring out what the verifier should be able to do in a visual proof, and what kinds of true statements should admit such proofs. Good luck!
Addendum (added 7/1/10). The following neat paper on "Approximate Testing of Visual Properties" (doi: 10.1007/978-3-540-45198-3_31) by Sofya Raskhodnikova looks very relevant: http://people.csail.mit.edu/sofya/pixels.pdf
Best Answer
There is a body of very interesting work surrounding the proof complexity of various formulations of the well-known pigeon-hole principle, the fact that there is no injective function from a set of size $m$ to a set of size $n$, when $m\gt n$. It turns out that the difficulty of proving this depends on how much bigger $m$ is than $n$, and so we have actually a variety of pigeon-hole principles here:
Pigeon-hole principle. There is no injective map $n+1\to n$ for natural numbers $n$.
Moderately weak pigeon-hole principle. There is no injection $2n\to n$, for positive natural numbers $n$.
Weak pigeon-hole principle. There is no injection from $n^2\to n$ for natural numbers $n>1$.
Very weak pigeon-hold principle. There is no injection $\infty\to n$, for any finite $n$.
The principles become easier to prove, of course, as the domain gets larger, and there are shorter proofs when the domain is very large. The paper Proof complexity of pigeon-hole principles by Alexander Rozborov contains many interesting results on the computational proof complexity of these various formulations of the pigeon-hole principle, and in particular of the necessary long lengths of proofs in certain systems. Meanwhile, results of Sam Buss, Polynomial size proofs of the propositional pigeon-hole principle show that there is a dependence on the particular proof system that is adopted, for in certain Frege system one can still obtain polynomial size proofs.