You are probably aware of the Google Code Search. It’s a very neat tool.
How hard is it to parse the 1000’s of valid emails appearing in the source code file ?
Do a code search for < .*@.*\..*>
Then use this shell script (Stupid wordpress do some funky escaping stuff with my html..)
Rinse and Repeat with page 2 of 4150000..