It would be near impossible to support sound. As it is right now, the DS client downloads a .jpeg screenshot of the screen from the server. Then whenever you move the mouse or anything, it sends the information to the server, and the program moves the mouse. The DS's wireless capabilities aren't fast enough to stream audio without having it to buffer every 15 seconds.