I am not completely sure about the audio thing...
In general, however, things such as arts and esd are sound servers, or programs that take all sound output from other programs, mix them, and send them to the dsp device on your soundcard.
ALSA and OSS are not sound servers... they are methods that allow a program to write directly to the dsp chip (/dev/dsp) so when one program has /dev/dsp open another program cannot write to the file.
So probably one of the reasons why arts sounds worse than oss is because it has to do a lot of encoding/decoding to get everything in the correct bitrate so that the soundcard can understand.
hopefully, somebody who REALLY knows what they're talking about can add on to my explanation. As far as video, I have very little idea.
|