You are right, voice adding is adding waves, not simply adding numbers. But pcm data are numbers already. So it's still some kind of addition, not 1+1=2 style, more like 1+1=1.3 style.
For the echo, I'm not convinced your answer is correct.
Thanks a lot. I think I'm satisfied with what I learned already.