It could also exaggerate people's facial expressions and even include thought bubbles to help people interpret social cues.
And sounds could have sound effect text placed near their sources.