A signal indicative of sound detected by at least one sensor at an audio device is received. The audio device may at least partially covers a pinna and the detected sound may interact with at least a torso of a human body, but can also interact with the head and shoulder. The signal is modulated with a non-linear transfer function to generate a modulated signal indicative of one or more audio cues for spatializing the detected sound while the audio device at least partially covers a pinna. Sound is output by the audio device based on the modulated signal.