I think he means real time..

Yea, have a look at DirectSound, or the FMOD library or OpenAL (I dont know about that though)

Both DS and FMOD support recording from a microphone, so I guess it could be an easy step up from there to full 'what u hear'