An audio output device is provided. The audio output device determines a similarity between a first external subject and a second external subject based on first biometric information about the first external object associated with the audio output device and second biometric information about the second external object associated with an external audio output device, and controls the audio output device and the external audio output device to operation in coordination or independently based on the similarity.