An apnea episode determination device includes, a processor and a memory which stores a plurality of instructions, which when executed by the processor, cause the processor to execute, detecting a breathing segment and a midway segment from a sound signal during sleep, the breathing segment being considered to include a breathing sound, the midway segment existing in between the breathing segments calculating an acoustic feature based on a background noise component and a signal component excluding the background noise component, which are included in the midway segment and determining that the midway segment is an apnea episode when the acoustic feature meets a preset condition.