A closed loop Deep Brain Stimulation (DBS) system constituted of: a physiological sensor a multi-electrode DBS lead an adaptive control system in communication with the physiological sensor and an implantable pulse generator (IPG) responsive to the adaptive control system, the adaptive control system comprising a learning module operable to learn to find the optimal stimulation parameters, classify and associate patient conditions responsive to the physiological sensor with optimal stimulation parameters in a plurality of patient conditions. The adaptive DBS device control system learns to deliver the optimal stimulation parameters based on Watkins and Dayan Q learning recursive formula, the closed loop adaptive DBS control system thus finds the optimal stimulation parameters online.