Deep Q learning