Technical report: real pole balancing

ttj10@eng.cam.ac.uk ttj10 at eng.cam.ac.uk
Tue Jan 12 07:01:53 EST 1993


The following technical report is available via the Cambridge
University ftp archive svr-ftp.eng.cam.ac.uk. Instructions for
retrieval from the archive follow the summary.

------------------------------------------------------------------------------
		 
		 Pole Balancing on a Real Rig using a
		  Reinforcement Learning Controller


		  Timothy Jervis and Frank Fallside


	     Cambridge University Engineering Department
		      Cambridge CB2 1PZ, England
				   


			       Abstract

In 1983, Barto, Sutton and Anderson~\cite{Barto83} published details
of an adaptive controller which learnt to balance a simulated inverted
pendulum. This {\em reinforcement learning} controller balanced the
pendulum as a by-product of avoiding a cost signal delivered to the
controller when the pendulum fell over. This paper describes their
controller learning to balance a real inverted pendulum.  As far as
the authors are aware, this is the first example of a reinforcement
learning controller being applied to a real inverted pendulum learning
in real time.

The results show that the controller was able to improve its
performance as it learnt, and that the task is computationally
tractable. However, the implementation was not straightforward.
Although some of the controller's parameters were tuned automatically
by learning, some were not and had to be carefully set for successful
control. This limits the usefulness of this kind of learning
controller to small problems which are likely to be better controlled
by other means. Before a learning controller can tackle more difficult
problems, a more powerful learning scheme has to be found.

------------------------------------------------------------------------------

			  FTP INSTRUCTIONS

		unix> ftp svr-ftp.eng.cam.ac.uk
		Name: anonymous
		Password: (your_userid at your_site)
		ftp> cd reports
		ftp> binary
		ftp> get jervis_tr115.ps.Z
		ftp> quit
		unix> uncompress jervis_tr115.ps.Z
		unix> 

		If "ftp svr-ftp.eng.cam.ac.uk" does not work,
		you might try "ftp 129.169.24.20".



More information about the Connectionists mailing list